[llvm] [DA] do not handle array accesses of different offsets (PR #123436)
Michael Kruse via llvm-commits
llvm-commits at lists.llvm.org
Fri Feb 28 03:44:55 PST 2025
================
@@ -3569,6 +3569,123 @@ bool DependenceInfo::invalidate(Function &F, const PreservedAnalyses &PA,
Inv.invalidate<LoopAnalysis>(F, PA);
}
+// Check that memory access offsets in V are multiples of array element size
+// EltSize. Param records the first parametric expression. If the scalar
+// evolution V contains two or more parameters, we check that the subsequent
+// parametric expressions are multiples of the first parametric expression
+// Param.
+static bool checkOffsets(ScalarEvolution *SE, const SCEV *V, const SCEV *&Param,
+ uint64_t EltSize) {
+ if (auto *AddRec = dyn_cast<SCEVAddRecExpr>(V)) {
+ if (!checkOffsets(SE, AddRec->getStart(), Param, EltSize))
+ return false;
+ return checkOffsets(SE, AddRec->getStepRecurrence(*SE), Param, EltSize);
+ }
+ if (auto *Cst = dyn_cast<SCEVConstant>(V)) {
+ APInt C = Cst->getAPInt();
+
+ // For example, alias_with_different_offsets in
+ // test/Analysis/DependenceAnalysis/DifferentOffsets.ll accesses "%A + 2":
+ // %arrayidx = getelementptr inbounds i8, ptr %A, i64 2
+ // store i32 42, ptr %arrayidx, align 1
+ // which is writing an i32, i.e., EltSize = 4 bytes, with an offset C = 2.
+ // checkOffsets returns false, as the offset C=2 is not a multiple of 4.
+ return C.srem(EltSize) == 0;
+ }
+
+ // Use a lambda helper function to check V for parametric expressions.
+ // Param records the first parametric expression. If the scalar evolution V
+ // contains two or more parameters, we check that the subsequent parametric
+ // expressions are multiples of the first parametric expression Param.
+ auto checkParamsMultipleOfSize = [&](const SCEV *V,
+ const SCEV *&Param) -> bool {
+ if (EltSize == 1)
+ return true;
+ if (!Param) {
+ Param = V;
+ return true;
----------------
Meinersbur wrote:
> If a parameter (ssa name invariant in loop) name %n is used as offset, then we assume that %n is a multiple of the array element size. There isn't even a check that `%n` is loop-invariant.
That ssumption that obviously does not necessarily hold true.
I think what you mean is that DA is assuming that `%n` is part of all index expressions and therefore has the same offset everywhere. If some index expressions use `%m` instead, they may have different relative offsets.
I do not think this is sound. Some index expressions may also use `%n + %n`, `%n + 1`, or no `%n` at all in which case they may have different relative offsets already. Alternatively, I may not have understood the logic here.
If `%n` (or any other subexpression) is invariant to all loops and occurs in every index, I think it could considered part of the base pointer. `ScalarEvolution::getPointerBase` atm looks into all SCEVAddExpr disregarding whether operands are loop invariant. Maybe that's a viable approach.
https://github.com/llvm/llvm-project/pull/123436
More information about the llvm-commits
mailing list