[llvm] [LoopVectorizer][AArch64] Move getMinTripCountTailFoldingThreshold later. (PR #132170)

Florian Hahn via llvm-commits llvm-commits at lists.llvm.org
Wed Mar 26 04:27:17 PDT 2025


================
@@ -4025,11 +4025,8 @@ LoopVectorizationCostModel::computeMaxVF(ElementCount UserVF, unsigned UserIC) {
       MaxPowerOf2RuntimeVF = std::nullopt; // Stick with tail-folding for now.
   }
 
-  if (MaxPowerOf2RuntimeVF && *MaxPowerOf2RuntimeVF > 0) {
-    assert((UserVF.isNonZero() || isPowerOf2_32(*MaxPowerOf2RuntimeVF)) &&
-           "MaxFixedVF must be a power of 2");
-    unsigned MaxVFtimesIC =
-        UserIC ? *MaxPowerOf2RuntimeVF * UserIC : *MaxPowerOf2RuntimeVF;
+  auto ScalarEpilogueNeeded = [this, &UserIC](unsigned MaxVF) {
----------------
fhahn wrote:

The function returns true if no scalar epilogue is needed, so the name should be adjusted?

```suggestion
  auto NoScalarEpilogueNeeded  = [this, &UserIC](unsigned MaxVF) {
```



https://github.com/llvm/llvm-project/pull/132170


More information about the llvm-commits mailing list