[llvm] [LoopVectorizer][AArch64] Move getMinTripCountTailFoldingThreshold later. (PR #132170)
Florian Hahn via llvm-commits
llvm-commits at lists.llvm.org
Wed Mar 26 04:27:17 PDT 2025
================
@@ -4025,11 +4025,8 @@ LoopVectorizationCostModel::computeMaxVF(ElementCount UserVF, unsigned UserIC) {
MaxPowerOf2RuntimeVF = std::nullopt; // Stick with tail-folding for now.
}
- if (MaxPowerOf2RuntimeVF && *MaxPowerOf2RuntimeVF > 0) {
- assert((UserVF.isNonZero() || isPowerOf2_32(*MaxPowerOf2RuntimeVF)) &&
- "MaxFixedVF must be a power of 2");
- unsigned MaxVFtimesIC =
- UserIC ? *MaxPowerOf2RuntimeVF * UserIC : *MaxPowerOf2RuntimeVF;
+ auto ScalarEpilogueNeeded = [this, &UserIC](unsigned MaxVF) {
----------------
fhahn wrote:
The function returns true if no scalar epilogue is needed, so the name should be adjusted?
```suggestion
auto NoScalarEpilogueNeeded = [this, &UserIC](unsigned MaxVF) {
```
https://github.com/llvm/llvm-project/pull/132170
More information about the llvm-commits
mailing list