[llvm] [LoopVectorize] Add cost of generating tail-folding mask to the loop (PR #130565)
Florian Hahn via llvm-commits
llvm-commits at lists.llvm.org
Wed Mar 19 13:38:59 PDT 2025
================
@@ -4601,8 +4601,32 @@ VectorizationFactor LoopVectorizationPlanner::selectVectorizationFactor() {
continue;
InstructionCost C = CM.expectedCost(VF);
- VectorizationFactor Candidate(VF, C, ScalarCost.ScalarCost);
+ // Add on other costs that are modelled in VPlan, but not in the legacy
+ // cost model.
+ VPCostContext CostCtx(CM.TTI, *CM.TLI, CM.Legal->getWidestInductionType(),
+ CM, CM.CostKind);
+ if (VPRegionBlock *VectorRegion = P->getVectorLoopRegion()) {
+ for (VPBlockBase *Block :
+ vp_depth_first_shallow(VectorRegion->getEntry())) {
+ if (!isa<VPBasicBlock>(Block))
+ continue;
+ for (VPRecipeBase &R : *cast<VPBasicBlock>(Block)) {
+ if (auto *VPI = dyn_cast<VPInstruction>(&R)) {
----------------
fhahn wrote:
might be worth
```suggestion
auto *VPI = dyn_cast<VPInstruction>(&R);
if (!VPI)
continue
```
to reduce the already high nesting level slightly
https://github.com/llvm/llvm-project/pull/130565
More information about the llvm-commits
mailing list