[llvm] [LoopVectorize] Add cost of generating tail-folding mask to the loop (PR #130565)

Florian Hahn via llvm-commits llvm-commits at lists.llvm.org
Wed Mar 19 13:38:59 PDT 2025


================
@@ -4601,8 +4601,32 @@ VectorizationFactor LoopVectorizationPlanner::selectVectorizationFactor() {
         continue;
 
       InstructionCost C = CM.expectedCost(VF);
-      VectorizationFactor Candidate(VF, C, ScalarCost.ScalarCost);
 
+      // Add on other costs that are modelled in VPlan, but not in the legacy
+      // cost model.
+      VPCostContext CostCtx(CM.TTI, *CM.TLI, CM.Legal->getWidestInductionType(),
+                            CM, CM.CostKind);
+      if (VPRegionBlock *VectorRegion = P->getVectorLoopRegion()) {
+        for (VPBlockBase *Block :
+             vp_depth_first_shallow(VectorRegion->getEntry())) {
+          if (!isa<VPBasicBlock>(Block))
+            continue;
+          for (VPRecipeBase &R : *cast<VPBasicBlock>(Block)) {
+            if (auto *VPI = dyn_cast<VPInstruction>(&R)) {
----------------
fhahn wrote:

might be worth 

```suggestion
            auto *VPI = dyn_cast<VPInstruction>(&R);
            if (!VPI) 
              continue
```

to reduce the already high nesting level slightly 

https://github.com/llvm/llvm-project/pull/130565


More information about the llvm-commits mailing list