[llvm] [VPlan] Move FOR splice cost into VPInstruction::FirstOrderRecurrenceSplice (PR #129645)

Luke Lau via llvm-commits llvm-commits at lists.llvm.org
Wed Mar 5 02:24:26 PST 2025


lukel97 wrote:

My gut feeling 

> > > The move makes sense. Curious if it would be easy to add a test where this leads to different VF to be picked?
> > 
> > 
> > I gave this a try there but I couldn't really find a way. Even by doubling the vector cost with `+tune-dlen-factor2`/adding predication the vector costs seem to outweigh the scalar. We also can't seem to EVL vectorize a first order recurrence of i64 yet either, which I thought might increase the vector cost, because it ends up creating a VPWidenIntOrFpInductionRecipe, which we can't handle yet without #115274 / #118638
> 
> I didn't get this point. Why would FOR be related to VPWidenIntOrFpInductionRecipe? Could you provide a more details or examples?

For this loop where `%for1` is i64:


```llvm
define void @first_order_recurrence(ptr noalias %A, ptr noalias %B, i64 %TC) {

entry:
  br label %for.body

for.body:
  %indvars = phi i64 [ 0, %entry ], [ %indvars.next, %for.body ]
  %for1 = phi i64 [ 33, %entry ], [ %0, %for.body ]
  %arrayidx = getelementptr inbounds nuw i64, ptr %A, i64 %indvars
  %0 = load i64, ptr %arrayidx, align 4
  %add = add nsw i64 %for1, %0
  %arrayidx2 = getelementptr inbounds nuw i64, ptr %B, i64 %indvars
  store i64 %add, ptr %arrayidx2, align 4
  %indvars.next = add nuw nsw i64 %indvars, 1
  %exitcond.not = icmp eq i64 %indvars.next, %TC
  br i1 %exitcond.not, label %for.end, label %for.body

for.end:
  ret void
}

```

The VPlan coming into `VPlanTransforms::tryAddExplicitVectorLength` seems to have a VPWidenIntOrFpInductionRecipe, used for predicating?

```
VPlan 'Initial VPlan for VF={vscale x 1,vscale x 2},UF>=1' {
Live-in vp<%0> = VF
Live-in vp<%1> = VF * UF
Live-in vp<%2> = vector-trip-count
Live-in vp<%3> = backedge-taken count
Live-in ir<%TC> = original trip-count

ir-bb<entry>:
Successor(s): vector.ph

vector.ph:
Successor(s): vector loop

<x1> vector loop: {
  vector.body:
    EMIT vp<%4> = CANONICAL-INDUCTION ir<0>, vp<%index.next>
    ir<%indvars> = WIDEN-INDUCTION  ir<0>, ir<1>, vp<%0>
    FIRST-ORDER-RECURRENCE-PHI ir<%for1> = phi ir<33>, vp<%6>
    EMIT vp<%5> = icmp ule ir<%indvars>, vp<%3>
    WIDEN-GEP Inv[Var] ir<%arrayidx> = getelementptr inbounds nuw ir<%A>, ir<%indvars>
  Successor(s): pred.load

  <xVFxUF> pred.load: {
    pred.load.entry:
      BRANCH-ON-MASK vp<%5>
    Successor(s): pred.load.if, pred.load.continue

    pred.load.if:
      REPLICATE ir<%0> = load ir<%arrayidx> (S->V)
    Successor(s): pred.load.continue

    pred.load.continue:
      PHI-PREDICATED-INSTRUCTION vp<%6> = ir<%0>
    No successors
  }
  Successor(s): for.body.0

  for.body.0:
    EMIT vp<%7> = first-order splice ir<%for1>, vp<%6>
    WIDEN ir<%add> = add nsw vp<%7>, vp<%6>
    WIDEN-GEP Inv[Var] ir<%arrayidx2> = getelementptr inbounds nuw ir<%B>, ir<%indvars>
  Successor(s): pred.store

  <xVFxUF> pred.store: {
    pred.store.entry:
      BRANCH-ON-MASK vp<%5>
    Successor(s): pred.store.if, pred.store.continue

    pred.store.if:
      REPLICATE store ir<%add>, ir<%arrayidx2>
    Successor(s): pred.store.continue

    pred.store.continue:
    No successors
  }
  Successor(s): for.body.1

  for.body.1:
    EMIT vp<%index.next> = add vp<%4>, vp<%1>
    EMIT branch-on-count vp<%index.next>, vp<%2>
  No successors
}
```

I'm not sure why we're predicating this anyway.

https://github.com/llvm/llvm-project/pull/129645


More information about the llvm-commits mailing list