[PATCH] D124961: [riscv] Use X0 for destination of VSETVLI instruction if result unused

Philip Reames via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed May 4 14:19:58 PDT 2022


reames added a comment.

In D124961#3492328 <https://reviews.llvm.org/D124961#3492328>, @craig.topper wrote:

> In D124961#3492309 <https://reviews.llvm.org/D124961#3492309>, @reames wrote:
>
>> In D124961#3492285 <https://reviews.llvm.org/D124961#3492285>, @craig.topper wrote:
>>
>>>> Since after the core insertion/lowering algorithm has run, basically all user written VSETVLIs will have their GPR result unused (as VTYPE/VLEN is now explicitly read instead), this kicks in for most tests which involve a vsetvli intrinsic. When inserting VSETVLIs to lower psuedos, we prefer the X0 form anyways.
>>>
>>> Almost any real scenario that processes more data than fits in a register will need the GPR output to update memory pointers and to control a loop. Which I guess means we're lacking realistic tests.
>>
>> How so?  An idiomatic tail folded loop doesn't need to explicitly read VL.  It's passed via VL reg itself to all the vector ops.  It changes across iterations, but that doesn't mean the GPR is used.
>
> Don't you need to know how many elements were processed to advance the pointer for the next iteration. That is done is scalar registers. You also need to update a remaining element count to be passed to the vsetvli for the next iteration. Assuming we're talking about a loop like this example from the spec

You are of course correct.

The actual example I was looking at was fixed length vectorization.  Because you know the VL in advance, you don't need to read it back.

So we have realistic test for fixed length loop vectorization at least.  :)


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D124961/new/

https://reviews.llvm.org/D124961



More information about the llvm-commits mailing list