[PATCH] D138101: [RISCV] Lower unmasked zero-stride vector load to (scalar load + splat)

Tue Nov 22 08:23:57 PST 2022

reames accepted this revision.
reames added a comment.
This revision is now accepted and ready to land.

LGTM, but please land the new test and the opcode change in a separate commit (using old llc), then rebase and land this over it.  Will make the test diff less confusing in the change log.

================
Comment at: llvm/test/CodeGen/RISCV/rvv/fixed-vector-strided-load-store-asm.ll:273
 ; NOT-OPTIMIZED-NEXT:    vle8.v v8, (a6)
-; NOT-OPTIMIZED-NEXT:    vadd.vx v8, v8, a5
+; NOT-OPTIMIZED-NEXT:    vmv.v.x v9, a5
+; NOT-OPTIMIZED-NEXT:    vdivu.vv v8, v9, v8
----------------
pcwang-thead wrote:
> reames wrote:
> > This looks like a regression.
> The diff result seems weird, but I think it's what we expected.
> I added a new test `gather_zero_stride_unfold` (copied from `gather_zero_stride`) in which I changed `%4 = add <32 x i8> %wide.load, %wide.masked.gather` to `%4 = udiv <32 x i8>  %wide.masked.gather, %wide.load` so that splat can't be folded since division is not commutative. So here are scalar load(`lbu a5, 0(a1)`) and vector splat(`vmv.v.x v9, a5`).
I got confused here by the diff.  I was responding as if this was the optimized check, which it isn't.  So ignore me here.  

Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D138101/new/

https://reviews.llvm.org/D138101