[PATCH] D139648: [RISCV] Use vmv.v.i for insertion into lane 0 of undef vector when profitable

Thu Dec 8 11:45:42 PST 2022

craig.topper added inline comments.

================
Comment at: llvm/lib/Target/RISCV/RISCVISelLowering.cpp:2835
+  // done by the vmv.v.i is linear in LMUL.  For the vmv.s.x sequence, we
+  // need two instructions (to materialize constant and then insrt), so
+  // lmul2 vmv.v.i requires about the same work.
----------------
insrt -> insert

================
Comment at: llvm/lib/Target/RISCV/RISCVISelLowering.cpp:2836
+  // need two instructions (to materialize constant and then insrt), so
+  // lmul2 vmv.v.i requires about the same work.
+  if (ConstantSDNode *Const = dyn_cast<ConstantSDNode>(Scalar)) {
----------------
This might not be strictly true on an architecture with data path width less than VLEN. If SEW fits within the datapath length, vmv.x.s could be done in less than cycles than an LMUL=1 vmv.v.i. And LMUL=2 vmv.v.i would be 4x or more cycles than than vmv.x.s in such an architecture.

But I'm not sure how big of an issue that is.

Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D139648/new/

https://reviews.llvm.org/D139648