[PATCH] D118215: [RISCV] Add support for matching vwmulsu from fixed vectors.
Chenbing.Zheng via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Jan 26 22:50:48 PST 2022
Chenbing.Zheng marked 3 inline comments as done.
Chenbing.Zheng added inline comments.
================
Comment at: llvm/lib/Target/RISCV/RISCVISelLowering.cpp:7269
+ // For VWMULSU.vx vd, vs2, rs1. The signed extern op must be vs2,
+ // so if Op0 IsSignExt it can not be a vector extended from scalar.
----------------
craig.topper wrote:
> Chenbing.Zheng wrote:
> > craig.topper wrote:
> > > This code doesn't make sense to me. Is it still needed even after removing the `SDNPCommutative` flag?
> > >
> > > Once you've identified a VSEXT and VZEXT everything should be fine. Tablegen should only be able to select the splat on zero extend operand.
> > I try to delete it after removing the SDNPCommutative flag. "vwmulsu_vx_v2i16_swap" test failed and genetate vwmulsu.vx v8, v9, a1.
> > This code I add here for blocking combine this stuation in "vwmulsu_vx_v2i16_swap" func.
> > Is there any other better way to block it ?
> >
> > Optimized vector-legalized selection DAG: %bb.0 'vwmulsu_vx_v2i16_swap:'
> > SelectionDAG has 20 nodes:
> > t0: ch = EntryToken
> > t4: i32,ch = CopyFromReg t0, Register:i32 %1
> > t2: i32,ch = CopyFromReg t0, Register:i32 %0
> > t8: v2i8,ch = load<(load (s16) from %ir.x)> t0, t2, undef:i32
> > t25: nxv1i8 = insert_subvector undef:nxv1i8, t8, Constant:i32<0>
> > t28: nxv1i16 = RISCVISD::VZEXT_VL t25, t27, Constant:i32<2>
> > t23: v2i8 = BUILD_VECTOR t4, t4
> > t30: nxv1i8 = insert_subvector undef:nxv1i8, t23, Constant:i32<0>
> > t31: nxv1i16 = RISCVISD::VSEXT_VL t30, t27, Constant:i32<2>
> > t33: nxv1i16 = RISCVISD::MUL_VL t28, t31, t27, Constant:i32<2>
> > t18: ch,glue = CopyToReg t0, Register:nxv1i16 $v8, t33
> > t27: nxv1i1 = RISCVISD::VMSET_VL Constant:i32<2>
> > t19: ch = RISCVISD::RET_FLAG t18, Register:nxv1i16 $v8, t18:1
> >
> > After combine:
> >
> > SelectionDAG has 18 nodes:
> > t0: ch = EntryToken
> > t4: i32,ch = CopyFromReg t0, Register:i32 %1
> > t23: v2i8 = BUILD_VECTOR t4, t4
> > t30: nxv1i8 = insert_subvector undef:nxv1i8, t23, Constant:i32<0>
> > t2: i32,ch = CopyFromReg t0, Register:i32 %0
> > t8: v2i8,ch = load<(load (s16) from %ir.x)> t0, t2, undef:i32
> > t25: nxv1i8 = insert_subvector undef:nxv1i8, t8, Constant:i32<0>
> > t27: nxv1i1 = RISCVISD::VMSET_VL Constant:i32<2>
> > t35: nxv1i16 = RISCVISD::VWMULSU_VL t30, t25, t27, Constant:i32<2>
> > t18: ch,glue = CopyToReg t0, Register:nxv1i16 $v8, t35
> > t19: ch = RISCVISD::RET_FLAG t18, Register:nxv1i16 $v8, t18:1
> With this code removed I get this
>
> ```
> vwmulsu_vx_v2i16_swap: # @vwmulsu_vx_v2i16_swap
> .cfi_startproc
> # %bb.0:
> vsetivli zero, 2, e8, mf8, ta, mu
> vle8.v v9, (a0)
> vmv.v.x v10, a1
> vwmulsu.vv v8, v10, v9
> ret
> ```
>
> That looks correct to me. The splat is done as a separate vmv.v.x.
Thanks,you are right ~
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D118215/new/
https://reviews.llvm.org/D118215
More information about the llvm-commits
mailing list