[PATCH] D157077: [RISCV] Teach VSETVLIInserter to not demand tail policy when there is no tail element
Jianjian Guan via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Aug 4 01:01:06 PDT 2023
jacquesguan created this revision.
jacquesguan added reviewers: craig.topper, asb, luismarques, frasercrmck, reames, luke.
Herald added subscribers: jobnoorman, VincentWu, vkmr, apazos, sameer.abuasal, s.egerton, Jim, benna, psnobl, jocewei, PkmX, the_o, brucehoult, MartinMosbeck, rogfer01, edward-jones, zzheng, jrtc27, shiva0217, kito-cheng, niosHD, sabuasal, simoncook, johnrusso, rbar, hiraditya, arichardson.
Herald added a project: All.
jacquesguan requested review of this revision.
Herald added subscribers: llvm-commits, wangpc, eopXD, MaskRay.
Herald added a project: LLVM.
If avl is vlmax and lmul is not fractional, there is no tail element so that no need to demand tail policy.
Repository:
rG LLVM Github Monorepo
https://reviews.llvm.org/D157077
Files:
llvm/lib/Target/RISCV/RISCVInsertVSETVLI.cpp
llvm/test/CodeGen/RISCV/rvv/rvv-peephole-vmerge-vops.ll
llvm/test/CodeGen/RISCV/rvv/vsetvli-insert.ll
Index: llvm/test/CodeGen/RISCV/rvv/vsetvli-insert.ll
===================================================================
--- llvm/test/CodeGen/RISCV/rvv/vsetvli-insert.ll
+++ llvm/test/CodeGen/RISCV/rvv/vsetvli-insert.ll
@@ -579,6 +579,32 @@
ret <vscale x 1 x double> %2
}
+define <vscale x 1 x i64> @test21(<vscale x 1 x i64> %a, <vscale x 1 x i64> %b, <vscale x 1 x i1> %mask) nounwind {
+; CHECK-LABEL: test21:
+; CHECK: # %bb.0: # %entry
+; CHECK-NEXT: vsetvli a0, zero, e64, m1, tu, mu
+; CHECK-NEXT: vadd.vv v8, v8, v8, v0.t
+; CHECK-NEXT: vadd.vv v9, v9, v8, v0.t
+; CHECK-NEXT: vmv1r.v v8, v9
+; CHECK-NEXT: ret
+entry:
+ %x = call <vscale x 1 x i64> @llvm.riscv.vadd.mask.nxv1i64.nxv1i64(
+ <vscale x 1 x i64> %a,
+ <vscale x 1 x i64> %a,
+ <vscale x 1 x i64> %a,
+ <vscale x 1 x i1> %mask,
+ i64 -1,
+ i64 0)
+ %y = call <vscale x 1 x i64> @llvm.riscv.vadd.mask.nxv1i64.nxv1i64(
+ <vscale x 1 x i64> %b,
+ <vscale x 1 x i64> %b,
+ <vscale x 1 x i64> %x,
+ <vscale x 1 x i1> %mask,
+ i64 -1,
+ i64 1)
+ ret <vscale x 1 x i64> %y
+}
+
; This used to fail the machine verifier due to the vsetvli being removed
; while the add was still using it.
define i64 @bad_removal(<2 x i64> %arg) {
Index: llvm/test/CodeGen/RISCV/rvv/rvv-peephole-vmerge-vops.ll
===================================================================
--- llvm/test/CodeGen/RISCV/rvv/rvv-peephole-vmerge-vops.ll
+++ llvm/test/CodeGen/RISCV/rvv/rvv-peephole-vmerge-vops.ll
@@ -385,7 +385,6 @@
; CHECK: # %bb.0:
; CHECK-NEXT: vsetvli a0, zero, e32, m1, ta, ma
; CHECK-NEXT: vfadd.vv v9, v9, v10
-; CHECK-NEXT: vsetvli zero, zero, e32, m1, tu, ma
; CHECK-NEXT: vmerge.vvm v8, v8, v9, v0
; CHECK-NEXT: ret
%a = call <vscale x 2 x float> @llvm.experimental.constrained.fadd(<vscale x 2 x float> %x, <vscale x 2 x float> %y, metadata !"round.dynamic", metadata !"fpexcept.strict") strictfp
Index: llvm/lib/Target/RISCV/RISCVInsertVSETVLI.cpp
===================================================================
--- llvm/lib/Target/RISCV/RISCVInsertVSETVLI.cpp
+++ llvm/lib/Target/RISCV/RISCVInsertVSETVLI.cpp
@@ -344,6 +344,22 @@
Res.MaskPolicy = false;
}
+ // VLMAX does not need tail policy.
+ if (RISCVII::hasVLOp(MI.getDesc().TSFlags)) {
+ RISCVII::VLMUL VLMul = RISCVII::getLMul(MI.getDesc().TSFlags);
+ // Fractional LMULs always require tail policy.
+ if (VLMul < RISCVII::LMUL_RESERVED) {
+ const MachineOperand &VLOp = MI.getOperand(getVLOpNum(MI));
+ if (VLOp.isImm()) {
+ int64_t Imm = VLOp.getImm();
+ if (Imm == RISCV::VLMaxSentinel)
+ Res.TailPolicy = false;
+ } else if (VLOp.getReg() == RISCV::X0) {
+ Res.TailPolicy = false;
+ }
+ }
+ }
+
// If this is a mask reg operation, it only cares about VLMAX.
// TODO: Possible extensions to this logic
// * Probably ok if available VLMax is larger than demanded
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D157077.547129.patch
Type: text/x-patch
Size: 2971 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20230804/a5a67983/attachment.bin>
More information about the llvm-commits
mailing list