[PATCH] D136059: [AMDGPU][DAG] Fix trunc/shift combine condition
Pierre van Houtryve via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Oct 17 00:56:47 PDT 2022
Pierre-vh created this revision.
Pierre-vh added a reviewer: arsenm.
Herald added subscribers: kosarev, foad, kerbowa, hiraditya, t-tye, tpr, dstuttard, yaxunl, jvesely, kzhuravl.
Herald added a project: All.
Pierre-vh requested review of this revision.
Herald added subscribers: llvm-commits, wdng.
Herald added a project: LLVM.
The condition needs to be different for right-shifts, else we may lose information in some cases.
Repository:
rG LLVM Github Monorepo
https://reviews.llvm.org/D136059
Files:
llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp
llvm/test/CodeGen/AMDGPU/partial-shift-shrink.ll
Index: llvm/test/CodeGen/AMDGPU/partial-shift-shrink.ll
===================================================================
--- llvm/test/CodeGen/AMDGPU/partial-shift-shrink.ll
+++ llvm/test/CodeGen/AMDGPU/partial-shift-shrink.ll
@@ -149,3 +149,19 @@
%trunc = trunc i64 %shift to i16
ret i16 %trunc
}
+
+define i32 @trunc_srl_i64_25_to_i26(i64 %x) {
+; GCN-LABEL: trunc_srl_i64_25_to_i26:
+; GCN: ; %bb.0:
+; GCN-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
+; GCN-NEXT: v_and_b32_e32 v0, 0xa000000, v0
+; GCN-NEXT: v_alignbit_b32 v0, 0, v0, 25
+; GCN-NEXT: v_add_u32_e32 v0, 55, v0
+; GCN-NEXT: s_setpc_b64 s[30:31]
+ %value.knownbits2 = and i64 %x, 167772160 ; 0xA000000
+ %shift = lshr i64 %value.knownbits2, 25
+ %trunc = trunc i64 %shift to i26
+ %add = add i26 %trunc, 55
+ %ext = zext i26 %add to i32
+ ret i32 %ext
+}
Index: llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp
===================================================================
--- llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp
+++ llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp
@@ -3246,7 +3246,15 @@
SDValue Amt = Src.getOperand(1);
KnownBits Known = DAG.computeKnownBits(Amt);
unsigned Size = VT.getScalarSizeInBits();
- if ((Known.isConstant() && Known.getConstant().ule(Size)) ||
+
+ // When the shift amount is known:
+ // - For left shifts, do the transform if ShiftAmt <= Size
+ // - For right shift, do it if ShiftAmt <= (32 - Size) to avoid
+ // losing information stored in the high bits when truncating.
+ const unsigned MaxCstSize = (Src.getOpcode() == ISD::SHL)
+ ? Size
+ : (32 - VT.getScalarSizeInBits());
+ if ((Known.isConstant() && Known.getConstant().ule(MaxCstSize)) ||
(Known.countMaxActiveBits() <= Log2_32(Size))) {
EVT MidVT = VT.isVector() ?
EVT::getVectorVT(*DAG.getContext(), MVT::i32,
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D136059.468128.patch
Type: text/x-patch
Size: 1989 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20221017/4536f79d/attachment-0001.bin>
More information about the llvm-commits
mailing list