[llvm] [AMDGPU][True16][CodeGen] srl pattern for true16 mode (PR #132987)

Brox Chen via llvm-commits llvm-commits at lists.llvm.org
Thu Mar 27 08:04:37 PDT 2025


================
@@ -2425,6 +2425,13 @@ def : GCNPat <(i1 imm:$imm),
   let WaveSizePredicate = isWave32;
 }
 
+let True16Predicate = UseRealTrue16Insts in
+foreach vt = [i32, v2i16] in
+def : GCNPat <
+  (vt (DivergentBinFrag<srl> VGPR_32:$src, (i32 16))),
+  (REG_SEQUENCE VGPR_32, (i16 (EXTRACT_SUBREG $src, hi16)), lo16, (V_MOV_B16_t16_e64 0, (i16 0x0000), 0), hi16)
----------------
broxigarchen wrote:

Hi Matt. Thanks for the review. I checked the performSrlCombine function. I think it's currently checking for 64bit SRL with a dynamic right shift bits, which probably cannot be put in a isel pattern.

I think for this case since the right shift bit is a constant 16, it's probably better to directly place it in the isel pattern. (If I understand correctly we prefer isel pattern better than customization in the pass)

https://github.com/llvm/llvm-project/pull/132987


More information about the llvm-commits mailing list