[llvm] [AMDGPU][True16][CodeGen] srl pattern for true16 mode (PR #132987)

Matt Arsenault via llvm-commits llvm-commits at lists.llvm.org
Fri Mar 28 09:03:41 PDT 2025


================
@@ -2425,6 +2425,13 @@ def : GCNPat <(i1 imm:$imm),
   let WaveSizePredicate = isWave32;
 }
 
+let True16Predicate = UseRealTrue16Insts in
+foreach vt = [i32, v2i16] in
+def : GCNPat <
+  (vt (DivergentBinFrag<srl> VGPR_32:$src, (i32 16))),
+  (REG_SEQUENCE VGPR_32, (i16 (EXTRACT_SUBREG $src, hi16)), lo16, (V_MOV_B16_t16_e64 0, (i16 0x0000), 0), hi16)
----------------
arsenm wrote:

But the registers shouldn't matter? The problem isn't any different than the current 16-bit vs. 32-bit uniform problem. As a proxy you can just check if the value is uniform? 

https://github.com/llvm/llvm-project/pull/132987


More information about the llvm-commits mailing list