[llvm] [AMDGPU][True16][CodeGen] srl pattern for true16 mode (PR #132987)
Brox Chen via llvm-commits
llvm-commits at lists.llvm.org
Fri Mar 28 08:53:15 PDT 2025
================
@@ -2425,6 +2425,13 @@ def : GCNPat <(i1 imm:$imm),
let WaveSizePredicate = isWave32;
}
+let True16Predicate = UseRealTrue16Insts in
+foreach vt = [i32, v2i16] in
+def : GCNPat <
+ (vt (DivergentBinFrag<srl> VGPR_32:$src, (i32 16))),
+ (REG_SEQUENCE VGPR_32, (i16 (EXTRACT_SUBREG $src, hi16)), lo16, (V_MOV_B16_t16_e64 0, (i16 0x0000), 0), hi16)
----------------
broxigarchen wrote:
Hi Matt. I tried put this lowering into SrlCombine, but seems not working. The sdagcombine happens before ISel selection, and thus we don't have info about register at that time. Putting it in SrlCombine might get it lowered to sgpr_hi/lo16 which does not work.
https://github.com/llvm/llvm-project/pull/132987
More information about the llvm-commits
mailing list