[llvm] [AMDGPU][True16][CodeGen] Add patterns to reduce intermediates (PR #162047)

Brox Chen via llvm-commits llvm-commits at lists.llvm.org
Mon Oct 6 07:33:44 PDT 2025


================
@@ -3717,6 +3717,23 @@ def : GCNPat <
 } // End foreach Ty = ...
 } // End AddedComplexity = 1
 
+let True16Predicate = UseRealTrue16Insts in {
+def : GCNPat<
+  (i32 (DivergentBinFrag<or>
+    (i32 (zext i16:$src_lo)),
+    (i32 (bitconvert (v2i16 (build_vector (i16 0), (i16 VGPR_32:$src_hi)))))
+  )),
+  (REG_SEQUENCE VGPR_32, $src_lo, lo16, $src_hi, hi16)
+>;
+def : GCNPat<
+  (i32 (DivergentBinFrag<or>
+    (i32 (bitconvert (v2i16 (build_vector (i16 0), (i16 VGPR_32:$src_hi))))),
----------------
broxigarchen wrote:

same as above

https://github.com/llvm/llvm-project/pull/162047


More information about the llvm-commits mailing list