[llvm] [AMDGPU][True16][CodeGen] build_vector pattern in true16 (PR #118904)

Joe Nash via llvm-commits llvm-commits at lists.llvm.org
Mon Feb 10 14:12:13 PST 2025


================
@@ -3424,6 +3441,8 @@ def : GCNPat <
 
 // Take the upper 16 bits from V[0] and the lower 16 bits from V[1]
 // Special case, can use V_ALIGNBIT (always uses encoded literal)
+foreach p = [NotHasTrue16BitInsts, UseFakeTrue16Insts] in
+let True16Predicate = p in
 def : GCNPat <
   (vecTy (DivergentBinFrag<build_vector>
     (Ty !if(!eq(Ty, i16),
----------------
Sisyph wrote:

Perhaps the PERM_B32 pat at new line 3456 should also be disabled in Real True16 mode? It seems redundant with the new REG_SEQUENCE  pat.

https://github.com/llvm/llvm-project/pull/118904


More information about the llvm-commits mailing list