[llvm] [AMDGPU][True16][CodeGen] support for true16 for vinterp 16bit instructions (PR #116702)
Brox Chen via llvm-commits
llvm-commits at lists.llvm.org
Thu Nov 21 13:36:07 PST 2024
================
@@ -181,9 +181,43 @@ multiclass VInterpF16Pat <SDPatternOperator op, Instruction inst,
def : VInterpF16Pat<op, inst, dst_type, 1, high_pat>;
}
+class VInterpF16Pat_t16 <SDPatternOperator op, Instruction inst,
+ ValueType dstVT, bit high, bit isP2> : GCNPat <
+ (dstVT (op
+ (VINTERPMods f32:$src0, i32:$src0_modifiers),
+ (VINTERPMods f32:$src1, i32:$src1_modifiers),
+ (VINTERPMods f32:$src2, i32:$src2_modifiers),
+ !if(high, (i1 -1), (i1 0)))),
+ (inst $src0_modifiers,
+ (f16 (EXTRACT_SUBREG VGPR_32:$src0, !if(high, hi16, lo16))),
----------------
broxigarchen wrote:
I see. I think this is not extracting from a 32bits float to a 16bits float. The input is always a 16bits float and this is just selecting the high half or low half of a 32 bit register
https://github.com/llvm/llvm-project/pull/116702
More information about the llvm-commits
mailing list