[llvm] [AMDGPU][True16][MC][CodeGen] true16 mode for v_cvt_pk_bf8/fp8_f32 (PR #141881)

Stanislav Mekhanoshin via llvm-commits llvm-commits at lists.llvm.org
Thu May 29 12:31:58 PDT 2025


================
@@ -570,6 +570,36 @@ def VOP3_CVT_PK_F8_F32_Profile : VOP3_Profile<VOP_I32_F32_F32, VOP3_OPSEL> {
   let HasExtVOP3DPP = 1;
 }
 
+def VOP3_CVT_PK_F8_F32_Profile_fake16 : VOP3_Profile_Fake16<VOP_I16_F32_F32, VOP3_OPSEL> {
----------------
rampitec wrote:

I'd prefer avoid this and let these to be generated automatically, then use VOP3_Profile_True16/VOP3_Profile_Fake16 wrappers directly or indirectly. It should probably already work, if not some changes may need to be pulled from downstream.

https://github.com/llvm/llvm-project/pull/141881


More information about the llvm-commits mailing list