[llvm-branch-commits] [clang] [llvm] AMDGPU: Define v_mfma_f32_32x32x16_bf16 for gfx950 (PR #116679)

Shilei Tian via llvm-branch-commits llvm-branch-commits at lists.llvm.org
Mon Nov 18 18:10:20 PST 2024


================
@@ -437,6 +437,8 @@ TARGET_BUILTIN(__builtin_amdgcn_cvt_sr_fp8_f32, "ifiiIi", "nc", "fp8-conversion-
 TARGET_BUILTIN(__builtin_amdgcn_mfma_f32_16x16x32_f16, "V4fV8hV8hV4fIiIiIi", "nc", "gfx950-insts")
 TARGET_BUILTIN(__builtin_amdgcn_mfma_f32_32x32x16_f16, "V16fV8hV8hV16fIiIiIi", "nc", "gfx950-insts")
 
----------------
shiltian wrote:

nit: maybe no need for a blank line

https://github.com/llvm/llvm-project/pull/116679


More information about the llvm-branch-commits mailing list