[llvm-branch-commits] [clang] [llvm] AMDGPU: Define v_mfma_f32_32x32x16_bf16 for gfx950 (PR #116679)
Shilei Tian via llvm-branch-commits
llvm-branch-commits at lists.llvm.org
Mon Nov 18 18:10:20 PST 2024
================
@@ -437,6 +437,8 @@ TARGET_BUILTIN(__builtin_amdgcn_cvt_sr_fp8_f32, "ifiiIi", "nc", "fp8-conversion-
TARGET_BUILTIN(__builtin_amdgcn_mfma_f32_16x16x32_f16, "V4fV8hV8hV4fIiIiIi", "nc", "gfx950-insts")
TARGET_BUILTIN(__builtin_amdgcn_mfma_f32_32x32x16_f16, "V16fV8hV8hV16fIiIiIi", "nc", "gfx950-insts")
----------------
shiltian wrote:
nit: maybe no need for a blank line
https://github.com/llvm/llvm-project/pull/116679
More information about the llvm-branch-commits
mailing list