[clang] [AMDGPU] - Add clang builtins for tied WMMA intrinsics (PR #70669)

Stanislav Mekhanoshin via cfe-commits cfe-commits at lists.llvm.org
Thu Nov 9 10:02:02 PST 2023


================
@@ -292,13 +292,17 @@ TARGET_BUILTIN(__builtin_amdgcn_wmma_f32_16x16x16_f16_w32, "V8fV16hV16hV8f", "nc
 TARGET_BUILTIN(__builtin_amdgcn_wmma_f32_16x16x16_bf16_w32, "V8fV16sV16sV8f", "nc", "gfx11-insts")
 TARGET_BUILTIN(__builtin_amdgcn_wmma_f16_16x16x16_f16_w32, "V16hV16hV16hV16hIb", "nc", "gfx11-insts")
 TARGET_BUILTIN(__builtin_amdgcn_wmma_bf16_16x16x16_bf16_w32, "V16sV16sV16sV16sIb", "nc", "gfx11-insts")
+TARGET_BUILTIN(__builtin_amdgcn_wmma_f16_16x16x16_f16_tied_w32, "V16hV16hV16hV16hIb", "nc", "gfx11-insts")
----------------
rampitec wrote:

Need to add negative test for the last operand to always be a constant integer. We do it every time 'I' modifier is used.

https://github.com/llvm/llvm-project/pull/70669


More information about the cfe-commits mailing list