[llvm] [AMDGPU] Replace `isInlinableLiteral16` with specific version (PR #84402)

Shilei Tian via llvm-commits llvm-commits at lists.llvm.org
Fri Mar 8 06:41:34 PST 2024


================
@@ -112,7 +112,7 @@ bb:
 define amdgpu_ps void @test_wmma_bf16_16x16x16_bf16_imm(<8 x i16> %A, <8 x i16> %B, ptr addrspace(1) %out) {
 ; GFX12-LABEL: test_wmma_bf16_16x16x16_bf16_imm:
 ; GFX12:       ; %bb.0: ; %bb
-; GFX12-NEXT:    v_wmma_bf16_16x16x16_bf16 v[10:13], v[0:3], v[4:7], 1.0
+; GFX12-NEXT:    v_wmma_bf16_16x16x16_bf16 v[10:13], v[0:3], v[4:7], 0
----------------
shiltian wrote:

> Also just as a note pulled from downstream: this shall use BF16 in the profile. As a minimum in the profile, but then also in the intrinsic. But downstream it still maintains 1.0, so something is missing.

Yes. That is on the top of my list. I'll fix the downstream today.

https://github.com/llvm/llvm-project/pull/84402


More information about the llvm-commits mailing list