[llvm] [AMDGPU] Fix folding of v2i16/v2f16 splat imms (PR #72709)
Stanislav Mekhanoshin via llvm-commits
llvm-commits at lists.llvm.org
Mon Nov 27 13:56:24 PST 2023
================
@@ -41,9 +41,11 @@ entry:
ret void
}
+; FIXME: GFX940 fails to fold 0x1001 with op_sel
+
; GCN-LABEL: {{^}}test_llvm_amdgcn_udot2_op_sel:
; GFX906: v_dot2_u32_u16 v{{[0-9]+}}, 1, v{{[0-9]+}}, s{{[0-9]+}} op_sel:[0,1,0] op_sel_hi:[0,0,1]{{$}}
-; GFX940: v_dot2_u32_u16 v{{[0-9]+}}, 1, v{{[0-9]+}}, s{{[0-9]+}}{{$}}
----------------
rampitec wrote:
And this is still a bug even if bf16 needs a different handling.
https://github.com/llvm/llvm-project/pull/72709
More information about the llvm-commits
mailing list