[llvm] [AMDGPU] Implement vop3p complex pattern optmization for gisel (PR #130234)
Shilei Tian via llvm-commits
llvm-commits at lists.llvm.org
Fri Mar 21 09:10:38 PDT 2025
================
@@ -263,8 +262,7 @@ define i32 @v_sdot2_fnegf32_c(<2 x i16> %a, <2 x i16> %b, float %c) {
; GFX10-LABEL: v_sdot2_fnegf32_c:
; GFX10: ; %bb.0:
; GFX10-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
-; GFX10-NEXT: v_xor_b32_e32 v2, 0x80000000, v2
-; GFX10-NEXT: v_dot2_i32_i16 v0, v0, v1, v2
+; GFX10-NEXT: v_dot2_i32_i16 v0, v0, v1, v2 neg_lo:[0,0,1] neg_hi:[0,0,1]
----------------
shiltian wrote:
We don't. These changes are not right.
https://github.com/llvm/llvm-project/pull/130234
More information about the llvm-commits
mailing list