[llvm] [AMDGPU][MC] Allow opsel for v_max_i16 etc in GFX10 (PR #143982)
Joe Nash via llvm-commits
llvm-commits at lists.llvm.org
Fri Jun 13 11:48:39 PDT 2025
================
@@ -64,8 +64,10 @@ define i8 @test_vector_reduce_or_v2i8(<2 x i8> %v) {
; GFX10-SDAG-LABEL: test_vector_reduce_or_v2i8:
; GFX10-SDAG: ; %bb.0: ; %entry
; GFX10-SDAG-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
+; GFX10-SDAG-NEXT: v_lshrrev_b16 v2, 8, v1
; GFX10-SDAG-NEXT: v_or_b32_e32 v0, v0, v1
-; GFX10-SDAG-NEXT: v_or_b32_sdwa v1, v1, v1 dst_sel:BYTE_1 dst_unused:UNUSED_PAD src0_sel:DWORD src1_sel:BYTE_1
+; GFX10-SDAG-NEXT: v_or_b32_e32 v2, v1, v2
+; GFX10-SDAG-NEXT: v_lshlrev_b16 v1, 8, v2
; GFX10-SDAG-NEXT: v_or_b32_sdwa v0, v0, v1 dst_sel:DWORD dst_unused:UNUSED_PAD src0_sel:BYTE_0 src1_sel:DWORD
----------------
Sisyph wrote:
Example of using extra instructions and registers.
https://github.com/llvm/llvm-project/pull/143982
More information about the llvm-commits
mailing list