[llvm] AMDGPU/GlobalISel: Partially move constant selection to patterns (PR #100786)
Joe Nash via llvm-commits
llvm-commits at lists.llvm.org
Mon Jul 29 10:57:50 PDT 2024
================
@@ -269,13 +276,14 @@ define double @v_rsq_clamp_f64_non_ieee(double %src) #2 {
; GFX12-NEXT: s_wait_bvhcnt 0x0
; GFX12-NEXT: s_wait_kmcnt 0x0
; GFX12-NEXT: v_rsq_f64_e32 v[0:1], v[0:1]
-; GFX12-NEXT: s_mov_b32 s0, -1
-; GFX12-NEXT: s_mov_b32 s1, 0x7fefffff
-; GFX12-NEXT: s_delay_alu instid0(TRANS32_DEP_1) | instid1(SALU_CYCLE_1)
-; GFX12-NEXT: v_min_num_f64_e32 v[0:1], s[0:1], v[0:1]
-; GFX12-NEXT: s_mov_b32 s1, 0xffefffff
-; GFX12-NEXT: s_delay_alu instid0(VALU_DEP_1) | instid1(SALU_CYCLE_1)
-; GFX12-NEXT: v_max_num_f64_e32 v[0:1], s[0:1], v[0:1]
+; GFX12-NEXT: v_mov_b32_e32 v2, -1
+; GFX12-NEXT: v_mov_b32_e32 v3, 0x7fefffff
+; GFX12-NEXT: s_delay_alu instid0(TRANS32_DEP_1) | instid1(VALU_DEP_1)
+; GFX12-NEXT: v_min_num_f64_e32 v[0:1], v[0:1], v[2:3]
----------------
Sisyph wrote:
Same question, is this regressing the 64 bit case by using vgprs instead of sgprs? I don't see any fix for that in later patches in the stack.
https://github.com/llvm/llvm-project/pull/100786
More information about the llvm-commits
mailing list