[llvm] [AMDGPU] Lazily emit waitcnts on function entry (PR #73122)

Diana Picus via llvm-commits llvm-commits at lists.llvm.org
Wed Dec 6 04:18:18 PST 2023


================
@@ -8,24 +8,24 @@
 define void @br_false() {
 ; WAVE64-LABEL: br_false:
 ; WAVE64:       ; %bb.0: ; %.exit
-; WAVE64-NEXT:    s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
 ; WAVE64-NEXT:  .LBB0_1: ; %bb0
 ; WAVE64-NEXT:    ; =>This Inner Loop Header: Depth=1
 ; WAVE64-NEXT:    s_mov_b32 s4, 1
 ; WAVE64-NEXT:    s_cmp_lg_u32 s4, 0
 ; WAVE64-NEXT:    s_cbranch_scc1 .LBB0_1
 ; WAVE64-NEXT:  ; %bb.2: ; %.exit5
+; WAVE64-NEXT:    s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
----------------
rovka wrote:

Why exactly do we need to wait here? We only wrote some SGPRs that we're not returning

https://github.com/llvm/llvm-project/pull/73122


More information about the llvm-commits mailing list