[llvm] [AMDGPU] Lazily emit waitcnts on function entry (PR #73122)

Matt Arsenault via llvm-commits llvm-commits at lists.llvm.org
Tue Nov 28 06:00:15 PST 2023


================
@@ -223,9 +223,9 @@ define amdgpu_kernel void @kernel_caller_byval() {
 define void @func_caller_stack() {
 ; MUBUF-LABEL: func_caller_stack:
 ; MUBUF:       ; %bb.0:
-; MUBUF-NEXT:    s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
 ; MUBUF-NEXT:    s_mov_b32 s4, s33
 ; MUBUF-NEXT:    s_mov_b32 s33, s32
----------------
arsenm wrote:

This is assuming that SP/FP are never written by a memory instruction, which is probably an OK assumption but we should probably document it 

https://github.com/llvm/llvm-project/pull/73122


More information about the llvm-commits mailing list