[llvm] [AMDGPU][SIInsertWaitcnts] Wait on all LDS DMA operations when no aliasing store is found (PR #170660)
Pierre van Houtryve via llvm-commits
llvm-commits at lists.llvm.org
Thu Dec 4 07:46:35 PST 2025
================
@@ -223,6 +223,7 @@ define amdgpu_kernel void @buffer_load_lds_dword_10_arrays(<4 x i32> %rsrc, i32
; GFX9-NEXT: s_waitcnt vmcnt(2)
; GFX9-NEXT: ds_read_b32 v7, v9 offset:1792
; GFX9-NEXT: ; wave barrier
+; GFX9-NEXT: s_waitcnt vmcnt(0)
----------------
Pierre-vh wrote:
Yes that's precisely what I was trying to fix. The wait is required here because we ran out of tracking slots.
#162077 increases the number of tracking slots so this will be improved
https://github.com/llvm/llvm-project/pull/170660
More information about the llvm-commits
mailing list