[llvm-branch-commits] [llvm] [AMDGPU][InsertWaitCnts] Track global_wb/inv/wbinv (PR #135340)

Jay Foad via llvm-branch-commits llvm-branch-commits at lists.llvm.org
Tue Apr 22 05:47:42 PDT 2025


================
@@ -19,7 +19,7 @@ body: |
     ; GFX12-NEXT: {{  $}}
     ; GFX12-NEXT: renamable $vgpr0 = GLOBAL_LOAD_DWORD_SADDR renamable $sgpr2_sgpr3, killed $vgpr0, 0, 0, implicit $exec :: (load (s32), addrspace 1)
     ; GFX12-NEXT: GLOBAL_INV 16, implicit $exec
-    ; GFX12-NEXT: S_WAIT_LOADCNT 0
+    ; GFX12-NEXT: S_WAIT_LOADCNT 1
----------------
jayfoad wrote:

> the data return & counter decrement can be done out of order.

No, generally the counter decrement *is* in order, otherwise it would never be useful to wait with a non-zero value. There are some specific cases where the decrement is not in order, which are handled by `counterOutOfOrder`.

https://github.com/llvm/llvm-project/pull/135340


More information about the llvm-branch-commits mailing list