[llvm-branch-commits] [llvm] [AMDGPU][InsertWaitCnts] Track global_wb/inv/wbinv (PR #135340)
Jay Foad via llvm-branch-commits
llvm-branch-commits at lists.llvm.org
Tue Apr 22 05:47:42 PDT 2025
================
@@ -19,7 +19,7 @@ body: |
; GFX12-NEXT: {{ $}}
; GFX12-NEXT: renamable $vgpr0 = GLOBAL_LOAD_DWORD_SADDR renamable $sgpr2_sgpr3, killed $vgpr0, 0, 0, implicit $exec :: (load (s32), addrspace 1)
; GFX12-NEXT: GLOBAL_INV 16, implicit $exec
- ; GFX12-NEXT: S_WAIT_LOADCNT 0
+ ; GFX12-NEXT: S_WAIT_LOADCNT 1
----------------
jayfoad wrote:
> the data return & counter decrement can be done out of order.
No, generally the counter decrement *is* in order, otherwise it would never be useful to wait with a non-zero value. There are some specific cases where the decrement is not in order, which are handled by `counterOutOfOrder`.
https://github.com/llvm/llvm-project/pull/135340
More information about the llvm-branch-commits
mailing list