[llvm-branch-commits] [clang] [llvm] [AMDGPU] Introduce asyncmark/wait intrinsics (PR #180467)
Krzysztof Drewniak via llvm-branch-commits
llvm-branch-commits at lists.llvm.org
Mon Feb 9 19:06:15 PST 2026
================
@@ -75,17 +91,23 @@ define float @struct.ptr.buffer.load(ptr addrspace(8) inreg %rsrc, ptr addrspace
; CHECK-NEXT: s_mov_b32 m0, s20
; CHECK-NEXT: v_mov_b32_e32 v0, 8
; CHECK-NEXT: buffer_load_dword v0, s[16:19], 0 idxen lds
+; CHECK-NEXT: ; asyncmark
; CHECK-NEXT: buffer_load_dword v0, s[16:19], 0 idxen offset:4 glc lds
+; CHECK-NEXT: ; asyncmark
; CHECK-NEXT: buffer_load_dword v0, s[16:19], 0 idxen offset:8 slc lds
; CHECK-NEXT: v_mov_b32_e32 v0, s20
-; CHECK-NEXT: s_waitcnt vmcnt(0)
+; CHECK-NEXT: ; wait_asyncmark(1)
+; CHECK-NEXT: s_waitcnt vmcnt(2)
; CHECK-NEXT: ds_read_b32 v0, v0
-; CHECK-NEXT: s_waitcnt lgkmcnt(0)
+; CHECK-NEXT: s_waitcnt vmcnt(0) lgkmcnt(0)
----------------
krzysz00 wrote:
Just to check, this is the implicit vmem wait on function exist reasserting itself?
https://github.com/llvm/llvm-project/pull/180467
More information about the llvm-branch-commits
mailing list