[clang] [llvm] [AMDGPU] Introduce asyncmark/wait intrinsics (PR #173259)
Pierre van Houtryve via cfe-commits
cfe-commits at lists.llvm.org
Mon Jan 5 02:46:31 PST 2026
================
@@ -0,0 +1,179 @@
+===============================
+ AMDGPU Asynchronous Operations
+===============================
+
+.. contents::
+ :local:
+
+Introduction
+============
+
+Asynchronous operations are memory transfers (usually between the global memory
+and LDS) that are completed independently at an unspecified scope. A thread that
+requests one or more asynchronous transfers can use *async markers* to track
+their completion. The thread waits for each marker to be *completed*, which
+indicates that requests initiated in program order before this marker have also
+completed.
+
+Operations
+==========
+
+``async_load_to_lds``
+---------------------
+
----------------
Pierre-vh wrote:
```suggestion
Legacy LDS DMA (GFX9-10)
---------------------------------------
```
I wonder if this should be separated in two sections ? e.g.
* Synchronization Operations
* asyncmark
* wait.asyncmark(N)
* Memory Operations
* Legacy LDS DMA (GFX9-10)
https://github.com/llvm/llvm-project/pull/173259
More information about the cfe-commits
mailing list