[llvm] [Doc][AMDGPU] Add barrier execution & memory model (PR #170447)

Pierre van Houtryve via llvm-commits llvm-commits at lists.llvm.org
Fri Dec 5 02:05:14 PST 2025


================
@@ -6797,6 +7110,10 @@ only accessed by a single thread, and is always write-before-read, there is
 never a need to invalidate these entries from the L1 cache. Hence all cache
 invalidates are done as ``*_vol`` to only invalidate the volatile cache lines.
 
+A wave waiting on a ``s_barrier`` is unable to handle traps or exceptions,
+thus a ``s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)`` is required before entering
+the barrier so that no memory exception can occur during the barrier.
----------------
Pierre-vh wrote:

I think targets <= GFX9 (excluding gfx90a+) do not wait implicitly

https://github.com/llvm/llvm-project/pull/170447


More information about the llvm-commits mailing list