[llvm] [Doc][AMDGPU] Add barrier execution & memory model (PR #170447)
Pierre van Houtryve via llvm-commits
llvm-commits at lists.llvm.org
Fri Dec 5 02:05:14 PST 2025
================
@@ -6797,6 +7110,10 @@ only accessed by a single thread, and is always write-before-read, there is
never a need to invalidate these entries from the L1 cache. Hence all cache
invalidates are done as ``*_vol`` to only invalidate the volatile cache lines.
+A wave waiting on a ``s_barrier`` is unable to handle traps or exceptions,
+thus a ``s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)`` is required before entering
+the barrier so that no memory exception can occur during the barrier.
----------------
Pierre-vh wrote:
I think targets <= GFX9 (excluding gfx90a+) do not wait implicitly
https://github.com/llvm/llvm-project/pull/170447
More information about the llvm-commits
mailing list