[llvm] AMDGPU/GFX12: Insert waitcnts before stores with scope_sys (PR #82996)
Jay Foad via llvm-commits
llvm-commits at lists.llvm.org
Mon Feb 26 05:08:33 PST 2024
================
@@ -2467,6 +2501,10 @@ bool SIMemoryLegalizer::expandStore(const SIMemOpInfo &MOI,
Changed |= CC->enableVolatileAndOrNonTemporal(
MI, MOI.getInstrAddrSpace(), SIMemOp::STORE, MOI.isVolatile(),
MOI.isNonTemporal());
+
+ // GFX12 specific, scope(desired coherence domain in cache hierarchy) is
+ // instruction field, do not confuse it with atomic scope.
+ Changed |= CC->expandSystemScopeStore(MI);
----------------
jayfoad wrote:
Is this also needed for atomic stores? They returned early on line 2495 so they won't hit this code.
https://github.com/llvm/llvm-project/pull/82996
More information about the llvm-commits
mailing list