[clang] [llvm] [AMDGPU] Emit a waitcnt instruction after each memory instruction (PR #79236)

Matt Arsenault via cfe-commits cfe-commits at lists.llvm.org
Mon Jan 29 00:57:59 PST 2024


================
@@ -2561,6 +2567,70 @@ bool SIMemoryLegalizer::expandAtomicCmpxchgOrRmw(const SIMemOpInfo &MOI,
   return Changed;
 }
 
+bool SIMemoryLegalizer::GFX9InsertWaitcntForPreciseMem(MachineFunction &MF) {
----------------
arsenm wrote:

can you just make this happen as a consequence of the existing flow rather than using an entirely separate pass over the function?

https://github.com/llvm/llvm-project/pull/79236


More information about the cfe-commits mailing list