[clang] [llvm] [AMDGPU] Emit a waitcnt instruction after each memory instruction (PR #79236)

Pierre van Houtryve via cfe-commits cfe-commits at lists.llvm.org
Sun Jan 28 22:41:11 PST 2024


================
@@ -641,6 +644,9 @@ class SIMemoryLegalizer final : public MachineFunctionPass {
   bool expandAtomicCmpxchgOrRmw(const SIMemOpInfo &MOI,
                                 MachineBasicBlock::iterator &MI);
 
+  bool GFX9InsertWaitcntForPreciseMem(MachineFunction &MF);
----------------
Pierre-vh wrote:

Agreed, this should definitely be a virtual function such as `insertWaitcntForPreciseMem` and let the CacheControl implementation do what is needed. This is just emulating what `CacheControl` already does

https://github.com/llvm/llvm-project/pull/79236


More information about the cfe-commits mailing list