[clang] [llvm] [AMDGPU] Emit a waitcnt instruction after each memory instruction (PR #79236)
    Pierre van Houtryve via cfe-commits 
    cfe-commits at lists.llvm.org
       
    Sun Jan 28 22:41:11 PST 2024
    
    
  
================
@@ -641,6 +644,9 @@ class SIMemoryLegalizer final : public MachineFunctionPass {
   bool expandAtomicCmpxchgOrRmw(const SIMemOpInfo &MOI,
                                 MachineBasicBlock::iterator &MI);
 
+  bool GFX9InsertWaitcntForPreciseMem(MachineFunction &MF);
----------------
Pierre-vh wrote:
Agreed, this should definitely be a virtual function such as `insertWaitcntForPreciseMem` and let the CacheControl implementation do what is needed. This is just emulating what `CacheControl` already does
https://github.com/llvm/llvm-project/pull/79236
    
    
More information about the cfe-commits
mailing list