[clang] [llvm] [AMDGPU] Emit a waitcnt instruction after each memory instruction (PR #79236)

Tony Tye via cfe-commits cfe-commits at lists.llvm.org
Tue Feb 6 13:36:50 PST 2024


================
@@ -605,12 +606,197 @@ class SIGfx12CacheControl : public SIGfx11CacheControl {
                                       bool IsNonTemporal) const override;
 };
 
+class SIPreciseMemorySupport {
----------------
t-tye wrote:

My initial thought had been that this would be part of the existing cache control functions. It seems it is the same kind of waitcnt as needs to be inserted after a store release. That also requires the right waitcnt to be generated according to the kind of memory instruction.

https://github.com/llvm/llvm-project/pull/79236


More information about the cfe-commits mailing list