[llvm] [AMDGPU][SIInsertWaitcnts] Do not add s_waitcnt when the counters are known to be 0 already (PR #72830)

Jay Foad via llvm-commits llvm-commits at lists.llvm.org
Wed Dec 13 08:15:07 PST 2023


================
@@ -292,6 +292,11 @@ class WaitcntBrackets {
     VgprVmemTypes[GprNo] = 0;
   }
 
+  void setNonKernelFunctionInitialState() {
+    setScoreUB(VS_CNT, getWaitCountMax(VS_CNT));
+    PendingEvents |= WaitEventMaskForInst[VS_CNT];
+  }
----------------
jayfoad wrote:

Could you please split out a prerequisite patch that just adds this functionality? I think there are only a couple of tests that it will affect, to do with barriers or release vgprs.

https://github.com/llvm/llvm-project/pull/72830


More information about the llvm-commits mailing list