[PATCH] D101380: [AMDGPU] Move insertion of function entry waitcnt later

Austin Kerbow via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Sun May 2 14:16:43 PDT 2021


kerbowa added inline comments.


================
Comment at: llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp:1601
+
+  if (!MFI->isEntryFunction()) {
+    // Wait for any outstanding memory operations that the input registers may
----------------
arsenm wrote:
> I thought we made this explicitly a property of the calling convention, so we could have this not be inserted for future conventions. I guess if this isn't checking the function for that already, it's a preexisting problem
When would that waitcnt be added if it's a property of the CC? It seems like these changes could help avoid it being added twice since they would be combined. 


================
Comment at: llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp:1606
+
+    // TODO: Could insert earlier and schedule more liberally with operations
+    // that only use caller preserved registers.
----------------
rampitec wrote:
> Is this comment still relevant?
It might be, if what it's saying is that we could move this waitcnt later until the function uses callee saved registers.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D101380/new/

https://reviews.llvm.org/D101380



More information about the llvm-commits mailing list