[PATCH] D101380: [AMDGPU] Move insertion of function entry waitcnt later
Matt Arsenault via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Apr 27 10:22:21 PDT 2021
arsenm added inline comments.
================
Comment at: llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp:1601
+
+ if (!MFI->isEntryFunction()) {
+ // Wait for any outstanding memory operations that the input registers may
----------------
I thought we made this explicitly a property of the calling convention, so we could have this not be inserted for future conventions. I guess if this isn't checking the function for that already, it's a preexisting problem
================
Comment at: llvm/test/CodeGen/AMDGPU/waitcnt-preexisting.mir:188-190
+ ; GFX9: S_WAITCNT 0
+ ; GFX9-NOT: S_WAITCNT 0
+ ; GFX9: S_ENDPGM 0
----------------
-NEXTs would be a bit more dependable
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D101380/new/
https://reviews.llvm.org/D101380
More information about the llvm-commits
mailing list