[PATCH] D115747: [AMDGPU] Hoist waitcnt out of loops when they unecessarily wait for stores

Jay Foad via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Mon Dec 20 08:41:51 PST 2021


foad added a comment.

Can't we always set the state at the start of the loop to the merged state from loop backedges *only*, and then have a final pass that inserts extra waits in preheaders to reconcile the state outside the loop with the state that was assumed for the start of the loop? (Sorry I know that is a bit simplistic and hand-wavy.)


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D115747/new/

https://reviews.llvm.org/D115747



More information about the llvm-commits mailing list