[PATCH] D117544: [AMDGPU] Fix missing waitcnt issue

Jay Foad via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Tue Jan 18 01:15:19 PST 2022


foad added a comment.

As a bit of background, the `OldOutOfOrder` test was introduced by @nhaehnle in a large refactoring in D54231 <https://reviews.llvm.org/D54231>. I think it's just a performance optimisation: if the old state had events that completed out of order, then any use of the corresponding registers would have to be preceded by a "waitcnt 0", so there's no need to reprocess that block because the waitcnts are already as strict as they can be. But this goes wrong in the case where the merge introduces a wait on a particular register that had no wait in the old state (so no waitcnt would have been generated for it the last time the block was processed).


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D117544/new/

https://reviews.llvm.org/D117544



More information about the llvm-commits mailing list