[PATCH] D64508: AMDGPU: Force s_waitcnt after GWS instructions

Matt Arsenault via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Jul 10 15:42:25 PDT 2019


arsenm added a comment.

In D64508#1579479 <https://reviews.llvm.org/D64508#1579479>, @rampitec wrote:

> Is there any documentation describing it? Also this may be a better job for wait count insertion pass or hazard recognizer, depending on the nature of the issue.


The documentation for these is really not great. It needs to be the literal next instruction, since it will jump back 8 bytes and retry in some situations. I'm not sure why it needs to be a full wait. I didn't want to rely on some other pass later to insert it, since it must be there. Having the waitcnt already exist also avoids teaching the waitcnt pass about this special case, and instead the general pre-existing waitcnt handling will take care of it.


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D64508/new/

https://reviews.llvm.org/D64508





More information about the llvm-commits mailing list