[PATCH] D87674: [AMDGPU] Insert waitcnt after returning from call

Madhur Amilkanthwar via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Sep 23 09:53:45 PDT 2020


madhur13490 added a comment.

In D87674#2290419 <https://reviews.llvm.org/D87674#2290419>, @arsenm wrote:

> In D87674#2290418 <https://reviews.llvm.org/D87674#2290418>, @madhur13490 wrote:
>
>> In D87674#2290396 <https://reviews.llvm.org/D87674#2290396>, @mareko wrote:
>>
>>> Yes, this commit is incorrect. It completely breaks code linking in Mesa OpenGL. s_waitcnt is required at the end of all global functions that return values.
>>>
>>> Please revert. @nhaehnle
>>
>> I don't understand why would it fail. This patch just moves s_waitcnt to the caller so they would be executed anyway. I think I am missing something. It would be helpful to root cause if we can isolate to a small test case.
>
> Shader returns aren't real returns and the "caller" doesn't wait

I see. So how should this be implemented? May be we conditionalize this patch just for compute?


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D87674/new/

https://reviews.llvm.org/D87674



More information about the llvm-commits mailing list