[PATCH] D145524: [AMDGPU] Skip buffer_wbl2 before atomic fence acquire

Tony Tye via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Tue Mar 7 16:23:10 PST 2023


t-tye accepted this revision.
t-tye added a comment.
This revision is now accepted and ready to land.

After offline discussion, the extra waitcnt is needed because this is a fence, and an acquire needs a waitcnt to ensure a proceeding load atomic that pairs with the fence has completed before invalidating the cache. The memory model on AMDGPUUsage does show the extra waitcnt for the fence. Previously, the waitcnt was being generated as part of the release which is not required if the fence is just an acquire.


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D145524/new/

https://reviews.llvm.org/D145524



More information about the llvm-commits mailing list