[PATCH] D120544: [AMDGPU] Omit unnecessary waitcnt before barriers

Jay Foad via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Feb 25 01:29:05 PST 2022


foad added a comment.

In D120544#3344977 <https://reviews.llvm.org/D120544#3344977>, @kerbowa wrote:

> The test waitcnt-vscnt.ll is kind of muddied by this patch. It was relying on the behavior of s_barrier always adding s_waitcnt_vscnt at barriers if there were any outstanding events. The singlethread fences were not actually doing anything. I've added workgroup fences instead. With workgroup fences we currently always add the waitcnt. The test will look as it did before after my memory legalizer/waitcnt overhaul so that these are optimized. Consider this a precommit of this test for now.

Can you split this into two patches: first change the test, and then rebase this patch?


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D120544/new/

https://reviews.llvm.org/D120544



More information about the llvm-commits mailing list