[PATCH] D120544: [AMDGPU] Omit unnecessary waitcnt before barriers

Austin Kerbow via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Thu Jun 30 21:56:32 PDT 2022


kerbowa added a comment.
Herald added a subscriber: kosarev.

JFYI, I'm planning to reintroduce this change in the next few weeks. Before this change, the compiler would ALWAYS wait for outstanding VMEM/LGKM at barriers. There is no HW requirement for this. As an optimization, we will be omitting these waitcnt on Navi/MI200. In order to retain the same behavior as before the change, fences must be added before barriers.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D120544/new/

https://reviews.llvm.org/D120544



More information about the llvm-commits mailing list