[PATCH] D89618: [AMDGPU] Optimize waitcnt insertion for flat memory operations
Tony Tye via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Oct 20 00:59:27 PDT 2020
t-tye added a comment.
In D89618#2340966 <https://reviews.llvm.org/D89618#2340966>, @rampitec wrote:
> JFYI how much it will help actual programs after it is fixed is unclear. It will likely change a lot of lit tests, but actual effect on real programs would depend on FE and language rules. And inlining of course, as usual.
It did change 46 lit tests. I agree it is unclear how much it will help. But the GLOBAL and SCRATCH flat operations seem like they may avoid the pessimistic waitcnt 0.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D89618/new/
https://reviews.llvm.org/D89618
More information about the llvm-commits
mailing list