[PATCH] D115747: [AMDGPU] Hoist waitcnt out of loops when they unecessarily wait for stores
Jay Foad via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Thu Feb 17 06:40:40 PST 2022
foad added a comment.
> Refactoring of the pass. We compute the brackets for both the flushed and non-flushed versions of each outer loop
I am nervous about the compile time impact of this, and about adding the extra code to implement this. (The pass is pretty complicated already.) Is the refactoring really required for the GFX9 improvement, or only for the planned GFX10 improvement?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D115747/new/
https://reviews.llvm.org/D115747
More information about the llvm-commits
mailing list