[PATCH] D116053: [MachineSink] Allow sinking of constant or ignorable physreg uses
Vang Thao via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Dec 21 12:52:34 PST 2021
vangthao added a comment.
> IR is essentially a single thread representation. The implicit exec use is our way to model mutithreaded divergence. Consider this transformation which shall now become legal:
>
> int lid = get_local_id(0); int lid = get_local_id(0);
> int i = 0; int i = 0;
> x = def(); do {
> do { => x = def();
> use1(x); use1(x);
> } while(i++ < lid); } while(i++ < lid);
> use2(x); use2(x);
>
> def dominates use2 in both cases, but in the second case not with every lane. All lanes except first will use an undef.
We will not sink into a loop if the def is outside of the loop. In the test case `loop_sink_fmac`, the def was already in a loop and was why it was able to be sinked.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D116053/new/
https://reviews.llvm.org/D116053
More information about the llvm-commits
mailing list