[PATCH] D113291: [AggressiveInstCombine] Lower Table Based CTTZ and enable it for AARCH64 in -O3
Dave Green via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Thu May 26 06:23:06 PDT 2022
dmgreen added a comment.
Hello - We were having a discussion about a very similar patch in D125755 <https://reviews.llvm.org/D125755>. I think the outcome for this patch is that either:
- We need to do this later (maybe in CodeGenPrepare).
- We need to do this unconditionally without the call to TTI.preferCTTZLowering() and have the reverse transform later for targets that do not have a cheaper alternative.
- We need to argue some more :)
There are more details about why in D125755 <https://reviews.llvm.org/D125755>. I would go for the first option if it doesn't lead to worse performance, as for the second I'm not sure when it would be profitable to transform back and emit the table. You may not want to do that for non-hot ctzs? It sounds like it may be difficult to get right, but maybe I'm overestimating it.
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D113291/new/
https://reviews.llvm.org/D113291
More information about the llvm-commits
mailing list