[PATCH] D117502: [LV] Combine vector reductions parts in tree instead of serially.
Florian Hahn via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Jan 17 11:40:29 PST 2022
fhahn added a comment.
In D117502#3249197 <https://reviews.llvm.org/D117502#3249197>, @dmgreen wrote:
> Does this alter much? Or do we end up redistributing them anyway? https://godbolt.org/z/z4nf5hPna
It won't have a massive impact in general, but it shaves off a few cycles, depending on the interleave count.
AFAICT the redistributions done in the https://godbolt.org/z/z4nf5hPna are done by `ReassoicatePass`, which likes to turn parallel reduction trees into serial ones (? but that's a separate issue I think), like for `@float2`, which looks like it got serialized. I don't think any passes that run after the vectorizer try to improve the length of reduction chains: https://godbolt.org/z/v4K4aK3a1
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D117502/new/
https://reviews.llvm.org/D117502
More information about the llvm-commits
mailing list