[PATCH] D111460: [X86][LoopVectorize] "Fix" `X86TTIImpl::getAddressComputationCost()`
Renato Golin via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Nov 29 10:09:32 PST 2021
rengolin added a comment.
I can't predict if the new shuffle patterns will be better or worse on each affected platform, least of all looking at IR, not ASM.
But if those changes have positive changes in code generation (translating to better benchmark numbers), then this looks good to me.
================
Comment at: llvm/lib/Target/X86/X86TargetTransformInfo.cpp:4076
+ // TODO: AVX2 is the current cut-off because we don't have correct
+ // interleaving costs for prior ISA's.
if (!BaseT::isStridedAccess(Ptr))
----------------
This makes a lot of sense to me. I expect none of the pre-AVX2 benchmark numbers to change at all and those for AVX2+ to change positively. If that's the case, then I don't see why this would be a bad choice.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D111460/new/
https://reviews.llvm.org/D111460
More information about the llvm-commits
mailing list