[PATCH] D114071: [X86][Costmodel] `getReplicationShuffleCost()`: promote 1 bit-wide elements to 8 bit when have AVX512BW+AVX512VBMI
Roman Lebedev via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Nov 19 05:00:54 PST 2021
lebedev.ri added a comment.
In D114071#3142360 <https://reviews.llvm.org/D114071#3142360>, @RKSimon wrote:
> LGTM (VBMI)
Thank you for the review!
================
Comment at: llvm/lib/Target/X86/X86TargetTransformInfo.cpp:3677
}
return bailout();
default:
----------------
RKSimon wrote:
> lebedev.ri wrote:
> > RKSimon wrote:
> > > AVX512F will use (pretty awful) vXi32 shuffles: https://simd.godbolt.org/z/YYzjaf7Wh
> > Yes, those are pretty awful, i'm not sure if there's much hope for plain AVX512F,
> > we need AVX512BW or AVX512DQ for this i'd say.
> Its probably worth adding them instead of scalarization bailout though.
I mean, yes, it is just not obvious to me how to do that without hardcoding them.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D114071/new/
https://reviews.llvm.org/D114071
More information about the llvm-commits
mailing list