[PATCH] D114071: [X86][Costmodel] `getReplicationShuffleCost()`: promote 1 bit-wide elements to 8 bit when have AVX512BW+AVX512VBMI

Simon Pilgrim via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Nov 19 02:35:55 PST 2021


RKSimon accepted this revision.
RKSimon added a comment.
This revision is now accepted and ready to land.

LGTM (VBMI)



================
Comment at: llvm/lib/Target/X86/X86TargetTransformInfo.cpp:3677
     }
     return bailout();
   default:
----------------
lebedev.ri wrote:
> RKSimon wrote:
> > AVX512F will use (pretty awful) vXi32 shuffles: https://simd.godbolt.org/z/YYzjaf7Wh
> Yes, those are pretty awful, i'm not sure if there's much hope for plain AVX512F,
> we need AVX512BW or AVX512DQ for this i'd say.
Its probably worth adding them instead of scalarization bailout though.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D114071/new/

https://reviews.llvm.org/D114071



More information about the llvm-commits mailing list