[PATCH] D103818: [X86] lowerVECTOR_SHUFFLE(): allow widening shuffle to have i128 (YMM half) element type

Simon Pilgrim via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Mon Jun 7 08:43:10 PDT 2021


RKSimon added a comment.

Why is this better? vinsertf128 tends to be faster than broadcasts



================
Comment at: llvm/test/CodeGen/X86/vector-shuffle-256-v4.ll:1016
 ; AVX2-NEXT:    vblendps {{.*#+}} ymm0 = ymm0[0,1,2,3],ymm1[4,5],ymm0[6,7]
 ; AVX2-NEXT:    retq
 ;
----------------
The AVX1 shuffle looks to be much better............


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D103818/new/

https://reviews.llvm.org/D103818



More information about the llvm-commits mailing list