[PATCH] D57059: [SLP] Initial support for the vectorization of the non-power-of-2 vectors.

Alexey Bataev via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Feb 28 11:01:24 PST 2020


ABataev marked an inline comment as done.
ABataev added inline comments.


================
Comment at: llvm/test/Transforms/SLPVectorizer/X86/zext.ll:263
+; SLM-NEXT:    [[TMP20:%.*]] = extractelement <2 x i16> [[TMP12]], i32 1
+; SLM-NEXT:    [[V7:%.*]] = insertelement <8 x i16> [[V6]], i16 [[TMP20]], i32 7
 ; SLM-NEXT:    ret <8 x i16> [[V7]]
----------------
RKSimon wrote:
> Its really odd that SLM fails to use zext <8 x i8> to <8 x i16> like SSE2, I think the custom SLM extract/insert costs are affecting something unexpected?
Yes, the cost of ExtractElement instructions affects vectorization, must be fixed in another patch for the vectorization of the building of aggregates.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D57059/new/

https://reviews.llvm.org/D57059





More information about the llvm-commits mailing list