[PATCH] D57059: [SLP] Initial support for the vectorization of the non-power-of-2 vectors.

Craig Topper via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Nov 20 11:07:19 PST 2020


craig.topper added inline comments.


================
Comment at: llvm/test/Transforms/SLPVectorizer/X86/pr47623.ll:26
 ;
 ; AVX-LABEL: @foo(
+; AVX-NEXT:    [[TMP1:%.*]] = load i32, i32* getelementptr inbounds ([8 x i32], [8 x i32]* @b, i64 0, i64 0), align 16
----------------
ABataev wrote:
> xbolva00 wrote:
> > Regression on avx?
> Yes, looks like the issue with the cost of `@llvm.masked.gather` for masked gather with some undefs in the mask
Gather is slow on CPUs prior to AVX512. And its cost is proportional to the number of elements. I don't think the value of the mask should be a factor.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D57059/new/

https://reviews.llvm.org/D57059



More information about the llvm-commits mailing list