[PATCH] D24681: Optimize patterns of vectorized interleaved memory accesses for X86.

Elena Demikhovsky via llvm-commits llvm-commits at lists.llvm.org
Thu Oct 6 08:34:24 PDT 2016


delena added inline comments.


> X86InterleavedAccess.cpp:70
> +///   %v2 = shuffle %intrshuffvec3, %intrshuffvec4, <0, 4, 2, 6>;
> +///   %v3 = shuffle %intrshuffvec3, %intrshuffvec4, <1, 5, 3, 7>;
> +///

AVX512 probably has another set of shuffles

> X86InterleavedAccess.cpp:89
> +    Value *NewBasePtr =
> +        Builder.CreateGEP(nullptr, VecBasePtr, Builder.getInt32(Part));
> +    Instruction *NewLoad =

inbounds GEP?

> X86InterleavedAccess.cpp:143
> +
> +  return lower(LI, Shuffles, Indices, Factor);
> +}

It is not a good name for function. I think that you don't need additional function call here at all.

https://reviews.llvm.org/D24681





More information about the llvm-commits mailing list