[PATCH] D94974: [SLP] Try doubled MaxElts for stores vectorization
Anton Afanasyev via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Feb 2 07:20:07 PST 2021
anton-afanasyev added inline comments.
================
Comment at: llvm/test/Transforms/SLPVectorizer/X86/arith-mul.ll:183
+; AVX2_PREFER128-NEXT: [[TMP5:%.*]] = mul <4 x i64> [[TMP1]], [[TMP3]]
+; AVX2_PREFER128-NEXT: [[TMP6:%.*]] = mul <4 x i64> [[TMP2]], [[TMP4]]
+; AVX2_PREFER128-NEXT: store <4 x i64> [[TMP5]], <4 x i64>* bitcast ([8 x i64]* @c64 to <4 x i64>*), align 8
----------------
RKSimon wrote:
> We're setting prefer-128-bit and yet still generating <4 x i64> ops?
Hmm, yes, you're right, that's strange to generate `<4 x i64>` for the case with preferable width (=128). But we can't check this at the abstract llvm level. Generally we don't know the target constraints, so this my patch looks too tricky for such cases.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D94974/new/
https://reviews.llvm.org/D94974
More information about the llvm-commits
mailing list