[PATCH] D99980: [SLP]Improve cost model for the vectorized extractelements.
Alexey Bataev via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Thu Apr 22 07:21:31 PDT 2021
ABataev added inline comments.
================
Comment at: llvm/test/Transforms/SLPVectorizer/X86/extractelement.ll:53
+; THRESH1-NEXT: [[TMP4:%.*]] = extractelement <2 x float> [[TMP2]], i32 1
+; THRESH1-NEXT: [[ADD:%.*]] = fadd float [[TMP3]], [[TMP4]]
; THRESH1-NEXT: store float [[ADD]], float* @a, align 4
----------------
RKSimon wrote:
> Missed fadd reduction opportunity
We even do not try to detect reductions if we have less than 4 elements, here we're going to have shuffle + vector fadd + extractelement.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D99980/new/
https://reviews.llvm.org/D99980
More information about the llvm-commits
mailing list