[PATCH] D99980: [SLP]Improve cost model for the vectorized extractelements.

Thu Apr 22 07:21:31 PDT 2021

ABataev added inline comments.

================
Comment at: llvm/test/Transforms/SLPVectorizer/X86/extractelement.ll:53
+; THRESH1-NEXT:    [[TMP4:%.*]] = extractelement <2 x float> [[TMP2]], i32 1
+; THRESH1-NEXT:    [[ADD:%.*]] = fadd float [[TMP3]], [[TMP4]]
 ; THRESH1-NEXT:    store float [[ADD]], float* @a, align 4
----------------
RKSimon wrote:
> Missed fadd reduction opportunity
We even do not try to detect reductions if we have less than 4 elements, here we're going to have shuffle + vector fadd + extractelement.

Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D99980/new/

https://reviews.llvm.org/D99980