[PATCH] D99980: [SLP]Improve cost model for the vectorized extractelements.

Alexey Bataev via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Thu Apr 22 07:21:31 PDT 2021


ABataev added inline comments.


================
Comment at: llvm/test/Transforms/SLPVectorizer/X86/extractelement.ll:53
+; THRESH1-NEXT:    [[TMP4:%.*]] = extractelement <2 x float> [[TMP2]], i32 1
+; THRESH1-NEXT:    [[ADD:%.*]] = fadd float [[TMP3]], [[TMP4]]
 ; THRESH1-NEXT:    store float [[ADD]], float* @a, align 4
----------------
RKSimon wrote:
> Missed fadd reduction opportunity
We even do not try to detect reductions if we have less than 4 elements, here we're going to have shuffle + vector fadd + extractelement.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D99980/new/

https://reviews.llvm.org/D99980



More information about the llvm-commits mailing list