[PATCH] D87416: [SLPVectorizer][X86][AMDGPU] Remove fcmp+select to fmin/fmax reduction support.

Wed Sep 9 14:13:11 PDT 2020

craig.topper created this revision.
craig.topper added reviewers: RKSimon, spatel, nikic, arsenm, efriedma, vdmitrie.
Herald added subscribers: kerbowa, hiraditya, t-tye, tpr, dstuttard, yaxunl, nhaehnle, jvesely, kzhuravl.
Herald added a project: LLVM.
craig.topper requested review of this revision.
Herald added a subscriber: wdng.

Previously we could match fcmp+select to a reduction if the fcmp had
the nonans fast math flag. But if the select had the nonans fast
math flag, InstCombine would turn it into a fminnum/fmaxnum intrinsic
before SLP gets to it. Seems fairly likely that if one of the
fcmp+select pair have the fast math flag, they both would.

My plan is to start vectorizing the fmaxnum/fminnum version soon,
but I wanted to get this code out as it had some of the strangest
fast math flag behaviors.

https://reviews.llvm.org/D87416

Files:
  llvm/lib/Transforms/Vectorize/SLPVectorizer.cpp
  llvm/test/Transforms/SLPVectorizer/AMDGPU/horizontal-store.ll
  llvm/test/Transforms/SLPVectorizer/AMDGPU/reduction.ll
  llvm/test/Transforms/SLPVectorizer/X86/horizontal-list.ll
  llvm/test/Transforms/SLPVectorizer/X86/horizontal-minmax.ll

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D87416.290826.patch
Type: text/x-patch
Size: 61732 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20200909/01aa879e/attachment.bin>