[PATCH] D61263: [X86][SSE] Fold scalar horizontal add/sub for non-0/1 element extractions
Simon Pilgrim via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed May 1 07:05:22 PDT 2019
RKSimon added inline comments.
================
Comment at: test/CodeGen/X86/haddsub.ll:1012
; AVX-NEXT: vpermilps {{.*#+}} xmm0 = xmm0[3,1,2,3]
; AVX-NEXT: vaddss %xmm0, %xmm1, %xmm0
; AVX-NEXT: vzeroupper
----------------
We still miss folding to extractf128+hadd+permilps - but the cost-benefit isn't great.
Repository:
rL LLVM
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D61263/new/
https://reviews.llvm.org/D61263
More information about the llvm-commits
mailing list