[PATCH] D74088: [x86] form broadcast of scalar memop even with >1 use
Sanjay Patel via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Feb 5 13:48:57 PST 2020
spatel marked an inline comment as done.
spatel added inline comments.
================
Comment at: llvm/test/CodeGen/X86/vector-reduce-fadd.ll:1109
define double @test_v16f64(double %a0, <16 x double> %a1) {
-; SSE-LABEL: test_v16f64:
-; SSE: # %bb.0:
-; SSE-NEXT: movapd {{[0-9]+}}(%rsp), %xmm8
-; SSE-NEXT: addsd %xmm1, %xmm0
-; SSE-NEXT: unpckhpd {{.*#+}} xmm1 = xmm1[1,1]
-; SSE-NEXT: addsd %xmm1, %xmm0
-; SSE-NEXT: addsd %xmm2, %xmm0
-; SSE-NEXT: unpckhpd {{.*#+}} xmm2 = xmm2[1,1]
-; SSE-NEXT: addsd %xmm2, %xmm0
-; SSE-NEXT: addsd %xmm3, %xmm0
-; SSE-NEXT: unpckhpd {{.*#+}} xmm3 = xmm3[1,1]
-; SSE-NEXT: addsd %xmm3, %xmm0
-; SSE-NEXT: addsd %xmm4, %xmm0
-; SSE-NEXT: unpckhpd {{.*#+}} xmm4 = xmm4[1,1]
-; SSE-NEXT: addsd %xmm4, %xmm0
-; SSE-NEXT: addsd %xmm5, %xmm0
-; SSE-NEXT: unpckhpd {{.*#+}} xmm5 = xmm5[1,1]
-; SSE-NEXT: addsd %xmm5, %xmm0
-; SSE-NEXT: addsd %xmm6, %xmm0
-; SSE-NEXT: unpckhpd {{.*#+}} xmm6 = xmm6[1,1]
-; SSE-NEXT: addsd %xmm6, %xmm0
-; SSE-NEXT: addsd %xmm7, %xmm0
-; SSE-NEXT: unpckhpd {{.*#+}} xmm7 = xmm7[1,1]
-; SSE-NEXT: addsd %xmm7, %xmm0
-; SSE-NEXT: addsd %xmm8, %xmm0
-; SSE-NEXT: unpckhpd {{.*#+}} xmm8 = xmm8[1,1]
-; SSE-NEXT: addsd %xmm8, %xmm0
-; SSE-NEXT: retq
+; SSE2-LABEL: test_v16f64:
+; SSE2: # %bb.0:
----------------
I haven't stepped through this or the next test diff to see why it changed. Anyone know why SSE2/41 would differ here?
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D74088/new/
https://reviews.llvm.org/D74088
More information about the llvm-commits
mailing list