[PATCH] D33728: [X86][SSE] Improve handling of non-temporal aligned loads
Simon Pilgrim via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Jun 5 09:04:05 PDT 2017
RKSimon added inline comments.
================
Comment at: test/CodeGen/X86/nontemporal-loads.ll:642
; SSE: # BB#0:
; SSE-NEXT: addps (%rdi), %xmm0
; SSE-NEXT: retq
----------------
craig.topper wrote:
> Why is sse4.1 still folding here? Is this because sse4.1 patterns uses memopv4f32 and not loadv4f32?
Ah - missed that one - yes its because its using the 'SSE-only' memory fragments.
Repository:
rL LLVM
https://reviews.llvm.org/D33728
More information about the llvm-commits
mailing list