[PATCH] D33728: [X86][SSE] Improve handling of non-temporal aligned loads

Craig Topper via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Mon Jun 5 08:53:52 PDT 2017


craig.topper added inline comments.


================
Comment at: test/CodeGen/X86/nontemporal-loads.ll:642
 ; SSE:       # BB#0:
 ; SSE-NEXT:    addps (%rdi), %xmm0
 ; SSE-NEXT:    retq
----------------
Why is sse4.1 still folding here? Is this because sse4.1 patterns uses memopv4f32 and not loadv4f32?


Repository:
  rL LLVM

https://reviews.llvm.org/D33728





More information about the llvm-commits mailing list