[PATCH] D33728: [X86][SSE] Improve handling of non-temporal aligned loads

Simon Pilgrim via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed May 31 09:29:41 PDT 2017


RKSimon created this revision.

PR32743 - Avoid folding of non-temporal aligned loads (when supported by the CPU) into instructions which will cause them to become temporal vector loads and pollute the caches.

PR32744 - Split 256-bit vector non-temporal aligned loads on AVX1 targets to keep them non-temporal.

These can be committed separately but are so inter-related I thought it better to get them reviewed together.


Repository:
  rL LLVM

https://reviews.llvm.org/D33728

Files:
  lib/Target/X86/X86ISelLowering.cpp
  lib/Target/X86/X86InstrFragmentsSIMD.td
  test/CodeGen/X86/fast-isel-nontemporal.ll
  test/CodeGen/X86/nontemporal-loads.ll

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D33728.100877.patch
Type: text/x-patch
Size: 37670 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20170531/2964d57c/attachment-0001.bin>


More information about the llvm-commits mailing list