[PATCH] [X86] tranform insertps to blendps when possible for better performance

Sanjay Patel spatel at rotateright.com
Mon Mar 2 15:47:25 PST 2015


Updated patch to:

1. Only handle the low element to low element insertps case (immediate == 0)
2. Added test case to confirm that we're only transforming the low element insertps case
3. Removed use of 'auto'


http://reviews.llvm.org/D7866

Files:
  lib/Target/X86/X86ISelLowering.cpp
  lib/Target/X86/X86InstrSSE.td
  test/CodeGen/X86/avx-load-store.ll
  test/CodeGen/X86/sse41.ll

EMAIL PREFERENCES
  http://reviews.llvm.org/settings/panel/emailpreferences/
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D7866.21055.patch
Type: text/x-patch
Size: 8305 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20150302/d9c6ab31/attachment.bin>


More information about the llvm-commits mailing list