[PATCH] Optimization for certain shufflevector by using insertps.

Filipe Cabecinhas filcab+llvm.phabricator at gmail.com
Wed Apr 23 14:46:54 PDT 2014


Hi nadav,

If we're doing a v4f32 shuffle on x86 with SSE4.1, we can lower certain
shufflevectors to an insertps instruction:
When most of the shufflevector result's elements come from one vector (and
keep their index), and one element comes from another vector or a memory
operand.

Added tests for insertps optimizations on shufflevector.

http://reviews.llvm.org/D3475

Files:
  lib/Target/X86/X86ISelLowering.cpp
  test/CodeGen/X86/sse41.ll
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D3475.8781.patch
Type: text/x-patch
Size: 5653 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20140423/bdbac681/attachment.bin>


More information about the llvm-commits mailing list