[PATCH] [x86] instcombine more cases of insertps into a shufflevector

Sanjay Patel spatel at rotateright.com
Fri Apr 24 10:44:31 PDT 2015


Hi RKSimon, andreadb, craig.topper,

This is a follow-on to D8833 (insertps optimization when the zero mask is not used). 

In this patch, we check for the case where the zmask is used, but both input vectors to the insertps intrinsic are the same operand. This lets us replace the 2nd shuffle input operand with the zero vector.

I confirmed that the x86 backend generates the expected insertps instructions for the shuffles created here.

http://reviews.llvm.org/D9257

Files:
  lib/Transforms/InstCombine/InstCombineCalls.cpp
  test/Transforms/InstCombine/x86-insertps.ll

EMAIL PREFERENCES
  http://reviews.llvm.org/settings/panel/emailpreferences/
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D9257.24399.patch
Type: text/x-patch
Size: 4178 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20150424/76d287f3/attachment.bin>


More information about the llvm-commits mailing list