[PATCH] [x86] instcombine more cases of insertps into a shufflevector
Sanjay Patel
spatel at rotateright.com
Fri Apr 24 10:44:31 PDT 2015
Hi RKSimon, andreadb, craig.topper,
This is a follow-on to D8833 (insertps optimization when the zero mask is not used).
In this patch, we check for the case where the zmask is used, but both input vectors to the insertps intrinsic are the same operand. This lets us replace the 2nd shuffle input operand with the zero vector.
I confirmed that the x86 backend generates the expected insertps instructions for the shuffles created here.
http://reviews.llvm.org/D9257
Files:
lib/Transforms/InstCombine/InstCombineCalls.cpp
test/Transforms/InstCombine/x86-insertps.ll
EMAIL PREFERENCES
http://reviews.llvm.org/settings/panel/emailpreferences/
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D9257.24399.patch
Type: text/x-patch
Size: 4178 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20150424/76d287f3/attachment.bin>
More information about the llvm-commits
mailing list