[PATCH] [X86] tranform insertps to blendps when possible for better performance
Chandler Carruth
chandlerc at gmail.com
Tue Mar 3 16:43:10 PST 2015
On Mon, Mar 2, 2015 at 3:47 PM, Sanjay Patel <spatel at rotateright.com> wrote:
> Updated patch to:
>
> 1. Only handle the low element to low element insertps case (immediate ==
> 0)
> 2. Added test case to confirm that we're only transforming the low element
> insertps case
> 3. Removed use of 'auto'
>
Thanks, and I'll look at the updated code once we sort out whether this can
be done directly during lowering with an adjustment to the lowering and/or
patterns for instruction selection.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20150303/4ea427c5/attachment.html>
More information about the llvm-commits
mailing list