[PATCH][X86] Add target specific combine rules to fold SSE/AVX/AVX2 blend intrinsics.

Andrea Di Biagio andrea.dibiagio at gmail.com
Thu May 15 08:26:09 PDT 2014


Thanks Nadav!

Committed revision 208895.

On Thu, May 15, 2014 at 4:04 PM, Nadav Rotem <nrotem at apple.com> wrote:
> LGTM.  Thanks Andrea.
>
>> On May 15, 2014, at 7:41, Andrea Di Biagio <andrea.dibiagio at gmail.com> wrote:
>>
>> Hi,
>>
>> This patch teaches the x86 backend how to fold SSE4.1/AVX/AVX2 blend
>> intrinsics in the following trivial cases:
>>  1)  fold (blend A, A, Mask) -> A;
>>  2)  fold (blend A, B, <allZeros>) -> A;
>>  3) fold (blend A, B, <allOnes>) -> B;
>>
>> Added two new tests to verify that the new folding rules work for all
>> the optimized blend intrinsics.
>>
>> Please let me know if ok to submit.
>>
>> Thanks,
>> Andrea Di Biagio
>> <patch-combine-x86-blend.diff>



More information about the llvm-commits mailing list