[PATCH][InstCombine][X86] Teach how to fold calls to SSE2/AVX2 psrl intrinsics.

Michael Spencer bigcheesegs at gmail.com
Fri Apr 25 12:59:49 PDT 2014


On Fri, Apr 25, 2014 at 9:43 AM, Andrea Di Biagio
<andrea.dibiagio at gmail.com> wrote:
> Hi,
>
> this patch extends the fix committed at revision 20758
> (http://llvm.org/viewvc/llvm-project?view=revision&revision=207058) to
> also teach InstCombine how to fold calls to SSE2/AVX2 packed logical
> shift right intrinsics.
>
> A packed logical shift right with a shift count bigger than or equal
> to the element size always produces a zero vector. In all other cases,
> it can be safely replaced by a 'lshr' instruction.
>
> Please let me know if ok to submit.
>
> Thanks,
> Andrea Di Biagio
> SN Systems - Sony Computer Entertainment Group.

lgtm.

- Michael Spencer



More information about the llvm-commits mailing list