[LLVMdev] Bug #16941

Nadav Rotem nrotem at apple.com
Tue Oct 22 14:41:14 PDT 2013


On Oct 21, 2013, at 12:09 PM, Dmitry Babokin <babokin at gmail.com> wrote:

> By the way, I'm curious, is the any reason why you focus on SSE4, not AVX? Seems that vectorizer should care the most about the latest silicon.
> 

I am interested in looking at the SSE4 code because lowering of AVX code is more complicated, especially for masks.  The problem that <8 x i1> can be legalized to <8 x i32> for YMM, or <8 x i16> for XMM.  ISPC worked around this limitation by explicitly extending the mask. The SEXT canonicalization reverted the code pattern that ISPC generated. 

Thanks,
Nadav   
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20131022/f6e399de/attachment.html>


More information about the llvm-dev mailing list