[PATCH][PR21138] Lower <u, u, u, u, idx1, idx2, idx3, idx4> using 256bit shuffle instead of splitting

Quentin Colombet qcolombet at apple.com
Wed Oct 15 17:44:12 PDT 2014


With the patch!
On Oct 15, 2014, at 5:42 PM, Quentin Colombet <qcolombet at apple.com> wrote:

> Hi Chandler,
> 
> Attached is a tentative patch to fix PR21138 http://llvm.org/bugs/show_bug.cgi?id=21138
> 
> I have not tested it against the test suite, I wanted to know if the direction matches what you expected first.
> In particular, the check that guards the transformation is simple than I would have expected at first, but I did not find cases where the transformation is actually not beneficial (though, I haven’t tried hard).
> 
> Anyhow, let me know what you think on the patch before I make the measurements.
> 
> Thanks,
> -Quentin
-------------- next part --------------
A non-text attachment was scrubbed...
Name: pr21138.patch
Type: application/octet-stream
Size: 4346 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20141015/4ec6177a/attachment.obj>


More information about the llvm-commits mailing list