[PATCH][PR21138] Lower <u, u, u, u, idx1, idx2, idx3, idx4> using 256bit shuffle instead of splitting

Quentin Colombet qcolombet at apple.com
Wed Oct 15 17:42:31 PDT 2014


Hi Chandler,

Attached is a tentative patch to fix PR21138 http://llvm.org/bugs/show_bug.cgi?id=21138

I have not tested it against the test suite, I wanted to know if the direction matches what you expected first.
In particular, the check that guards the transformation is simple than I would have expected at first, but I did not find cases where the transformation is actually not beneficial (though, I haven’t tried hard).

Anyhow, let me know what you think on the patch before I make the measurements.

Thanks,
-Quentin



More information about the llvm-commits mailing list