[patch] Simplify a vpermilps with constant mask
grosbach at apple.com
Mon Apr 21 10:47:48 PDT 2014
Why not change the clang IR codegen to just get this right in the first place?
On Apr 21, 2014, at 7:20 AM, Rafael Espíndola <rafael.espindola at gmail.com> wrote:
> On 11 April 2014 07:54, Rafael Espíndola <rafael.espindola at gmail.com> wrote:
>> With a constant mask a vpermilps is just a shufflevector. The attached
>> patch implements the simplification. This allows us to produce denser
>> code. It should also allow more folding down the line.
More information about the llvm-commits