[PATCH] Optimize insertqi when we copy all the lower 64 bits.
Rafael Avila de Espindola
rafael.espindola at gmail.com
Wed Apr 23 09:16:51 PDT 2014
Sent from my iPhone
> On Apr 23, 2014, at 12:08, Nadav Rotem <nrotem at apple.com> wrote:
>
>
>> On Apr 23, 2014, at 6:56 AM, Rafael EspĂndola <rafael.espindola at gmail.com> wrote:
>>
>>> On 15 April 2014 14:04, Nadav Rotem <nrotem at apple.com> wrote:
>>> Hi Filipe,
>>>
>>> Why is this an IR-level transform? Could you implement this in SelectionDAG ?
>>
>> What is the advantage of doing this at SelectionDAG? Since this is an
>> intrinsic, we know all that we need at the IR level already. IR also
>> has the advantage of opening the potential for further optimizations
> It is not clear to me why we represent this intrinsic as an IR-level intrinsic and not as a regular insertelement instruction. We already have IR-level optimizations on insertelement and I prefer not to duplicate all of them.
>
That I fully agree with. If the operation can be represented with generic ir that is by far the best solution.
>> an has much better testing than SelecetionDAG.
>>
>> Cheers,
>> Rafael
>
More information about the llvm-commits
mailing list