[PATCH] ARM/NEON: Improve codegen for long sext/zext operations.
Pete Couperus
pjcoup at gmail.com
Fri Mar 15 19:51:05 PDT 2013
Hi Renato,
Thanks for the review. I'd be happy to modify the cost values.
Is there a guideline for what this should be? I surmise looking at
the comments and the other entries that this isn't quite as simple as
something like the number of vmovl's. Or should I just remove the
entries corresponding to the cases covered in the patch?
Thanks!
Pete
On Fri, Mar 15, 2013 at 2:19 AM, Renato Golin <renato.golin at linaro.org> wrote:
> On 15 March 2013 06:43, Pete Couperus <pjcoup at gmail.com> wrote:
>>
>> The ARM backend currently has poor codegen for long sext/zext
>> operations, such as v8i8 -> v8i32.
>> This patch addresses this by performing a custom expansion in
>> ARMISelLowering.
>> This partially addresses PR14867
>> (http://llvm.org/bugs/show_bug.cgi?id=14867).
>
>
> Hi Pete,
>
> Great patch, thanks! And lots of test cases, which is always great.
>
> Can you also update the cost of sign/zero extend in ARMTTI?
>
> cheers,
> --renato
More information about the llvm-commits
mailing list