[PATCH] ARM/NEON: Improve codegen for long sext/zext operations.

Pete Couperus pjcoup at gmail.com
Fri Mar 15 19:51:05 PDT 2013


Hi Renato,

Thanks for the review.  I'd be happy to modify the cost values.
Is there a guideline for what this should be?  I surmise looking at
the comments and the other entries that this isn't quite as simple as
something like the number of vmovl's.  Or should I just remove the
entries corresponding to the cases covered in the patch?
Thanks!

Pete


On Fri, Mar 15, 2013 at 2:19 AM, Renato Golin <renato.golin at linaro.org> wrote:
> On 15 March 2013 06:43, Pete Couperus <pjcoup at gmail.com> wrote:
>>
>> The ARM backend currently has poor codegen for long sext/zext
>> operations, such as v8i8 -> v8i32.
>> This patch addresses this by performing a custom expansion in
>> ARMISelLowering.
>> This partially addresses PR14867
>> (http://llvm.org/bugs/show_bug.cgi?id=14867).
>
>
> Hi Pete,
>
> Great patch, thanks! And lots of test cases, which is always great.
>
> Can you also update the cost of sign/zero extend in ARMTTI?
>
> cheers,
> --renato



More information about the llvm-commits mailing list