[llvm-commits] [PATCH] ARM: Custom lower scalar ctpop

Pete Couperus pjcoup at gmail.com
Sat Dec 8 15:35:58 PST 2012


Hello,

I think you are right, probably vcnt and a couple of vaddl's.  I will verify, and resubmit shortly.

Pete


Evan Cheng <evan.cheng at apple.com> wrote:

>This seems wrong to me. I think a scalar popcount should be 3-4 instructions. Owen, I believe you know the right code sequence for popcount with vcnt. What do you suggest?
>
>Evan
>
>On Dec 8, 2012, at 9:06 AM, Pete Couperus <pjcoup at gmail.com> wrote:
>
>> Hello,
>> 
>> Forgot the test case.  Reattached with test case.
>> 
>> Pete
>> 
>> 
>> On Sat, Dec 8, 2012 at 8:39 AM, Pete Couperus <pjcoup at gmail.com> wrote:
>>> Hello,
>>> 
>>> This patch builds on the vector support for ARM/NEON ctpop lowering
>>> (r169325) to give i32/i64 custom lowering.
>>> It does not tie into the ctpop idiom recognition, as that patch seems
>>> to being discussed right now.
>>> Please review!
>>> Thanks!
>>> 
>>> Pete
>> <0001-scalar-ctpop.diff>




More information about the llvm-commits mailing list