[llvm-commits] [PATCH] ARM/NEON: Custom lower ctpop for appropriate vector types

Pete Couperus pjcoup at gmail.com
Thu Nov 29 20:38:22 PST 2012


Hello,

The population count intrinsic (ctpop) is supported on ARM/NEON for v8i8
and v16i8 types via NEON's vcnt instruction.
This patch leverages vcnt and other NEON instructions to custom lower ctpop
for v2i32/v4i32 and v4i16/v8i16 types.
As you'll see, a fair chunk of this patch is comments describing the
lowering, which I am happy to adjust to people's liking.
Please review!
Thanks!

Pete
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20121129/33358ad2/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: 0001-ctpop.diff
Type: application/octet-stream
Size: 9131 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20121129/33358ad2/attachment.obj>


More information about the llvm-commits mailing list