[PATCH] Implement aarch64 neon instruction class SIMD lsone and lsone-post - LLVM

Tim Northover t.p.northover at gmail.com
Mon Nov 25 02:45:28 PST 2013


> This sounds good, and we are suggesting using EXTRACT_SUBREG and friends, so
> we are on the same page, right?

I'm not suggesting creating the EXTRACT_SUBREGs as a special-case in
AArch64 code, but leaving it to generic handling of shuffles. The less
special-cases we have to deal with, the better.

> I don't understand about this. Can you explicitly point out what intrinsics
> names are really unnecessary?

These two are distinct intrinsics:

{ <16 x i8>, <16 x i8> } @llvm.arm.neon.vld2lane.v16i8(i8*, <16 x i8>,
<16 x i8>, i32, i32)
{ <8 x i8>, <8 x i8> } @llvm.arm.neon.vld2lane.v8i8(i8*, <8 x i8>, <8
x i8>, i32, i32)

The first maps directly to the AArch64 instruction, the second needs
backend hacks to put it into a form suitable for precisely the same
instruction as the first.

Cheers.

Tim.



More information about the cfe-commits mailing list