[PATCH] [AArch64]Implement 128 bit register copy with NEON.
Tim Northover
t.p.northover at gmail.com
Mon Nov 25 02:25:30 PST 2013
Hi Kevin,
This is definitely a good idea (the existing "solution" was an
appalling hack), but I think the instructions chosen are sub-optimal.
ARM has designated the instruction "ORR vD.16b, vN.16b, vN.16b" as the
official 128-bit move instruction.
CPU designers are likely to optimise around this (for example it might
not even make it to the ALU, and be handled directly by the register
renaming hardware) so that's what LLVM should be generating, I think.
Cheers.
Tim.
http://llvm-reviews.chandlerc.com/D2264
More information about the llvm-commits
mailing list