[PATCH] [AArch64]Implement 128 bit register copy with NEON.

Tim Northover t.p.northover at gmail.com
Mon Nov 25 02:25:30 PST 2013


  Hi Kevin,

  This is definitely a good idea (the existing "solution" was an
  appalling hack), but I think the instructions chosen are sub-optimal.
  ARM has designated the instruction "ORR vD.16b, vN.16b, vN.16b" as the
  official 128-bit move instruction.

  CPU designers are likely to optimise around this (for example it might
  not even make it to the ALU, and be handled directly by the register
  renaming hardware) so that's what LLVM should be generating, I think.

  Cheers.

  Tim.

http://llvm-reviews.chandlerc.com/D2264



More information about the llvm-commits mailing list