[PATCH] D118394: [AArch64][NEON][SVE] Lower FCOPYSIGN using AArch64ISD::BSP
David Truby via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Feb 1 06:36:30 PST 2022
DavidTruby added inline comments.
================
Comment at: llvm/test/CodeGen/AArch64/vector-fcopysign.ll:303
+; NOFP16-NEXT: bsl.16b v1, v2, v4
+; NOFP16-NEXT: fcvt h1, s1
; NOFP16-NEXT: mov.h v0[3], v1[0]
----------------
efriedma wrote:
> It looks like the generated code here got worse; is there something wrong with the way we lower AArch64ISD::BSP?
This looks like an oddity of register allocation to me. I'm not sure why the bsl instruction doesn't just use v3, in which case the extra mov wouldn't be necessary. By the time we get to AArch64ISD::BSP lowering register allocation has already happened though. I'm not sure what the solution to that is.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D118394/new/
https://reviews.llvm.org/D118394
More information about the llvm-commits
mailing list