[PATCH] D141439: [AARCH64][SVE] Do not optimize vector conversions
zino benaissa via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Sun Jan 15 13:19:09 PST 2023
bzinodev marked an inline comment as done.
bzinodev added inline comments.
================
Comment at: llvm/lib/Target/AArch64/AArch64ISelLowering.cpp:14229
bool AArch64TargetLowering::optimizeExtendOrTruncateConversion(Instruction *I,
Loop *L) const {
----------------
paulwalker-arm wrote:
> For clarification, is this a bad optimisation when SVE is available? or is it the case the current code generation for SVE is suboptimal?
In a nutshell I don't know. Is the zext peep that rely on tbl instruction is always profitable even when targeting NEON? I have limited access to ARM HW, this peep performs slower on my small toy test.
On SVE, disabling the peep, uunpklo is generated and it is fast enough. Please feel free to propose better code generation options. thanks
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D141439/new/
https://reviews.llvm.org/D141439
More information about the llvm-commits
mailing list