[PATCH] D141439: [AARCH64][SVE] Do not optimize vector conversions

zino benaissa via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Sun Jan 15 13:19:09 PST 2023


bzinodev marked an inline comment as done.
bzinodev added inline comments.


================
Comment at: llvm/lib/Target/AArch64/AArch64ISelLowering.cpp:14229
 
 bool AArch64TargetLowering::optimizeExtendOrTruncateConversion(Instruction *I,
                                                                Loop *L) const {
----------------
paulwalker-arm wrote:
> For clarification, is this a bad optimisation when SVE is available? or is it the case the current code generation for SVE is suboptimal? 
In a nutshell I don't know. Is the zext peep that rely on tbl instruction is always profitable even when targeting NEON? I have limited access to ARM HW, this peep performs slower on my small toy test.

On SVE, disabling the peep,  uunpklo is generated and it is fast enough. Please feel free to propose better code generation options. thanks




CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D141439/new/

https://reviews.llvm.org/D141439



More information about the llvm-commits mailing list