[PATCH] D130618: [AArch64][LoopVectorize] Enable tail-folding of simple loops on neoverse-v1

Dave Green via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Mon May 15 00:20:16 PDT 2023


dmgreen added a comment.

Thanks, a threshold of 15 will certainly help mitigate some of the issues in common cases. The results I have are still not looking great in places, but like we discussed some of this will be dependant on alignment and other issues like multiple ir instructions becoming a single aarch64 instruction. And there are certainly cases where this is making improvements, even if it still makes me nervous.

On a more technical note I don't think this works without setting -sve-tail-folding=default as nothing will set NeedsDefault.


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D130618/new/

https://reviews.llvm.org/D130618



More information about the llvm-commits mailing list