[PATCH] D130618: [AArch64][LoopVectorize] Enable tail-folding of simple loops on neoverse-v1
Dave Green via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon May 15 00:20:16 PDT 2023
dmgreen added a comment.
Thanks, a threshold of 15 will certainly help mitigate some of the issues in common cases. The results I have are still not looking great in places, but like we discussed some of this will be dependant on alignment and other issues like multiple ir instructions becoming a single aarch64 instruction. And there are certainly cases where this is making improvements, even if it still makes me nervous.
On a more technical note I don't think this works without setting -sve-tail-folding=default as nothing will set NeedsDefault.
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D130618/new/
https://reviews.llvm.org/D130618
More information about the llvm-commits
mailing list