[PATCH] D121788: [AArch64] Increase MaxInterleaveFactor to 4
JinGu Kang via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Mar 22 08:51:08 PDT 2022
jaykang10 added a comment.
Thanks for comment! @dmgreen
> Do you have any performance results for changing this? I've not had much luck with trying it in the past, and it obviously can change quite a lot. It can certainly help in places, but I've found that if you turn it up too high you just end up over-unrolling loops, not getting into the fast loop body as much. It can obviously depend on the input code and loop counts tough. Perhaps it needs some better costmodelling?
I was able to see the overall performance number slightly up for an internal benchmark on neoverse-n1 but we would need to tune something like cost model according to micro architectures.
> I was still hoping to get D118979 <https://reviews.llvm.org/D118979> in because it should help quite a bit - and it on it's own increases the number of items processed per vector element, and this will increase it further. We have cleaned up quite a few of the places it doesn't do as well, there are just a few that have been stuck in review a while. Perhaps it makes sense to try and push that through, then re-evaluate this on top?
I agree with you. Let's visit this patch later.
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D121788/new/
https://reviews.llvm.org/D121788
More information about the llvm-commits
mailing list