[PATCH] D155355: [AArch64] Set maximum vscale VF with shouldMaximizeVectorBandwidth
Dave Green via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Jul 17 00:13:43 PDT 2023
dmgreen added a comment.
For Neon we enabled shouldMaximizeVectorBandwidth so that the backend could make use of instructions like umull/umull2 and the narrowing instructions. Extending into larger types for Neon is quite natural in places, and can lead to less total instructions. SVE has instructions like UMULLB/T that work on the top/bottom lanes in a pair, but I don't believe the backend makes any use of them at the moment.
The description is a bit light on details. What is the reasoning behind enabling this for SVE too? And do you have any benchmark results?
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D155355/new/
https://reviews.llvm.org/D155355
More information about the llvm-commits
mailing list