[llvm] [AArch64] Set MaxInterleaving to 4 for Neoverse V2 (PR #100385)
David Green via llvm-commits
llvm-commits at lists.llvm.org
Mon Aug 19 13:08:18 PDT 2024
davemgreen wrote:
> One issue with interleaving is that epilogue vecotrizaiton only considers VF, but not VF x UF. There are a number of cases where epilogue vectorization would be beneficial on AArch64 when the VF < 16 but UF > 1. @juliannagele is currently looking into adjusting the cost model for that.
I hadn't realised that we didn't account for the UF already. That sounds like a good thing to fix, thanks for the info. @juliannagele @fhahn do you have a timeframe for when such a patch will be ready? I would like to avoid the regressions if we can, and would not want to end up relying on a patch the never materializes. Otherwise perhaps @sjoerdmeijer could increase the limit for cases where the UF is 4, and we can improve it further in the future where needed.
https://github.com/llvm/llvm-project/pull/100385
More information about the llvm-commits
mailing list