[llvm] [AArch64] Set MaxInterleaving to 4 for Neoverse V2 (PR #100385)

David Green via llvm-commits llvm-commits at lists.llvm.org
Mon Aug 19 13:08:18 PDT 2024


davemgreen wrote:

> One issue with interleaving is that epilogue vecotrizaiton only considers VF, but not VF x UF. There are a number of cases where epilogue vectorization would be beneficial on AArch64 when the VF < 16 but UF > 1. @juliannagele is currently looking into adjusting the cost model for that.

I hadn't realised that we didn't account for the UF already. That sounds like a good thing to fix, thanks for the info. @juliannagele @fhahn do you have a timeframe for when such a patch will be ready? I would like to avoid the regressions if we can, and would not want to end up relying on a patch the never materializes. Otherwise perhaps @sjoerdmeijer could increase the limit for cases where the UF is 4, and we can improve it further in the future where needed.

https://github.com/llvm/llvm-project/pull/100385


More information about the llvm-commits mailing list