guy-david wrote: Unrolling doesn't quite fit. Unlike vectorization, the loads of the next iterations are not grouped together, because the loop's body is repeated so it's nontrivial to widen any of them. https://github.com/llvm/llvm-project/pull/173420