https://github.com/davemgreen commented: Can you make this a subtarget feature and add a test. I'm a little surprised this is profitable (or worth the codesize gain), but these cores do like extra unrolling. https://github.com/llvm/llvm-project/pull/169825