https://github.com/arsenm approved this pull request. LGTM with nits. I think the load/store costs need to be revisited, we're treating them as as cheap as cheap ALU instructions which they certainly are not (also ignoring unaligned cost) https://github.com/llvm/llvm-project/pull/134934