MacDue wrote: Maybe add zip2 as well? (In the 2/4-way lowerings I think `zip1, zip2`, would use half the registers two `zip1`s, which might be something we'd like to try). https://github.com/llvm/llvm-project/pull/79270