https://github.com/krzysz00 approved this pull request. Approved assuming tests pass (Side note: this version now works for `i4` and friends - so long as the bitwidth of the elements is a power of 2) https://github.com/llvm/llvm-project/pull/135982