pasaulais wrote: Is it worth separating out the `i8` extraction part from these changes and only keep the `v16i8` load part for now? I have not updated this PR with your suggestions yet as it may need reworking anyway after your `v4i8` work https://github.com/llvm/llvm-project/pull/67322