Artem-B wrote: A logical follow-up to this patch would be to replace bfe -> prmt, too for byte extraction. We should also take a look at v4i8 <-> v4i16 conversions. If we're not using prmt there, we should. https://github.com/llvm/llvm-project/pull/110766