krzysz00 wrote: I claim that subgroup reduction is a fundamental primitive separate from shuffles and that decomposing it should be a very low-level decision. (See also, SPIR-V has it as a primitive) https://github.com/llvm/llvm-project/pull/137109