[Mlir-commits] [mlir] [mlir][gpu] Pattern to promote `gpu.shuffle` to specialized AMDGPU ops (PR #137109)

Thu Apr 24 15:34:58 PDT 2025

Hardcode84 wrote:

> Why can't we emit gpu.subgroup_reduce instead of shuffle and use the lowering patterns from https://github.com/llvm/llvm-project/pull/133204 ? I'd like to understand why you are approaching it from the form that's already decomposed to shuffles.

We are decomposing reductions early in Wave so it's just a bunch of `gpu.shuffle xor` + `arith`.

And in general, `gpu.shuffle` is fundamental building block so it would be nice to have a good lowering for it.

I will probably add dpp support later, so the overall uplifting flow will be `shuffle` -> `swizzle` -> `dpp`

https://github.com/llvm/llvm-project/pull/137109