[Mlir-commits] [mlir] [AMDGPU] Implement gpu.subgroup_reduce with DPP intrinsics on AMD GPUs (PR #133204)

Krzysztof Drewniak llvmlistbot at llvm.org
Tue Apr 15 08:14:56 PDT 2025


================
@@ -362,6 +365,131 @@ struct VectorSubgroupReduceToShuffles final
   unsigned shuffleBitwidth = 0;
   bool matchClustered = false;
 };
+
+Value createSubgroupDPPReduction(OpBuilder &b, Location loc, Value input,
+                                 gpu::AllReduceOperation mode,
+                                 const ClusterInfo &ci,
+                                 amdgpu::Chipset chipset) {
+  Value dppResult;
+  Value result = input;
+  const int allRows = 0xf;
+  const int allBanks = 0xf;
----------------
krzysz00 wrote:

Nit: these can be constexpr

https://github.com/llvm/llvm-project/pull/133204


More information about the Mlir-commits mailing list