[Mlir-commits] [mlir] [AMDGPU] Implement gpu.subgroup_reduce with DPP intrinsics on AMD GPUs (PR #133204)
Krzysztof Drewniak
llvmlistbot at llvm.org
Tue Apr 15 08:14:56 PDT 2025
================
@@ -246,6 +267,8 @@ gpu.module @kernels {
// CHECK-SHFL-LABEL: gpu.func @kernel5_clustered(
// CHECK-SHFL-SAME: %[[ARG0:.+]]: i16)
+ // CHECK-DPP-LABEL: gpu.func @kernel5_clustered
----------------
krzysz00 wrote:
Can we get some negative tests? I'd like to see DPP not being used for strided clusters / other cases where it can't apply
https://github.com/llvm/llvm-project/pull/133204
More information about the Mlir-commits
mailing list