[Mlir-commits] [mlir] [AMDGPU] Implement gpu.subgroup_reduce with DPP intrinsics on AMD GPUs (PR #133204)
llvmlistbot at llvm.org
llvmlistbot at llvm.org
Wed Apr 2 02:05:25 PDT 2025
stefankoncarevic wrote:
Are there plans to ensure compatibility with RDNA architectures specifically Navi3 and Navi4 in this implementation?
If so, it's important to note that bcast_15 and bcast_31 are not supported for Navi3 and Navi4. Instead, you might want to consider using permlane as an alternative for these architectures.
https://github.com/llvm/llvm-project/pull/133204
More information about the Mlir-commits
mailing list