[Mlir-commits] [mlir] [mlir][gpu] Add `subgroup_broadcast` op (PR #152808)
Krzysztof Drewniak
llvmlistbot at llvm.org
Tue Aug 26 08:02:03 PDT 2025
================
@@ -171,6 +171,27 @@ struct GPUSubgroupSizeOpToROCDL : ConvertOpToLLVMPattern<gpu::SubgroupSizeOp> {
const amdgpu::Chipset chipset;
};
+struct GPUSubgroupBroadcastOpToROCDL
+ : public ConvertOpToLLVMPattern<gpu::SubgroupBroadcastOp> {
+ using ConvertOpToLLVMPattern::ConvertOpToLLVMPattern;
+
+ LogicalResult
+ matchAndRewrite(gpu::SubgroupBroadcastOp op, OpAdaptor adaptor,
----------------
krzysz00 wrote:
We'll want to make sure we don't need type mangling or duplication here - ex. I don't know if readfirstlane or readlane of <4 x i32> or <32 x i8> works natively
https://github.com/llvm/llvm-project/pull/152808
More information about the Mlir-commits
mailing list