[Mlir-commits] [mlir] [mlir][gpu] Support Cluster of Thread Blocks in `gpu.launch_func` (PR #72871)
llvmlistbot at llvm.org
llvmlistbot at llvm.org
Wed Nov 22 05:53:14 PST 2023
================
@@ -1065,6 +1081,11 @@ KernelDim3 LaunchFuncOp::getBlockSizeOperandValues() {
return KernelDim3{operands[3], operands[4], operands[5]};
}
+KernelDim3 LaunchFuncOp::getClusterSizeOperandValues() {
+ auto operands = getOperands().drop_front(getAsyncDependencies().size());
+ return KernelDim3{operands[6], operands[7], operands[8]};
----------------
qcolombet wrote:
Could we use `getClusterSizeX`, ..., here instead of hardcoding the indices?
**edit**: I see this is a common pattern in this file so fell free to ignore.
https://github.com/llvm/llvm-project/pull/72871
More information about the Mlir-commits
mailing list