[Mlir-commits] [mlir] [MLIR][GPU] Add gpu.cluster_dim_blocks Op to represent number of blocks per cluster (PR #95245)
Pradeep Kumar
llvmlistbot at llvm.org
Wed Jun 12 07:40:13 PDT 2024
================
@@ -86,6 +86,11 @@ static std::optional<uint64_t> getKnownLaunchDim(Op op, LaunchDims type) {
void ClusterDimOp::inferResultRanges(ArrayRef<ConstantIntRanges>,
SetIntRangeFn setResultRange) {
+ setResultRange(getResult(), getIndexRange(1, kMaxDim));
----------------
schwarzschild-radius wrote:
The PTX ISA or the CUDA Programming Guide does not mention of any upper bound other than the maximum portable cluster size which can be overriden using `CU_FUNC_ATTRIBUTE_NON_PORTABLE_CLUSTER_SIZE_ALLOWED`
https://github.com/llvm/llvm-project/pull/95245
More information about the Mlir-commits
mailing list