[all-commits] [llvm/llvm-project] a8b511: [MLIR][NVGPU] Fix the cga_cluster.mlir test (#112191)
Durgadoss R via All-commits
all-commits at lists.llvm.org
Mon Oct 14 07:14:36 PDT 2024
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: a8b51154417c7ead2e1bfffee0b70812270e5730
https://github.com/llvm/llvm-project/commit/a8b51154417c7ead2e1bfffee0b70812270e5730
Author: Durgadoss R <durgadossr at nvidia.com>
Date: 2024-10-14 (Mon, 14 Oct 2024)
Changed paths:
M mlir/lib/Conversion/GPUToNVVM/LowerGpuOpsToNVVMOps.cpp
M mlir/test/Integration/GPU/CUDA/sm90/cga_cluster.mlir
Log Message:
-----------
[MLIR][NVGPU] Fix the cga_cluster.mlir test (#112191)
This patch fixes the sm90 cluster test by:
* Fixing a typo in LowerGpuOpsToNVVMOps where one of the ClusterDim Op
conversion pattern should actually be for the
ClusterDimBlocks Op. This addresses the compilation error for this test.
* The grid-size should be (4,4,1) instead of (2,2,1). This passes the
scf-if check against the threshold of 3 below and actually
generates the required prints from the GPU.
Signed-off-by: Durgadoss R <durgadossr at nvidia.com>
To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications
More information about the All-commits
mailing list