[all-commits] [llvm/llvm-project] a8b511: [MLIR][NVGPU] Fix the cga_cluster.mlir test (#112191)

Mon Oct 14 07:14:36 PDT 2024

  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: a8b51154417c7ead2e1bfffee0b70812270e5730
      https://github.com/llvm/llvm-project/commit/a8b51154417c7ead2e1bfffee0b70812270e5730
  Author: Durgadoss R <durgadossr at nvidia.com>
  Date:   2024-10-14 (Mon, 14 Oct 2024)

  Changed paths:
    M mlir/lib/Conversion/GPUToNVVM/LowerGpuOpsToNVVMOps.cpp
    M mlir/test/Integration/GPU/CUDA/sm90/cga_cluster.mlir

  Log Message:
  -----------
  [MLIR][NVGPU] Fix the cga_cluster.mlir test (#112191)

This patch fixes the sm90 cluster test by:
* Fixing a typo in LowerGpuOpsToNVVMOps where one of the ClusterDim Op
   conversion pattern should actually be for the
   ClusterDimBlocks Op. This addresses the compilation error for this test.
* The grid-size should be (4,4,1) instead of (2,2,1). This passes the
   scf-if check against the threshold of 3 below and actually
   generates the required prints from the GPU.

Signed-off-by: Durgadoss R <durgadossr at nvidia.com>

To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications