[all-commits] [llvm/llvm-project] 13d623: [MLIR][NVGPU] Fix nvgpu_arrive syntax in matmulBui...

Durgadoss R via All-commits all-commits at lists.llvm.org
Fri Oct 25 22:46:11 PDT 2024


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: 13d6233e77982f2a596922a79365373e1466a968
      https://github.com/llvm/llvm-project/commit/13d6233e77982f2a596922a79365373e1466a968
  Author: Durgadoss R <durgadossr at nvidia.com>
  Date:   2024-10-26 (Sat, 26 Oct 2024)

  Changed paths:
    M mlir/test/Integration/GPU/CUDA/sm90/python/tools/matmulBuilder.py

  Log Message:
  -----------
  [MLIR][NVGPU] Fix nvgpu_arrive syntax in matmulBuilder.py (#113713)

This patch updates the syntax for nvgpu_arrive Op
in matmulBuilder.py. This fixes the compilation
error for this test.

For the warp-specialized matmul_kernel implementation,
removing the WaitGroupSyncOp (after the mma-main-loop)
fixes the hang observed.

With these two fixes, the test compiles and
executes successfully on an sm90a machine.

Signed-off-by: Durgadoss R <durgadossr at nvidia.com>



To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications


More information about the All-commits mailing list