[all-commits] [llvm/llvm-project] 13d623: [MLIR][NVGPU] Fix nvgpu_arrive syntax in matmulBui...
Durgadoss R via All-commits
all-commits at lists.llvm.org
Fri Oct 25 22:46:11 PDT 2024
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: 13d6233e77982f2a596922a79365373e1466a968
https://github.com/llvm/llvm-project/commit/13d6233e77982f2a596922a79365373e1466a968
Author: Durgadoss R <durgadossr at nvidia.com>
Date: 2024-10-26 (Sat, 26 Oct 2024)
Changed paths:
M mlir/test/Integration/GPU/CUDA/sm90/python/tools/matmulBuilder.py
Log Message:
-----------
[MLIR][NVGPU] Fix nvgpu_arrive syntax in matmulBuilder.py (#113713)
This patch updates the syntax for nvgpu_arrive Op
in matmulBuilder.py. This fixes the compilation
error for this test.
For the warp-specialized matmul_kernel implementation,
removing the WaitGroupSyncOp (after the mma-main-loop)
fixes the hang observed.
With these two fixes, the test compiles and
executes successfully on an sm90a machine.
Signed-off-by: Durgadoss R <durgadossr at nvidia.com>
To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications
More information about the All-commits
mailing list