[Mlir-commits] [mlir] Extend GPU and NVVM mma ops to support fp64 (PR #165380)
Giacomo Castiglioni
llvmlistbot at llvm.org
Tue Oct 28 06:48:56 PDT 2025
================
@@ -463,6 +463,43 @@ llvm.func @nvvm_wmma_mma(%0 : i32, %1 : i32, %2 : i32, %3 : i32, %4 : i32, %5 :
llvm.return
}
+// CHECK-LABEL: @nvvm_wmma_load_a_f64
+llvm.func @nvvm_wmma_load_a_f64(%arg0: !llvm.ptr, %arg1 : i32) {
+ // CHECK: call double @llvm.nvvm.wmma.m8n8k4.load.a.row.stride.f64.p0(ptr %{{.*}}, i32 %{{.*}})
+ %0 = nvvm.wmma.load %arg0, %arg1
----------------
castigli wrote:
Ok, I was wondering if I should split it myself. This PR will be about the nvvm dialect, I'll open one later for the gpu dialect.
https://github.com/llvm/llvm-project/pull/165380
More information about the Mlir-commits
mailing list