[Mlir-commits] [mlir] Extend GPU and NVVM mma ops to support fp64	(PR #165380)
    Guray Ozen 
    llvmlistbot at llvm.org
       
    Tue Oct 28 06:36:02 PDT 2025
    
    
  
================
@@ -463,6 +463,43 @@ llvm.func @nvvm_wmma_mma(%0 : i32, %1 : i32, %2 : i32, %3 : i32, %4 : i32, %5 :
   llvm.return
 }
 
+// CHECK-LABEL: @nvvm_wmma_load_a_f64
+llvm.func @nvvm_wmma_load_a_f64(%arg0: !llvm.ptr, %arg1 : i32) {
+  // CHECK: call double @llvm.nvvm.wmma.m8n8k4.load.a.row.stride.f64.p0(ptr %{{.*}}, i32 %{{.*}})
+  %0 = nvvm.wmma.load %arg0, %arg1
----------------
grypp wrote:
I recommend splitting the PR into 2 pieces. First PR is about `f64` in nvvm dialect, and second PR could using them in gpu dialect.
https://github.com/llvm/llvm-project/pull/165380
    
    
More information about the Mlir-commits
mailing list