[all-commits] [llvm/llvm-project] 7c49ab: [MLIR][NVVM] Add `dot.accumulate.4way` OP (#139043)

Srinivasa Ravi via All-commits all-commits at lists.llvm.org
Mon May 12 00:04:22 PDT 2025


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: 7c49ab0f62ba461ad38b021552b9ea727aa0b492
      https://github.com/llvm/llvm-project/commit/7c49ab0f62ba461ad38b021552b9ea727aa0b492
  Author: Srinivasa Ravi <srinivasar at nvidia.com>
  Date:   2025-05-12 (Mon, 12 May 2025)

  Changed paths:
    M mlir/include/mlir/Dialect/LLVMIR/NVVMOps.td
    M mlir/lib/Dialect/LLVMIR/IR/NVVMDialect.cpp
    M mlir/test/Dialect/LLVMIR/nvvm.mlir
    M mlir/test/Target/LLVMIR/nvvmir.mlir

  Log Message:
  -----------
  [MLIR][NVVM] Add `dot.accumulate.4way` OP  (#139043)

This change adds the `dot.accumulate.4way` Op to the NVVM dialect to
perform four-way byte dot product-accumulate operation.

PTX Spec Reference:
https://docs.nvidia.com/cuda/parallel-thread-execution/#integer-arithmetic-instructions-dp4a



To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications


More information about the All-commits mailing list