[all-commits] [llvm/llvm-project] 7c49ab: [MLIR][NVVM] Add `dot.accumulate.4way` OP (#139043)
Srinivasa Ravi via All-commits
all-commits at lists.llvm.org
Mon May 12 00:04:22 PDT 2025
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: 7c49ab0f62ba461ad38b021552b9ea727aa0b492
https://github.com/llvm/llvm-project/commit/7c49ab0f62ba461ad38b021552b9ea727aa0b492
Author: Srinivasa Ravi <srinivasar at nvidia.com>
Date: 2025-05-12 (Mon, 12 May 2025)
Changed paths:
M mlir/include/mlir/Dialect/LLVMIR/NVVMOps.td
M mlir/lib/Dialect/LLVMIR/IR/NVVMDialect.cpp
M mlir/test/Dialect/LLVMIR/nvvm.mlir
M mlir/test/Target/LLVMIR/nvvmir.mlir
Log Message:
-----------
[MLIR][NVVM] Add `dot.accumulate.4way` OP (#139043)
This change adds the `dot.accumulate.4way` Op to the NVVM dialect to
perform four-way byte dot product-accumulate operation.
PTX Spec Reference:
https://docs.nvidia.com/cuda/parallel-thread-execution/#integer-arithmetic-instructions-dp4a
To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications
More information about the All-commits
mailing list