[all-commits] [llvm/llvm-project] 41a07e: [mlir][sparse] recognize NVidia 2:4 type for matmu...
Aart Bik via All-commits
all-commits at lists.llvm.org
Tue Jan 2 14:44:39 PST 2024
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: 41a07e668c29e219ed2f26d61da8b6b3295ff967
https://github.com/llvm/llvm-project/commit/41a07e668c29e219ed2f26d61da8b6b3295ff967
Author: Aart Bik <39774503+aartbik at users.noreply.github.com>
Date: 2024-01-02 (Tue, 02 Jan 2024)
Changed paths:
M mlir/lib/Dialect/SparseTensor/Transforms/SparseGPUCodegen.cpp
M mlir/lib/ExecutionEngine/CudaRuntimeWrappers.cpp
A mlir/test/Dialect/SparseTensor/GPU/gpu_matmul24_lib.mlir
R mlir/test/Dialect/SparseTensor/GPU/gpu_matmul_lib_2to4.mlir
A mlir/test/Integration/Dialect/SparseTensor/GPU/CUDA/sm80-lt/sparse-matmul-2-4-hand.mlir
R mlir/test/Integration/Dialect/SparseTensor/GPU/CUDA/sm80-lt/sparse-matmul-2-4-lib-from-linalg.mlir
M mlir/test/Integration/Dialect/SparseTensor/GPU/CUDA/sm80-lt/sparse-matmul-2-4-lib.mlir
M mlir/test/Integration/Dialect/SparseTensor/GPU/CUDA/sm80-lt/sparse-matmul-2-4-prune.mlir
Log Message:
-----------
[mlir][sparse] recognize NVidia 2:4 type for matmul (#76758)
This removes the temporary DENSE24 attribute and replaces it with proper
recognition of dense to 24 conversion. The compressionh will be
performed on the device prior to performing the matrix mult. Note that
we no longer need to start with the linalg version, we can lift this to
the proper named linalg op. Also renames some files into more consistent
names.
More information about the All-commits
mailing list