[all-commits] [llvm/llvm-project] 4df01d: [mlir][sparse][gpu][nvidia] add pruning step and c...
Aart Bik via All-commits
all-commits at lists.llvm.org
Fri Jul 14 12:08:31 PDT 2023
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: 4df01dc27079b70d2fdec8d795e525b7955c60f7
https://github.com/llvm/llvm-project/commit/4df01dc27079b70d2fdec8d795e525b7955c60f7
Author: Aart Bik <ajcbik at google.com>
Date: 2023-07-14 (Fri, 14 Jul 2023)
Changed paths:
M mlir/lib/ExecutionEngine/CudaRuntimeWrappers.cpp
A mlir/test/Integration/Dialect/SparseTensor/GPU/CUDA/sm80-lt/sparse-matmul-2-4-prune.mlir
Log Message:
-----------
[mlir][sparse][gpu][nvidia] add pruning step and check to 2:4 matrix multiplication
(1) without the check, the results may silently be wrong, so check is needed
(2) add pruning step to guarantee 2:4 property
Note, in the longer run, we may want to split out the pruning step somehow,
or make it optional.
Reviewed By: K-Wu
Differential Revision: https://reviews.llvm.org/D155320
More information about the All-commits
mailing list