[all-commits] [llvm/llvm-project] c4ba84: [mlir][nvgpu] Fix packing accumlator matrix (#69316)
Guray Ozen via All-commits
all-commits at lists.llvm.org
Tue Oct 17 03:46:24 PDT 2023
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: c4ba84d6555148fb7469fd44412a49d9d66eb4cf
https://github.com/llvm/llvm-project/commit/c4ba84d6555148fb7469fd44412a49d9d66eb4cf
Author: Guray Ozen <guray.ozen at gmail.com>
Date: 2023-10-17 (Tue, 17 Oct 2023)
Changed paths:
M mlir/lib/Conversion/NVGPUToNVVM/NVGPUToNVVM.cpp
Log Message:
-----------
[mlir][nvgpu] Fix packing accumlator matrix (#69316)
The #68728 significantly simplified the accumulator matrix type, making
it easier to work with the nvgpu dialect without worrying about the
number of required structs, as this information is abstracted away in
the nvgpu-to-nvvm transformation.
However, we forgot packing the structs after initialization, causing the
accumulator matrix to hold undefined values, which is wrong. This PR
addresses that.
More information about the All-commits
mailing list