[all-commits] [llvm/llvm-project] c4ba84: [mlir][nvgpu] Fix packing accumlator matrix (#69316)

Guray Ozen via All-commits all-commits at lists.llvm.org
Tue Oct 17 03:46:24 PDT 2023


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: c4ba84d6555148fb7469fd44412a49d9d66eb4cf
      https://github.com/llvm/llvm-project/commit/c4ba84d6555148fb7469fd44412a49d9d66eb4cf
  Author: Guray Ozen <guray.ozen at gmail.com>
  Date:   2023-10-17 (Tue, 17 Oct 2023)

  Changed paths:
    M mlir/lib/Conversion/NVGPUToNVVM/NVGPUToNVVM.cpp

  Log Message:
  -----------
  [mlir][nvgpu] Fix packing accumlator matrix (#69316)

The #68728 significantly simplified the accumulator matrix type, making
it easier to work with the nvgpu dialect without worrying about the
number of required structs, as this information is abstracted away in
the nvgpu-to-nvvm transformation.

However, we forgot packing the structs after initialization, causing the
accumulator matrix to hold undefined values, which is wrong. This PR
addresses that.




More information about the All-commits mailing list