[all-commits] [llvm/llvm-project] 22a32f: [mlir][gpu] Add dump-ptx option

Thu Jul 13 12:15:10 PDT 2023

  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: 22a32f7d9cb1beed4c37eed6ebea30c60f4d4a3b
      https://github.com/llvm/llvm-project/commit/22a32f7d9cb1beed4c37eed6ebea30c60f4d4a3b
  Author: Guray Ozen <guray.ozen at gmail.com>
  Date:   2023-07-13 (Thu, 13 Jul 2023)

  Changed paths:
    M mlir/include/mlir/Dialect/GPU/Transforms/Passes.h
    M mlir/lib/Dialect/GPU/Transforms/SerializeToCubin.cpp
    A mlir/test/Integration/Dialect/SparseTensor/GPU/CUDA/dump-ptx.mlir

  Log Message:
  -----------
  [mlir][gpu] Add dump-ptx option

When targeting NVIDIA GPUs, seeing the generated PTX is important. Currently, we don't have simple way to do it.

This work adds dump-ptx to gpu-to-cubin pass. One can use it like `gpu-to-cubin{chip=sm_90 features=+ptx80 dump-ptx}`.

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D155166