[all-commits] [llvm/llvm-project] 22a32f: [mlir][gpu] Add dump-ptx option
Guray Ozen via All-commits
all-commits at lists.llvm.org
Thu Jul 13 12:15:10 PDT 2023
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: 22a32f7d9cb1beed4c37eed6ebea30c60f4d4a3b
https://github.com/llvm/llvm-project/commit/22a32f7d9cb1beed4c37eed6ebea30c60f4d4a3b
Author: Guray Ozen <guray.ozen at gmail.com>
Date: 2023-07-13 (Thu, 13 Jul 2023)
Changed paths:
M mlir/include/mlir/Dialect/GPU/Transforms/Passes.h
M mlir/lib/Dialect/GPU/Transforms/SerializeToCubin.cpp
A mlir/test/Integration/Dialect/SparseTensor/GPU/CUDA/dump-ptx.mlir
Log Message:
-----------
[mlir][gpu] Add dump-ptx option
When targeting NVIDIA GPUs, seeing the generated PTX is important. Currently, we don't have simple way to do it.
This work adds dump-ptx to gpu-to-cubin pass. One can use it like `gpu-to-cubin{chip=sm_90 features=+ptx80 dump-ptx}`.
Reviewed By: nicolasvasilache
Differential Revision: https://reviews.llvm.org/D155166
More information about the All-commits
mailing list