[all-commits] [llvm/llvm-project] 71bdd2: mlir/lib/Dialect/GPU/Transforms: improve context m...
Rohan Yadav via All-commits
all-commits at lists.llvm.org
Fri Oct 20 10:35:23 PDT 2023
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: 71bdd2c2380d3e0cc96898bd7908ac8c99d10a8f
https://github.com/llvm/llvm-project/commit/71bdd2c2380d3e0cc96898bd7908ac8c99d10a8f
Author: Rohan Yadav <rohany at alumni.cmu.edu>
Date: 2023-10-20 (Fri, 20 Oct 2023)
Changed paths:
M mlir/lib/Dialect/GPU/Transforms/SerializeToCubin.cpp
Log Message:
-----------
mlir/lib/Dialect/GPU/Transforms: improve context management in SerializeToCubin (#65779)
This commit adjusts the CUDA context management in the SerializeToCubin
pass. In particular, it uses the device 0 primary context instead of
creating a new CUDA context on each invocation of SerializeToCubin. This
yields very large improvements in compile time, especially if an
application (like a JIT compiler) is calling SerializeToCubin
repeatedly.
Differential Revision: https://reviews.llvm.org/D159487
Co-authored-by: Rohan Yadav <rohany at cs.stanford.edu>
More information about the All-commits
mailing list