[llvm] [mlir] [mlir][gpu] Change GPU modules to globals (PR #135478)
Christian Sigg via llvm-commits
llvm-commits at lists.llvm.org
Sun Apr 13 22:25:25 PDT 2025
chsigg wrote:
> do you have a pointer to what you have in mind here?
This environment variable controls lazy loading of kernels in the CUDA driver:
https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#lazy-loading
The documentation is not consistent whether to use `CUDA_MODULE_LOADING` or `CUDA_MODULE_DATA_LOADING` (see e.g. [here](https://forums.developer.nvidia.com/t/the-kernel-warm-up-phase-in-gpu-packet-processing-example-and-concurrent-kernels/326034)). I checked that the former works though: with two different kernels, the test fails without env variable setting and passes with `CUDA_MODULE_LOADING=EAGER`.
https://github.com/llvm/llvm-project/pull/135478
More information about the llvm-commits
mailing list