[all-commits] [llvm/llvm-project] 54b5c7: [Offload] Use flat array for cuLaunchKernel (#95116)
Johannes Doerfert via All-commits
all-commits at lists.llvm.org
Wed Jun 12 23:44:08 PDT 2024
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: 54b5c76d3b98c864daf39c24cda35d4a03d60747
https://github.com/llvm/llvm-project/commit/54b5c76d3b98c864daf39c24cda35d4a03d60747
Author: Johannes Doerfert <johannes at jdoerfert.de>
Date: 2024-06-13 (Thu, 13 Jun 2024)
Changed paths:
M offload/include/Shared/APITypes.h
M offload/plugins-nextgen/amdgpu/src/rtl.cpp
M offload/plugins-nextgen/common/include/PluginInterface.h
M offload/plugins-nextgen/common/src/PluginInterface.cpp
M offload/plugins-nextgen/cuda/dynamic_cuda/cuda.h
M offload/plugins-nextgen/cuda/src/rtl.cpp
M offload/plugins-nextgen/host/src/rtl.cpp
Log Message:
-----------
[Offload] Use flat array for cuLaunchKernel (#95116)
We already used a flat array of kernel launch parameters for the AMD GPU
launch but now we also use this scheme for the NVIDIA GPU launch. The
only remaining/required use of the indirection is the host plugin (due
ot ffi). This allows to us simplify the use for non-OpenMP kernel
launch.
To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications
More information about the All-commits
mailing list