[all-commits] [llvm/llvm-project] 54b5c7: [Offload] Use flat array for cuLaunchKernel (#95116)

Wed Jun 12 23:44:08 PDT 2024

  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: 54b5c76d3b98c864daf39c24cda35d4a03d60747
      https://github.com/llvm/llvm-project/commit/54b5c76d3b98c864daf39c24cda35d4a03d60747
  Author: Johannes Doerfert <johannes at jdoerfert.de>
  Date:   2024-06-13 (Thu, 13 Jun 2024)

  Changed paths:
    M offload/include/Shared/APITypes.h
    M offload/plugins-nextgen/amdgpu/src/rtl.cpp
    M offload/plugins-nextgen/common/include/PluginInterface.h
    M offload/plugins-nextgen/common/src/PluginInterface.cpp
    M offload/plugins-nextgen/cuda/dynamic_cuda/cuda.h
    M offload/plugins-nextgen/cuda/src/rtl.cpp
    M offload/plugins-nextgen/host/src/rtl.cpp

  Log Message:
  -----------
  [Offload] Use flat array for cuLaunchKernel (#95116)

We already used a flat array of kernel launch parameters for the AMD GPU
launch but now we also use this scheme for the NVIDIA GPU launch. The
only remaining/required use of the indirection is the host plugin (due
ot ffi). This allows to us simplify the use for non-OpenMP kernel
launch.

To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications