[Openmp-commits] [openmp] [clang] [OpenMP] Cleanup and fixes for ABI agnostic DeviceRTL (PR #71234)

Joseph Huber via Openmp-commits openmp-commits at lists.llvm.org
Fri Nov 3 14:28:10 PDT 2023


================
@@ -3086,10 +3139,14 @@ Error AMDGPUKernelTy::launchImpl(GenericDeviceTy &GenericDevice,
   // Only COV5 implicitargs needs to be set. COV4 implicitargs are not used.
   if (getImplicitArgsSize() == sizeof(utils::AMDGPUImplicitArgsTy)) {
     ImplArgs->BlockCountX = NumBlocks;
+    ImplArgs->BlockCountY = 1;
+    ImplArgs->BlockCountZ = 1;
     ImplArgs->GroupSizeX = NumThreads;
     ImplArgs->GroupSizeY = 1;
     ImplArgs->GroupSizeZ = 1;
     ImplArgs->GridDims = 1;
+    ImplArgs->HeapV1Ptr =
----------------
jhuber6 wrote:

Why do we need this? I'm assuming the user is `__ockl_dm_alloc`, but we don't use that anywhere because it requires the AMD RPC implementation.

https://github.com/llvm/llvm-project/pull/71234


More information about the Openmp-commits mailing list