[Openmp-commits] [PATCH] D135444: [OpenMP] Utilize the "non-uniform-workgroup" to simplify DeviceRTL

Johannes Doerfert via Phabricator via Openmp-commits openmp-commits at lists.llvm.org
Wed Nov 2 07:19:21 PDT 2022


jdoerfert added a comment.

@estewart08 Can you try the suggested fix below?



================
Comment at: openmp/libomptarget/DeviceRTL/src/Mapping.cpp:82
 
-uint32_t getNumberOfBlocks() {
-  return getGridDim(__builtin_amdgcn_grid_size_x(),
-                    __builtin_amdgcn_workgroup_size_x());
-}
+uint32_t getNumberOfBlocks() { return __builtin_amdgcn_grid_size_x(); }
 
----------------
This should be `__builtin_amdgcn_grid_size_x()/__builtin_amdgcn_workgroup_size_x()`. 


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D135444/new/

https://reviews.llvm.org/D135444



More information about the Openmp-commits mailing list