[Openmp-commits] [PATCH] D87084: [OpenMP][AMDGPU] Use DS_Max_Warp_Number instead of WARPSIZE

Pushpinder Singh via Phabricator via Openmp-commits openmp-commits at lists.llvm.org
Thu Sep 3 05:10:09 PDT 2020


pdhaliwal created this revision.
pdhaliwal added reviewers: JonChesterfield, ABataev, jdoerfert, grokos.
Herald added subscribers: openmp-commits, guansong, t-tye, tpr, dstuttard, yaxunl, kzhuravl.
Herald added a project: OpenMP.
pdhaliwal requested review of this revision.
Herald added subscribers: sstefan1, wdng.

The size of worker_rootS should have been DS_Max_Warp_Number.
This reduces memory usage by deviceRTL on AMDGPU from around 2.3GB
to around 770MB.


Repository:
  rG LLVM Github Monorepo

https://reviews.llvm.org/D87084

Files:
  openmp/libomptarget/deviceRTLs/common/omptarget.h


Index: openmp/libomptarget/deviceRTLs/common/omptarget.h
===================================================================
--- openmp/libomptarget/deviceRTLs/common/omptarget.h
+++ openmp/libomptarget/deviceRTLs/common/omptarget.h
@@ -252,7 +252,7 @@
       workDescrForActiveParallel; // one, ONLY for the active par
 
   ALIGN(16)
-  __kmpc_data_sharing_worker_slot_static worker_rootS[WARPSIZE];
+  __kmpc_data_sharing_worker_slot_static worker_rootS[DS_Max_Warp_Number];
   ALIGN(16) __kmpc_data_sharing_master_slot_static master_rootS[1];
 };
 


-------------- next part --------------
A non-text attachment was scrubbed...
Name: D87084.289691.patch
Type: text/x-patch
Size: 552 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/openmp-commits/attachments/20200903/72b7e0f3/attachment.bin>


More information about the Openmp-commits mailing list