[Openmp-commits] [openmp] [OpenMP][libomptarget] Enable parallel copies via multiple SDMA engines (PR #71801)

Joseph Huber via Openmp-commits openmp-commits at lists.llvm.org
Thu Nov 9 04:29:12 PST 2023


================
@@ -130,6 +130,40 @@ Error iterateAgentMemoryPools(hsa_agent_t Agent, CallbackTy Cb) {
                        "Error in hsa_amd_agent_iterate_memory_pools: %s");
 }
 
+/// Dispatches an asynchronous memory copy
+/// Enables different SDMA engines for the dispatch in a round-robin fashion.
+Error asyncMemCopy(void *Dst, hsa_agent_t DstAgent, const void *Src,
+                   hsa_agent_t SrcAgent, size_t Size, uint32_t NumDepSignals,
+                   const hsa_signal_t *DepSignals,
+                   hsa_signal_t CompletionSignal) {
+  static BoolEnvar OMPX_UseMultipleSdmaEngines{
----------------
jhuber6 wrote:

This will emit a global constructor which we really don't want to have scattered around in the plugin. We need to initialize this in the AMDGPUDeviceTy or somewhere else.

https://github.com/llvm/llvm-project/pull/71801


More information about the Openmp-commits mailing list