[Openmp-commits] [PATCH] D115279: [OpenMP][AMDGPU] Switch host-device memory copy to asynchronous version

Carlo Bertolli via Phabricator via Openmp-commits openmp-commits at lists.llvm.org
Wed Dec 8 10:19:15 PST 2021


carlo.bertolli added a comment.

In D115279#3179975 <https://reviews.llvm.org/D115279#3179975>, @ye-luo wrote:

> I know what happened to my machine. Some CMake change caused offload plugins are not compiled. Sign. broken upstream.
> My intention is to check pinned status. Before the first lock(not pinned), after the first lock(pinned), after the offload region(pinned), after the unlock(unpinned).
> Could you also verify with rocprof hsa trace that the lock and unlock are both called twice?

That makes sense.

I ran it with gdb (running with debug symbols for impl/impl.cpp in the plugin) and all calls to memory_lock/unlock return success.
I am now expanding dynamic_hsa to include the missing calls - following @JonChesterfield suggestions.

Thanks!


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D115279/new/

https://reviews.llvm.org/D115279



More information about the Openmp-commits mailing list