[Openmp-commits] [PATCH] D115279: [OpenMP][AMDGPU] Switch host-device memory copy to asynchronous version
Carlo Bertolli via Phabricator via Openmp-commits
openmp-commits at lists.llvm.org
Wed Dec 8 10:19:15 PST 2021
carlo.bertolli added a comment.
In D115279#3179975 <https://reviews.llvm.org/D115279#3179975>, @ye-luo wrote:
> I know what happened to my machine. Some CMake change caused offload plugins are not compiled. Sign. broken upstream.
> My intention is to check pinned status. Before the first lock(not pinned), after the first lock(pinned), after the offload region(pinned), after the unlock(unpinned).
> Could you also verify with rocprof hsa trace that the lock and unlock are both called twice?
That makes sense.
I ran it with gdb (running with debug symbols for impl/impl.cpp in the plugin) and all calls to memory_lock/unlock return success.
I am now expanding dynamic_hsa to include the missing calls - following @JonChesterfield suggestions.
Thanks!
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D115279/new/
https://reviews.llvm.org/D115279
More information about the Openmp-commits
mailing list