[all-commits] [llvm/llvm-project] 840d0b: [amdgpu] D2D memcpy via streams and HSA (#69977)
Jon Chesterfield via All-commits
all-commits at lists.llvm.org
Mon Oct 23 16:05:18 PDT 2023
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: 840d0b7e03045c30b052dce8d0bb82f3fa0c5d6d
https://github.com/llvm/llvm-project/commit/840d0b7e03045c30b052dce8d0bb82f3fa0c5d6d
Author: Jon Chesterfield <JonChesterfield at users.noreply.github.com>
Date: 2023-10-24 (Tue, 24 Oct 2023)
Changed paths:
M openmp/libomptarget/plugins-nextgen/amdgpu/src/rtl.cpp
Log Message:
-----------
[amdgpu] D2D memcpy via streams and HSA (#69977)
hsa_amd_memory_async_copy can handle device to device copies if passed
the corresponding parameters.
No functional change - currently D2D copy goes through a fallback in
libomptarget that stages through a host malloc, after this it goes
directly through HSA.
Works under exactly the situations that HSA works. Verified locally on a
performance benchmark. Hoping to attract further testing from internal
developers after it lands.
More information about the All-commits
mailing list