[Openmp-commits] [openmp] [amdgpu] Implement D2D memcpy as HSA call (PR #69955)

Mon Oct 23 12:36:38 PDT 2023

jhuber6 wrote:

> > I'm guessing it's impossible to make a test for this? Could we do a D2D memcpy on the same device and make sure that they are the same via a `memcmp`?
> 
> It's already implemented (and possibly tested) on cuda but there's no functional change from what we have at the moment that bounces through host memory. It mostly doesn't have a test because I don't think and of the CI have multiple GPUs and I haven't spent the time working out how to write an openmp test which behaves sanely with one or two gpus in this context.

So we already have test coverage, but it's just falling back to a D2H + H2D copy instead, okay. I don't think the test suite is really set up in a way to use multiple devices, even if you're on a machine that supports it you'd need to use the `device` clause to check all of them. However a lot of testing could probably be simplified if we did `--offload-arch=native` and then just ran the test on each device, unrelated aside.

https://github.com/llvm/llvm-project/pull/69955