[Openmp-commits] [PATCH] D81989: [OpenMP] Introduce low level dependency process to target offloading
Shilei Tian via Phabricator via Openmp-commits
openmp-commits at lists.llvm.org
Wed Jun 17 08:03:42 PDT 2020
tianshilei1992 added a comment.
In D81989#2098113 <https://reviews.llvm.org/D81989#2098113>, @JonChesterfield wrote:
> Is there some design documentation on this? It's tricky to distinguish intent from quirks of cuda.
>
> Amdgcn is built on the 'heterogenous system architecture' model which has a fair amount of support for managing graphs of tasks but also has challenging forward progress properties. I'm not immediately sure it would share much code with the nvptx implementation.
I'll add some documentation.
The high level idea is:
1. Add events to a queue. This operation is not blocking.
2. Add following operations into the queue.
3. Save the event from the second step.
Does AMD GCN support this pattern? The record event thing can be optional because I know some device RT generate the event when pushing an operation into a queue, like OpenCL.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D81989/new/
https://reviews.llvm.org/D81989
More information about the Openmp-commits
mailing list