[Openmp-commits] [PATCH] D81989: [OpenMP] Introduce low level dependency process to target offloading

Shilei Tian via Phabricator via Openmp-commits openmp-commits at lists.llvm.org
Wed Jun 17 08:03:42 PDT 2020

tianshilei1992 added a comment.

In D81989#2098113 <https://reviews.llvm.org/D81989#2098113>, @JonChesterfield wrote:

> Is there some design documentation on this? It's tricky to distinguish intent from quirks of cuda.
> Amdgcn is built on the 'heterogenous system architecture' model which has a fair amount of support for managing graphs of tasks but also has challenging forward progress properties. I'm not immediately sure it would share much code with the nvptx implementation.

I'll add some documentation.

The high level idea is:

1. Add events to a queue. This operation is not blocking.
2. Add following operations into the queue.
3. Save the event from the second step.

Does AMD GCN support this pattern? The record event thing can be optional because I know some device RT generate the event when pushing an operation into a queue, like OpenCL.

  rG LLVM Github Monorepo



More information about the Openmp-commits mailing list