[PATCH] D59331: [OpenMP][Offloading][3/3] OpenMP GPU kernel optimizations
Johannes Doerfert via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Wed Mar 13 13:59:24 PDT 2019
jdoerfert created this revision.
jdoerfert added reviewers: ABataev, arpith-jacob, guraypp, gtbercea, hfinkel.
jdoerfert added projects: OpenMP, LLVM.
NOTE: Also see the '[RFC] Late (OpenMP) GPU code "SPMD-zation"' email thread.
This patch introduces an LLVM optimization for target regions (TRegions)
that tries to enable SPMD-mode or use custom state machines for the
TRegion have been introduced in https://reviews.llvm.org/D59319 .
The interface is deliberately simple to be easily analyzable in the
middle end. Design decisions included:
- Hide all (complex) implementation choices in the runtime library but allow complete removal of the abstraction once the runtime is inlined.
- Provide all runtime calls with sufficient, easy encoded information.
- Make the LLVM optimization, even though tied to the TRegion interface, as general as possible.
Positive and negative examples for the LLVM optimization are provided in the
rG LLVM Github Monorepo
-------------- next part --------------
A non-text attachment was scrubbed...
Size: 142825 bytes
Desc: not available
More information about the cfe-commits