[Openmp-dev] OpenMP in LLVM Multi-company Telecom Meeting Minutes Feb 12th
Narayanaswamy, Ravi via Openmp-dev
openmp-dev at lists.llvm.org
Wed Feb 12 14:02:38 PST 2020
Next Meeting : Feb 26th
Opens:
- Discussion on automatically detecting target pragmas in the code without user using command line option -fopenmp-targets="..." and invoking the necessary host and target compilation.
Question was asked if there is documentation on how to use the existing -fopenmp-targets option. Alexey said there is a documentation explaining the option.
- Francesco wanted to know the fate of VecClone pass which Intel had proposed. Intel will look into the feedback on the original proposal and see what is needed to commit the functionality.
- Deepak asked about the issue with declaration conflicts in system header which is different for target and apps which uses the host header outside offload region.
This should be addressed when scoped declare variant is implemented.
Development Activity:
- The libomptarget has switched to using C++14 from C++11 for building the libraries. Now it is same as LLVM and allows us to use LLVM ADT.
- Support for multiple streams has been added so that multiple kernels can be executed concurrently. Each offload execution or data transfer is assigned to a stream in a round robin fashion.
At startup time 256 streams are created as default which can be controlled with an environment variable.
- Sheila continued working on un-shackled thread to enable asynchronous offload.
- Johannes has merged the OpenMPOpt transformation path. De-duplication of some openmp calls have been implemented.
Also attributed to the calls have been added, which could enable many optimization.
Implementation of "declare variant"
- Parser part for declare variant has been posted for review.
- Resolution for variant representation at call site has be agreed upon. Both the original function and the variant function will be represented in the AST
- Next plan on prototyping support for dynamic declare variant to provide feedback to OpenMP where this issue is being discussed for 5.1
User-defined mapper function status
- The functional implementation has been accepted.
- Alexey wants the changes to be split into multiple patches. Lingda will be working on splitting and committing the changes.
DeviceRTL redesign to support sharing code
- Trunk clang compiles for everything needed for AMD.
- Can target specific code be written in inline IR as it is not supported by Clang.
Best to typeup a list of builtins needed and add them to clang.
Roll Call :
Company
Attendees
8-Jan
29-Jan
12-Feb
AMD
Greg Rodgers
x
Ashwin Aji
Jan Sjodin
x
Ron Lieberman
x
sameer Sahasrabuddhe
Jonathan Chesterfield
x
x
x
Andrey Kasaurov
ANL
Hal Finkel
Michael Kruse
x
Johannes Doerfert
x
x
x
ARM
Andrzej Warzynski
x
Francesco Petrogalli
x
x
IBM
Alexandre Eichenberger
Carlo Bertolli
Kelvin Li
x
Doru
Alexey Bataev
x
x
x
Intel
Andrey Churbanov
Ravi Narayanaswamy
x
x
x
Serguei Dmitriev
Rajiv Deodhar
Lorri Menard
Terry Wilmarth
Rao, Prem
Hansang Bae
Xinmin Tian
Sirram Aananthakrishnan
x
Jeff Hammond
George Rokos
x
x
x
Cray
Deepak Eachempati
x
x
Micron
John Leidel
Nvidia
James Beyer
ORNL
Graham Lopez
Joel Denny
x
Geoffroy Vallee
Fady Ghanim
x
x
x
Oscar Hernandez
SBU/BNL
Lingda Li
x
Jose Monlsave
Vivek Kale
x
Martin Kong
TU Darmstadt
Lukas Sommer
x
x
BrookHavens
Abid Malik
x
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/openmp-dev/attachments/20200212/e773e2bb/attachment-0001.html>
More information about the Openmp-dev
mailing list