[Openmp-dev] OpenMP in LLVM Multi-company Telecom	Meeting	Minutes	Feb 12th
    Rokos, Georgios via Openmp-dev 
    openmp-dev at lists.llvm.org
       
    Wed Feb 12 18:22:59 PST 2020
    
    
  
I think QUEUES or DEVICE_QUEUES sounds good. Queue is a "neutral" name (i.e. it doesn't come from any particular architecture) and quite descriptive at the same time. Nvidia's documentation itself describes the stream as a "sequence of operations", so it can be called a queue.
OpenCL also calls them queues: https://www.khronos.org/registry/OpenCL/sdk/1.0/docs/man/xhtml/clCreateCommandQueue.html
OpenCL is supposed to be the architecture-agnostic programming standard, so that's another plus in favor of "queue".
LIBOMPTARGET_NUM_QUEUES
LIBOMPTARGET_NUM_DEVICE_QUEUES
LIBOMPTARGET_NUM_COMMAND_QUEUES
George
-----Original Message-----
From: Openmp-dev <openmp-dev-bounces at lists.llvm.org> On Behalf Of Doerfert, Johannes via Openmp-dev
Sent: Wednesday, 12 February, 2020 14:46
To: openmp-dev at lists.llvm.org; Narayanaswamy, Ravi <ravi.narayanaswamy at intel.com>
Subject: Re: [Openmp-dev] OpenMP in LLVM Multi-company Telecom Meeting Minutes Feb 12th
Quick update:
I looked in TR8 and unfortunately we do not specify a name for "streams". We only expose the device "context" and then you go from there.
Suggestions on how to rename the environment variable are welcome (see https://reviews.llvm.org/D74145#1871282 for context).
________________________________________
From: Openmp-dev <openmp-dev-bounces at lists.llvm.org> on behalf of Narayanaswamy, Ravi via Openmp-dev <openmp-dev at lists.llvm.org>
Sent: Wednesday, February 12, 2020 16:02
To: openmp-dev at lists.llvm.org
Subject: [Openmp-dev] OpenMP in LLVM Multi-company Telecom Meeting Minutes      Feb 12th
Next Meeting :  Feb 26th
Opens:
   -  Discussion on automatically detecting target pragmas in the code without user using command line option –fopenmp-targets=”…” and invoking the necessary host and target compilation.
        Question was asked if there is documentation on how to use the existing –fopenmp-targets option.    Alexey said there is a documentation explaining the option.
  -   Francesco wanted to know the fate of VecClone pass which Intel had proposed.   Intel will look into the  feedback on the original proposal and see what is needed to commit the functionality.
  -   Deepak asked about the issue with declaration conflicts in system header which is different for target and apps which uses the host header outside offload region.
       This should be addressed when scoped declare variant is implemented.
Development Activity:
   -   The libomptarget has switched to using  C++14 from C++11 for building the libraries.  Now it is same as LLVM and allows us to use LLVM ADT.
   -    Support for multiple streams has been added so that multiple kernels can be executed concurrently.   Each offload execution or data transfer is assigned to a stream  in a round robin fashion.
         At startup time 256 streams are created as default which can be controlled with an environment variable.
    - Sheila continued working  on un-shackled thread to enable asynchronous offload.
    -  Johannes has merged the OpenMPOpt transformation  path.  De-duplication of some openmp calls have been implemented.
        Also attributed to the calls have been added, which could enable many optimization.
Implementation of “declare variant”
    -    Parser part for declare variant has been posted for review.
    -    Resolution for variant representation at call site has be agreed upon.  Both the original function and the variant function will be represented in the AST
    -    Next plan on prototyping support for dynamic declare variant to provide feedback to OpenMP where this issue is being discussed for 5.1
User-defined mapper function status
    -   The functional implementation has been accepted.
    -    Alexey wants the changes to be split into multiple patches.  Lingda will be working on splitting and committing  the changes.
DeviceRTL redesign to support sharing code
    -   Trunk clang compiles for everything needed for AMD.
    -   Can target specific code be written in inline IR as it is not supported by Clang.
            Best to typeup a list of builtins needed and add them to clang.
Roll Call :
Company
Attendees
8-Jan
29-Jan
12-Feb
AMD
Greg Rodgers
x
Ashwin Aji
Jan Sjodin
x
Ron Lieberman
x
sameer Sahasrabuddhe
Jonathan Chesterfield
x
x
x
Andrey Kasaurov
ANL
Hal Finkel
Michael Kruse
x
Johannes Doerfert
x
x
x
ARM
Andrzej Warzynski
x
Francesco Petrogalli
x
x
IBM
Alexandre Eichenberger
Carlo Bertolli
Kelvin Li
x
Doru
Alexey Bataev
x
x
x
Intel
Andrey Churbanov
Ravi Narayanaswamy
x
x
x
Serguei Dmitriev
Rajiv Deodhar
Lorri Menard
Terry Wilmarth
Rao, Prem
Hansang Bae
Xinmin Tian
Sirram Aananthakrishnan
x
Jeff Hammond
George Rokos
x
x
x
Cray
Deepak Eachempati
x
x
Micron
John Leidel
Nvidia
James Beyer
ORNL
Graham Lopez
Joel Denny
x
Geoffroy Vallee
Fady Ghanim
x
x
x
Oscar Hernandez
SBU/BNL
Lingda Li
x
Jose Monlsave
Vivek Kale
x
Martin Kong
TU Darmstadt
Lukas Sommer
x
x
BrookHavens
Abid Malik
x
_______________________________________________
Openmp-dev mailing list
Openmp-dev at lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/openmp-dev
    
    
More information about the Openmp-dev
mailing list