[cfe-dev] [RFC] Moving (parts of) the Cling REPL in Clang

Hal Finkel via cfe-dev cfe-dev at lists.llvm.org
Fri Jul 10 14:10:39 PDT 2020


On 7/10/20 4:00 PM, JF Bastien wrote:
>
>
>> On Jul 10, 2020, at 1:55 PM, Hal Finkel <hfinkel at anl.gov 
>> <mailto:hfinkel at anl.gov>> wrote:
>>
>>
>> On 7/10/20 1:57 PM, Vassil Vassilev wrote:
>>> On 7/10/20 6:43 AM, JF Bastien wrote:
>>>> I like cling, and having it integrated with the rest of the project 
>>>> would be neat. I agree with Hal’s suggestion to explain the design 
>>>> of what remains. It sounds like a pretty small amount of code.
>>>
>>>
>>> JF, Hal, did you mean you want a design document of how cling in 
>>> general or a design RFC for the patches we have? A design document 
>>> for cling would be quite large and will take us some time to write 
>>> up. OTOH, we could relatively easily give a rationale for each patch.
>>
>>
>> I had in mind something that's probably in between. Something that 
>> explains the patches and enough about how they fit into a larger 
>> system that we can reason about the context.
>
> Maybe a purpose would be more useful to understand your request? I 
> assume you meant “I’d like us to understand what we’re signing up to 
> maintain, and why it’s useful to do things this way”. In particular, 
> if there’s undue burden in a particular component, and the code could 
> be changed to work differently with less support overhead, then we’d 
> want to identify this fact ahead of time.
>
> I’m guessing at what Hal is asking, LMK if that’s not what you had in 
> mind!


Yes. To understand how all of the pieces fit together to enable support 
for incremental compilation of C++ code. Once everything is in place, if 
I wanted to use the infrastructure to do some kind of incremental 
compilation of C++, what would I do? And what do the set of patches aim 
to do to get us there?

  -Hal


>
>
>>  -Hal
>>
>>
>>>
>>>
>>>>
>>>>
>>>>> On Jul 9, 2020, at 7:25 PM, Hal Finkel via cfe-dev 
>>>>> <cfe-dev at lists.llvm.org <mailto:cfe-dev at lists.llvm.org>> wrote:
>>>>>
>>>>> I think that it would be great to have infrastructure for 
>>>>> incremental C++ compilation, supporting interactive use, 
>>>>> just-in-time compilation, and so on. I think that the best way to 
>>>>> deal with the patches, etc., as well as IncrementalAction, is to 
>>>>> first send an RFC explaining the overall design.
>>>>>
>>>>> -Hal
>>>>>
>>>>> On 7/9/20 3:46 PM, Vassil Vassilev via cfe-dev wrote:
>>>>>> Motivation
>>>>>> ===
>>>>>>
>>>>>> Over the last decade we have developed an interactive, 
>>>>>> interpretative C++ (aka REPL) as part of the high-energy physics 
>>>>>> (HEP) data analysis project -- ROOT [1-2]. We invested a 
>>>>>> significant effort to replace the CINT C++ interpreter with a 
>>>>>> newly implemented REPL based on llvm -- cling [3]. The cling 
>>>>>> infrastructure is a core component of the data analysis framework 
>>>>>> of ROOT and runs in production for approximately 5 years.
>>>>>>
>>>>>> Cling is also  a standalone tool, which has a growing community 
>>>>>> outside of our field. Cling’s user community includes users in 
>>>>>> finance, biology and in a few companies with proprietary 
>>>>>> software. For example, there is a xeus-cling jupyter kernel [4]. 
>>>>>> One of the major challenges we face to foster that community is  
>>>>>> our cling-related patches in llvm and clang forks. The benefits 
>>>>>> of using the LLVM community standards for code reviews, release 
>>>>>> cycles and integration has been mentioned a number of times by 
>>>>>> our "external" users.
>>>>>>
>>>>>> Last year we were awarded an NSF grant to improve cling's 
>>>>>> sustainability and make it a standalone tool. We thank the LLVM 
>>>>>> Foundation Board for supporting us with a non-binding letter of 
>>>>>> collaboration which was essential for getting this grant.
>>>>>>
>>>>>>
>>>>>> Background
>>>>>> ===
>>>>>>
>>>>>> Cling is a C++ interpreter built on top of clang and llvm. In a 
>>>>>> nutshell, it uses clang's incremental compilation facilities to 
>>>>>> process code chunk-by-chunk by assuming an ever-growing 
>>>>>> translation unit [5]. Then code is lowered into llvm IR and run 
>>>>>> by the llvm jit. Cling has implemented some language "extensions" 
>>>>>> such as execution statements on the global scope and error 
>>>>>> recovery. Cling is in the core of HEP -- it is heavily used 
>>>>>> during data analysis of exabytes of particle physics data coming 
>>>>>> from the Large Hadron Collider (LHC) and other particle physics 
>>>>>> experiments.
>>>>>>
>>>>>>
>>>>>> Plans
>>>>>> ===
>>>>>>
>>>>>> The project foresees three main directions -- move parts of cling 
>>>>>> upstream along with the clang and llvm features that enable them; 
>>>>>> extend and generalize the language interoperability layer around 
>>>>>> cling; and extend and generalize the OpenCL/CUDA support in 
>>>>>> cling. We are at the early stages of the project and this email 
>>>>>> intends to be an RFC for the first part -- upstreaming parts of 
>>>>>> cling. Please do share your thoughts on the rest, too.
>>>>>>
>>>>>>
>>>>>> Moving Parts of Cling Upstream
>>>>>> ---
>>>>>>
>>>>>> Over the years we have slowly moved some patches upstream. 
>>>>>> However we still have around 100 patches in the clang fork. Most 
>>>>>> of them are in the context of extending the incremental 
>>>>>> compilation support for clang. The incremental compilation poses 
>>>>>> some challenges in the clang infrastructure. For example, we need 
>>>>>> to tune CodeGen to work with multiple llvm::Module instances, and 
>>>>>> finalize per each end-of-translation unit (we have multiple of 
>>>>>> them). Other changes include small adjustments in the 
>>>>>> FileManager's caching mechanism, and bug fixes in the 
>>>>>> SourceManager (code which can be reached mostly from within our 
>>>>>> setup). One conclusion we can draw from our research is that the 
>>>>>> clang infrastructure fits amazingly well to something which was 
>>>>>> not its main use case. The grand total of our diffs against 
>>>>>> clang-9 is: `62 files changed, 1294 insertions(+), 231 
>>>>>> deletions(-)`. Cling is currently being upgraded from llvm-5 to 
>>>>>> llvm-9.
>>>>>>
>>>>>> A major weakness of cling's infrastructure is that it does not 
>>>>>> work with the clang Action infrastructure due to the lack of an 
>>>>>> IncrementalAction.  A possible way forward would be to implement 
>>>>>> a clang::IncrementalAction as a starting point. This way we 
>>>>>> should be able to reduce the amount of setup necessary to use the 
>>>>>> incremental infrastructure in clang. However, this will be a bit 
>>>>>> of a testing challenge -- cling lives downstream and some of the 
>>>>>> new code may be impossible to pick straight away and use. 
>>>>>> Building a mainline example tool such as clang-repl which gives 
>>>>>> us a way to test that incremental case or repurpose the already 
>>>>>> existing clang-interpreter may  be able to address the issue. The 
>>>>>> major risk of the task is avoiding code in the clang mainline 
>>>>>> which is untested by its HEP production environment.
>>>>>> There are several other types of patches to the ROOT fork of 
>>>>>> Clang, including ones  in the context of performance,towards  C++ 
>>>>>> modules support (D41416), and storage (does not have a patch yet 
>>>>>> but has an open projects entry and somebody working on it). These 
>>>>>> patches can be considered in parallel independently on the rest.
>>>>>>
>>>>>> Extend and Generalize the Language Interoperability Layer Around 
>>>>>> Cling
>>>>>> ---
>>>>>>
>>>>>> HEP has extensive experience with on-demand python 
>>>>>> interoperability using cppyy[6], which is built around the type 
>>>>>> information provided by cling. Unlike tools with custom parsers 
>>>>>> such as swig and sip and tools built on top of C-APIs such as 
>>>>>> boost.python and pybind11, cling can provide information about 
>>>>>> memory management patterns (eg refcounting) and instantiate 
>>>>>> templates on the fly.We feel that functionality may not be of 
>>>>>> general interest to the llvm community but we will prepare 
>>>>>> another RFC and send it here later on to gather feedback.
>>>>>>
>>>>>>
>>>>>> Extend and Generalize the OpenCL/CUDA Support in Cling
>>>>>> ---
>>>>>>
>>>>>> Cling can incrementally compile CUDA code [7-8] allowing easier 
>>>>>> set up and enabling some interesting use cases. There are a 
>>>>>> number of planned improvements including talking to HIP [9] and 
>>>>>> SYCL to support more hardware architectures.
>>>>>>
>>>>>>
>>>>>>
>>>>>> The primary focus of our work is to upstreaming functionality 
>>>>>> required to build an incremental compiler and rework cling build 
>>>>>> against vanilla clang and llvm. The last two points are to give 
>>>>>> the scope of the work which we will be doing the next 2-3 years. 
>>>>>> We will send here RFCs for both of them to trigger technical 
>>>>>> discussion if there is interest in pursuing this direction.
>>>>>>
>>>>>>
>>>>>> Collaboration
>>>>>> ===
>>>>>>
>>>>>> Open source development nowadays relies on reviewers. LLVM is no 
>>>>>> different and we will probably disturb a good number of people in 
>>>>>> the community ;)We would like to invite anybody interested in 
>>>>>> joining our incremental C++ activities to our open every second 
>>>>>> week calls. Announcements will be done via google group: 
>>>>>> compiler-research-announce 
>>>>>> (https://groups.google.com/g/compiler-research-announce 
>>>>>> <https://groups.google.com/g/compiler-research-announce>).
>>>>>>
>>>>>>
>>>>>>
>>>>>> Many thanks!
>>>>>>
>>>>>>
>>>>>> David & Vassil
>>>>>>
>>>>>> References
>>>>>> ===
>>>>>> [1] ROOT GitHub https://github.com/root-project/root 
>>>>>> <https://github.com/root-project/root>
>>>>>> [2] ROOT https://root.cern <https://root.cern>
>>>>>> [3] Cling https://github.com/root-project/cling 
>>>>>> <https://github.com/root-project/cling>
>>>>>> [4] Xeus-Cling 
>>>>>> https://blog.jupyter.org/xeus-is-now-a-jupyter-subproject-c4ec5a1bf30b 
>>>>>> <https://blog.jupyter.org/xeus-is-now-a-jupyter-subproject-c4ec5a1bf30b>
>>>>>> [5] Cling – The New Interactive Interpreter for ROOT 6, 
>>>>>> https://iopscience.iop.org/article/10.1088/1742-6596/396/5/052071 
>>>>>> <https://iopscience.iop.org/article/10.1088/1742-6596/396/5/052071>
>>>>>> [6] High-performance Python-C++ bindings with PyPy and Cling, 
>>>>>> https://dl.acm.org/doi/10.5555/3019083.3019087 
>>>>>> <https://dl.acm.org/doi/10.5555/3019083.3019087>
>>>>>> [7] 
>>>>>> https://indico.cern.ch/event/697389/contributions/3085538/attachments/1712698/2761717/2018_09_10_cling_CUDA.pdf 
>>>>>> <https://indico.cern.ch/event/697389/contributions/3085538/attachments/1712698/2761717/2018_09_10_cling_CUDA.pdf>
>>>>>> [8] CUDA C++ in Jupyter: Adding CUDA Runtime Support to Cling', 
>>>>>> https://zenodo.org/record/3713753#.Xu8jqvJRXxU 
>>>>>> <https://zenodo.org/record/3713753#.Xu8jqvJRXxU>
>>>>>> [9] HIP Programming Guide 
>>>>>> https://rocmdocs.amd.com/en/latest/Programming_Guides/HIP-GUIDE.html 
>>>>>> <https://rocmdocs.amd.com/en/latest/Programming_Guides/HIP-GUIDE.html>
>>>>>>
>>>>>> _______________________________________________
>>>>>> cfe-dev mailing list
>>>>>> cfe-dev at lists.llvm.org <mailto:cfe-dev at lists.llvm.org>
>>>>>> https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev
>>>>> --
>>>>> Hal Finkel
>>>>> Lead, Compiler Technology and Programming Languages
>>>>> Leadership Computing Facility
>>>>> Argonne National Laboratory
>>>>>
>>>>> _______________________________________________
>>>>> cfe-dev mailing list
>>>>> cfe-dev at lists.llvm.org <mailto:cfe-dev at lists.llvm.org>
>>>>> https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev
>>>
>>>
>> --
>> Hal Finkel
>> Lead, Compiler Technology and Programming Languages
>> Leadership Computing Facility
>> Argonne National Laboratory
>
-- 
Hal Finkel
Lead, Compiler Technology and Programming Languages
Leadership Computing Facility
Argonne National Laboratory

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/cfe-dev/attachments/20200710/f433e755/attachment-0001.html>


More information about the cfe-dev mailing list