[LLVMdev] [RFC] Parallelization metadata and intrinsics in LLVM (for OpenMP, etc.)
Hal Finkel
hfinkel at anl.gov
Wed Oct 3 10:49:00 PDT 2012
On Tue, 2 Oct 2012 14:28:25 +0000
"Adve, Vikram Sadanand" <vadve at illinois.edu> wrote:
> Hal, Andrey, Alexey,
>
> From the LLVM design viewpoint, there is a fundamental problem with
> both Hal's approach and the Intel approach: both are quite
> language-specific. OpenMP is a particular parallel language, with
> particular constructs (e.g., parallel regions) and semantics.
Also, after thinking about it, I object to this characterization.
OpenMP is a parallelization model and runtime interface (with some
specific language bindings pre-specified). There is very little in the
standard that is language specific. It would not be difficult to create
OpenMP for Python, or Go, etc. OpenMP covers both task-based and
region/loop-based parallelism. As far as choosing a conceptual
substrate on which to base LLVM's parallelism, using OpenMP would not
be a bad idea. If LLVM has parallelization support, such support will
also be a, "a particular parallel language, with particular constructs
(e.g., parallel regions) and semantics."
I think that there is a dangerous inclination to oversimply the
parallelization interface. While it is true that parallel algorithms
can often be modeled using a few distinct operators, implementing a
programming language this way is not a good idea. You would not, for
example, argue that C should eliminate all integer types except for the
largest. It is true that integer operations on shorter types can be
implemented in terms of the larger types combined with appropriate
masking operations. However, it would neither be convenient,
aesthetically pleasing, nor easy to optimize in practice.
-Hal
> LLVM
> is a language-neutral IR and infrastructure and OpenMP-specific
> concepts should not creep into it. I've included an excerpt from
> Hal's proposal below, which shows what I mean: the design is couched
> in terms of OpenMP parallel regions. Other parallel languages, e.g,
> Cilk, have no such notion. The latest Intel proposal is at least as
> OpenMP-specific.
>
> I do agree with the general goal of trying to support parallel
> programming languages in a more first-class manner in LLVM than we do
> today. But the right approach for that is to be as language-neutral
> as possible. For example, any parallelism primitives in the LLVM IR
> and any LLVM- or machine-level optimizations that apply to parallel
> code should be applicable not only to OpenMP but also to languages
> like Cilk, Java/C#, and several others. I think "libraries" like
> Intel's TBB should be supported, too: they have a (reasonably)
> well-defined semantics, just like languages do, and are become widely
> used.
>
> I also do not think LLVM metadata is the way to represent the
> primitives, because they are simply too fragile. But you don't have
> to argue that one with me :-), others have argued this already. You
> really need more first class, language-neutral, LLVM mechanisms for
> parallelism. I'm not pretending I know how to do this, though there
> are papers on the subject, including one from an Intel team (Pillar:
> A Parallel Implementation Language, LCPC 2007).
>
> --Vikram
> Professor, Computer Science
> University of Illinois at Urbana-Champaign
> http://llvm.org/~vadve
>
>
>
>
> > To mark this function as a parallel region, a module-level
> > 'parallel' metadata entry is created. The call site(s) of this
> > function are marked with this metadata,. The metadata has entries:
> > - The string "region"
> > - A reference to the parallel-region function
> > - If applicable, a list of metadata references specifying
> > special-handling child regions (parallel loops and
> > serialized/critical regions)
> >
> > If the special-handling region metadata is no longer referenced by
> > code within the parallel region, then the region has become
> > invalid, and will be removed (meaning all parallelization metadata
> > will be removed) by the ParallelizationCleanup. The same is true
> > for all other cross-referenced metadata below.
> >
> > Note that parallel regions can be nested.
> >
> > As a quick example, something like:
> > int main() {
> > int a;
> > #pragma omp parallel firstprivate(a)
> > do_something(a)
> > ...
> > }
> >
> > becomes something like:
> >
> > define private void @parreg(i32 %a) {
> > entry:
> > call void @do_something(i32 %a)
> > ret
> > }
> >
> > define i32 @main() {
> > entry:
> > ...
> > call void @parreg1(i32 %a) !parallel !0
> > ...
> >
> > !0 = metadata !{ metadata !"region", @parreg }
> >
>
>
> --Vikram
> Professor, Computer Science
> University of Illinois at Urbana-Champaign
> http://llvm.org/~vadve
>
>
>
>
>
>
--
Hal Finkel
Postdoctoral Appointee
Leadership Computing Facility
Argonne National Laboratory
More information about the llvm-dev
mailing list