[Openmp-commits] [PATCH] D51875: [OPENMP][NVPTX] Add support for lastprivates/reductions handling in SPMD constructs with lightweight runtime.
Jonas Hahnfeld via Phabricator via Openmp-commits
openmp-commits at lists.llvm.org
Fri Sep 28 08:23:04 PDT 2018
Hahnfeld added a comment.
In https://reviews.llvm.org/D51875#1249162, @ABataev wrote:
> In https://reviews.llvm.org/D51875#1249159, @Hahnfeld wrote:
> > In https://reviews.llvm.org/D51875#1249153, @ABataev wrote:
> > > Say, last distribute chunk is `[L, U]`. In the inner `for` directive it is split into `[L,U1], [U1+1, U2], ..., [Un-1 + 1, U]`. `Distribute` marks all these chunks as last, not the last `[Un-1 + 1, U]`.
> > I got that. This is why the outer `distribute` only passes the global address for its last chunk. Then the inner `for` decides which thread executes `[Un-1 + 1, U]` and writes the lastprivate value.
> Yes, that's right! You got it.
So now you are agreeing to "my" solution which is different than what Clang currently does - I'm confused.
More information about the Openmp-commits