[Openmp-commits] [PATCH] D65013: [OPENMP][NVPTX]Fix parallel level counter in Cuda 9.0.

Alexey Bataev via Phabricator via Openmp-commits openmp-commits at lists.llvm.org
Tue Jul 23 01:51:23 PDT 2019


ABataev added a comment.

In D65013#1596821 <https://reviews.llvm.org/D65013#1596821>, @jdoerfert wrote:

> I'm confused, partly about the "convergent" part.
>
> The code looks vastly different but no tests are affected?
>  Could you please point out how to reproduce the problem?
>  Where did the shuffles go?
>  Why is there a threadfence and syncwrap now?
>  Which old accesses were problematic and why?


There is a problem with at least 1 test in Cuda 9+: spmd_parallel_regions.cpp. To fix this problem we need 3 things: fix the test itself (see D65112 <https://reviews.llvm.org/D65112>), fix the runtime part (this patch) and fix the handling of critical sections in compiler (the 3rd patch that depends on this one).


Repository:
  rOMP OpenMP

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D65013/new/

https://reviews.llvm.org/D65013





More information about the Openmp-commits mailing list