[Openmp-commits] [PATCH] D62393: [OPENMP][NVPTX]Mark parallel level counter as volatile.
Alexey Bataev via Phabricator via Openmp-commits
openmp-commits at lists.llvm.org
Sat Oct 5 14:59:48 PDT 2019
ABataev added a comment.
In D62393#1696341 <https://reviews.llvm.org/D62393#1696341>, @jdoerfert wrote:
> So this changed over time, we for sure need to update the name and maybe commit message.
>
> As @JonChesterfield pointed out, the wrapping of the accesses in function calls is fine.
> The two functions with inline assembly should live in the target_impl.h though.
>
> At some point it was said this fixes a problem that only occurs with D62318 <https://reviews.llvm.org/D62318> applied.
> Is this still the case? If not, we need to specify exactly how we can reproduce this issue, e.g., in a comment next to the target offload region.
Yes, it still the case. The cuda8 compiler is too optimistic and makes some incorrect optimizations with D62318 <https://reviews.llvm.org/D62318>. This patch prevents these optimizations. Cuda9 and later have no such problems.
Repository:
rOMP OpenMP
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D62393/new/
https://reviews.llvm.org/D62393
More information about the Openmp-commits
mailing list