[Openmp-commits] [PATCH] D62393: [OPENMP][NVPTX]Mark parallel level counter as volatile.

Alexey Bataev via Phabricator via Openmp-commits openmp-commits at lists.llvm.org
Tue Oct 8 07:48:17 PDT 2019


ABataev added a comment.

In D62393#1699629 <https://reviews.llvm.org/D62393#1699629>, @jlebar wrote:

> > No, I don't try to add a workaround for the bug in LLVM since there are no problems in LLVM optimizations. The same code works correctly with Cuda9.2. But I would like to add it for cuda8 since we need to fully support it unless we drop the support for cuda8.
>
> Based on this and my comment above, I strongly oppose this patch.
>
> ptxas from CUDA 8 has many, many known bugs.  I oppose working around them in LLVM; it will eat up a huge amount of maintainers' time, and you will never cover all or even many of the bugs.
>
> If you want to un-support CUDA 8 in LLVM, fine by me.  If in your own personal setup you want to use the ptxas from CUDA 10.1 with the rest of the CUDA 8 toolkit, that should also work.


Fine, then no more optimizations in the runtime unless we drop support for Cuda8.


Repository:
  rOMP OpenMP

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D62393/new/

https://reviews.llvm.org/D62393





More information about the Openmp-commits mailing list