[Openmp-commits] [PATCH] D103648: [OpenMP] libomp: fix dynamic loop dispatcher
Andrey Churbanov via Phabricator via Openmp-commits
openmp-commits at lists.llvm.org
Fri Aug 13 06:01:45 PDT 2021
AndreyChurbanov added inline comments.
================
Comment at: openmp/runtime/src/kmp_dispatch.cpp:911
+ gtid, my_buffer_index, sh->buffer_index));
+ __kmp_wait<kmp_uint32>(&sh->buffer_index, my_buffer_index,
+ __kmp_eq<kmp_uint32> USE_ITT_BUILD_ARG(NULL));
----------------
rogfer01 wrote:
> Sometimes (not always, so it seems a data race) running this test in an Arm 64-bit machine with 46 cores (and in a Power9 machine with 40 cores) all the threads end waiting here, so the test doesn't progress anymore.
>
> All the cases I've seen happen with `KMP_DISP_NUM_BUFFERS=3` and `-DMY_SCHEDULE=guided`.
>
> Any idea how I could debug this further?
>
> A quick look about `sh->buffer_index` shows it is a `volatile` and it is updated in
>
> ```
> sh->buffer_index += __kmp_dispatch_num_buffers;
> KD_TRACE(100, ("__kmp_dispatch_next: T#%d change buffer_index:%d\n",
> gtid, sh->buffer_index));
>
> KMP_MB(); /* Flush all pending memory write invalidates. */
> ```
>
> Given that this is not an atomic operation (yet it goes followed by a memory barrier) my only hypothesis is that the original load of `sh->buffer_index` might have read an old value but that would suggest `KMP_MB()` is not effective in these targets? So I am at loss here.
>
> Thanks!
Which test(s) you see hanging?
There indeed may be a data race somewhere in the code, I will try to take a look. Still better to know the exact test case.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D103648/new/
https://reviews.llvm.org/D103648
More information about the Openmp-commits
mailing list