[Openmp-dev] Failed to read local memory on device

Itaru Kitayama via Openmp-dev openmp-dev at lists.llvm.org
Sun Feb 2 17:31:20 PST 2020


Alexey,
Am still seeing the issue on POWER8 at JSC:
[New Thread 0x10008b9ff180 (LWP 79809)]
[Switching focus to CUDA kernel 0, grid 2, block (0,0,0), thread (0,0,0),
device 0, sm 0, warp 1, lane 0]
0x00000000145f0868 in
__omp_offloading_3b_8077e6c5__ZN4nest20EventDeliveryManager15deliver_events_INS_9SpikeDataEEEbiRKSt6vectorIT_SaIS4_EE_l666<<<(1,1,1),(128,1,1)>>>
(
    nranks=<error reading variable: Error: Failed to read local memory at
address 0xc8f0308 on device 0 sm 0 warp 1 lane 0,
error=CUDBG_ERROR_INVALID_MEMORY_ACCESS(0x8).
>,
    vla=<error reading variable: Error: Failed to read local memory at
address 0xc8f0310 on device 0 sm 0 warp 1 lane 0,
error=CUDBG_ERROR_INVALID_MEMORY_ACCESS(0x8).
>,
    r_buf=<error reading variable: Error: Failed to read local memory at
address 0xc8f0318 on device 0 sm 0 warp 1 lane 0,
error=CUDBG_ERROR_INVALID_MEMORY_ACCESS(0x8).
>,
    send_recv_count_spike_data_per_rank=<error reading variable: Error:
Failed to read local memory at address 0xc8f0320 on device 0 sm 0 warp 1
lane 0, error=CUDBG_ERROR_INVALID_MEMORY_ACCESS(0x8).
>,
    are_others_completed=<error reading variable: Error: Failed to read
local memory at address 0xc8f0328 on device 0 sm 0 warp 1 lane 0,
error=CUDBG_ERROR_INVALID_MEMORY_ACCESS(0x8).
>,
    spike_data=<error reading variable: Error: Failed to read local memory
at address 0xc8f0330 on device 0 sm 0 warp 1 lane 0,
error=CUDBG_ERROR_INVALID_MEMORY_ACCESS(0x8).
>,
    tid=<error reading variable: Error: Failed to read local memory at
address 0xc8f0338 on device 0 sm 0 warp 1 lane 0,
error=CUDBG_ERROR_INVALID_MEMORY_ACCESS(0x8).
>,
    se=<error reading variable: Error: Failed to read local memory at
address 0xc8f0340 on device 0 sm 0 warp 1 lane 0,
error=CUDBG_ERROR_INVALID_MEMORY_ACCESS(0x8).
>,
    vla=<error reading variable: Error: Failed to read local memory at
address 0xc8f0310 on device 0 sm 0 warp 1 lane 0,
error=CUDBG_ERROR_INVALID_MEMORY_ACCESS(0x8).
>,
    prepared_timestamps=<error reading variable: Error: Failed to read
local memory at address 0xc8f0350 on device 0 sm 0 warp 1 lane 0,
error=CUDBG_ERROR_INVALID_MEMORY_ACCESS(0x8).
>,

How do I work-around this?
Thanks,
Itaru.

On Fri, Jan 17, 2020 at 12:46 PM Alexey Bataev <a.bataev at outlook.com> wrote:

> There is a small issue with the debig info, will try to fix it soon. You
> can make a single step and after that you should be in the context of the
> function with correct debug info. (such functions has suffix _debug__).
>
> Best regards,
> Alexey Bataev
>
> 16 янв. 2020 г., в 15:38, Itaru Kitayama <itaru.kitayama at gmail.com>
> написал(а):
>
> 
> Sure, if I code it in a sample code that works (offloaded properly), as
> expected.
>
> On Fri, Jan 17, 2020 at 10:37 AM Itaru Kitayama <itaru.kitayama at gmail.com>
> wrote:
>
>> In CUDA-GDB.
>>
>> On Fri, Jan 17, 2020 at 10:34 AM Alexey Bataev <a.bataev at outlook.com>
>> wrote:
>>
>>> This is what you see in cuda-gdb? Or at the runtime when you run the
>>> program directly?
>>>
>>> -------------
>>> Best regards,
>>> Alexey Bataev
>>>
>>> 16.01.2020 3:22 PM, Itaru Kitayama via Openmp-dev пишет:
>>>
>>> Hi,
>>> Adding this small loop to my app:
>>>
>>> #pragma omp target parallel for map(to: nranks)
>>>         for (int i=0;i<0;i++) {
>>>         printf("%d\n", nranks);
>>>         }
>>>
>>> leads to a run-time error on device. See below:
>>>
>>> [New Thread 0x10008b9ff180 (LWP 103236)]
>>> [Switching focus to CUDA kernel 0, grid 2, block (0,0,0), thread
>>> (0,0,0), device 0, sm 0, warp 0, lane 0]
>>> 0x0000000014588068 in
>>> __omp_offloading_3b_801278cf__ZN4nest20EventDeliveryManager15deliver_events_INS_9SpikeDataEEEbiRKSt6vectorIT_SaIS4_EE_l666<<<(1,1,1),(128,1,1)>>>
>>> (
>>>     nranks=<error reading variable: Error: Failed to read local memory
>>> at address 0xc8ee158 on device 0 sm 0 warp 0 lane 0,
>>> error=CUDBG_ERROR_INVALID_MEMORY_ACCESS(0x8).
>>> >) at
>>> /p/project/cpcp0/pcp0151/projects/nest-simulator/nestkernel/event_delivery_manager.cpp:667
>>> 667 for (int i=0;i<0;i++) {
>>>
>>> I am using the latest trunk Clang for this.
>>>
>>> _______________________________________________
>>> Openmp-dev mailing listOpenmp-dev at lists.llvm.orghttps://lists.llvm.org/cgi-bin/mailman/listinfo/openmp-dev
>>>
>>>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/openmp-dev/attachments/20200203/804a7af2/attachment.html>


More information about the Openmp-dev mailing list