[Parallel_libs-commits] [PATCH] D24528: [SE] Pack global dev handle addresses
Jason Henline via Parallel_libs-commits
parallel_libs-commits at lists.llvm.org
Tue Sep 13 16:16:51 PDT 2016
jhen created this revision.
jhen added a reviewer: jlebar.
jhen added subscribers: parallel_libs-commits, jprice.
Herald added a subscriber: jlebar.
We were packing global device memory handles in
`PackedKernelArgumentArray`, but as I was implementing the CUDA
platform, I realized that CUDA wants the address of the handle, not the
handle itself. So this patch switches to packing the address of the
handle.
https://reviews.llvm.org/D24528
Files:
streamexecutor/examples/HostSaxpy.cpp
streamexecutor/include/streamexecutor/DeviceMemory.h
streamexecutor/include/streamexecutor/PackedKernelArgumentArray.h
streamexecutor/unittests/CoreTests/PackedKernelArgumentArrayTest.cpp
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D24528.71256.patch
Type: text/x-patch
Size: 6182 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/parallel_libs-commits/attachments/20160913/dc247ef0/attachment-0001.bin>
More information about the Parallel_libs-commits
mailing list