[Openmp-commits] [PATCH] D91746: [libomptarget] Fail on missing symbols in device image
Jon Chesterfield via Phabricator via Openmp-commits
openmp-commits at lists.llvm.org
Wed Nov 18 15:20:05 PST 2020
JonChesterfield added a comment.
Cuda reads from _exec_mode, and writes to omptarget_device_environment, after the image has been loaded onto the gpu. Amdgpu read/writes the device image before it is loaded onto the gpu to avoid the extra round trip. This means amdgpu has an elf symbol table parsing function and makes a possibly avoidable copy of the device image (I haven't checked whether libomptarget handles a plugin mutating the image gracefully). Do we want the same optimisation for cuda?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D91746/new/
https://reviews.llvm.org/D91746
More information about the Openmp-commits
mailing list