[cfe-dev] [RFC][OpenMP] Usability improvement, allow dropping offload targets
Dmitriev, Serguei N via cfe-dev
cfe-dev at lists.llvm.org
Tue Jul 31 15:09:34 PDT 2018
1. I think there is no interaction with that patch. As I understand it mainly solves the problem of supporting OpenMP offloads in static libraries for NVPTX target, but that is a different problem. Doru will correct me if my understanding in wrong.
2. Yes, it is possible to solve link problem that way. Alexey has already suggested such change (and I guess has committed such fix), but I believe it would be a short term solution and offload initialization will still require changes in future. I have explained my concerns in the response to Alexey's email.
From: Hal Finkel [mailto:hfinkel at anl.gov]
Sent: Tuesday, July 31, 2018 10:12 AM
To: Dmitriev, Serguei N <serguei.n.dmitriev at intel.com>; cfe-dev at lists.llvm.org
Cc: gheorghe-teod.bercea at ibm.com; Alexey Bataev <a.bataev at hotmail.com>
Subject: Re: [cfe-dev] [RFC][OpenMP] Usability improvement, allow dropping offload targets
I agree that this is a problem worth solving. Two questions:
1. Is there any interaction between your proposal and https://reviews.llvm.org/D47394, and if so, how might that work?
2. Might it be possible to solve this problem by providing weak versions of the required symbols, and then having the linker find the strong versions when linking for each target requested when linking the executable?
On 07/30/2018 06:50 PM, Dmitriev, Serguei N via cfe-dev wrote:
> The existing OpenMP offloading implementation in clang does not allow
> dropping offload targets at link time. That is, if an object file is
> created with one set of offload targets you must use exactly the same
> set of offload targets at the link stage. Otherwise, linking will fail
> $ clang -fopenmp
> -fopenmp-targets=x86_64-pc-linux-gnu,nvptx64-nvidia-cuda foo.c -c $
> clang -fopenmp -fopenmp-targets=x86_64-pc-linux-gnu foo.o
> /tmp/foo-dd79f7.o:(.rodata..omp_offloading.device_images[.omp_offloading.descriptor_reg]+0x20): undefined reference to `.omp_offloading.img_start.nvptx64-nvidia-cuda'
> /tmp/foo-dd79f7.o:(.rodata..omp_offloading.device_images[.omp_offloading.descriptor_reg]+0x28): undefined reference to `.omp_offloading.img_end.nvptx64-nvidia-cuda'
> clang-7: error: linker command failed with exit code 1 (use -v to see
> invocation) $
> This limits OpenMP offload usability. So far, this has not been a high
> priority issue but the importance of this problem will grow once clang
> offload starts supporting static libraries with offload functionality.
> For instance, this limitation won't allow creating general purpose
> static libraries targeting multiple types of offload devices and later
> linking them into a program that uses only one offload target.
> Problem description
> Offload targets cannot be dropped at the link phase because object
> files produced by the compiler for the host have dependencies on the
> offload targets specified during compilation. These dependencies arise
> from the offload initialization code.
> The clang front-end adds offload initialization code to each host
> object in addition to all necessary processing of OpenMP constructs.
> This initialization code is intended to register target binaries for
> all offload targets in the runtime library at program startup. This
> code consists of two compiler-generated routines. One of these
> routines is added to the list of global constructors and the other to
> the global destructors. The constructor routine calls a libomptarget
> API which registers the target binaries and the destructor correspondingly calls a similar API for unregistering target binaries.
> Both these APIs accept a pointer to the target binary descriptor
> object which specifies the number of offload target binaries to
> register and the start/end addresses of target binary images. Since
> the start/end addresses of target binaries are not available at
> compile time, the target binary descriptors are initialized using
> link-time constants which reference (undefined) symbols containing the
> start/end addresses of all target images. These symbols are created by
> the dynamically-generated linker script which the clang driver creates for the host link action.
> References to the target specific symbols from host objects make them
> dependent on particular offload targets and prevents dropping offload
> targets at the link step. Therefore, the OpenMP offload initialization
> needs to be redesigned to make offload targets discardable.
> Proposed change
> Host objects should be independent of offload targets in order to
> allow dropping code for offload targets. That can be achieved by
> removing offload initialization code from host objects. The compiler
> should not inject this code into host objects.
> However, offload initialization should still be done, so it is
> proposed to move the initialization code into a special dynamically
> generated object file (referred to as 'wrapper object' here onwards),
> which, besides the initialization code, will also contain embedded images for offload targets.
> The wrapper object file will be generated by the clang driver with the
> help of a new tool: clang-offload-wrapper. This tool will take offload
> target binaries as input and produce bitcode files containing offload
> initialization code and embedded target images. The output bitcode is
> then passed to the backend and assembler tools from the host toolchain
> to produce the wrapper object which is then added as an input to the linker for host linking.
> The offload action builder in the clang driver needs to be changed to
> use this tool while building the actions graph for OpenMP offload compilations.
> Current status
> A patch with initial implementation of the proposed changes has been
> uploaded to phabricator for review - https://reviews.llvm.org/D49510.
> Looking for a feedback for this proposal.
> cfe-dev mailing list
> cfe-dev at lists.llvm.org
Lead, Compiler Technology and Programming Languages Leadership Computing Facility Argonne National Laboratory
More information about the cfe-dev