[PATCH] D42922: [CUDA] Register relocatable GPU binaries

Jonas Hahnfeld via Phabricator via cfe-commits cfe-commits at lists.llvm.org
Mon Feb 12 04:08:13 PST 2018


Hahnfeld planned changes to this revision.
Hahnfeld added a comment.

Still no regression tests.

I did some functional tests though (https://reviews.llvm.org/F5822023): With this patch Clang can generate valid object files with relocatable device code. For linking I still defer to `nvcc` and I'm not sure if I'm interested in reverse-engineering the needed tools to make this fully work with Clang's Driver: I think the biggest advantage of CUDA in Clang is using LLVM's CodeGen. Note that (in my simple tests) Clang's object files had enough compatibility to mix them with other objects generated by `nvcc` (see `Makefile.mixed`)!


https://reviews.llvm.org/D42922





More information about the cfe-commits mailing list