[PATCH] D42922: [CUDA] Register relocatable GPU binaries
Jonas Hahnfeld via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Mon Feb 12 04:08:13 PST 2018
Hahnfeld planned changes to this revision.
Hahnfeld added a comment.
Still no regression tests.
I did some functional tests though (https://reviews.llvm.org/F5822023): With this patch Clang can generate valid object files with relocatable device code. For linking I still defer to `nvcc` and I'm not sure if I'm interested in reverse-engineering the needed tools to make this fully work with Clang's Driver: I think the biggest advantage of CUDA in Clang is using LLVM's CodeGen. Note that (in my simple tests) Clang's object files had enough compatibility to mix them with other objects generated by `nvcc` (see `Makefile.mixed`)!
https://reviews.llvm.org/D42922
More information about the cfe-commits
mailing list