[PATCH] D127901: [LinkerWrapper] Add PTX output to CUDA fatbinary in LTO-mode
Joseph Huber via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Wed Jun 22 15:01:09 PDT 2022
jhuber6 added a comment.
In D127901#3603006 <https://reviews.llvm.org/D127901#3603006>, @tra wrote:
> Then we do need a knob controlling whether we do want to embed PTX or not. The default should be "off" IMO.
> We currently have `--[no-]cuda-include-ptx=` we may reuse for that purpose.
We could definitely re-use that. It's another option that probably need to go inside the binary itself since normally those options aren't passed to the linker. We'll probably just use the same default as that flag (which is on I think).
> This brings another question -- which GPU variant will we generate PTX for? One? All (if more than one is specified)? The ones specified by `--[no-]cuda-include-ptx=` ?
Right now, it'll be the one that's attached to the LTO job. So if the user specified `sm_70` they'll get PTX for `sm_70`.
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
More information about the cfe-commits