[PATCH] D127901: [LinkerWrapper] Add PTX output to CUDA fatbinary in LTO-mode

Wed Jun 22 15:01:09 PDT 2022

jhuber6 added a comment.

In D127901#3603006 <https://reviews.llvm.org/D127901#3603006>, @tra wrote:

> Then we do need a knob controlling whether we do want to embed PTX or not. The default should be "off" IMO.
> We currently have `--[no-]cuda-include-ptx=` we may reuse for that purpose.

We could definitely re-use that. It's another option that probably need to go inside the binary itself since normally those options aren't passed to the linker. We'll probably just use the same default as that flag (which is on I think).

> This brings another question -- which GPU variant will we generate PTX for? One? All (if more than one is specified)? The ones specified by `--[no-]cuda-include-ptx=` ?

Right now, it'll be the one that's attached to the LTO job. So if the user specified `sm_70` they'll get PTX for `sm_70`.

Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D127901/new/

https://reviews.llvm.org/D127901