[PATCH] D149451: [NVPTX] Add NVPTXCtorDtorLoweringPass to handle global ctors / dtors

Joseph Huber via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Apr 28 11:42:36 PDT 2023


jhuber6 added a comment.

In D149451#4306204 <https://reviews.llvm.org/D149451#4306204>, @tra wrote:

> In D149451#4306185 <https://reviews.llvm.org/D149451#4306185>, @jhuber6 wrote:
>
>> I'm still very unhappy that you can't emit sections in `ptxas`. maybe we need to make a `ptxas` wrapper that compiles it in debug mode and regular mode then objcopies the section from the debug mode one into the regular one. I'm sure nothing could go wrong there :).
>
> Considering the already-known quirkiness of ELF handling by other CUDA tools, I would not hold my breath for objcopy keeping CUDA tools/runtime/driver happy, either.
>
> That's another point for growing our own ability to deal with NVIDIA GPU binaries in LLD and other LLVM's binary utilities. Everything else will require playing this endless game of whack-a-mole chasing version-dependent quirks in NVIDIA's tools.

Yes, I remember when putting a SHT_NOTE section in caused it to no longer loader the module. I have no clue why they seemed to have implemented the ELF target so poorly. But what can you do.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D149451/new/

https://reviews.llvm.org/D149451



More information about the llvm-commits mailing list