[PATCH] D85223: [CUDA][HIP] Support accessing static device variable in host code for -fgpu-rdc
Jon Chesterfield via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Tue Feb 9 10:38:35 PST 2021
JonChesterfield accepted this revision.
JonChesterfield added a comment.
This works around the limitations of the binary format nvptx and amdgpu are using in the compiler. It's the wrong place in the stack to fix it - we could introduce another symbol table in the binary to capture the per-tu-between-arch scoping.
However, if we later reach consensus on what to do in the elf instead, we can still do that. In particular, embedding an elf for one arch in a named section of an elf for a host arch is crude. This workaround seems acceptable in the meantime.
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D85223/new/
https://reviews.llvm.org/D85223
More information about the cfe-commits
mailing list