[PATCH] D91516: [AMDGPU] Support for device scope shared variables
Jon Chesterfield via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Dec 23 13:37:46 PST 2020
JonChesterfield added a comment.
I believe the intended effect of this pass is:
- Allow functions called directly to access LDS without being inlined
- Silently induce UB in programs containing recursive functions
- Silently induce UB in some programs containing indirect calls, probably those with complicated data flow
The benefit is that some applications which meet those constraints perform better than when the functions are inlined instead, perhaps due to code size effects.
If that is an accurate characterisation, I don't think it is an improvement on the status quo. Perhaps one to keep in the internal repo?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D91516/new/
https://reviews.llvm.org/D91516
More information about the llvm-commits
mailing list