[PATCH] D120566: [OpenCL][AMDGPU]: Do not allow a call to kernel
Yaxun Liu via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Fri Feb 25 07:11:57 PST 2022
yaxunl added a comment.
One of my concerns is that all kernels are duplicated which may cause code object size doubled.
Do we need to make the clone always_inline and let the kernel call its clone to avoid duplicate function bodies? Or LLVM has some pass to do that?
Another concern is that the duplicate non-kernel functions have actual kernel ABI. Not sure if that can cause any issues.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D120566/new/
https://reviews.llvm.org/D120566
More information about the cfe-commits
mailing list