[PATCH] D120566: [OpenCL][AMDGPU]: Do not allow a call to kernel

Yaxun Liu via Phabricator via cfe-commits cfe-commits at lists.llvm.org
Fri Feb 25 07:11:57 PST 2022


yaxunl added a comment.

One of my concerns is that all kernels are duplicated which may cause code object size doubled.

Do we need to make the clone always_inline and let the kernel call its clone to avoid duplicate function bodies? Or LLVM has some pass to do that?

Another concern is that the duplicate non-kernel functions have actual kernel ABI. Not sure if that can cause any issues.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D120566/new/

https://reviews.llvm.org/D120566



More information about the cfe-commits mailing list