[clang] [Clang][OpenCL][AMDGPU] Allow a kernel to call another kernel (PR #115821)
Aniket Lal via cfe-commits
cfe-commits at lists.llvm.org
Mon Apr 14 08:11:54 PDT 2025
lalaniket8 wrote:
> > this change makes sense and it doesn't bring (known to me) regressions
>
> Actually, there is an incorrect behavior in the following test case: https://godbolt.org/z/dc3T7Mo3G , note __clang_ocl_kern_imp_sample_kernel_float was generated, but was never called. @lalaniket8 can this be addressed?
The call was placed and inlined since -O2 optimization flag was passed. Default inlining happens with -O1 and higher flags.
You can change it to -O0 and see the call to `__clang_ocl_kern_imp_sample_kernel_float`.
https://github.com/llvm/llvm-project/pull/115821
More information about the cfe-commits
mailing list