[all-commits] [llvm/llvm-project] c3ce56: [Clang][OpenCL][AMDGPU] OpenCL Kernel stubs shoul...
Aniket Lal via All-commits
all-commits at lists.llvm.org
Wed May 7 03:12:44 PDT 2025
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: c3ce5684a8b408220eed983d065edba0e6ed5016
https://github.com/llvm/llvm-project/commit/c3ce5684a8b408220eed983d065edba0e6ed5016
Author: Aniket Lal <lalaniket8 at gmail.com>
Date: 2025-05-07 (Wed, 07 May 2025)
Changed paths:
M clang/lib/CodeGen/CodeGenModule.cpp
M clang/test/CodeGenOpenCL/amdgpu-enqueue-kernel.cl
M clang/test/CodeGenOpenCL/cl-uniform-wg-size.cl
M clang/test/CodeGenOpenCL/cl20-device-side-enqueue.cl
M clang/test/CodeGenOpenCL/convergent.cl
M clang/test/CodeGenOpenCL/enqueue-kernel-non-entry-block.cl
Log Message:
-----------
[Clang][OpenCL][AMDGPU] OpenCL Kernel stubs should be assigned alwaysinline attribute (#137769)
OpenCL Kernels body is emitted as stubs and the kernel is emitted as
call to respective stub.
(https://github.com/llvm/llvm-project/pull/115821).
The stub function should be alwaysinlined, since call to stub can cause
performance drop.
Co-authored-by: anikelal <anikelal at amd.com>
To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications
More information about the All-commits
mailing list