[all-commits] [llvm/llvm-project] c3ce56: [Clang][OpenCL][AMDGPU] OpenCL Kernel stubs shoul...

Aniket Lal via All-commits all-commits at lists.llvm.org
Wed May 7 03:12:44 PDT 2025


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: c3ce5684a8b408220eed983d065edba0e6ed5016
      https://github.com/llvm/llvm-project/commit/c3ce5684a8b408220eed983d065edba0e6ed5016
  Author: Aniket Lal <lalaniket8 at gmail.com>
  Date:   2025-05-07 (Wed, 07 May 2025)

  Changed paths:
    M clang/lib/CodeGen/CodeGenModule.cpp
    M clang/test/CodeGenOpenCL/amdgpu-enqueue-kernel.cl
    M clang/test/CodeGenOpenCL/cl-uniform-wg-size.cl
    M clang/test/CodeGenOpenCL/cl20-device-side-enqueue.cl
    M clang/test/CodeGenOpenCL/convergent.cl
    M clang/test/CodeGenOpenCL/enqueue-kernel-non-entry-block.cl

  Log Message:
  -----------
  [Clang][OpenCL][AMDGPU]  OpenCL Kernel stubs should be assigned alwaysinline attribute (#137769)

OpenCL Kernels body is emitted as stubs and the kernel is emitted as
call to respective stub.
(https://github.com/llvm/llvm-project/pull/115821).
The stub function should be alwaysinlined, since call to stub can cause
performance drop.

Co-authored-by: anikelal <anikelal at amd.com>



To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications


More information about the All-commits mailing list