[all-commits] [llvm/llvm-project] 283b9e: [Clang][OpenCL][AMDGPU] Allow a kernel to call ano...

Aniket Lal via All-commits all-commits at lists.llvm.org
Wed Mar 12 22:18:32 PDT 2025


  Branch: refs/heads/users/lalaniket8/emit-device-version-of-openCL-kernel
  Home:   https://github.com/llvm/llvm-project
  Commit: 283b9efba388c36a68e8f2bc5e128b7c65a92686
      https://github.com/llvm/llvm-project/commit/283b9efba388c36a68e8f2bc5e128b7c65a92686
  Author: anikelal <anikelal at amd.com>
  Date:   2025-03-13 (Thu, 13 Mar 2025)

  Changed paths:
    M clang/include/clang/AST/GlobalDecl.h
    M clang/lib/AST/Expr.cpp
    M clang/lib/AST/ItaniumMangle.cpp
    M clang/lib/AST/Mangle.cpp
    M clang/lib/AST/MicrosoftMangle.cpp
    M clang/lib/CodeGen/CGBlocks.cpp
    M clang/lib/CodeGen/CGCall.cpp
    M clang/lib/CodeGen/CGExpr.cpp
    M clang/lib/CodeGen/CGOpenCLRuntime.cpp
    M clang/lib/CodeGen/CGOpenCLRuntime.h
    M clang/lib/CodeGen/CodeGenModule.cpp
    A clang/test/CodeGenOpenCL/opencl-kernel-call.cl
    M clang/test/CodeGenOpenCL/spir-calling-conv.cl
    M clang/test/CodeGenOpenCL/visibility.cl

  Log Message:
  -----------
  [Clang][OpenCL][AMDGPU] Allow a kernel to call another kernel

This feature is currently not supported in the compiler.
To facilitate this we emit a stub version of each kernel
function body with different name mangling scheme, and
replaces the respective kernel call-sites appropriately.

Fixes https://github.com/llvm/llvm-project/issues/60313

D120566 was an earlier attempt made to upstream a solution
for this issue.


  Commit: 64cbbae4263bf3d06031af548c8816ad3b36bbcf
      https://github.com/llvm/llvm-project/commit/64cbbae4263bf3d06031af548c8816ad3b36bbcf
  Author: anikelal <anikelal at amd.com>
  Date:   2025-03-13 (Thu, 13 Mar 2025)

  Changed paths:
    M clang/include/clang/AST/GlobalDecl.h
    M clang/lib/CodeGen/CGOpenCLRuntime.cpp

  Log Message:
  -----------
  [Clang][OpenCL][AMDGPU] Allow a kernel to call another kernel

Simplifying isDeclOpenCLKernel() and removing resolved comments


  Commit: aa66d866a7a777fa715fbc252a552dd396dfe192
      https://github.com/llvm/llvm-project/commit/aa66d866a7a777fa715fbc252a552dd396dfe192
  Author: anikelal <anikelal at amd.com>
  Date:   2025-03-13 (Thu, 13 Mar 2025)

  Changed paths:
    M clang/include/clang/AST/Decl.h
    M clang/include/clang/AST/GlobalDecl.h
    M clang/lib/AST/Decl.cpp
    M clang/lib/AST/Expr.cpp
    M clang/lib/AST/ItaniumMangle.cpp
    M clang/lib/AST/Mangle.cpp
    M clang/lib/CodeGen/CGBlocks.cpp
    M clang/lib/CodeGen/CGCall.cpp
    M clang/lib/CodeGen/CGExpr.cpp
    M clang/lib/CodeGen/CGOpenCLRuntime.cpp
    M clang/lib/CodeGen/CGOpenCLRuntime.h
    M clang/lib/CodeGen/CodeGenFunction.cpp
    M clang/lib/CodeGen/CodeGenModule.cpp
    M clang/lib/CodeGen/CodeGenTypes.h
    M clang/lib/CodeGen/TargetInfo.h
    M clang/lib/CodeGen/Targets/AMDGPU.cpp
    M clang/lib/CodeGen/Targets/SPIR.cpp
    M clang/lib/CodeGen/Targets/X86.cpp
    M clang/test/CodeGenOpenCL/addr-space-struct-arg.cl
    M clang/test/CodeGenOpenCL/amdgpu-abi-struct-arg-byref.cl
    M clang/test/CodeGenOpenCL/amdgpu-abi-struct-coerce.cl
    M clang/test/CodeGenOpenCL/amdgpu-enqueue-kernel.cl
    M clang/test/CodeGenOpenCL/amdgpu-printf.cl
    M clang/test/CodeGenOpenCL/cl-uniform-wg-size.cl
    M clang/test/CodeGenOpenCL/cl20-device-side-enqueue-attributes.cl
    M clang/test/CodeGenOpenCL/cl20-device-side-enqueue.cl
    M clang/test/CodeGenOpenCL/convergent.cl
    M clang/test/CodeGenOpenCL/enqueue-kernel-non-entry-block.cl
    M clang/test/CodeGenOpenCL/kernel-arg-info.cl
    M clang/test/CodeGenOpenCL/opencl-kernel-call.cl
    M clang/test/CodeGenOpenCL/reflect.cl
    M clang/test/CodeGenOpenCL/spir-calling-conv.cl
    M clang/test/CodeGenOpenCL/visibility.cl
    M clang/test/CodeGenOpenCLCXX/addrspace-of-this.clcpp

  Log Message:
  -----------
  Emitting OpenCL Kernels as calls to their respective inlined stubs to avoid double emission of kernel body.


  Commit: fc791489ff592ca3ce2c390db1bcc9aecdcbe901
      https://github.com/llvm/llvm-project/commit/fc791489ff592ca3ce2c390db1bcc9aecdcbe901
  Author: anikelal <anikelal at amd.com>
  Date:   2025-03-13 (Thu, 13 Mar 2025)

  Changed paths:
    M clang/lib/CodeGen/TargetInfo.cpp
    M clang/lib/CodeGen/TargetInfo.h
    M clang/lib/CodeGen/Targets/AMDGPU.cpp
    M clang/lib/CodeGen/Targets/X86.cpp
    M clang/test/CodeGenOpenCL/cl20-device-side-enqueue.cl
    M clang/test/CodeGenOpenCL/opencl-kernel-call.cl

  Log Message:
  -----------
  Set correct calling convention for stubs in X86 and SPIR targets
Test aggregate paratemers passed byref and byval


  Commit: 926cff45e520c70f84c348a788995d53a7e918c1
      https://github.com/llvm/llvm-project/commit/926cff45e520c70f84c348a788995d53a7e918c1
  Author: anikelal <anikelal at amd.com>
  Date:   2025-03-13 (Thu, 13 Mar 2025)

  Changed paths:
    M clang/test/CodeGenOpenCL/addr-space-struct-arg.cl
    M clang/test/CodeGenOpenCL/amdgpu-abi-struct-arg-byref.cl
    M clang/test/CodeGenOpenCL/amdgpu-enqueue-kernel.cl
    M clang/test/CodeGenOpenCL/implicit-addrspacecast-function-parameter.cl

  Log Message:
  -----------
  Update tests


Compare: https://github.com/llvm/llvm-project/compare/665db4b399e3...926cff45e520

To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications


More information about the All-commits mailing list