[clang] [llvm] [clang][CUDA] Avoid accounting for tail padding in LLVM offloading (PR #156229)

Joseph Huber via llvm-commits llvm-commits at lists.llvm.org
Thu Sep 18 12:52:15 PDT 2025


================
@@ -3655,11 +3655,6 @@ Error AMDGPUKernelTy::launchImpl(GenericDeviceTy &GenericDevice,
                                  KernelArgsTy &KernelArgs,
                                  KernelLaunchParamsTy LaunchParams,
                                  AsyncInfoWrapperTy &AsyncInfoWrapper) const {
-  if (ArgsSize != LaunchParams.Size &&
----------------
jhuber6 wrote:

The implicit arguments are placed directly after what HSA reports as the argument size, does this change that?

https://github.com/llvm/llvm-project/pull/156229


More information about the llvm-commits mailing list