[clang] [llvm] [clang][CUDA] Avoid accounting for tail padding in LLVM offloading (PR #156229)
Joseph Huber via llvm-commits
llvm-commits at lists.llvm.org
Thu Sep 18 12:52:15 PDT 2025
================
@@ -3655,11 +3655,6 @@ Error AMDGPUKernelTy::launchImpl(GenericDeviceTy &GenericDevice,
KernelArgsTy &KernelArgs,
KernelLaunchParamsTy LaunchParams,
AsyncInfoWrapperTy &AsyncInfoWrapper) const {
- if (ArgsSize != LaunchParams.Size &&
----------------
jhuber6 wrote:
The implicit arguments are placed directly after what HSA reports as the argument size, does this change that?
https://github.com/llvm/llvm-project/pull/156229
More information about the llvm-commits
mailing list