[clang] [llvm] [clang][CUDA] Avoid accounting for tail padding in LLVM offloading (PR #156229)
Kevin Sala Penades via llvm-commits
llvm-commits at lists.llvm.org
Thu Sep 25 14:55:15 PDT 2025
================
@@ -3655,11 +3655,6 @@ Error AMDGPUKernelTy::launchImpl(GenericDeviceTy &GenericDevice,
KernelArgsTy &KernelArgs,
KernelLaunchParamsTy LaunchParams,
AsyncInfoWrapperTy &AsyncInfoWrapper) const {
- if (ArgsSize != LaunchParams.Size &&
----------------
kevinsala wrote:
It shouldn't change that. There is the `ImplArgsOffset` computed later, which rounds up the size to ImplicitArgsTy's alginment.
https://github.com/llvm/llvm-project/pull/156229
More information about the llvm-commits
mailing list