Artem-B wrote: @castigli would it be possible for you to post LLVM IR and PTX generated in your case tor the kernel in question? We'll try to do the same on our end, and see if there are any obvious differences. https://github.com/llvm/llvm-project/pull/169061