[llvm] [Offload] Add olGetKernelMaxGroupSize (PR #142950)

Ross Brunton via llvm-commits llvm-commits at lists.llvm.org
Mon Aug 4 07:59:49 PDT 2025


RossBrunton wrote:

I want to expose the number directly to UR (I guess so users can try out different values for local memory and see which works best). I don't think having a launch option for "determine the best option" would fix this (although I don't disagree with the existence of such a function entirely).

It's currently specific to Nvidia, but an implementation can be made for AMD (as it is in HIP) and users will probably appreciate having it available in the liboffload library rather than having to roll it themselves.

https://github.com/llvm/llvm-project/pull/142950


More information about the llvm-commits mailing list