[clang] [llvm] [AMDGPU] Adding the amdgpu-num-work-groups function attribute (PR #75647)

Jun Wang via llvm-commits llvm-commits at lists.llvm.org
Tue Dec 19 11:25:57 PST 2023


jwanggit86 wrote:

Two possible optimizations mentioned by the requester are, 
"1. This'll let the backend know the maximum size of the workgroup ID, and so we can do things like infer nsw or the ability to use a 16-bit add or so on

2. This could be used to optimize global sync stuff in the future
"

https://github.com/llvm/llvm-project/pull/75647


More information about the llvm-commits mailing list