[clang] [AMDGPU] Introduce 'amdgpu_num_workgroups_{xyz}' builtin (PR #83927)

Matt Arsenault via cfe-commits cfe-commits at lists.llvm.org
Tue Mar 5 06:50:30 PST 2024


https://github.com/arsenm commented:

I think we would be better off teaching an IR optimizer pass to recognize the divide pattern and remap it to the load from the location, rather than forcing the complexity into every frontend 

https://github.com/llvm/llvm-project/pull/83927


More information about the cfe-commits mailing list