[PATCH] D18340: AMDGPU: allow specifying a workgroup size that needs to fit in a compute unit

Bas Nieuwenhuizen via llvm-commits llvm-commits at lists.llvm.org
Mon Mar 21 16:34:51 PDT 2016

bnieuwenhuizen added a comment.

On second thought I think I would prefer a function attribute, as we get exact size information per program. Similarly we have in OpenCL the reqd_work_group_size to specify that a specific kernel will always be used with that size. As we can have multiple kernels in OpenCL and as far as I understand a subtarget is global, I think a function attribute would be more suited.


More information about the llvm-commits mailing list