[PATCH] D18340: AMDGPU: allow specifying a workgroup size that needs to fit in a compute unit
Bas Nieuwenhuizen via llvm-commits
llvm-commits at lists.llvm.org
Mon Mar 21 16:34:51 PDT 2016
bnieuwenhuizen added a comment.
On second thought I think I would prefer a function attribute, as we get exact size information per program. Similarly we have in OpenCL the reqd_work_group_size to specify that a specific kernel will always be used with that size. As we can have multiple kernels in OpenCL and as far as I understand a subtarget is global, I think a function attribute would be more suited.
More information about the llvm-commits