[PATCH] D81991: [AMDGPU] Control num waves per EU for implicit work-group size
Pushpinder Singh via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Jun 17 18:54:50 PDT 2020
pdhaliwal added a comment.
My understanding is that clang already emits amdgpu-flat-work-group-size in range of [1, 256] as default value and HIP defaults to [1,1024]. And logic to calculate min-waves-per-eu is dependent on the maximum value of work-group size not minimum.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D81991/new/
https://reviews.llvm.org/D81991
More information about the llvm-commits
mailing list