[PATCH] D81991: [AMDGPU] Control num waves per EU for implicit work-group size

Pushpinder Singh via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Jun 17 18:54:50 PDT 2020


pdhaliwal added a comment.

My understanding is that clang already emits amdgpu-flat-work-group-size in range of [1, 256] as default value and HIP defaults to [1,1024]. And logic to calculate min-waves-per-eu is dependent on the maximum value of work-group size not minimum.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D81991/new/

https://reviews.llvm.org/D81991





More information about the llvm-commits mailing list