[PATCH] D18340: AMDGPU: allow specifying a workgroup size that needs to fit in a compute unit
Bas Nieuwenhuizen via llvm-commits
llvm-commits at lists.llvm.org
Wed Mar 23 09:57:31 PDT 2016
bnieuwenhuizen added inline comments.
================
Comment at: lib/Target/AMDGPU/SIRegisterInfo.cpp:40
@@ +39,3 @@
+ AllowedSGPRCount = 800;
+ AllowedSGPRCount = (AllowedSGPRCount / MaxWaveCountPerSIMD) & ~15;
+
----------------
arsenm wrote:
> There is a alignTo function you should use
Note that alignTo rounds up, this rounds down.
http://reviews.llvm.org/D18340
More information about the llvm-commits
mailing list