[PATCH] D18340: AMDGPU: allow specifying a workgroup size that needs to fit in a compute unit

Bas Nieuwenhuizen via llvm-commits llvm-commits at lists.llvm.org
Wed Mar 23 09:57:31 PDT 2016


bnieuwenhuizen added inline comments.

================
Comment at: lib/Target/AMDGPU/SIRegisterInfo.cpp:40
@@ +39,3 @@
+    AllowedSGPRCount = 800;
+    AllowedSGPRCount = (AllowedSGPRCount / MaxWaveCountPerSIMD) & ~15;
+
----------------
arsenm wrote:
> There is a alignTo function you should use
Note that alignTo rounds up, this rounds down.


http://reviews.llvm.org/D18340





More information about the llvm-commits mailing list