[PATCH] D145586: [AMDGPU] Tweak PromoteAlloca limits

Matt Arsenault via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Mar 17 09:42:58 PDT 2023


arsenm added a comment.

Unrelated but PromoteAllocaToVectorLimit  should really move to a new PM pass parameter



================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUPromoteAlloca.cpp:402
 
-  // Use up to 1/4 of available register budget for vectorization.
+  // Use up to 1/2 of available register budget for vectorization if we have
+  // >=64 MaxVGPRs, otherwise use 1/4.
----------------
Half feels pretty aggressive


================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUPromoteAlloca.cpp:405-406
+  // If PromoteAllocaToVectorLimit is used, also use 1/4.
   unsigned Limit = PromoteAllocaToVectorLimit ? PromoteAllocaToVectorLimit * 8
                                               : (MaxVGPRs * 32);
+  const unsigned SizeFactor =
----------------
The largest register class we support is <32 x i32>, do we definitely never introduce larger vectors?


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D145586/new/

https://reviews.llvm.org/D145586



More information about the llvm-commits mailing list