[PATCH] D120346: [AMDGPU] Extend SILoadStoreOptimizer to handle global stores

Stanislav Mekhanoshin via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Feb 23 10:31:24 PST 2022


rampitec added inline comments.


================
Comment at: llvm/test/CodeGen/AMDGPU/soft-clause-exceeds-register-budget.ll:7
+; CHECK:         s_load_dwordx16
+; CHECK-NEXT:    s_load_dwordx16
+
----------------
foad wrote:
> What happened here? Autogenerating the checks for this file might make it clearer what actually changed.
There is a bunch of global stores at the end of the test. We have combined 2 more of them, which has changed register pressure. The estimate for the last block changed from 40 to 41. Consequentially that has changed the scheduling and a pack of 4 global loads was scheduled lower, while couple of scalar x16 loads was scheduled higher.

I actually do not think that generating checks for ~400 instructions is beneficial. We will have to regenerate it often and it will obscure actual clauses.


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D120346/new/

https://reviews.llvm.org/D120346



More information about the llvm-commits mailing list