[PATCH] D120346: [AMDGPU] Extend SILoadStoreOptimizer to handle global stores

Jay Foad via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Thu Feb 24 01:24:07 PST 2022


foad accepted this revision.
foad added inline comments.
This revision is now accepted and ready to land.


================
Comment at: llvm/test/CodeGen/AMDGPU/soft-clause-exceeds-register-budget.ll:7
+; CHECK:         s_load_dwordx16
+; CHECK-NEXT:    s_load_dwordx16
+
----------------
rampitec wrote:
> foad wrote:
> > What happened here? Autogenerating the checks for this file might make it clearer what actually changed.
> There is a bunch of global stores at the end of the test. We have combined 2 more of them, which has changed register pressure. The estimate for the last block changed from 40 to 41. Consequentially that has changed the scheduling and a pack of 4 global loads was scheduled lower, while couple of scalar x16 loads was scheduled higher.
> 
> I actually do not think that generating checks for ~400 instructions is beneficial. We will have to regenerate it often and it will obscure actual clauses.
Well it's still not clear to me that it's still testing whatever it was supposed to test. But I don't really mind.


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D120346/new/

https://reviews.llvm.org/D120346



More information about the llvm-commits mailing list