[PATCH] D79792: [AMDGPU] New SIInsertHardClauses pass

Stanislav Mekhanoshin via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Thu May 14 09:44:02 PDT 2020


rampitec added a comment.

In D79792#2035964 <https://reviews.llvm.org/D79792#2035964>, @foad wrote:

> Yes, shouldClusterMemOps also imposes a limit on the length of the cluster. If that is *only* to help with register pressure, then perhaps I can bypass that check by always calling it with NumLoads=2 instead of NumLoads=CI.Size+1. What do you think?


That's actually a very good idea. Just add a comment explaining it!


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D79792/new/

https://reviews.llvm.org/D79792





More information about the llvm-commits mailing list