[PATCH] D79792: [AMDGPU] New SIInsertHardClauses pass
Stanislav Mekhanoshin via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Thu May 14 09:44:02 PDT 2020
rampitec added a comment.
In D79792#2035964 <https://reviews.llvm.org/D79792#2035964>, @foad wrote:
> Yes, shouldClusterMemOps also imposes a limit on the length of the cluster. If that is *only* to help with register pressure, then perhaps I can bypass that check by always calling it with NumLoads=2 instead of NumLoads=CI.Size+1. What do you think?
That's actually a very good idea. Just add a comment explaining it!
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D79792/new/
https://reviews.llvm.org/D79792
More information about the llvm-commits
mailing list