[PATCH] D65961: AMDGPU/SILoadStoreOptimizer: Optimize scanning for mergeable instructions

Jay Foad via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Tue Dec 3 03:36:53 PST 2019


foad added a comment.

In D65961#1765026 <https://reviews.llvm.org/D65961#1765026>, @tstellar wrote:

> In D65961#1763143 <https://reviews.llvm.org/D65961#1763143>, @foad wrote:
>
> > Hi @tstellar, I'm looking into a case where this patch slowed down a shader by 10%. Before I go too far, was this patch supposed to change the behaviour at all, or was it supposed to be purely a compile time improvement?
>
>
> The intention was to not change the behavior at all.
>
> > In the case I'm looking at it seems to do the same amount of load merging as before, but the merged loads are inserted at different places in the basic block.
>
> Do you have a MIR or .ll dump of the shader I could look at ?  Also, does https://reviews.llvm.org/D65966 help?


P8173 <https://reviews.llvm.org/P8173> is a MIR test case. See the RUN line for how to run it. I see significant differences in the placing of the merged BUFFER_LOAD instructions with/without D65961 <https://reviews.llvm.org/D65961> (or before/after it was committed).

I can't easily try D65966 <https://reviews.llvm.org/D65966>. It looks like it needs some significant changes to apply to HEAD.


Repository:
  rL LLVM

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D65961/new/

https://reviews.llvm.org/D65961





More information about the llvm-commits mailing list