[PATCH] D54882: [AMDGPU] Add sdwa support for ADD|SUB U64 decomposed Pseudos

Ron Lieberman via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Tue Nov 27 06:45:18 PST 2018


ronlieb added a comment.

In D54882#1309605 <https://reviews.llvm.org/D54882#1309605>, @rampitec wrote:

> In D54882#1309588 <https://reviews.llvm.org/D54882#1309588>, @rampitec wrote:
>
> > In D54882#1309583 <https://reviews.llvm.org/D54882#1309583>, @ronlieb wrote:
> >
> > > In D54882#1308240 <https://reviews.llvm.org/D54882#1308240>, @rampitec wrote:
> > >
> > > > Essentially this is a limited version of shrinking. So I have several questions:
> > > >
> > > > 1. Why not to run shrink pass before sdwa instead?
> > >
> > >
> > > I tried adding Shrink pass before PeepholeSDWA and observed 88 lit test failures.
> > >  i tried moving Shrink pass before Peephole SDWA and observed 25 lit test failures
> >
> >
> > Which may be a good thing if these failures are progressions (as I suspect) and not regressions. Are they progressions?
> >  That is the point of other comments too, this patch is limited to handle just two instructions while there is a clear possibility to do it for almost any VOP3.
>
>
> I would also assume many of these failures are just commute which is attempted by shrink pass. That is normal and would only need to change the tests.


i tried an experiment of simply invoking the Shrink pass a 2nd time.

  addPass(createSIShrinkInstructionsPass());
  addPass(createSIShrinkInstructionsPass());

which resulted in 74 failures, and they do seem to be commute changes primarily (did not look at them all)
So then, i added a 3rd invocation and zero failures (i'm still laughing at this one).


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D54882/new/

https://reviews.llvm.org/D54882





More information about the llvm-commits mailing list