[PATCH] D88577: [AArch64] Generate udot for v16i8 sum reduction to i32

Vinay Madhusudan via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Thu Oct 1 03:29:41 PDT 2020


mivnay added a comment.

In D88577#2305658 <https://reviews.llvm.org/D88577#2305658>, @dmgreen wrote:

> Hello
>
> Can you update with full context? -U999999. It makes phabriactor reviews easier to follow.
>
> I had thought about this somewhat in reference to inloop reductions. I had presumed that it would need some form of partial reduction though, as you would want part of the reduction would then happen outside the loop (I think)
>
> Improving codegen on it's own is good, but I'm interested in seeing how this fits with the other patches.

Hi,

I am working on the performance related issues mentioned in the bug <https://bugs.llvm.org/show_bug.cgi?id=46888> and meta ticket <https://bugs.llvm.org/show_bug.cgi?id=46929>. I have three unrelated patches (i.e., patterns) for codegen improvements. This is the first one.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D88577/new/

https://reviews.llvm.org/D88577



More information about the llvm-commits mailing list