[PATCH] D88577: [AArch64] Generate udot for v16i8 sum reduction to i32

Thu Oct 1 02:51:11 PDT 2020

mivnay added a comment.

In D88577#2305592 <https://reviews.llvm.org/D88577#2305592>, @SjoerdMeijer wrote:

> I haven't  looked in much detail at this patch, but this looks like some straightforward lowering of `llvm.experimental.vector.reduce.add`. Absolutely nothing wrong with that, but I am curious who's going to produce this intrinsic? The vectoriser, the matrix pass? In other words, any ideas on the bigger picture?

Thanks for looking into the patch. The pattern added in lit test gets generated from the below C code:

  #include <stdint.h>
  #include <stdlib.h>

  int func(uint8_t *a) {
    int sum = 0;
    for (int i = 0; i < 16; i++) {
      sum += a[i];
    }

    return sum;
  }

Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D88577/new/

https://reviews.llvm.org/D88577