[PATCH] D115143: [AArch64][Analysis] Add on overhead costs for SVE gathers and scatters

David Sherwood via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Tue Dec 7 01:30:42 PST 2021


david-arm added a comment.

In D115143#3175743 <https://reviews.llvm.org/D115143#3175743>, @dmgreen wrote:

> 10 sounds high, but from looking at some of the software optimization guides it does not seem like a bad worse case value.
>
> Is it worth making it a option (that can default to 10), to allow experimentation with other values?

I guess that might be useful?

>From the benchmarks I ran for tight loops with a single gather or scatter with a variety of different strides the vector loops with SVE gathers and scatters overall didn't perform better than a scalar loop! Even worse, in loops with a high density of gathers and scatters the runtime can even be 2x-3x worse than a scalar loop. The optimisation guides do suggest there is a very low throughput.


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D115143/new/

https://reviews.llvm.org/D115143



More information about the llvm-commits mailing list