[PATCH] D106653: [LoopVectorize][AArch64] Enable ordered reductions by default for AArch64

David Sherwood via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Aug 18 05:17:33 PDT 2021


david-arm added a comment.

Hi @dmgreen, so I have run SPEC2006 on a neoverse-n1 9 times without my patch and 9 times with when building with -O3, then compared the averages. Here is a summary of the results showing a few outliers (slowest at the top, fastest at the bottom):

Benchmark          Percentage Runtime Change (<0 = faster with ordered reductions)




453.povray:          0.3
464.h264ref:         0.1
462.libquantum:   0
...
450.soplex:          -0.95
429.mcf:              -1.22
482.sphinx3:       -1.27
471.omnetpp:     -1.51

Geometric mean: -0.45

Overall it looks like this slightly faster with ordered reductions enabled by default for AArch64.


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D106653/new/

https://reviews.llvm.org/D106653



More information about the llvm-commits mailing list