[PATCH] D68667: [SLP] respect target register width for GEP vectorization (PR43578)

Dávid Bolvanský via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Oct 9 03:05:50 PDT 2019


xbolva00 added a comment.

Generally, I think there are more bugs for -march=haswell. Only in rare cases the perf of binaries with  -march=haswell is better than plain -O3.
I tried this patch with zstd but nothing improved.

Plain -O3
./zstd  -b selesiafiles/* -f
 3# 13 files         : 251919670 ->  97724903 (2.578), 182.0 MB/s , 923.2 MB/s

-O3 -march=haswell
/zstd  -b selesiafiles/* -f
 3# 13 files         : 251919670 ->  97724903 (2.578), 185.7 MB/s , 866.9 MB/s

-O3 -march=haswell -mprefer-vector-width=128
./zstd  -b bench/* -f
 3# 13 files         : 251919670 ->  97724903 (2.578), 188.5 MB/s , 806.8 MB/s

for example gcc-10's results for -march=haswell
./zstd  -b bench/* -f
 3# 13 files         : 251919670 ->  97724903 (2.578), 188.7 MB/s ,1032.8 MB/s


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D68667/new/

https://reviews.llvm.org/D68667





More information about the llvm-commits mailing list