[PATCH] D68667: [SLP] respect target register width for GEP vectorization (PR43578)
Dávid Bolvanský via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Oct 9 03:05:50 PDT 2019
xbolva00 added a comment.
Generally, I think there are more bugs for -march=haswell. Only in rare cases the perf of binaries with -march=haswell is better than plain -O3.
I tried this patch with zstd but nothing improved.
Plain -O3
./zstd -b selesiafiles/* -f
3# 13 files : 251919670 -> 97724903 (2.578), 182.0 MB/s , 923.2 MB/s
-O3 -march=haswell
/zstd -b selesiafiles/* -f
3# 13 files : 251919670 -> 97724903 (2.578), 185.7 MB/s , 866.9 MB/s
-O3 -march=haswell -mprefer-vector-width=128
./zstd -b bench/* -f
3# 13 files : 251919670 -> 97724903 (2.578), 188.5 MB/s , 806.8 MB/s
for example gcc-10's results for -march=haswell
./zstd -b bench/* -f
3# 13 files : 251919670 -> 97724903 (2.578), 188.7 MB/s ,1032.8 MB/s
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D68667/new/
https://reviews.llvm.org/D68667
More information about the llvm-commits
mailing list