[PATCH] D20443: [PowerPC] - Combine loads of v4i8 to loads of i32 followed by bitcast
Nemanja Ivanovic via llvm-commits
llvm-commits at lists.llvm.org
Mon May 30 12:12:58 PDT 2016
nemanjai added a comment.
I've done some lightweight performance testing (the LNT tests on a quiet machine) and here are the results with benchmarks that take less than 5s to execute omitted (BEFORE_TIME is without this patch and AFTER_TIME is with this patch):
BENCHMARK_NAME BEFORE_TIME AFTER_TIME ABS_DIFF PCT_DIFF
MultiSource/Applications/SPASS/Output/SPASS 6.050 6.049 -0.001 -0.02%
MultiSource/Applications/JM/lencod/Output/lencod 5.102 5.138 0.036 0.70%
MultiSource/Applications/lambda-0.1.3/Output/lambda 5.891 5.889 -0.002 -0.04%
MultiSource/Applications/hexxagon/Output/hexxagon 13.814 13.808 -0.007 -0.05%
MultiSource/Applications/lua/Output/lua 22.924 22.919 -0.004 -0.02%
MultiSource/Benchmarks/SciMark2-C/Output/scimark2 72.514 72.508 -0.006 -0.01%
MultiSource/Benchmarks/nbench/Output/nbench 19.489 19.189 -0.300 -1.54%
MultiSource/Benchmarks/NPB-serial/is/Output/is 10.415 10.417 0.002 0.02%
MultiSource/Benchmarks/ASC_Sequoia/IRSmk/Output/IRSmk 32.249 32.987 0.739 2.29%
MultiSource/Benchmarks/ASC_Sequoia/AMGmk/Output/AMGmk 10.481 10.482 0.000 0.00%
MultiSource/Benchmarks/ASC_Sequoia/CrystalMk/Output/CrystalMk 7.537 7.537 -0.001 -0.01%
MultiSource/Benchmarks/TSVC/Reductions-flt/Output/Reductions-flt 5.548 5.548 0.000 0.00%
MultiSource/Benchmarks/TSVC/Searching-flt/Output/Searching-flt 6.452 6.452 -0.001 -0.01%
MultiSource/Benchmarks/TSVC/GlobalDataFlow-dbl/Output/GlobalDataFlow-dbl 6.823 6.825 0.002 0.03%
MultiSource/Benchmarks/TSVC/Reductions-dbl/Output/Reductions-dbl 5.509 5.509 0.001 0.01%
MultiSource/Benchmarks/TSVC/Searching-dbl/Output/Searching-dbl 6.253 6.253 0.000 0.00%
MultiSource/Benchmarks/PAQ8p/Output/paq8p 110.282 73.201 -37.080 -33.62%
MultiSource/Benchmarks/Bullet/Output/bullet 5.242 5.209 -0.033 -0.63%
MultiSource/Benchmarks/7zip/Output/7zip-benchmark 7.853 7.842 -0.011 -0.14%
MultiSource/Benchmarks/mafft/Output/pairlocalalign 26.034 26.056 0.022 0.08%
SingleSource/Benchmarks/CoyoteBench/Output/almabench 30.836 30.827 -0.009 -0.03%
SingleSource/Benchmarks/CoyoteBench/Output/huffbench 21.345 21.350 0.005 0.02%
SingleSource/Benchmarks/Shootout/Output/sieve 5.268 5.267 -0.001 -0.01%
SingleSource/Benchmarks/Shootout/Output/lists 7.607 7.605 -0.002 -0.03%
SingleSource/Benchmarks/Shootout/Output/methcall 10.474 10.474 0.000 0.00%
SingleSource/Benchmarks/Shootout-C++/Output/lists 11.707 11.733 0.026 0.22%
SingleSource/Benchmarks/Shootout-C++/Output/methcall 11.177 11.175 -0.002 -0.02%
SingleSource/Benchmarks/Misc/Output/flops 6.340 6.343 0.003 0.05%
SingleSource/Benchmarks/Misc/Output/salsa20 6.292 6.291 -0.001 -0.01%
SingleSource/Benchmarks/Misc/Output/ReedSolomon 6.903 6.905 0.002 0.03%
SingleSource/Benchmarks/Misc-C++/Large/Output/sphereflake 6.154 6.154 -0.000 -0.00%
SingleSource/Benchmarks/Misc-C++/Output/stepanov_v1p2 9.809 9.809 -0.000 -0.00%
SingleSource/Benchmarks/Adobe-C++/Output/stepanov_abstraction 5.733 5.733 -0.000 -0.01%
SingleSource/Benchmarks/Polybench/linear-algebra/kernels/2mm/Output/2mm 18.724 16.745 -1.980 -10.57%
SingleSource/Benchmarks/Polybench/linear-algebra/kernels/3mm/Output/3mm 29.825 28.844 -0.981 -3.29%
SingleSource/Benchmarks/Polybench/linear-algebra/kernels/gemm/Output/gemm 8.889 8.518 -0.371 -4.18%
SingleSource/Benchmarks/Polybench/linear-algebra/kernels/symm/Output/symm 13.260 14.198 0.937 7.07%
SingleSource/Benchmarks/Linpack/Output/linpack-pc 13.215 13.214 -0.001 -0.01%
SingleSource/Benchmarks/SmallPT/Output/smallpt 12.783 12.781 -0.001 -0.01%
SingleSource/UnitTests/Vector/Altivec/Output/alti.expandfft 23.281 23.291 0.009 0.04%
SingleSource/UnitTests/Vectorizer/Output/gcc-loops 7.511 7.511 0.000 0.00%
Repository:
rL LLVM
http://reviews.llvm.org/D20443
More information about the llvm-commits
mailing list