[LLVMdev] -fplugin-arg-dragonegg-enable-gcc-optzns status
Jack Howarth
howarth at bromo.med.uc.edu
Thu Jun 9 17:47:26 PDT 2011
Duncan,
Here are the complete benchmarks rerun against gcc 4.5.4 built with...
Using built-in specs.
COLLECT_GCC=gfortran-fsf-4.5
COLLECT_LTO_WRAPPER=/sw/lib/gcc4.5/libexec/gcc/x86_64-apple-darwin11.0.0/4.5.4/lto-wrapper
Target: x86_64-apple-darwin11.0.0
Configured with: ../gcc-4.5.4/configure --prefix=/sw --prefix=/sw/lib/gcc4.5 --mandir=/sw/share/man --infodir=/sw/lib/gcc4.5/info --enable-languages=c,c++,fortran,objc,obj-c++,java --with-gmp=/sw --with-libiconv-prefix=/sw --with-ppl=/sw --with-cloog=/sw --with-mpc=/sw --with-system-zlib --x-includes=/usr/X11R6/include --x-libraries=/usr/X11R6/lib --program-suffix=-fsf-4.5 --enable-lto --enable-checking=release
Thread model: posix
gcc version 4.5.4 20110608 (prerelease) (GCC)
x86_64 darwin
A) gcc 4.5.4svn using -msse3 -ffast-math -O3 -fno-tree-vectorize
B) gcc 4.5.4svn/dragonegg using -msse3 -ffast-math -O3 -fno-tree-vectorize -fplugin-arg-dragonegg-enable-gcc-optzns
C) gcc 4.5.4svn/dragonegg using -msse3 -ffast-math -O3 -fno-tree-vectorize
D) gcc 4.5.4svn/dragonegg using -msse3 -ffast-math -O3 -fno-tree-vectorize -fplugin-arg-dragonegg-enable-gcc-optzns -fplugin-arg-dragonegg-llvm-ir-optimize=2
E) gcc 4.5.4svn/dragonegg using -msse3 -ffast-math -O3 -fno-tree-vectorize -fplugin-arg-dragonegg-llvm-ir-optimize=2
Run Time (seconds)
Benchmark A) stock B) gcc 4.5.4/ C) gcc 4.5.4/ D) gcc 4.5.4/ E) gcc 4.5.4/
gcc 4.5.4 dragonegg/optzns dragonegg dragonegg/optzns/ dragonegg/optimize=2
optimize=2
ac 9.58 9.11 12.28 9.12 12.73
aermod 20.99 16.18 17.86 16.30 17.89
air 6.06 6.58 7.69 6.51 7.64
capacita 35.76 39.86 46.10 39.58 45.89
channel 2.03 2.04 1.96 2.04 1.96
doduc 28.16 28.50 30.34 28.53 30.42
fatigue 8.12 7.09 10.34 7.06 10.25
gas_dyn 10.16 9.92 11.67 9.96 11.81
induct 20.14 20.76 48.75 20.78 48.75
linpk 15.43 15.41 15.64 15.41 15.64
mdbx 11.41 11.72 12.11 11.72 12.07
nf 27.90 28.52 29.26 28.42 29.13
protein 38.65 38.72 41.31 38.75 39.49
rnflow 27.22 28.18 31.81 28.15 31.98
test_fpu 11.49 11.23 11.57 11.17 11.52
tfft 1.91 1.95 2.15 1.95 2.16
Mean 12.72 12.60 14.73 12.59 14.72
Compile Time (seconds)
Benchmark A) stock B) gcc 4.5.4/ C) gcc 4.5.4/ D) gcc 4.5.4/ E) gcc 4.5.4/
gcc 4.5.4 dragonegg/optzns dragonegg dragonegg/optzns/ dragonegg/optimize=2
optimize=2
ac 0.86 0.44 0.31 0.41 0.28
aermod 31.13 25.81 20.94 25.44 20.87
air 1.74 1.48 0.81 1.46 0.78
capacita 0.86 0.74 0.44 0.71 0.42
channel 0.35 0.32 0.23 0.30 0.23
doduc 3.08 2.63 1.63 2.60 1.58
fatigue 1.04 1.05 0.89 0.90 0.70
gas_dyn 0.94 0.94 0.75 0.84 0.62
induct 3.30 2.52 1.84 2.36 1.66
linpk 0.33 0.28 0.20 0.28 0.20
mdbx 1.09 1.02 0.60 0.99 0.59
nf 0.41 0.40 0.28 0.40 0.28
protein 1.56 1.28 0.98 1.21 0.82
rnflow 1.75 1.70 1.24 1.61 1.13
test_fpu 1.38 1.41 1.05 1.31 0.95
tfft 0.31 0.28 0.19 0.28 0.19
Executable Size (bytes)
Benchmark A) stock B) gcc 4.5.4/ C) gcc 4.5.4/ D) gcc 4.5.4/ E) gcc 4.5.4/
gcc 4.5.4 dragonegg/optzns dragonegg dragonegg/optzns/ dragonegg/optimize=2
optimize=2
ac 26344 30896 26704 30896 26824
aermod 1145924 1043816 1052056 1027680 1031880
air 57404 57700 53532 53556 53532
capacita 40864 41008 37064 41008 37064
channel 22448 22664 22664 22664 22664
doduc 127340 124108 120124 124372 120484
fatigue 61152 65352 65664 61256 61568
gas_dyn 647864 58768 59024 54672 54960
induct 162360 180440 175312 168304 163176
linpk 18112 18848 18864 18848 18896
mdbx 53464 57652 49516 57652 49516
nf 22560 23784 24080 23784 24080
protein 74320 74440 74816 70344 66624
rnflow 66040 71488 71648 67416 67616
test_fpu 52624 58224 58320 54128 54256
tfft 18416 18456 18600 18456 18600
More information about the llvm-dev
mailing list