[LLVMdev] Polyhedron 2005 results for dragonegg 3.3svn
Duncan Sands
duncan.sands at gmail.com
Fri May 31 21:45:48 PDT 2013
Hi Jack,
On 29/05/13 22:04, Jack Howarth wrote:
> On Wed, May 29, 2013 at 03:25:30PM +0200, Duncan Sands wrote:
>> Hi Jack, I pulled the loop vectorizer and fast math changes into the 3.3 branch,
>> so hopefully they will be part of 3.3 rc3 (and 3.3 final!). It would be great
>> if you could redo the benchmarks rc3.
>>
>
> Duncan,
> As requested, appended are the updated Polyhedron 2005 benchmark results with both RC1 and RC3 llvm 3.3 testing.
thanks for doing this. As rc3 hasn't been tagged yet, I assume you used latest
3.3svn?
> There is a small improvement in the dragonegg results (without -fplugin-arg-dragonegg-enable-gcc-optzns) in RC3. I assume
> we still only have partial coverage of all of the -ffast-math optimizations performed by FSF gcc in llvm's fast-math
> support, correct?
These results are very disappointing, I was hoping to see a big improvement
somewhere instead of no real improvement anywhere (except for gas_dyn) or a
regression (eg: mdbx). I think LLVM now has a reasonable array of fast-math
optimizations. I will try to find time to poke at gas_dyn and induct: since
turning on gcc's optimizations there halve the run-time, LLVM's IR optimizers
are clearly missing something important.
Ciao, Duncan.
> Jack
>
> Tested on x86_apple-darwin12
>
> Compile Flags: -ffast-math -funroll-loops -O3
>
> de-gfc47: /sw/lib/gcc4.7/bin/gfortran -fplugin=/sw/lib/gcc4.7/lib/dragonegg.so -specs=/sw/lib/gcc4.7/lib/integrated-as.specs
> de-gfc48: /sw/lib/gcc4.8/bin/gfortran -fplugin=/sw/lib/gcc4.8/lib/dragonegg.so -specs=/sw/lib/gcc4.8/lib/integrated-as.specs
> de-gfc47+optzns: /sw/lib/gcc4.7/bin/gfortran -fplugin=/sw/lib/gcc4.7/lib/dragonegg.so -specs=/sw/lib/gcc4.7/lib/integrated-as.specs
> +-fplugin-arg-dragonegg-enable-gcc-optzns
> de-gfc48+optzns: /sw/lib/gcc4.8/bin/gfortran -fplugin=/sw/lib/gcc4.8/lib/dragonegg.so -specs=/sw/lib/gcc4.8/lib/integrated-as.specs
> +-fplugin-arg-dragonegg-enable-gcc-optzns
> gfortran47: /sw/bin/gfortran-fsf-4.7
> gfortran48: /sw/bin/gfortran-fsf-4.8
>
> Run time (secs)
>
> Benchmark de-gfc47 de-gfc47 de-gfc48 de-gfc48 de-gfc47 de-gfc47 de-gfc48 de-gfc48 gfortran47 gfortran48
> +optzns +optzns +optzns +optzns
> RC1 RC3 RC1 RC3 RC1 RC3 RC1 RC3
> ac 11.39 11.66 11.39 11.58 8.09 8.07 8.14 8.14 8.18 8.05
> aermod 16.35 16.47 16.00 16.44 14.50 14.61 15.28 14.43 16.45 16.23
> air 6.88 6.87 6.77 6.77 5.42 5.42 5.28 5.27 5.83 5.73
> capacita 39.85 37.80 39.83 37.86 34.71 34.81 33.47 33.53 32.51 33.02
> channel 2.05 2.06 2.05 2.06 2.15 2.15 1.99 1.99 1.83 1.83
> doduc 27.10 27.43 27.37 27.39 26.75 27.03 26.31 26.24 25.91 25.76
> fatigue 8.85 8.84 8.81 8.88 7.72 7.75 5.60 5.42 8.26 5.60
> gas_dyn 11.76 8.25 11.50 7.94 4.51 4.52 4.21 4.20 3.88 3.59
> induct 24.01 24.45 24.04 24.04 11.86 11.90 11.85 11.85 12.08 12.21
> linpk 15.43 15.48 15.48 15.49 15.40 15.47 15.83 15.81 15.37 15.64
> mdbx 11.92 12.14 11.91 12.15 11.30 11.29 11.27 11.27 11.18 11.42
> nf 29.57 30.08 30.04 30.11 29.50 29.82 29.59 29.86 27.21 27.25
> protein 36.15 36.15 35.21 35.17 35.93 36.02 34.16 34.06 31.88 31.81
> rnflow 27.02 27.08 25.92 26.12 26.77 26.83 22.20 22.21 24.67 21.21
> test_fpu 11.49 11.55 11.47 11.52 9.11 9.11 9.30 9.30 7.90 8.01
> tfft 1.92 1.94 1.92 1.92 1.92 1.92 1.89 1.90 1.86 1.90
>
> Geom. Mean 13.19 12.95 13.10 12.83 10.99 11.02 10.52 10.47 10.60 10.22
>
> Compile time (secs)
>
> Benchmark de-gfc47 de-gfc47 de-gfc48 de-gfc48 de-gfc47 de-gfc47 de-gfc48 de-gfc48 gfortran47 gfortran48
> +optzns +optzns +optzns +optzns
> RC1 RC3 RC1 RC3 RC1 RC3 RC1 RC3
> ac 0.62 1.63 0.29 0.93 2.20 1.02 0.71 0.73 2.88 2.08
> aermod 35.19 35.57 20.44 35.86 43.50 43.39 42.90 43.08 42.75 55.97
> air 1.16 1.23 1.11 1.26 2.72 2.68 2.40 2.35 4.48 4.28
> capacita 0.52 0.60 0.52 0.62 1.02 0.94 1.04 0.96 1.90 1.89
> channel 0.26 0.28 0.23 0.30 0.47 0.45 0.50 0.47 0.65 0.75
> doduc 1.74 1.89 1.74 1.91 3.78 3.71 3.53 3.55 6.03 5.68
> fatigue 0.91 0.91 0.87 0.91 1.33 1.30 1.49 1.49 1.97 2.04
> gas_dyn 0.70 0.87 0.63 0.88 1.40 1.37 1.39 1.39 3.39 2.44
> induct 1.95 1.83 1.77 1.83 2.87 2.81 2.99 3.02 4.08 4.42
> linpk 0.25 0.32 0.21 0.32 0.53 0.52 0.72 0.73 0.92 1.25
> mdbx 0.66 0.73 0.61 0.75 1.30 1.26 1.24 1.15 2.16 1.90
> nf 0.39 0.55 0.35 0.55 0.80 0.80 0.74 0.74 2.12 1.67
> protein 1.12 1.18 1.03 1.20 2.01 1.99 1.79 1.77 4.39 3.62
> rnflow 1.26 1.55 1.19 1.55 2.93 2.84 2.72 2.73 6.43 5.47
> test_fpu 0.91 1.12 0.85 1.13 2.27 5.06 2.22 2.23 5.28 4.26
> tfft 0.22 0.24 0.18 0.22 0.39 0.40 0.46 0.46 0.59 0.78
>
> Executable (bytes)
>
> Benchmark de-gfc47 de-gfc47 de-gfc48 de-gfc48 de-gfc47 de-gfc47 de-gfc48 de-gfc48 gfortran47 gfortran48
> +optzns +optzns +optzns +optzns
> RC1 RC3 RC1 RC3 RC1 RC3 RC1 RC3
> ac 26776 30896 26792 30912 47160 47160 34928 34928 59120 42784
> aermod 1023024 1035312 1023064 1031248 1052728 1052728 1031576 1031568 1392840 1286136
> air 61940 61940 61948 61948 65964 65964 61876 61876 110768 106680
> capaci 41344 45440 41144 41144 45440 45440 45040 45040 77920 73248
> channe 22736 22600 22744 22608 26696 22600 22552 22552 34704 34656
> doduc 128376 120188 128384 120196 140580 140580 136296 136296 205320 189040
> fatigu 65648 69744 65640 69736 69808 69808 73848 73848 90240 82040
> gas_dy 54840 58936 54936 59032 63144 63144 71304 71304 123680 99184
> induct 163064 163064 158792 162888 163192 167288 166920 171024 179080 170872
> linpk 18680 22896 18688 22904 22896 22896 34920 34920 42640 50936
> mdbx 49492 57684 49508 57700 57692 57692 53604 53604 90232 78032
> nf 23880 32080 23888 27984 32088 32088 32104 32104 84072 67744
> protei 74960 79056 75048 79144 87144 87144 83128 83128 131976 115688
> rnflow 67704 79992 67712 80000 88248 88248 96152 96152 205584 176912
> test_f 50000 62296 50008 62304 70440 70440 78456 78456 179464 142608
> tfft 18568 18568 18576 18576 18416 18416 22544 22544 30680 34832
>
>
More information about the llvm-dev
mailing list