[LLVMdev] Polyhedron 2005 results for dragonegg 3.3svn

Jack Howarth howarth at bromo.med.uc.edu
Wed May 29 13:04:37 PDT 2013


On Wed, May 29, 2013 at 03:25:30PM +0200, Duncan Sands wrote:
> Hi Jack, I pulled the loop vectorizer and fast math changes into the 3.3 branch,
> so hopefully they will be part of 3.3 rc3 (and 3.3 final!).  It would be great
> if you could redo the benchmarks rc3.
>

Duncan,
    As requested, appended are the updated Polyhedron 2005 benchmark results with both RC1 and RC3 llvm 3.3 testing.
There is a small improvement in the dragonegg results (without -fplugin-arg-dragonegg-enable-gcc-optzns) in RC3. I assume 
we still only have partial coverage of all of the -ffast-math optimizations performed by FSF gcc in llvm's fast-math
support, correct?
                      Jack

Tested on x86_apple-darwin12

Compile Flags: -ffast-math -funroll-loops -O3

de-gfc47: /sw/lib/gcc4.7/bin/gfortran -fplugin=/sw/lib/gcc4.7/lib/dragonegg.so -specs=/sw/lib/gcc4.7/lib/integrated-as.specs
de-gfc48: /sw/lib/gcc4.8/bin/gfortran -fplugin=/sw/lib/gcc4.8/lib/dragonegg.so -specs=/sw/lib/gcc4.8/lib/integrated-as.specs
de-gfc47+optzns: /sw/lib/gcc4.7/bin/gfortran -fplugin=/sw/lib/gcc4.7/lib/dragonegg.so -specs=/sw/lib/gcc4.7/lib/integrated-as.specs
+-fplugin-arg-dragonegg-enable-gcc-optzns
de-gfc48+optzns: /sw/lib/gcc4.8/bin/gfortran -fplugin=/sw/lib/gcc4.8/lib/dragonegg.so -specs=/sw/lib/gcc4.8/lib/integrated-as.specs
+-fplugin-arg-dragonegg-enable-gcc-optzns
gfortran47: /sw/bin/gfortran-fsf-4.7
gfortran48: /sw/bin/gfortran-fsf-4.8

Run time (secs)

Benchmark         de-gfc47  de-gfc47  de-gfc48  de-gfc48  de-gfc47  de-gfc47  de-gfc48  de-gfc48 gfortran47 gfortran48
                                                          +optzns   +optzns   +optzns   +optzns
                     RC1       RC3     RC1       RC3       RC1       RC3        RC1       RC3
ac                  11.39     11.66   11.39     11.58      8.09      8.07       8.14      8.14       8.18        8.05
aermod              16.35     16.47   16.00     16.44     14.50     14.61      15.28     14.43      16.45       16.23
air                  6.88      6.87    6.77      6.77      5.42      5.42       5.28      5.27       5.83        5.73
capacita            39.85     37.80   39.83     37.86     34.71     34.81      33.47     33.53      32.51       33.02
channel              2.05      2.06    2.05      2.06      2.15      2.15       1.99      1.99       1.83        1.83
doduc               27.10     27.43   27.37     27.39     26.75     27.03      26.31     26.24      25.91       25.76
fatigue              8.85      8.84    8.81      8.88      7.72      7.75       5.60      5.42       8.26        5.60
gas_dyn             11.76      8.25   11.50      7.94      4.51      4.52       4.21      4.20       3.88        3.59
induct              24.01     24.45   24.04     24.04     11.86     11.90      11.85     11.85      12.08       12.21
linpk               15.43     15.48   15.48     15.49     15.40     15.47      15.83     15.81      15.37       15.64
mdbx                11.92     12.14   11.91     12.15     11.30     11.29      11.27     11.27      11.18       11.42
nf                  29.57     30.08   30.04     30.11     29.50     29.82      29.59     29.86      27.21       27.25
protein             36.15     36.15   35.21     35.17     35.93     36.02      34.16     34.06      31.88       31.81
rnflow              27.02     27.08   25.92     26.12     26.77     26.83      22.20     22.21      24.67       21.21
test_fpu            11.49     11.55   11.47     11.52      9.11      9.11       9.30      9.30       7.90        8.01
tfft                 1.92      1.94    1.92      1.92      1.92      1.92       1.89      1.90       1.86        1.90

Geom. Mean          13.19     12.95   13.10     12.83     10.99     11.02      10.52     10.47      10.60       10.22

Compile time (secs)

Benchmark         de-gfc47  de-gfc47  de-gfc48  de-gfc48  de-gfc47  de-gfc47  de-gfc48  de-gfc48 gfortran47 gfortran48
                                                          +optzns   +optzns   +optzns   +optzns
                     RC1       RC3       RC1       RC3      RC1       RC3       RC1       RC3
ac                   0.62     1.63       0.29      0.93     2.20      1.02      0.71      0.73      2.88        2.08
aermod              35.19    35.57      20.44     35.86    43.50     43.39     42.90     43.08     42.75       55.97
air                  1.16     1.23       1.11      1.26     2.72      2.68      2.40      2.35      4.48        4.28
capacita             0.52     0.60       0.52      0.62     1.02      0.94      1.04      0.96      1.90        1.89
channel              0.26     0.28       0.23      0.30     0.47      0.45      0.50      0.47      0.65        0.75
doduc                1.74     1.89       1.74      1.91     3.78      3.71      3.53      3.55      6.03        5.68
fatigue              0.91     0.91       0.87      0.91     1.33      1.30      1.49      1.49      1.97        2.04
gas_dyn              0.70     0.87       0.63      0.88     1.40      1.37      1.39      1.39      3.39        2.44
induct               1.95     1.83       1.77      1.83     2.87      2.81      2.99      3.02      4.08        4.42
linpk                0.25     0.32       0.21      0.32     0.53      0.52      0.72      0.73      0.92        1.25
mdbx                 0.66     0.73       0.61      0.75     1.30      1.26      1.24      1.15      2.16        1.90
nf                   0.39     0.55       0.35      0.55     0.80      0.80      0.74      0.74      2.12        1.67
protein              1.12     1.18       1.03      1.20     2.01      1.99      1.79      1.77      4.39        3.62
rnflow               1.26     1.55       1.19      1.55     2.93      2.84      2.72      2.73      6.43        5.47
test_fpu             0.91     1.12       0.85      1.13     2.27      5.06      2.22      2.23      5.28        4.26
tfft                 0.22     0.24       0.18      0.22     0.39      0.40      0.46      0.46      0.59        0.78

Executable (bytes)

Benchmark         de-gfc47  de-gfc47  de-gfc48  de-gfc48  de-gfc47  de-gfc47  de-gfc48  de-gfc48 gfortran47 gfortran48
                                                          +optzns   +optzns   +optzns   +optzns 
                     RC1       RC3       RC1       RC3      RC1       RC3       RC1       RC3 
ac                  26776     30896     26792     30912    47160     47160     34928     34928       59120      42784
aermod            1023024   1035312   1023064   1031248  1052728   1052728   1031576   1031568     1392840    1286136
air                 61940     61940     61948     61948    65964     65964     61876     61876      110768     106680
capaci              41344     45440     41144     41144    45440     45440     45040     45040       77920      73248
channe              22736     22600     22744     22608    26696     22600     22552     22552       34704      34656
doduc              128376    120188    128384    120196   140580    140580    136296    136296      205320     189040
fatigu              65648     69744     65640     69736    69808     69808     73848     73848       90240      82040
gas_dy              54840     58936     54936     59032    63144     63144     71304     71304      123680      99184
induct             163064    163064    158792    162888   163192    167288    166920    171024      179080     170872
linpk               18680     22896     18688     22904    22896     22896     34920     34920       42640      50936
mdbx                49492     57684     49508     57700    57692     57692     53604     53604      90232       78032
nf                  23880     32080     23888     27984    32088     32088     32104     32104       84072      67744
protei              74960     79056     75048     79144    87144     87144     83128     83128      131976     115688
rnflow              67704     79992     67712     80000    88248     88248     96152     96152      205584     176912
test_f              50000     62296     50008     62304    70440     70440     78456     78456      179464     142608
tfft                18568     18568     18576     18576    18416     18416     22544     22544       30680      34832




More information about the llvm-dev mailing list