[llvm-dev] LLD: time to enable --threads by default
Rui Ueyama via llvm-dev
llvm-dev at lists.llvm.org
Wed Nov 16 16:13:58 PST 2016
I should've said that do you know if there's an optimized SHA1
implementation that we can use?
On Wed, Nov 16, 2016 at 4:11 PM, Mehdi Amini <mehdi.amini at apple.com> wrote:
> The current implementation was “copy/pasted” from somewhere (it was
> explicitly public domain).
>
> On Nov 16, 2016, at 4:05 PM, Rui Ueyama <ruiu at google.com> wrote:
>
> Can we just copy-and-paste optimized code from somewhere?
>
> On Wed, Nov 16, 2016 at 4:03 PM, Mehdi Amini <mehdi.amini at apple.com>
> wrote:
>
>> SHA1 in LLVM is *very* naive, any improvement is welcome there!
>> It think Amaury pointed it originally and he had an alternative
>> implementation IIRC.
>>
>> —
>> Mehdi
>>
>> On Nov 16, 2016, at 3:58 PM, Rui Ueyama via llvm-dev <
>> llvm-dev at lists.llvm.org> wrote:
>>
>> By the way, while running benchmark, I found that our SHA1 function seems
>> much slower than the one in gold. gold slowed down by only 1.3 seconds to
>> compute a SHA1 of output, but we spent 6.0 seconds to do the same thing (I
>> believe). Something doesn't seem right.
>>
>> Here is a table to link the same binary with -no-threads and
>> -build-id={none,md5,sha1}. The numbers are in seconds.
>>
>> LLD gold
>> none 7.82 13.78
>> MD5 9.68 14.56
>> SHA1 13.85 15.05
>>
>>
>> On Wed, Nov 16, 2016 at 1:46 PM, Rafael Espíndola <
>> rafael.espindola at gmail.com> wrote:
>>
>>> On 16 November 2016 at 15:52, Rafael Espíndola
>>> <rafael.espindola at gmail.com> wrote:
>>> > I will do a quick benchmark run.
>>>
>>>
>>> On a mac pro (running linux) the results I got with all cores available:
>>>
>>> firefox
>>> master 7.146418217
>>> patch 5.304271767 1.34729488437x faster
>>> firefox-gc
>>> master 7.316743822
>>> patch 5.46436812 1.33899174824x faster
>>> chromium
>>> master 4.265597914
>>> patch 3.972218527 1.07385781648x faster
>>> chromium fast
>>> master 1.823614026
>>> patch 1.686059427 1.08158348205x faster
>>> the gold plugin
>>> master 0.340167513
>>> patch 0.318601465 1.06768973269x faster
>>> clang
>>> master 0.579914119
>>> patch 0.520784947 1.11353855817x faster
>>> llvm-as
>>> master 0.03323043
>>> patch 0.041571719 1.251013574x slower
>>> the gold plugin fsds
>>> master 0.36675887
>>> patch 0.350970944 1.04498356992x faster
>>> clang fsds
>>> master 0.656180056
>>> patch 0.591607603 1.10914743602x faster
>>> llvm-as fsds
>>> master 0.030324313
>>> patch 0.040045353 1.32056917497x slower
>>> scylla
>>> master 3.23378908
>>> patch 2.019191831 1.60152642773x faster
>>>
>>> With only 2 cores:
>>>
>>> firefox
>>> master 7.174839911
>>> patch 6.319808477 1.13529388384x faster
>>> firefox-gc
>>> master 7.345525844
>>> patch 6.493005841 1.13129820362x faster
>>> chromium
>>> master 4.180752414
>>> patch 4.129515199 1.01240756179x faster
>>> chromium fast
>>> master 1.847296843
>>> patch 1.78837299 1.0329483018x faster
>>> the gold plugin
>>> master 0.341725451
>>> patch 0.339943222 1.0052427255x faster
>>> clang
>>> master 0.581901114
>>> patch 0.566932481 1.02640284955x faster
>>> llvm-as
>>> master 0.03381059
>>> patch 0.036671392 1.08461260215x slower
>>> the gold plugin fsds
>>> master 0.369184003
>>> patch 0.368774353 1.00111084189x faster
>>> clang fsds
>>> master 0.660120583
>>> patch 0.641040511 1.02976422187x faster
>>> llvm-as fsds
>>> master 0.031074029
>>> patch 0.035421531 1.13990789543x slower
>>> scylla
>>> master 3.243011681
>>> patch 2.630991522 1.23261958615x faster
>>>
>>>
>>> With only 1 core:
>>>
>>> firefox
>>> master 7.174323116
>>> patch 7.301968002 1.01779190649x slower
>>> firefox-gc
>>> master 7.339104117
>>> patch 7.466171668 1.01731376868x slower
>>> chromium
>>> master 4.176958448
>>> patch 4.188387233 1.00273615003x slower
>>> chromium fast
>>> master 1.848922713
>>> patch 1.858714219 1.00529578978x slower
>>> the gold plugin
>>> master 0.342383846
>>> patch 0.347106743 1.01379415838x slower
>>> clang
>>> master 0.582476955
>>> patch 0.600524655 1.03098440178x slower
>>> llvm-as
>>> master 0.033248459
>>> patch 0.035622988 1.07141771593x slower
>>> the gold plugin fsds
>>> master 0.369510236
>>> patch 0.376390506 1.01861997133x slower
>>> clang fsds
>>> master 0.661267753
>>> patch 0.683417482 1.03349585535x slower
>>> llvm-as fsds
>>> master 0.030574688
>>> patch 0.033052779 1.08105041006x slower
>>> scylla
>>> master 3.236604638
>>> patch 3.325831407 1.02756801617x slower
>>>
>>> Given that we have an improvement even with just two cores available,
>>> LGTM.
>>>
>>> Cheers,
>>> Rafael
>>>
>>
>> _______________________________________________
>> LLVM Developers mailing list
>> llvm-dev at lists.llvm.org
>> http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>>
>>
>>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20161116/b3385bbf/attachment.html>
More information about the llvm-dev
mailing list