[llvm-dev] gnu X sysv hash performance
Brian Cain via llvm-dev
llvm-dev at lists.llvm.org
Fri Dec 1 14:07:03 PST 2017
On Fri, Dec 1, 2017 at 3:55 PM, Rui Ueyama via llvm-dev <
llvm-dev at lists.llvm.org> wrote:
> On Fri, Dec 1, 2017 at 1:26 PM, Rafael Avila de Espindola <
> rafael.espindola at gmail.com> wrote:
>
>>
>> I got curious how the lld produced gnu hash tables compared to gold. To
>> test that I timed "perf record ninja check-llvm" (just the lit run) in a
>> BUILD_SHARED_LIBS build.
>>
>> The performance was almost identical, so I decided to try sysv versus
>> gnu (both produced by lld). The results are interesting:
>>
>> % grep -v '^#' perf-gnu/perf.report-by-dso-sym | head
>> 38.77% ld-2.24.so [.] do_lookup_x
>> 8.08% ld-2.24.so [.] strcmp
>> 2.66% ld-2.24.so [.]
>> _dl_relocate_object
>> 2.58% ld-2.24.so [.]
>> _dl_lookup_symbol_x
>> 1.85% ld-2.24.so [.] _dl_name_match_p
>> 1.46% [kernel.kallsyms] [k] copy_page
>> 1.38% ld-2.24.so [.] _dl_map_object
>> 1.30% [kernel.kallsyms] [k] unmap_page_range
>> 1.28% [kernel.kallsyms] [k]
>> filemap_map_pages
>> 1.26% libLLVMSupport.so.6.0.0svn [.] sstep
>> % grep -v '^#' perf-sysv/perf.report-by-dso-sym | head
>> 42.18% ld-2.24.so [.] do_lookup_x
>> 17.73% ld-2.24.so [.] check_match
>> 14.41% ld-2.24.so [.] strcmp
>> 1.22% ld-2.24.so [.]
>> _dl_relocate_object
>> 1.13% ld-2.24.so [.]
>> _dl_lookup_symbol_x
>> 0.91% ld-2.24.so [.] _dl_name_match_p
>> 0.67% ld-2.24.so [.] _dl_map_object
>> 0.65% [kernel.kallsyms] [k] unmap_page_range
>> 0.63% [kernel.kallsyms] [k] copy_page
>> 0.59% libLLVMSupport.so.6.0.0svn [.] sstep
>>
>> So the gnu hash table helps a lot, but BUILD_SHARED_LIBS is still crazy
>> inefficient.
>
>
> What is "100%" in these numbers? If 100% means all execution time,
> ld-2.24.so takes more than 70% of execution time. Is this real?
>
>
>
perf usually measures cycles ("CPU_CLK_UNHALTED" for core/xeon, e.g.). So
it's not time but cycles. This is a critical distinction when the thing
being measured has delays/synchronization/disk/network I/O.
Also it looks like this report might be decomposed by some other attribute
(DSO-at-a-time?) that would affect what "100%" means.
Doing perf on "ninja check-llvm" seems like it would measure cycles
contributed by lots of non-lld things, in fact it's worth ruling out
whether it's dominated by non-lld things. Doesn't testing itself perhaps
spend more cycles than the linking being done here?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20171201/b499fdd6/attachment.html>
More information about the llvm-dev
mailing list