<div dir="ltr"><br><div class="gmail_extra"><br><div class="gmail_quote">On Sun, Dec 4, 2016 at 2:50 PM, Rui Ueyama <span dir="ltr"><<a href="mailto:ruiu@google.com" target="_blank">ruiu@google.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr">Lazy symbols are created only for defined symbols in object files in static archives. LLD doesn't know the complete picture of symbols in that sense. When you pull out a file from an archive, new symbols appear, and you need to resolve them again. Isn't this a sequential process?</div></blockquote><div><br></div><div>Oh yeah, you're right. The problem of determining which objects are pulled from which archives is still a transitive closure type algorithm, even with LLD's current semantics.</div><div>I can think of a couple things:</div><div>1. use a parallel/distributed transitive closure algorithm (probably fairly complicated)</div><div>2. the result of the transitive closure computation (which objects are pulled from the archives) is going to be highly cacheable across runs, so the simple MapReduce approach will still work well most of the time with a simple run-to-run cache. This caching problem is actually very similar to the ThinLTO problem of which files are needed in order for cross-module importing to happen.</div><div><br></div><div><br></div><div>Anyway, this is all just brainstorming.</div><div><br></div><div><span style="font-size:12.8px">-- Sean Silva</span></div><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div class="gmail-HOEnZb"><div class="gmail-h5"><div class="gmail_extra"><br><div class="gmail_quote">On Sun, Dec 4, 2016 at 2:03 PM, Sean Silva <span dir="ltr"><<a href="mailto:chisophugis@gmail.com" target="_blank">chisophugis@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><br><div class="gmail_extra"><br><div class="gmail_quote"><div><div class="gmail-m_6645927580703254646h5">On Sun, Dec 4, 2016 at 7:09 AM, Rui Ueyama <span dir="ltr"><<a href="mailto:ruiu@google.com" target="_blank">ruiu@google.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div class="gmail_extra"><div class="gmail_quote"><div><div class="gmail-m_6645927580703254646m_-5317772366625192215h5">On Sun, Dec 4, 2016 at 1:55 AM, Sean Silva <span dir="ltr"><<a href="mailto:chisophugis@gmail.com" target="_blank">chisophugis@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><br><div class="gmail_extra"><br><div class="gmail_quote"><div><div class="gmail-m_6645927580703254646m_-5317772366625192215m_-5966883869221760077h5">On Sat, Dec 3, 2016 at 3:35 PM, Rui Ueyama via llvm-commits <span dir="ltr"><<a href="mailto:llvm-commits@lists.llvm.org" target="_blank">llvm-commits@lists.llvm.org</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">Author: ruiu<br>

Date: Sat Dec  3 17:35:22 2016<br>

New Revision: 288606<br>

<br>

URL: <a href="http://llvm.org/viewvc/llvm-project?rev=288606&view=rev" rel="noreferrer" target="_blank">http://llvm.org/viewvc/llvm-pr<wbr>oject?rev=288606&view=rev</a><br>

Log:<br>

Add comments about the use of threads in LLD.<br>

<br>

Modified:<br>

    lld/trunk/ELF/Threads.h<br>

<br>

Modified: lld/trunk/ELF/Threads.h<br>

URL: <a href="http://llvm.org/viewvc/llvm-project/lld/trunk/ELF/Threads.h?rev=288606&r1=288605&r2=288606&view=diff" rel="noreferrer" target="_blank">http://llvm.org/viewvc/llvm-pr<wbr>oject/lld/trunk/ELF/Threads.h?<wbr>rev=288606&r1=288605&r2=288606<wbr>&view=diff</a><br>

==============================<wbr>==============================<wbr>==================<br>

--- lld/trunk/ELF/Threads.h (original)<br>

+++ lld/trunk/ELF/Threads.h Sat Dec  3 17:35:22 2016<br>

@@ -6,6 +6,54 @@<br>

 // License. See LICENSE.TXT for details.<br>

 //<br>

 //===------------------------<wbr>------------------------------<wbr>----------------===//<br>

+//<br>

+// LLD supports threads to distribute workloads to multiple cores. Using<br>

+// multicore is most effective when more than one core are idle. At the<br>

+// last step of a build, it is often the case that a linker is the only<br>

+// active process on a computer. So, we are naturally interested in using<br>

+// threads wisely to reduce latency to deliver results to users.<br>

+//<br>

+// That said, we don't want to do "too clever" things using threads.<br>

+// Complex multi-threaded algorithms are sometimes extremely hard to<br>

+// justify the correctness and can easily mess up the entire design.<br>

+//<br>

+// Fortunately, when a linker links large programs (when the link time is<br>

+// most critical), it spends most of the time to work on massive number of<br>

+// small pieces of data of the same kind. Here are examples:<br>

+//<br>

+//  - We have hundreds of thousands of input sections that need to be<br>

+//    copied to a result file at the last step of link. Once we fix a file<br>

+//    layout, each section can be copied to its destination and its<br>

+//    relocations can be applied independently.<br>

+//<br>

+//  - We have tens of millions of small strings when constructing a<br>

+//    mergeable string section.<br>

+//<br>

+// For the cases such as the former, we can just use parallel_for_each<br>

+// instead of std::for_each (or a plain for loop). Because tasks are<br>

+// completely independent from each other, we can run them in parallel<br>

+// without any coordination between them. That's very easy to understand<br>

+// and justify.<br>

+//<br>

+// For the cases such as the latter, we can use parallel algorithms to<br>

+// deal with massive data. We have to write code for a tailored algorithm<br>

+// for each problem, but the complexity of multi-threading is isolated in<br>

+// a single pass and doesn't affect the linker's overall design.<br>

+//<br>

+// The above approach seems to be working fairly well. As an example, when<br>

+// linking Chromium (output size 1.6 GB), using 4 cores reduces latency to<br>

+// 75% compared to single core (from 12.66 seconds to 9.55 seconds) on my<br>

+// machine. Using 40 cores reduces it to 63% (from 12.66 seconds to 7.95<br>

+// seconds). Because of the Amdahl's law, the speedup is not linear, but<br>

+// as you add more cores, it gets faster.<br>

+//<br>

+// On a final note, if you are trying to optimize, keep the axiom "don't<br>

+// guess, measure!" in mind. Some important passes of the linker are not<br>

+// that slow. For example, resolving all symbols is not a very heavy pass,<br>

+// although it would be very hard to parallelize it. You want to first<br>

+// identify a slow pass and then optimize it.<br></blockquote><div><br></div></div></div><div>Actually, LLD's symbol resolution (the approach with Lazy symbols for archives) is a perfect example of a MapReduce type problem, so it is actually quite parallelizable.</div><div>You basically have a huge number of (SymbolName,SymbolValue) pairs and you want to coalesce all values with the same SymbolName into pairs (SymbolName, [SymbolValue1,SymbolValue2,...<wbr>]) which you can then process all the SymbolValueN's to see which is the real definition. This is precisely the problem that MapReduce solves.</div></div></div></div></blockquote><div><br></div></div></div><div>How do you handle static archives?</div></div></div></div></blockquote><div><br></div></div></div><div>LLD's archive semantics insert lazy symbols for all the archive members, so it isn't a problem.</div><span class="gmail-m_6645927580703254646HOEnZb"><font color="#888888"><div><br></div><div>-- Sean Silva</div></font></span><span><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div class="gmail_extra"><div class="gmail_quote"><span><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div class="gmail_extra"><div class="gmail_quote"><div><br></div><div>(note: I don't necessarily mean that it needs to be done in a distributed fashion, just that the core problem is really one of coalescing values with the same keys.</div><div>)</div><div><br></div><div>MapReduce's core abstraction is also a good tool for deduplicating strings.</div><div><br></div><div><br></div><div>Richard Smith and I were actually brainstorming at the latest llvm social a distributed linker may be a good fit for the linking problem at Google (but it was just brainstorming; obviously that would be a huge effort and we would need very serious justification before embarking on that effort).<span class="gmail-m_6645927580703254646m_-5317772366625192215m_-5966883869221760077HOEnZb"><font color="#888888"><br></font></span></div><span class="gmail-m_6645927580703254646m_-5317772366625192215m_-5966883869221760077HOEnZb"><font color="#888888"><div><br></div><div>-- Sean Silva</div></font></span><span><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">

+//<br>

+//===------------------------<wbr>------------------------------<wbr>----------------===//<br>

<br>

 #ifndef LLD_ELF_THREADS_H<br>

 #define LLD_ELF_THREADS_H<br>

<br>

<br>

______________________________<wbr>_________________<br>

llvm-commits mailing list<br>

<a href="mailto:llvm-commits@lists.llvm.org" target="_blank">llvm-commits@lists.llvm.org</a><br>

<a href="http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-commits" rel="noreferrer" target="_blank">http://lists.llvm.org/cgi-bin/<wbr>mailman/listinfo/llvm-commits</a><br>

</blockquote></span></div><br></div></div>

</blockquote></span></div><br></div></div>

</blockquote></span></div><br></div></div>

</blockquote></div><br></div>

</div></div></blockquote></div><br></div></div>