<div dir="ltr"><div dir="ltr">On Sun, Nov 7, 2021 at 6:14 AM David Blaikie <<a href="mailto:dblaikie@gmail.com">dblaikie@gmail.com</a>> wrote:<br></div><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div dir="ltr">On Sat, Nov 6, 2021 at 8:50 AM Sam McCall <<a href="mailto:sammccall@google.com" target="_blank">sammccall@google.com</a>> wrote:<br></div><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="auto"><div><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Sat, 6 Nov 2021, 03:36 David Blaikie via cfe-dev, <<a href="mailto:cfe-dev@lists.llvm.org" target="_blank">cfe-dev@lists.llvm.org</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr">Yeah, FWIW I'd +1 Andrew's comments here - it was sort of one major premise of clang being designed as a reusable library, that C++ is just too complicated to reimplement separately/repeatedly in various tools.</div></blockquote></div></div><div dir="auto">Yes. This is a good argument for reusable implementations, but I'm not sure one is enough.</div><div dir="auto">c.f. clang-format not using clang beyond the lexer, and the success attributable to that.</div><div dir="auto">Ideally we'd share an impl there, in practice its maturity as a product and concrete design choices in its parser combine to make that hard.</div></div></blockquote><div><br>Given the long time scale of these things - any chance of a plan to converge clang-format and this new thing eventually? (so we have 2 rather than 3 versions of C++ understanding in the LLVM project)<br> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="auto"><div dir="auto"></div><div dir="auto"><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr">For something's that's going to change significant code - how slow is a clang-based solution? What's the tradeoff being made?</div></blockquote></div></div><div dir="auto">Basically it's the difference between interactive latency and not.</div><div dir="auto">For our internal clangd deployment (because those are the numbers I have) 90%ile is most of a minute to parse headers, and several minutes in the build system to get ready (generated headers, flags...).</div></div></blockquote><div><br></div><div>How much of this work is equivalent/shared/cached by the build system? (eg: if I just did a build, then I wanted to refactor a function - how long are we talking there?)</div></div></div></blockquote><div>The build system stuff is cacheable[1], so once you've done that, a tool might take 30 seconds (per file) each time you run it.</div><div><br></div><div>For single-file operations (think go-to-definition), this is enough to avoid the tool. See the (lack of) popularity of clang-rename :-).</div><div>This can be mitigated with PCH/preamble as in clangd, which still takes 30 seconds to prepare, but now you can perform subsequent operations quickly. This startup delay is the #1 user complaint about clangd. (We have several significant optimizations here that trade off accuracy, and still).</div><div>The PCH is typically hundreds of megabytes per source file, so caching it silently/indefinitely makes people unhappy - ask me how I know! In clangd we retain it while the user has the file open, which works OK for a stateful program.<br></div><div><br></div><div>For codebase-wide operations (find-refs) the parsing time easily gets into hours.</div><div>You can mitigate this by building an index, but *that* takes hours and it's a significant barrier.</div><div><br></div><div><div>Bottom line: users want tools that are predictably fast (<100ms, including the first run).</div></div><div><br></div><div>[1] In practice, because build system caches are mutable user-controlled state, cache sharing isn't transparent. Either tools don't share cache with the 'real' build, or the user is *required* to do a real build to get accurate results - I've seen both</div><div><br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div class="gmail_quote"><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="auto"><div dir="auto">Secondarily, it's the difference between just using the tool and having to "set it up". We do a lot of user support for clangd and I can tell you this is a nontrivial concern. (For people who build with something that's not recent mainline clang/gcc, target weird platforms, don't build on the machine they edit on, use non-cmake build systems, ...)</div></div></blockquote><div><br>The second one I have less concern for, I'll admit. <br></div></div></div>

</blockquote></div></div>