<div dir="ltr"><div>Thanks for the quick reply.</div><div><br></div>> <span style="font-size:12.8px">Are you sure the actual handling of the breakpoint & callback in lldb is what is taking most of the time?</span><div><span style="font-size:12.8px"><br></span></div><div><span style="font-size:12.8px">I'm not positive. I did collect some callgrind profiles to take a look at where most of the time is being spent, but i'm not very familiar with lldb internals so the results were hard to interpret. I did notice that there was a lot of packet/network business when using lldb to profile a program (which I assumed was communication between my program and lldb-server). I was not sure how this effected the performance, so perhaps this is the real bottleneck.</span></div><div><span style="font-size:12.8px"><br></span></div><div><span style="font-size:12.8px">> </span><span style="font-size:12.8px">Greg just switched to using a unix-domain socket for this communication for platforms that support it.  This speeds up the packet traffic side of things.</span></div><div><span style="font-size:12.8px"><br></span></div><div><span style="font-size:12.8px">In what version of lldb was this introduced? I'm running 3.7.1. I'm also on ubuntu 14.04, is that a supported platform?</span></div><div><span style="font-size:12.8px"><br></span></div><div><span style="font-size:12.8px">> </span><span style="font-size:12.8px">One of the original motivations of having lldb-server be based on lldb classes - as opposed to the MacOS X version of debugserver which is an independent construct - was that you could re-use the server code to create an in-process Process plugin, eliminating a lot of this traffic & context switching when you needed maximum speed.</span></div><div><span style="font-size:12.8px"><br></span></div><div><span style="font-size:12.8px">That sounds very interesting. Is there an example of this implementation you could point me to?</span></div><div><span style="font-size:12.8px"><br></span></div><div><span style="font-size:12.8px"><br></span></div><div class="gmail_extra"><br><div class="gmail_quote">On Tue, Aug 16, 2016 at 10:20 AM, Jim Ingham <span dir="ltr"><<a href="mailto:jingham@apple.com" target="_blank">jingham@apple.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Are you sure the actual handling of the breakpoint & callback in lldb is what is taking most of the time?  The last time we looked at this, the majority of the work was in communicating with debugserver to get the stop notification and restart.  Note, besides all the packet code, this involves context switches from process->lldbserver->lldb and back, which is also pretty expensive.<br>

<br>

Greg just switched to using a unix-domain socket for this communication for platforms that support it.  This speeds up the packet traffic side of things.<br>

<br>

One of the original motivations of having lldb-server be based on lldb classes - as opposed to the MacOS X version of debugserver which is an independent construct - was that you could re-use the server code to create an in-process Process plugin, eliminating a lot of this traffic & context switching when you needed maximum speed.  The original Mac OS X lldb port actually had a process plugin wholly in-process with lldb as well as the debugserver based one, but there wasn't enough motivation to justify maintaining the two different implementations of the same code.  I don't know whether the Linux port takes advantage of this possibility, however.  That would be something to look into, however.<br>

<br>

Once we actually figure out about the stop, figuring out the breakpoint and getting to its callback is pretty simple...  I doubt making "lighter weight breakpoints" in particular will recover the performance you need, though if your sampling turns up some inefficient algorithms have crept in, it would be great to fix that.<br>

<br>

Another option we've toyed with on and off is something like the gdb "tracepoints" were you can upload instructions to perform "experiments" when a breakpoint is hit to the lldb-server instance.  The work to perform the experiment and the results would all be kept in the lldb-server instance till a real breakpoint is hit, at which point lldb can download all the results and present them to the user.  This would eliminate some of the context-switches and packet traffic while you were running in the hot parts of your code.  This is a decent chunk of work, however.<br>

<br>

Jim<br>

<div><div class="h5"><br>

<br>

> On Aug 16, 2016, at 9:57 AM, Benjamin Dicken via lldb-dev <<a href="mailto:lldb-dev@lists.llvm.org">lldb-dev@lists.llvm.org</a>> wrote:<br>

><br>

> I recently started using lldb to write a basic instrumentation tool for tracking the values of variables at various code-points in a program. I've been working with lldb for less than two weeks, so I am pretty new. Though, I have used and written llvm passes in the past, so I'm familiar with the clang/llvm/lldb ecosystem.<br>

><br>

> I have a very early prototype of the tool up and running, using the C++ API. The user can specify either an executable to run or an already-running PID to attach to. The user also supplies a file+line_number at which a breakpoint (with a callback) is placed. For testing/prototyping purposes, the breakpoint callback just increments a counter and then immediately returns false. Eventually, more interesting things will happen in this callback.<br>

><br>

> I've noticed that just the action of hitting a breakpoint and invoking the callback is very expensive. I did some instruction-count collection by running this lldb tool on a simple test program, and placing the breakpoint+callback at different points in the program, causing it to get triggered different amounts of times. I used `perf stat -e instructions ...` to gather instruction exec counts for each run. After doing a little math, it appears that I'm incurring 1.0 - 1.1 million instruction execs per breakpoint.<br>

><br>

> This amount of slowdown is prohibitively expensive for my needs, because I want to place callbacks in hot portions of the "inferior" program.<br>

><br>

> Is there a way to make this faster? Is it possible to create "lighter-weight" breakpoints? I really like the lldb API (though the documentation is lacking in some places), but if this performance hit can't be mitigated, it may be unusable for me.<br>

><br>

> For reference, this is the callback function:<br>

><br>

> ```<br>

> static int cb_count = 0;<br>

> bool SimpleCallback (<br>

>     void *baton,<br>

>     lldb::SBProcess &process,<br>

>     lldb::SBThread &thread,<br>

>     lldb::SBBreakpointLocation &location) {<br>

>   //TODO: Eventually do more interesting things...<br>

>   cb_count++;<br>

>   return false;<br>

> }<br>

> ```<br>

><br>

> And here is how I set it up to be called back:<br>

><br>

> ```<br>

> lldb::SBBreakpoint bp1 = debugger_data->target.<wbr>BreakpointCreateByLocation(<wbr>file_name, line_no);<br>

> if (!bp1.IsValid()) std::cerr << "invalid breakpoint";<br>

> bp1.SetCallback(<wbr>SimpleCallback, 0);<br>

> ```<br>

><br>

> -Benjamin<br>

</div></div>> ______________________________<wbr>_________________<br>

> lldb-dev mailing list<br>

> <a href="mailto:lldb-dev@lists.llvm.org">lldb-dev@lists.llvm.org</a><br>

> <a href="http://lists.llvm.org/cgi-bin/mailman/listinfo/lldb-dev" rel="noreferrer" target="_blank">http://lists.llvm.org/cgi-bin/<wbr>mailman/listinfo/lldb-dev</a><br>

<br>

</blockquote></div><br><br clear="all"><div><br></div>-- <br><div class="gmail_signature" data-smartmail="gmail_signature"><div dir="ltr">Ben</div></div>

</div></div>