<div dir="ltr"><br><div class="gmail_extra"><br><div class="gmail_quote">On Tue, Jan 19, 2016 at 9:40 AM, Jonas Wagner via llvm-dev <span dir="ltr"><<a href="mailto:llvm-dev@lists.llvm.org" target="_blank">llvm-dev@lists.llvm.org</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir="ltr"><div class="gmail_quote"><div dir="ltr"><div><p style="margin:1.2em 0px!important">Hi,</p>
<p style="margin:1.2em 0px!important">I’m thinking about using LLVM to implement a limited form of self-modifying code. Before diving into that, I’d like to get some feedback from you all.</p>
<p style="margin:1.2em 0px!important"><strong>The goal:</strong> I’d like to add “optional” code to a program that I can enable at runtime and that has zero (i.e., as close to zero as I can get) overhead when not enabled.</p>
<p style="margin:1.2em 0px!important"><strong>Existing solutions:</strong> Currently, I can guard optional code using a branch, something like <code style="font-size:0.85em;font-family:Consolas,Inconsolata,Courier,monospace;margin:0px 0.15em;padding:0px 0.3em;white-space:pre-wrap;border:1px solid rgb(234,234,234);border-top-left-radius:3px;border-top-right-radius:3px;border-bottom-right-radius:3px;border-bottom-left-radius:3px;display:inline;background-color:rgb(248,248,248)">br i1 %cond, label %optional, label %skip, !prof !0</code>. Branch weights ensure that the branch is predicted correctly. The overhead of this is not as low as I’d like,</p></div></div></div></div></blockquote><div><br></div><div>How low would you like it? What use case is suffering for performance due to the branch?</div><div><br></div><div>Self-modifying code for truly zero-overhead (when not enabled) instrumentation is a real thing (look at e.g. DTrace pid provider) but unless the number of instrumentation point is very large (100's of thousands? millions?) or not known beforehand (both are true for DTrace), the cost of a branch will be negligible.</div><div><br></div><div>AFAIK, the cost of a well-predicted, not-taken branch is the same as a nop on every x86 made in the last many years. See <a href="http://www.agner.org/optimize/instruction_tables.pdf">http://www.agner.org/optimize/instruction_tables.pdf</a></div><div>Generally speaking a correctly-predicted not-taken branch is basically identical to a nop, and a correctly-predicted taken branch is has an extra overhead similar to an "add" or other extremely cheap operation. More concerning is that the condition that is branched on is probably some flag in memory somewhere and will require a memory operation to check it (but of course on a good OoO w/ speculative execution this doesn't hold up anything but the retire queue).</div><div><br></div><div>-- Sean Silva</div><div><br></div><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex"><div dir="ltr"><div class="gmail_quote"><div dir="ltr"><div><p style="margin:1.2em 0px!important"> though, because the branch is still present in the code and because computing <code style="font-size:0.85em;font-family:Consolas,Inconsolata,Courier,monospace;margin:0px 0.15em;padding:0px 0.3em;white-space:pre-wrap;border:1px solid rgb(234,234,234);border-top-left-radius:3px;border-top-right-radius:3px;border-bottom-right-radius:3px;border-bottom-left-radius:3px;display:inline;background-color:rgb(248,248,248)">%cond</code> also has some cost.</p>
<p style="margin:1.2em 0px!important"><strong>The idea:</strong> I’d like to have a branch that is the same as the example above, but that gets translated into a <code style="font-size:0.85em;font-family:Consolas,Inconsolata,Courier,monospace;margin:0px 0.15em;padding:0px 0.3em;white-space:pre-wrap;border:1px solid rgb(234,234,234);border-top-left-radius:3px;border-top-right-radius:3px;border-bottom-right-radius:3px;border-bottom-left-radius:3px;display:inline;background-color:rgb(248,248,248)">nop</code> instruction. Preferably some unique <code style="font-size:0.85em;font-family:Consolas,Inconsolata,Courier,monospace;margin:0px 0.15em;padding:0px 0.3em;white-space:pre-wrap;border:1px solid rgb(234,234,234);border-top-left-radius:3px;border-top-right-radius:3px;border-bottom-right-radius:3px;border-bottom-left-radius:3px;display:inline;background-color:rgb(248,248,248)">nop</code> that I can easily recognize in the binary, and that has the same size as an unconditional branch instruction. Then, I could use a framework such as DynInst to replace that <code style="font-size:0.85em;font-family:Consolas,Inconsolata,Courier,monospace;margin:0px 0.15em;padding:0px 0.3em;white-space:pre-wrap;border:1px solid rgb(234,234,234);border-top-left-radius:3px;border-top-right-radius:3px;border-bottom-right-radius:3px;border-bottom-left-radius:3px;display:inline;background-color:rgb(248,248,248)">nop</code> with an unconditional branch instruction at run-time.</p>
<p style="margin:1.2em 0px!important">My questions to the community would be:</p>
<ul style="margin:1.2em 0px;padding-left:2em">
<li style="margin:0.5em 0px">Does the idea make sense, or am I missing a much simpler approach?</li>
<li style="margin:0.5em 0px">What would be the easiest way to obtain the desired binary? Adding a new TerminatorInstruction sounds daunting, is there something simpler?</li>
</ul>
<p style="margin:1.2em 0px!important">I also wonder whether I could even expects speedups from this? Are <code style="font-size:0.85em;font-family:Consolas,Inconsolata,Courier,monospace;margin:0px 0.15em;padding:0px 0.3em;white-space:pre-wrap;border:1px solid rgb(234,234,234);border-top-left-radius:3px;border-top-right-radius:3px;border-bottom-right-radius:3px;border-bottom-left-radius:3px;display:inline;background-color:rgb(248,248,248)">nop</code> instructions actually cheaper than branches? Would modifying the binary at run-time play well enough with caches etc.? These are probably not questions for the LLVM mailing list, but if anybody has good answers they are welcome.</p>
<p style="margin:1.2em 0px!important">Looking forward to hearing your thoughts,<br>Jonas</p>
<div title="MDH:SGksPGRpdj48YnI+PC9kaXY+PGRpdj5JJ20gdGhpbmtpbmcgYWJvdXQgdXNpbmcgTExWTSB0byBp
bXBsZW1lbnQgYSBsaW1pdGVkIGZvcm0gb2Ygc2VsZi1tb2RpZnlpbmcgY29kZS4gQmVmb3JlIGRp
dmluZyBpbnRvIHRoYXQsIEknZCBsaWtlIHRvIGdldCBzb21lIGZlZWRiYWNrIGZyb20geW91IGFs
bC48L2Rpdj48ZGl2Pjxicj48L2Rpdj48ZGl2PioqVGhlIGdvYWw6KiogSSdkIGxpa2UgdG8gYWRk
ICJvcHRpb25hbCIgY29kZSB0byBhIHByb2dyYW0gdGhhdCBJIGNhbiBlbmFibGUgYXQgcnVudGlt
ZSBhbmQgdGhhdCBoYXMgemVybyAoaS5lLiwgYXMgY2xvc2UgdG8gemVybyBhcyBJIGNhbiBnZXQp
IG92ZXJoZWFkIHdoZW4gbm90IGVuYWJsZWQuPC9kaXY+PGRpdj48YnI+PC9kaXY+PGRpdj4qKkV4
aXN0aW5nIHNvbHV0aW9uczoqKiBDdXJyZW50bHksIEkgY2FuIGd1YXJkIG9wdGlvbmFsIGNvZGUg
dXNpbmcgYSBicmFuY2gsIHNvbWV0aGluZyBsaWtlIGBiciBpMSAlY29uZCwgbGFiZWwgJW9wdGlv
bmFsLCBsYWJlbCAlc2tpcCwgIXByb2YgITBgLiBCcmFuY2ggd2VpZ2h0cyBlbnN1cmUgdGhhdCB0
aGUgYnJhbmNoIGlzIHByZWRpY3RlZCBjb3JyZWN0bHkuIFRoZSBvdmVyaGVhZCBvZiB0aGlzIGlz
IG5vdCBhcyBsb3cgYXMgSSdkIGxpa2UsIHRob3VnaCwgYmVjYXVzZSB0aGUgYnJhbmNoIGlzIHN0
aWxsIHByZXNlbnQgaW4gdGhlIGNvZGUgYW5kIGJlY2F1c2UgY29tcHV0aW5nIGAlY29uZGAgYWxz
byBoYXMgc29tZSBjb3N0LjwvZGl2PjxkaXY+PGJyPjwvZGl2PjxkaXY+KipUaGUgaWRlYToqKiBJ
J2QgbGlrZSB0byBoYXZlIGEgYnJhbmNoIHRoYXQgaXMgdGhlIHNhbWUgYXMgdGhlIGV4YW1wbGUg
YWJvdmUsIGJ1dCB0aGF0IGdldHMgdHJhbnNsYXRlZCBpbnRvIGEgYG5vcGAgaW5zdHJ1Y3Rpb24u
IFByZWZlcmFibHkgc29tZSB1bmlxdWUgYG5vcGAgdGhhdCBJIGNhbiBlYXNpbHkgcmVjb2duaXpl
IGluIHRoZSBiaW5hcnksIGFuZCB0aGF0IGhhcyB0aGUgc2FtZSBzaXplIGFzIGFuIHVuY29uZGl0
aW9uYWwgYnJhbmNoIGluc3RydWN0aW9uLiBUaGVuLCBJIGNvdWxkIHVzZSBhIGZyYW1ld29yayBz
dWNoIGFzIER5bkluc3QgdG8gcmVwbGFjZSB0aGF0IGBub3BgIHdpdGggYW4gdW5jb25kaXRpb25h
bCBicmFuY2ggaW5zdHJ1Y3Rpb24gYXQgcnVuLXRpbWUuPC9kaXY+PGRpdj48YnI+PC9kaXY+PGRp
dj5NeSBxdWVzdGlvbnMgdG8gdGhlIGNvbW11bml0eSB3b3VsZCBiZTo8L2Rpdj48ZGl2Pjxicj48
L2Rpdj48ZGl2Pi0gRG9lcyB0aGUgaWRlYSBtYWtlIHNlbnNlLCBvciBhbSBJIG1pc3NpbmcgYSBt
dWNoIHNpbXBsZXIgYXBwcm9hY2g/PC9kaXY+PGRpdj4tIFdoYXQgd291bGQgYmUgdGhlIGVhc2ll
c3Qgd2F5IHRvIG9idGFpbiB0aGUgZGVzaXJlZCBiaW5hcnk/IEFkZGluZyBhIG5ldyBUZXJtaW5h
dG9ySW5zdHJ1Y3Rpb24gc291bmRzIGRhdW50aW5nLCBpcyB0aGVyZSBzb21ldGhpbmcgc2ltcGxl
cj88L2Rpdj48ZGl2Pjxicj48L2Rpdj48ZGl2PkkgYWxzbyB3b25kZXIgd2hldGhlciBJIGNvdWxk
IGV2ZW4gZXhwZWN0cyBzcGVlZHVwcyBmcm9tIHRoaXM/IEFyZSBgbm9wYCBpbnN0cnVjdGlvbnMg
YWN0dWFsbHkgY2hlYXBlciB0aGFuIGJyYW5jaGVzPyBXb3VsZCBtb2RpZnlpbmcgdGhlIGJpbmFy
eSBhdCBydW4tdGltZSBwbGF5IHdlbGwgZW5vdWdoIHdpdGggY2FjaGVzIGV0Yy4/IFRoZXNlIGFy
ZSBwcm9iYWJseSBub3QgcXVlc3Rpb25zIGZvciB0aGUgTExWTSBtYWlsaW5nIGxpc3QsIGJ1dCBp
ZiBhbnlib2R5IGhhcyBnb29kIGFuc3dlcnMgdGhleSBhcmUgd2VsY29tZS48YnI+PC9kaXY+PGRp
dj48YnI+PC9kaXY+PGRpdj5Mb29raW5nIGZvcndhcmQgdG8gaGVhcmluZyB5b3VyIHRob3VnaHRz
LDwvZGl2PjxkaXY+Sm9uYXM8L2Rpdj4=" style="min-height:0px;width:0px;max-height:0px;max-width:0px;overflow:hidden;font-size:0em;padding:0px;margin:0px"></div></div></div></div></div>
<br>_______________________________________________<br>
LLVM Developers mailing list<br>
<a href="mailto:llvm-dev@lists.llvm.org">llvm-dev@lists.llvm.org</a><br>
<a href="http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev" rel="noreferrer" target="_blank">http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev</a><br>
<br></blockquote></div><br></div></div>