<br><br><div class="gmail_quote">On Thu, Nov 24, 2011 at 7:45 PM, Vikram Adve <span dir="ltr"><<a href="mailto:vadve@illinois.edu" target="_blank">vadve@illinois.edu</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Daniel, Kostya,<br>
<br>
We had a meeting with the Clang people (Chris, Doug, Ted) on Thursday before the llvmdev meeting about adding dynamic checking tools into Clang -- IOC for undefined integer behaviors, and SAFECode for memory safety. SAFECode has similar goals to AddressSanitizer, though at least for now it has more checks, </blockquote>
<div><br></div><div>[off topic, could you please educate me on the kinds of bugs that SAFECode finds and asan does not? Maybe in a separate thread. ]</div><div><br></div><div><br></div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
but is slower.<br>
<br>
The main conclusion was that we should have a common run-time API for various tools to report errors. This API can be shared by tools like IOC, SAFECode, and (we hope) Address Sanitizer. The benefits are that users have to learn only one error reporting style; and the code base can use a common set of mechanisms to track and report execution information, so that improvements by any one project will benefit the other projects as well.<br>
<br>
We're happy to keep the API simple for now, and to wrap the SAFECode and IOC run-times to use it. Let us know if you'd like to help define and use this API.<br></blockquote><div><br></div><div>I would like to have some level of sharing between all these tools (add ThreadSanitizer to the mix).</div>
<div>But as I just discussed with John Regehr in a separate thread, the tools (asan and IOC) are quite different and sharing is limited. </div><div><br></div><div>Let me describe what asan (AddressSanitizer) and tsan (ThreadSanitizer) need and also my understanding of IOC's needs. Then we'll see how much sharing is possible between these and SAFECode. </div>
<div><br></div><div>asan: </div><div> -- unwind stack on malloc/free (should be fast, libunwind is too slow, currently uses custom unwinder based on frame pointers)</div><div> -- symbolize stack. Currently uses external script and add2line, which sucks. Maybe could reuse some of lldb code.</div>
<div> -- intercept functions (malloc, pthread, etc)</div><div> -- small set of utility functions (e.g. printf, mmap, strlen) that does not use libc. </div>
<div> -- the functions that are called on error take strictly one parameter and never return (for speed and code size). Another option is to report an error by causing SIGILL using a call-free instruction sequence (5% faster). </div>
<div> -- dies on first error. </div><div> -- reports to stderr. Someone may need to customize it, but we did not have such requests yet. </div><div> -- report style: see example at <a href="http://code.google.com/p/address-sanitizer/wiki/AddressSanitizer#Introduction" target="_blank">http://code.google.com/p/address-sanitizer/wiki/AddressSanitizer#Introduction</a> . The format of the reports is similar to that of tsan, which in turn is similar to that of valgrind. I am reluctant to change the format because quite a few existing users of asan and tsan (and memcheck) rely on it.</div>
<div> -- no suppressions</div><div> -- supports blacklist file (and we also may need something like __attribute__(no-address-sanitizer) </div><div>-- major user-visible controls: unwind depth, redzone size (passed via env. var ASAN_OPTIONS)</div>
<div><br></div><div> tsan: </div><div> -- keep shadow call stack (unwinding is too slow)</div><div> -- symbolize stack (same problem as with asan). gcc-based variant uses libbfd, but I don't like that code. valgrind-based variant uses valgrind's symbolizer. (ditto for PIN-based) </div>
<div> -- intercept functions (malloc, pthread, etc) </div><div> -- small set of utility functions (e.g. printf, mmap, strlen) that does not use libc. </div><div> -- suppressions (syntax is similar to valgrind's)</div>
<div> -- ignore file (aka blacklist of functions/files) </div><div> -- reports to stderr, may change to another stream. </div><div> -- report style: see above. Examples are here: <a href="http://code.google.com/p/data-race-test/wiki/UnderstandingThreadSanitizerReports#The_report" target="_blank">http://code.google.com/p/data-race-test/wiki/UnderstandingThreadSanitizerReports#The_report</a></div>
<div><div> -- major user-visible controls: stack depth, sensitivity options, resource usage options. compiler-based variant gets the options via an env. var. </div></div><div><br></div><div>IOC: </div><div> - does not need unwinding (does not show the stack) </div>
<div> - does not need symbolization (adds all info at compile time to the error reporting call. questionable from performance point of view)</div>
<div> - may need suppression, removal of duplicates, etc</div><div> - (John, what else is important here?) </div><div><br></div><div><br></div><div>Currently, I can see that SAFECode can share these items with asan/tsan: </div>
<div>- unwinding </div><div>- symbolizing </div><div>- function interception mechanism </div><div>- option parsing </div><div>- maybe style of error reports</div><div>- maybe some low-level code (e.g. asan's tiny libc replacement) </div>
<div><br></div><div>--kcc </div><div><br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<br>
--Vikram<br>
Professor, Computer Science<br>
University of Illinois at Urbana-Champaign<br>
<a href="http://llvm.org/~vadve" target="_blank">http://llvm.org/~vadve</a><br>
<br>
<br>
<br>
<br>
<br>
On Nov 24, 2011, at 12:00 PM, <<a href="mailto:llvmdev-request@cs.uiuc.edu" target="_blank">llvmdev-request@cs.uiuc.edu</a>><br>
wrote:<br>
<br>
> Date: Thu, 24 Nov 2011 10:27:35 -0500<br>
> From: Daniel Dunbar <<a href="mailto:daniel@zuster.org" target="_blank">daniel@zuster.org</a>><br>
> Subject: Re: [LLVMdev] AddressSanitizer run-time in<br>
> tools/clang/runtime/compiler-rt<br>
> To: Kostya Serebryany <<a href="mailto:kcc@google.com" target="_blank">kcc@google.com</a>><br>
> Cc: <a href="mailto:llvmdev@cs.uiuc.edu" target="_blank">llvmdev@cs.uiuc.edu</a><br>
> Message-ID:<br>
> <<a href="mailto:CAEU8z697Ey0_mMdLt8swJVUR%2Bx0K9Rvays8NW7a3f1R3RbxQKg@mail.gmail.com" target="_blank">CAEU8z697Ey0_mMdLt8swJVUR+x0K9Rvays8NW7a3f1R3RbxQKg@mail.gmail.com</a>><br>
> Content-Type: text/plain; charset=ISO-8859-1<br>
><br>
> Quick answers, I'm on txgiving break this week and not doing any real<br>
> work, but I will be doing more compiler-rt work when I get back<br>
> (initially focused at getting profile libs to come from compiler-rt on<br>
> Linux et al).<br>
><br>
> On Wed, Nov 16, 2011 at 9:24 PM, Kostya Serebryany <<a href="mailto:kcc@google.com" target="_blank">kcc@google.com</a>> wrote:<br>
>> Hi Daniel,<br>
>> Chris suggested to talk to you about committing the AddressSanitizer (asan)<br>
>> run-time into the llvm tree (llvm-project/compiler-rt).<br>
>> Questions:<br>
>> - What is the preferred name for the directory? (asan? libasan?<br>
>> address_sanitizer? AdressSanitizer?)<br>
><br>
> I don't care. lib/asan seems perfectly reasonable to me, with a README<br>
> explaining what the module is for.<br>
><br>
>> - Should the asan run-time use cmake, or just make, or what? The build is a<br>
>> bit tricky, especially for tests. We currently use make.<br>
><br>
> The only build system I care about for compiler-rt is the one in make.<br>
> It's quite a crazy set up, although there is a reason for it to be as<br>
> complicated as it is. I can help (and/or) do the asan integration into<br>
> the compiler-rt make build.<br>
><br>
>> - How would you suggest to do the code review?<br>
>> ? ?The code of the run-time is ~5<br>
>> KLOC.?<a href="http://code.google.com/p/address-sanitizer/source/browse/#svn%2Ftrunk%2Fasan" target="_blank">http://code.google.com/p/address-sanitizer/source/browse/#svn%2Ftrunk%2Fasan</a><br>
>> ? ?The Apple-specific part may make some Apple experts cry (or maybe not).<br>
>> ? ?The code uses google's coding style, which is similar, but not equivalent<br>
>> to the LLVM's one. We check it using cpplint before commits.<br>
><br>
> I personally would like to see it be in LLVM style, but the rest of<br>
> compiler-rt isn't really that way, so I don't think this is a blocker.<br>
><br>
> Unless anyone objects, I think we should just bring the code in as is<br>
> so that ASAN works, and if people want to do more thorough review that<br>
> can happen post commit.<br>
><br>
>> ? ?LLVM license is used.<br>
>> ? ?The tests are ~2.5 KLOC; most of the tests require the clang compiler<br>
>> with -faddress-sanitizer switch.<br>
>> ? ?If possible, I'd prefer not to review the patch in a usual way (too big),<br>
>> but instead have the comments to the files<br>
>> in?<a href="http://code.google.com/p/address-sanitizer/source/browse/#svn%2Ftrunk%2Fasan" target="_blank">http://code.google.com/p/address-sanitizer/source/browse/#svn%2Ftrunk%2Fasan</a><br>
>> ? ?Or, alternatively, commit everything as is and then iterate over<br>
>> individual files.<br>
>> - The run-time library uses a couple of third_party files (BSD license) to<br>
>> parse process mapping.<br>
>> ? W/o these files asan will work, but will not produce symbolizable traces.<br>
>> ? Can we include these files as as in a separate directory? (They may be<br>
>> useful for other tools as well)<br>
><br>
> Let's try to include the bits with extra licenses in subdirectories of<br>
> lib/asan. If someone wants to reuse them later we can find a different<br>
> way to factor things, I don't see a reason to worry about it now.<br>
><br>
>> ??<a href="http://code.google.com/p/address-sanitizer/source/browse/trunk/asan/sysinfo.cc" target="_blank">http://code.google.com/p/address-sanitizer/source/browse/trunk/asan/sysinfo.cc</a><br>
>> - Yet another piece of third-party code (mit license) is used for<br>
>> Apple-specific work (function overriding). Same question as above apply.<br>
>><br>
>> ?<a href="http://code.google.com/p/address-sanitizer/source/browse/trunk/asan/mach_override.c" target="_blank">http://code.google.com/p/address-sanitizer/source/browse/trunk/asan/mach_override.c</a><br>
>> - test-specific: can I rely on gtest being installed? (fresh version is<br>
>> required).<br>
><br>
> Compiler-rt doesn't currently have a very good test set up. You'll<br>
> probably have to find a way to shoehorn this in for your own testing<br>
> initially, but we can try and work out a way to integrate it more<br>
> properly...<br>
><br>
> - Daniel<br>
><br>
>> Thanks,<br>
>> --kcc<br>
>><br>
>><br>
><br>
<br>
<br>
_______________________________________________<br>
LLVM Developers mailing list<br>
<a href="mailto:LLVMdev@cs.uiuc.edu" target="_blank">LLVMdev@cs.uiuc.edu</a> <a href="http://llvm.cs.uiuc.edu" target="_blank">http://llvm.cs.uiuc.edu</a><br>
<a href="http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev" target="_blank">http://lists.cs.uiuc.edu/mailman/listinfo/llvmdev</a><br>
</blockquote></div><br>