<html><head><meta http-equiv="Content-Type" content="text/html; charset=utf-8"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class=""><br class=""><div><br class=""><blockquote type="cite" class=""><div class="">On Mar 31, 2020, at 3:31 PM, Petr Hosek via cfe-dev <<a href="mailto:cfe-dev@lists.llvm.org" class="">cfe-dev@lists.llvm.org</a>> wrote:</div><br class="Apple-interchange-newline"><div class=""><div dir="ltr" class="">We'd like to implement support for selectively enabling profile instrumentation only for certain files/functions.<div class=""><br class=""></div><div class="">The motivation is collecting code coverage information in presubmit testing, where we want to avoid instrumenting the entire build which would introduce a lot of overhead. Rather, we only want to instrument files/functions that were modified by the patch by passing this information to Clang.<br class=""></div><div class=""><br class=""></div><div class="">Clang already has -fprofile-filter-files= and -fprofile-exclude-files=, but those flags are currently only supported by GCOV. Furthermore, they only work at the granularity of entire files. I'm also not a fan of using flags which increases the size of the command line as you keep adding more files.</div><div class=""><br class=""></div><div class="">The solution I'm considering for source-code coverage is use the special case list (see <a href="https://github.com/llvm/llvm-project/blob/master/llvm/include/llvm/Support/SpecialCaseList.h" class="">https://github.com/llvm/llvm-project/blob/master/llvm/include/llvm/Support/SpecialCaseList.h</a>) which is already used by sanitizers and XRay and seems like a great fit for this case. Concretely, we would add a new flag, e.g. -fprofile-flter=path/to/special/case/list. The file would have the following format:</div><div class=""><br class=""></div><div class="">[include]</div><div class="">src:src:/path/to/source/*</div>fun:MyFooBar<div class="">[exclude]</div><div class="">src:src:/path/to/source/file.c<br class=""></div><div class=""><br class=""></div><div class="">This is similar to <a href="https://clang.llvm.org/docs/SanitizerSpecialCaseList.html" class="">https://clang.llvm.org/docs/SanitizerSpecialCaseList.html</a> and <a href="https://llvm.org/docs/XRay.html#special-case-file" class="">https://llvm.org/docs/XRay.html#special-case-file</a>.</div><div class=""><br class=""></div><div class="">Does this sound reasonable?</div></div></div></blockquote><div><br class=""></div><div>This sounds like a reasonable plan to me.</div><div><br class=""></div><blockquote type="cite" class=""><div class=""><div dir="ltr" class=""><div class="">Related question is whether this option should apply to both AST and IR based instrumentation and where to perform the filtering.</div></div></div></blockquote><div><br class=""></div><div>Is there a specific use case for filtering files when applying IR based instrumentation, or would this just be for the sake of completeness?</div><br class=""><blockquote type="cite" class=""><div class=""><div dir="ltr" class=""><div class=""> We could do the filtering either when inserting the instrumentation instructions, i.e. in PGOInstrumentation.cpp and CodeGenPGO.cpp, or when lowering these instructions InstrProfiling.cpp (where we would just discard these instructions). The advantage of the latter approach is that we only need to implement the filtering once and it'd support both AST and IR based instrumentation, but it also means that we would unnecessarily insert extra instructions only to drop them later.</div><div class=""><br class=""></div><div class="">Does anyone have any preference or suggestion?</div></div></div></blockquote><div><br class=""></div><div>It is probably cleaner to avoid emitting instrumentation for filtered-out files as early as possible. If the filtering is implemented late, then e.g. with source-based coverage unnecessary coverage mappings may be embedded in the binary. With the recent format change this might not necessarily pose a scaling problem, but it doesn’t seem ideal.</div><div><br class=""></div><div>For IR PGO instrumentation, one alternative is to add an attribute to functions that should be instrumented (c.f. the “sanitize_address” attribute used by ASan). Then, the filtering can be done very early. Later, the instrumentation pass would skip functions without the attribute. I’m not sure I recall correctly, but in an old review I think it was suggested that this is a prerequisite for removing the Optional<PGOOptions> from PassBuilder.</div><div><br class=""></div><div>best,</div><div>vedant</div><div><br class=""></div><br class=""><blockquote type="cite" class=""><div class="">
_______________________________________________<br class="">cfe-dev mailing list<br class=""><a href="mailto:cfe-dev@lists.llvm.org" class="">cfe-dev@lists.llvm.org</a><br class="">https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev<br class=""></div></blockquote></div><br class=""></body></html>