[cfe-dev] Support for selectively enabling profile instrumentation only for certain files/functions

Vedant Kumar via cfe-dev cfe-dev at lists.llvm.org
Tue Mar 31 17:01:14 PDT 2020



> On Mar 31, 2020, at 3:31 PM, Petr Hosek via cfe-dev <cfe-dev at lists.llvm.org> wrote:
> 
> We'd like to implement support for selectively enabling profile instrumentation only for certain files/functions.
> 
> The motivation is collecting code coverage information in presubmit testing, where we want to avoid instrumenting the entire build which would introduce a lot of overhead. Rather, we only want to instrument files/functions that were modified by the patch by passing this information to Clang.
> 
> Clang already has -fprofile-filter-files= and -fprofile-exclude-files=, but those flags are currently only supported by GCOV. Furthermore, they only work at the granularity of entire files. I'm also not a fan of using flags which increases the size of the command line as you keep adding more files.
> 
> The solution I'm considering for source-code coverage is use the special case list (see https://github.com/llvm/llvm-project/blob/master/llvm/include/llvm/Support/SpecialCaseList.h <https://github.com/llvm/llvm-project/blob/master/llvm/include/llvm/Support/SpecialCaseList.h>) which is already used by sanitizers and XRay and seems like a great fit for this case. Concretely, we would add a new flag, e.g. -fprofile-flter=path/to/special/case/list. The file would have the following format:
> 
> [include]
> src:src:/path/to/source/*
> fun:MyFooBar
> [exclude]
> src:src:/path/to/source/file.c
> 
> This is similar to https://clang.llvm.org/docs/SanitizerSpecialCaseList.html <https://clang.llvm.org/docs/SanitizerSpecialCaseList.html> and https://llvm.org/docs/XRay.html#special-case-file <https://llvm.org/docs/XRay.html#special-case-file>.
> 
> Does this sound reasonable?

This sounds like a reasonable plan to me.

> Related question is whether this option should apply to both AST and IR based instrumentation and where to perform the filtering.

Is there a specific use case for filtering files when applying IR based instrumentation, or would this just be for the sake of completeness?

> We could do the filtering either when inserting the instrumentation instructions, i.e. in PGOInstrumentation.cpp and CodeGenPGO.cpp, or when lowering these instructions InstrProfiling.cpp (where we would just discard these instructions). The advantage of the latter approach is that we only need to implement the filtering once and it'd support both AST and IR based instrumentation, but it also means that we would unnecessarily insert extra instructions only to drop them later.
> 
> Does anyone have any preference or suggestion?

It is probably cleaner to avoid emitting instrumentation for filtered-out files as early as possible. If the filtering is implemented late, then e.g. with source-based coverage unnecessary coverage mappings may be embedded in the binary. With the recent format change this might not necessarily pose a scaling problem, but it doesn’t seem ideal.

For IR PGO instrumentation, one alternative is to add an attribute to functions that should be instrumented (c.f. the “sanitize_address” attribute used by ASan). Then, the filtering can be done very early. Later, the instrumentation pass would skip functions without the attribute. I’m not sure I recall correctly, but in an old review I think it was suggested that this is a prerequisite for removing the Optional<PGOOptions> from PassBuilder.

best,
vedant


> _______________________________________________
> cfe-dev mailing list
> cfe-dev at lists.llvm.org
> https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/cfe-dev/attachments/20200331/a83cf176/attachment.html>


More information about the cfe-dev mailing list