[clang] [clang-tools-extra] [compiler-rt] [PGO] Exposing PGO's Counter Reset and File Dumping APIs (PR #76471)
Teresa Johnson via cfe-commits
cfe-commits at lists.llvm.org
Wed Jan 3 07:16:19 PST 2024
teresajohnson wrote:
> I realized one problem during testing IRPGO (thanks again for the suggestion @minglotus-6 !).
>
> A function's control flow may change between `-fprofile-generate` and `-fprofile-use` when we make use of definitions in the new header. For example, one may have the following code:
>
> ```c
> #include "profile/instr_prof_interface.h"
>
> void main() {
> ...
> if (__llvm_profile_dump())
> return error;
>
> cleanup();
> }
> ```
>
> During `-fprofile-generate`, `__llvm_profile_dump` is a declared name and main's control flow includes a branch to `return error;`. During `-fprofile-use`, `__llvm_profile_dump()` is replaced by `(0)` and the frontend eliminates the `if` statement and the branch to `return error`. Such control flow change can lead to PGO warnings (hash mismatch).
>
> I think it may be OK to keep the PR this way because the new macros can potentially cause control flow changes directly as well. The documentation is updated (https://github.com/llvm/llvm-project/pull/76471/files#diff-7389be311daf0b9b476c876bef04245fa3c0ad9337ce865682174bd77d53b648R2908) to advise against using these APIs in a program's hot regions and warn about possible impact on control flow.
>
> Do you all think this is reasonable?
That's probably ok, as it doesn't make sense to do dumping etc in a hot region of code anyway. An alternate suggestion is to make these functions real functions that simply return 0 instead of #defined to 0. But that may not avoid the issue described above because early inlining will likely inline and simplify the code before IR PGO matching anyway.
https://github.com/llvm/llvm-project/pull/76471
More information about the cfe-commits
mailing list