[PATCH] D86332: [SampleFDO] Enhance profile remapping support for inline instance
Wenlei He via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Sun Aug 23 23:49:34 PDT 2020
wenlei added inline comments.
================
Comment at: llvm/lib/ProfileData/SampleProf.cpp:218-222
+ // with equivalent CalleeName using remapping rules. To avoid mixing
+ // profile inline instances at one callsite with inline instances at
+ // other callsites, the CalleeRemapper has to be cleared everytime
+ // before it is used, however the remapping table containing the
+ // remapping rules in the CalleeRemapper will be kept in clearing.
----------------
wmi wrote:
> wenlei wrote:
> > I guess I didn't fully understand the clearing part - if a function (inlinee) is renamed, why do we not want to apply the remapping globally? E.g. for A->B, and C->B inlined profile, if B is renamed to D, we would want to match D to B for both A->D profile and C->D path, right? Then a global remapper including all names should suffice.
> >
> > Could you share a concrete example as to why each function needs its own temp remapper for all its inlinees?
> >
> Right, given two names B and D, ideally we want the remapper to tell us whether they are identical by returning true or false for remapper.equivalent(B, D). However, remapper doesn't work like this. To check whether B and D are identical, remapper needs to insert B into a foldingset first by parsing it, then check whether D has already existed in it by parsing D and trying to insert D (but not truely insert it).
>
> Similarly, if we want to check whether an equivalent name exists in a nameset, remapper need to insert alll the names in the nameset before it can answer the query. So to answer the query whether the callee's name matches an inline instance profile at a specific callsite, it needs to clear names in the remapper inserted at other callsites before it inserts the callee's name at the current callsite into remapper.
>
> Existing remapper doesn't have to be cleared throughout the sample profile loading pass because it insert all the names of outline instance in the sample profile, and the question it needs to answer in the whole pass is whether a function name has a match in the outline instances. So because of the way how remapper works, we need different remappers to answer different types of remapping queries.
Maybe I'm still missing something, but the part that I don't understand is why would we need to have boundary for nameset and the queries? For mangle names, they all live in a single conceptual "namespace", regardless of whether it's name for inlinee or outlined function.
For inlinee's name, it's possible that we've inserted their names in global remapper when initializing with all outlined functions' names. If not, we could also insert it into global remapper right before checking for alternative names? To me the only difference for inlinee's name is that, we might discover those names a bit later as we may not see them when initializing the global remapper with outlined function names, but that doesn't seem to warrant creating boundaries for nameset (between outline vs inlinee, inlinees for one functions vs inlinees for another).
Repository:
rL LLVM
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D86332/new/
https://reviews.llvm.org/D86332
More information about the llvm-commits
mailing list