[PATCH] D111750: [llvm-profgen] Allow unsymbolized profile as perf input
Wenlei He via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Oct 25 17:55:29 PDT 2021
wenlei added inline comments.
================
Comment at: llvm/tools/llvm-profgen/PerfReader.cpp:797
+ }
+ Key->genHashCode();
+ auto Ret =
----------------
wlei wrote:
> wenlei wrote:
> > wlei wrote:
> > > wenlei wrote:
> > > > wlei wrote:
> > > > > wenlei wrote:
> > > > > > Would it better if hash code is lazily generated instead of requiring an explicit call?
> > > > > >
> > > > > > ```
> > > > > > getHashCode() {
> > > > > > if (HashCode == 0)
> > > > > > genHashCode()
> > > > > > ...
> > > > > > }
> > > > > > ```
> > > > > Here to explicitly call `genHashCode` is intentional, the reason is we want to avoid making `genHashCode` a virtual function, i, e, avoid call `genHashCode `after casting to base class. so we separate like:
> > > > > ```
> > > > > derived class: HashCode = genHashCode();
> > > > >
> > > > > base class : getHashCode{return HashCode;}
> > > > > ```
> > > > >
> > > > >
> > > > Not sure if I understand. Why do we want to avoid call genHashCode after casting to base class?
> > > Currently we have two type of key, `StrKey` and `ProbeKey` and they derived from the base `ContextKey`.
> > >
> > > We have a hash map to store them, like unordered_map<ContextKey, ...>, the logic of insert is like
> > > ```
> > > StrKey* key = ..
> > >
> > > hashmap[key] = ...;
> > > ```
> > > In the hashmap, `StrKey*/ProbeKey*` implicitly be cast to a ContextKey* class then call `ContextKey->getHashCode` , so `getHashCode` should be a virtual function which has overhead because `StrKey` and `ProbeKey` have different genHashCode.
> > >
> > > So to avoid this, we can explicitly call StrKey->genHashCode() before being casting to base and store it into a variable `HashCode`
> > >
> > > then base's getHashCode just read the HasCode, no need a virtual function.
> > >
> > >
> > >
> > >
> > >
> > >
> > > so getHashCode should be a virtual function which has overhead because StrKey and ProbeKey have different genHashCode.
> >
> > > So to avoid this
> >
> > Why do we want to avoid this? performance reason? It looks to me that having getHashCode as virtual function is natural and clean.
> >
> > This pattern perhaps is not related to this patch though.
> Yeah, my initial intention is performance reason. I can try it in a separate patch.
Yes, a separate patch is fine. But I feel we might be over optimizing here unless the perf delta is visible.
================
Comment at: llvm/tools/llvm-profgen/PerfReader.h:659
+*/
+class UnsymbolizedProfileReader : public PerfReaderBase {
+public:
----------------
wlei wrote:
> hoy wrote:
> > wlei wrote:
> > > hoy wrote:
> > > > wenlei wrote:
> > > > > hoy wrote:
> > > > > > What is the main reason of making this type hierarchy? It looks like `UnsymbolizedProfileReader` doesn't need most of the interfaces `PerfReaderBase` provides. Conceptually it sounds to me that the two classes function independently, and if we'd like code sharing, `UnsymbolizedProfileReader` can be made the base class of `PerfReaderBase` or make a new base class that simply reads in something and outputs an symbolized profile?
> > > > > > UnsymbolizedProfileReader can be made the base class of PerfReaderBase
> > > > >
> > > > > I don't think this is a good idea. Conceptually, PerfReader is not a special kind of UnsymbolizedProfileReader.
> > > > Right, they are independent of each other. PerfReader really deals with perf input. They only share in the raw output writting. A new base class makes more sense?
> > > It seems to me the hierarchy should be like
> > > ```
> > > PerfReaderBase. ---> PerfScriptReader. ---> LBRPerfReader
> > > ---> HybridPerfReader
> > >
> > > ---> UnsymbolizedProfileReader
> > > ```
> > > This will make the hierarchy deep though.
> > >
> > That's the current hierarchy, IIUC? I was wondering if `UnsymbolizedProfileReader` should inherit from a base class that's not `PerfReaderBase`. If you look at the implementation of `PerfReaderBase`, there isn't much `UnsymbolizedProfileReader` shares. The writing of unsymbolized profile is probably not needed for `UnsymbolizedProfileReader`.
> Yeah, It's mostly similar to your suggestion. I meant just rename current `PerfReaderBase` to `PerfScriptReader`. then we can use `PerfReaderBase` as the new base.
The hierarchy in the latest version looks good to me.
However, I'm wondering if we could move LBRPerfReader::generateRawProfile into PerfScriptReader? That way we could have HybridPerfReader and LBRPerfReader both inherit from PerfScriptReader directly. The LBRPerfReader-> HybridPerfReader hierarchy seems a bit weird, but I understand it was for reusing LBRPerfReader::generateRawProfile. Don't bother if this can cause bigger change.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D111750/new/
https://reviews.llvm.org/D111750
More information about the llvm-commits
mailing list