[PATCH] D110466: [llvm-profgen][CSSPGO] On-demand function size computation for preinliner
Wenlei He via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Sep 27 17:12:22 PDT 2021
wenlei accepted this revision.
wenlei added a comment.
lgtm, thanks.
================
Comment at: llvm/tools/llvm-profgen/ProfiledBinary.cpp:550-551
+ uint64_t EndOffset) {
+ uint32_t Index = getIndexForOffset(StartOffset);
+ if (CodeAddrs[Index] != StartOffset)
+ WithColor::warning() << "Invalid start instruction at "
----------------
wlei wrote:
> wenlei wrote:
> > wlei wrote:
> > > wenlei wrote:
> > > > wlei wrote:
> > > > > wenlei wrote:
> > > > > > Is this the same as `addressIsCode`? or perhaps we need `addrOffsetIsCode`?
> > > > > The input of `addressIsCode` is the `address`. I guess the confusing part is `CodeAddrs` so I changed to `codeAddrOffsets`.
> > > > What I meant is we could have something like this:
> > > > ```
> > > > bool addrOffsetIsCode(uint64_t Offset) const {
> > > > return Offset2LocStackMap.find(Offset) != Offset2LocStackMap.end();
> > > > }
> > > > ```
> > > >
> > > > which is similar to addressIsCode, except that we use offset now.
> > > >
> > > >
> > > > ```
> > > > bool addressIsCode(uint64_t Address) const {
> > > > uint64_t Offset = virtualAddrToOffset(Address);
> > > > return Offset2LocStackMap.find(Offset) != Offset2LocStackMap.end();
> > > > }
> > > > ```
> > > >
> > > > I think it's cleaner to use API instead of dealing with internals like `codeAddrOffsets` directly when possible. Also `Offset2LocStackMap` is unordered_map while `codeAddrOffsets ` is sorted vector. So look up with `Offset2LocStackMap` is faster.
> > > I see. Here I used `uint32_t Index = getIndexForOffset(StartOffset);` because I need to use the `Index` to iterate all the invalid instructions which are stored in `codeAddrOffsets`. `Offset2LocStackMap ` doesn't give the `Index`.
> > > ```
> > > Index = getIndexForOffset(StartOffset);
> > > while(codeAddrOffsets[Index] <= End) {
> > > // use codeAddrOffsets[Index] to do something.
> > > Index++;
> > > }
> > > ```
> > > To avoid using `codeAddrOffsets` directly, previously we use `IP`, but IP only working on `Address` and this introduces many redundancy. i. e. we need to change `offset` --> `address` and then change `address` back --> `offset`.
> > >
> > Ok, I missed the use of that Index later.. However why do we care about invalid instructions? I assume invalid instructions won't be in the middle of a function, then for those from padding at the end, should we ignore them when computing size?
> Yeah, we actually only care about valid instructions here.
> ```
> uint32_t Index = getIndexForOffset(StartOffset);
> uint64_t Offset = CodeAddrOffsets[Index];
> ```
> If `StartOffset` is an invalid offset(give a warning), it will round to next valid `Index`. for other instructions, they're all got from the `CodeAddrOffsets` by increasing the index, so they're also valid.
I see. Though with current callers, StartOffset should always be valid offset, right? The check and warning is more about making the function robust for other use cases.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D110466/new/
https://reviews.llvm.org/D110466
More information about the llvm-commits
mailing list