<div dir="ltr"><div class="gmail_default" style="font-family:monospace;font-size:small;color:#000000">Just a quick note -- IRPGO profile is not deterministic with multi-threaded programs due to contentions (there is of course atomic update mode, but it can be slow). Asynchronous dumping is another reason that the profile is not guaranteed to be repeatable.</div><div class="gmail_default" style="font-family:monospace;font-size:small;color:#000000"><br></div><div class="gmail_default" style="font-family:monospace;font-size:small;color:#000000">David</div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Thu, Oct 7, 2021 at 9:18 AM Wenlei He <<a href="mailto:wenlei@fb.com">wenlei@fb.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<div lang="EN-US" style="overflow-wrap: break-word;">
<div class="gmail-m_9192219421710674796WordSection1">
<p class="MsoNormal">Thanks for sharing the progress and details on the binary format. Overall this looks like a clean design that fits current PGO profile format with extensions.<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">Some high level comments:<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<ul style="margin-top:0in" type="disc">
<li class="gmail-m_9192219421710674796MsoListParagraph" style="margin-left:0in">Does memprof/PGHO work together with today's IRPGO today, i.e. can we have one instrumented build to collect both PGO and PGHO profile, or we will need separate PGO instrumentation
builds for each, in which case CSPGO + PGHO would need three iterations of training and build, which would be significant operational cost..<u></u><u></u></li><li class="gmail-m_9192219421710674796MsoListParagraph" style="margin-left:0in">I think some of the problems memprof faced when dealing with storing calling context and mapping context to IR is very similar to CSSPGO. I'm wondering if it makes sense to promote
some existing infrastructure to be more general beyond just serving <a href="https://lists.llvm.org/pipermail/llvm-dev/2020-August/144101.html" target="_blank">CSSPGO</a>. One example is the IR mapping you mentioned (quoted below). In CSSPGO, we have the exact same need, and it's handled by `SampleContextTracker` which queries a context trie using an instruction/DILocation.
<u></u><u></u></li></ul>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal"><i> > Because the MIB corresponding to the A->B context is associated with function B in the profile, we do not find it by looking at function A’s profile when we see function A’s malloc call during matching. To address this we
need to keep a correspondence from debug locations to the associated profile information.<u></u><u></u></i></p>
<p class="MsoNormal"><u></u> <u></u></p>
<ul style="margin-top:0in" type="disc">
<li class="gmail-m_9192219421710674796MsoListParagraph" style="margin-left:0in">The serialization of calling context, pruning of calling context are also example of shared problems, and we've put in some effort to have effective solutions (e.g. offline
<a href="https://reviews.llvm.org/D99146" target="_blank">preinliner</a> for most effective pruning, which I think could be adapted to help keep most important allocation context). Perhaps some of the frameworks can be merged, so LLVM has general context aware PGO support
that can be leverage by different kinds of PGO (IRPGO, PGHO, CSSPGO). If you think this is worth pursuing, we’d be happy to help too.<u></u><u></u></li></ul>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">More on the details:<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<ul style="margin-top:0in" type="disc">
<li class="gmail-m_9192219421710674796MsoListParagraph" style="margin-left:0in">I saw that MemInfoBlock contains alloc/dealloc cpuid, does that make memprof profile non-deterministic in the sense that running memprof twice on the exact program and input would
yield bit-wise different memory profile? I think IR PGO profile is deterministic?<u></u><u></u></li></ul>
<p class="MsoNormal"><u></u> <u></u></p>
<ul style="margin-top:0in" type="disc">
<li class="gmail-m_9192219421710674796MsoListParagraph" style="margin-left:0in">Why do we use `file:line:discriminator` instead of `func:line_offset:discriminator `? The later would be more resilient to source change. If function name string is too long, we could
perhaps leverage the MD5 encoding used by sample PGO?<u></u><u></u></li></ul>
<p class="MsoNormal"><u></u> <u></u></p>
<ul style="margin-top:0in" type="disc">
<li class="gmail-m_9192219421710674796MsoListParagraph" style="margin-left:0in">Is the design of mmap section (quoted below) trying to support memprof for multiple binaries in the same process at the same time, or mainly for handling multiple non-consecutive executable
segments for a single binary? <u></u><u></u></li></ul>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal"><i> > The process memory mappings for the executable segment during profiling are stored in this section. This allows symbolization during post processing for binaries which are built with position independent code. For now all
read only, executable mappings are recorded, however in the future, mappings for heap data can also potentially be stored.<u></u><u></u></i></p>
<p class="MsoNormal"><u></u> <u></u></p>
<ul style="margin-top:0in" type="disc">
<li class="gmail-m_9192219421710674796MsoListParagraph" style="margin-left:0in">Do we need each function record to have its own schema, do we expect different functions to use different versions/schemas? The is very flexible, but wondering what’s the use case.
If the schema is for compatibility across versions, perhaps a file level scheme would be enough?<u></u><u></u></li></ul>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal"> <i> > The InstrProfRecord for each function will hold the schema and an array of Memprof info blocks, one for each unique allocation context.<u></u><u></u></i></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<p class="MsoNormal">Thanks,<u></u><u></u></p>
<p class="MsoNormal">Wenlei<u></u><u></u></p>
<p class="MsoNormal"><u></u> <u></u></p>
<div style="border-right:none;border-bottom:none;border-left:none;border-top:1pt solid rgb(181,196,223);padding:3pt 0in 0in">
<p class="MsoNormal" style="margin-bottom:12pt"><b><span style="font-size:12pt;color:black">From:
</span></b><span style="font-size:12pt;color:black">Snehasish Kumar <<a href="mailto:snehasishk@google.com" target="_blank">snehasishk@google.com</a>><br>
<b>Date: </b>Wednesday, September 29, 2021 at 3:17 PM<br>
<b>To: </b>llvm-dev <<a href="mailto:llvm-dev@lists.llvm.org" target="_blank">llvm-dev@lists.llvm.org</a>>, Vedant Kumar <<a href="mailto:vsk@apple.com" target="_blank">vsk@apple.com</a>>, Wenlei He <<a href="mailto:wenlei@fb.com" target="_blank">wenlei@fb.com</a>>, <a href="mailto:andreybokhanko@gmail.com" target="_blank">andreybokhanko@gmail.com</a> <<a href="mailto:andreybokhanko@gmail.com" target="_blank">andreybokhanko@gmail.com</a>>, David Li <<a href="mailto:davidxl@google.com" target="_blank">davidxl@google.com</a>>, Teresa Johnson <<a href="mailto:tejohnson@google.com" target="_blank">tejohnson@google.com</a>><br>
<b>Subject: </b>RFC: A binary serialization format for MemProf<u></u><u></u></span></p>
</div>
<div>
<div>
<p class="MsoNormal"><span style="font-size:10pt;font-family:"Courier New";color:black">This RFC contains the following:</span><span style="font-size:12pt"><u></u><u></u></span></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:10pt;font-family:"Courier New";color:black"><br>
<br>
</span><span style="font-size:12pt"><u></u><u></u></span></p>
</div>
<div>
<div>
<p style="margin:0in"><span style="font-size:7.5pt;font-family:"Courier New";color:black">* Proposal to introduce a new raw binary serialization format for heap allocation profiles</span><span style="font-size:7.5pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:7.5pt;font-family:"Courier New";color:black">* Proposal to extend the
<span class="gmail-m_9192219421710674796gmaildefault">PGO </span>indexed format to hold heap allocation profiles</span><span style="font-size:7.5pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span class="gmail-m_9192219421710674796gmaildefault"><span style="font-size:7.5pt;font-family:"Courier New"">We look forward to your feedback on the proposals.</span></span><u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><span style="font-size:10pt;font-family:"Courier New";color:black">Authors:
<a href="mailto:snehasishk@google.com" target="_blank">snehasishk@google.com</a>, <a href="mailto:davidxl@google.com" target="_blank">davidxl@google.com</a>, <a href="mailto:tejohnson@google.com" target="_blank">tejohnson@google.com</a> </span><u></u><u></u></p>
</div>
<div>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">Introduction</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">—-----------</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">The design of a sanitizer-based heap profiler (MemProf) was shared with llvm-dev in Jun 2020 [1]. Since then Teresa (@tejohnson) has added a sanitizer based heap profiler
which can be enabled with -fmemory-profile. Today it writes out the data in a text format which can be inspected by users. We have used this to drive analyses of heap behaviour at Google. This RFC shares details on a binary serialization format for heap profiling
data which can then be reused by the compiler to guide optimizations similar to traditional PGO. </span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">Similar to the existing instrumentation based PGO, the binary heap profile data for PGHO has two forms. One is the raw data format that is used by the profiler runtime,
and the other is the indexed profile format used by the compiler. The profile data with the indexed profile data format will be generated by llvm-profdata from the raw profile data offline. This allows a single binary profile file to hold the PGO and Memprof
profiling data. Fig 1 below shows the binary format generation and use.</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal" style="margin-bottom:12pt"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> ┌──────────────────┐ Raw Profile ┌───────────────┐ Indexed Profile (v8) </span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> │ Compiler Runtime ├─────────────► llvm-profdata ├───► with Memprof data ───► -fprofile-use</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> └──────────────────┘ └───────────────┘</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">Fig 1: Memprof binary profile lifecycle</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">Raw Binary Format</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">—----------------</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">The raw binary format contains 4 sections</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">1. Memprof raw profile header</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">2. Memory mapping layout</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">3. Memory Info Block (MIB) records</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">4. Call stack information</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> +----------------------+</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | Magic |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> +----------------------+ H</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | Version | E</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> +----------------------+ A</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | Total Size | D </span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> +----------------------+ E </span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">+---------| Map Offset | R</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| +----------------------+ </span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| +-----| MIB Offset | </span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| | +----------------------+ </span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| | | Call Stack Offset |---------+</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| | +----------------------+ |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">+-------->| Number of | M |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | | Map Entries | A |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | +----------------------+ P | </span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | | | |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | | Map Entry | S |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | +----------------------+ E |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | | ... | C |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | +----------------------+ T |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | | | I |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | | Map Entry | O |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | +----------------------+ N |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | +----------------------+ |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> +---->| Number of | M |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | MIB Entries | I |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> +----------------------+ B | </span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | | |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | MIB Entry | S |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> +----------------------+ E |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | ... | C |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> +----------------------+ T |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | | I |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | MIB Entry | O |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> +----------------------+ N |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> +----------------------+ |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | Number of |<--------+</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | Call Stack Entries | S S</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> +----------------------+ T E</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | Call Stack Entry | A C</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> +----------------------+ C T</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | ... | K I</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> +----------------------+ O</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | Call Stack Entry | N</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> +----------------------+</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">Fig 2: Memprof Raw Format</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal" style="margin-bottom:12pt"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">* Memprof raw profile header</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">The header consists of a unique identifier, version number, total size of the profile as well as offset into the profile for each of the following sections - memory mapping
layout, memprof profile entries and call stack information. We do not intend to maintain backwards compatibility for the raw binary format.</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">* Memory mapping layout</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">The process memory mappings for the executable segment during profiling are stored in this section. This allows symbolization during post processing for binaries which
are built with position independent code. For now all read only, executable mappings are recorded, however in the future, mappings for heap data can also potentially be stored. For each mapping, we record</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<ul style="margin-top:0in" type="disc">
<li style="color:black;margin-top:0in;margin-bottom:0in;vertical-align:baseline;font-variant-numeric:normal;font-variant-east-asian:normal;white-space:pre-wrap">
<span style="font-size:10pt;font-family:"Courier New"">Start (virtual address)<u></u><u></u></span></li><li style="color:black;margin-top:0in;margin-bottom:0in;vertical-align:baseline;font-variant-numeric:normal;font-variant-east-asian:normal;white-space:pre-wrap">
<span style="font-size:10pt;font-family:"Courier New"">End (virtual address)<u></u><u></u></span></li><li style="color:black;margin-top:0in;margin-bottom:0in;vertical-align:baseline;font-variant-numeric:normal;font-variant-east-asian:normal;white-space:pre-wrap">
<span style="font-size:10pt;font-family:"Courier New"">Offset (from the file it was mmap-ed from)<u></u><u></u></span></li><li style="color:black;margin-top:0in;margin-bottom:0in;vertical-align:baseline;font-variant-numeric:normal;font-variant-east-asian:normal;white-space:pre-wrap">
<span style="font-size:10pt;font-family:"Courier New"">buildId (linker generated hash -Wl,build-id<u></u><u></u></span></li></ul>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">* MIB Records</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">The profile information we collect is currently defined in [2]. The section begins with a 8 byte entry containing the number of profile entries in this section. Each
entry is uniquely identified by the dynamic calling context of the allocation site. For each entry, various metrics such as access counts, allocation sizes, object lifetimes etc are computed via profiling. This section may contain multiple entries identified
by the same callstack id. Subsequent processing to convert and merge multiple raw profiles will deduplicate any such entries.</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">* Call stack information<br>
Each memprof profile entry is uniquely identified by its dynamic calling context. In this section we record the identifier and its corresponding call stack trace. We use the sanitizer stack depot provided identifier and serialize the trace for each without
deduplication. The section begins with a 8b entry containing the number of call stack entries. Each call stack entry contains a 8b field which denotes how many contexts are recorded in this entry. Each frame is identified by an 8b program counter address which
holds the call instruction virtual address - 1. Further deduplication is possible though we do not do so at this time.</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">* Raw Profile Characteristics</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">To understand the characteristics of raw binary profiles generated by Memprof, we experimented with a clang bootstrap build. We ran 500 invocations of Memprof-instrumented
clang on clang source code. Each invocation produced a raw binary profile and we present some aggregate information about them below:</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">+---------------------------------+--------+---------+---------+</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| | Min | Median | Max |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">+---------------------------------+--------+---------+---------+</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| Unique allocation contexts | 940 | 10661 | 35355 |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">+---------------------------------+--------+---------+---------+</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| MIB records size (bytes) | 101528 | 1151396 | 3818348 |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">+---------------------------------+--------+---------+---------+</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| Call stack section size (bytes) | 31080 | 419048 | 1439144 |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">+---------------------------------+--------+---------+---------+</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| Total Size (bytes) | 133336 | 1571680 | 5258220 |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p><span style="font-size:10pt;font-family:"Courier New";color:black">+---------------------------------+--------+---------+---------+</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">The size of the header is 48 bytes and the number of read only executable memory maps is usually small (12 in this case) with each map entry consuming 64 bytes. We find
that the raw profiles are highly compressible, on average files are compressed by 81%. The largest raw profile in the dataset is ~5M in size. It is compressed to 975K using zip on default settings. In contrast for the same clang build, the instrumented PGO
raw profile is ~21M in size (zip compressed 73%). Note that the Memprof profile size is proportional to the number of allocation contexts during profiling. </span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">Since these are profiles from individual invocations, they must be merged before use. This is performed implicitly during the conversion to indexed profile format by
llvm-profdata. MIBs are merged based on their call stack.</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">Memprof extensions for the indexed profile format</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">—------------------------------------------------</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">+----------------+</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| MAGIC |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">+----------------+</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| VERSION |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">+----------------+</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| HASHTYPE |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">+--+----------+--+</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">|HASHTAB OFFSET |-------+</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">+--+----------+--+ |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">+----------------+ |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| | |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| PROFILE | |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| SUMMARY | |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| DATA | |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">+----------------+ |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">+----------------+ <----+</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| OnDisk |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| Chained |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| HashTable |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">+----------------+</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">Fig 3: Existing PGO indexed profile format</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">During the offline processing step (using llvm-profdata), allocation contexts are pruned and merged. The end result is a collection of unique allocation contexts with
access, size and lifetime properties. Contexts are uniquely identified based on the call stack and are stored using a prefix deduplication scheme described in Section “Symbolized Memprof call stack section”.</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">To fit into the PGO profile format, we need to index the profile using the function name. The only functions that own Memprof profile data are those direct callers of
the allocator primitive functions. Thus the profile data mapping in the IR must account for potentially missing frames. Implications on matching of the profile data with the IR is touched upon in Section “Profile Data matching in IR” and will be further detailed
in an upcoming RFC. </span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">The Memprof profile data for a particular function can then be just an array of MIB entries. One allocation site in the function can have multiple MIB entries each one
of them corresponding to one allocation context.</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">The change to the existing PGO indexed format is summarized as:</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<ul style="margin-top:0in" type="disc">
<li style="color:black;margin-top:0in;margin-bottom:0in;vertical-align:baseline;font-variant-numeric:normal;font-variant-east-asian:normal;white-space:pre-wrap">
<span style="font-size:10pt;font-family:"Courier New"">Augment the profile record data structure to include an optional MIB array after the value profile data [3].<u></u><u></u></span></li><li style="color:black;margin-top:0in;margin-bottom:0in;vertical-align:baseline;font-variant-numeric:normal;font-variant-east-asian:normal;white-space:pre-wrap">
<span style="font-size:10pt;font-family:"Courier New"">Add one additional section before the existing OnDiskChainedHashtable to store the allocation stacks (referenced by the MIBs). This section is after the profile summary data section [4].<u></u><u></u></span></li><li style="color:black;margin-top:0in;margin-bottom:0in;vertical-align:baseline;font-variant-numeric:normal;font-variant-east-asian:normal;white-space:pre-wrap">
<span style="font-size:10pt;font-family:"Courier New"">Bump the version number.<u></u><u></u></span></li></ul>
<p class="MsoNormal" style="margin-bottom:12pt"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">Memprof portable entry format</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">—----------------------------</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">The raw memprof profile entry format is subject to change with version. However, the indexed profile entry must be backwards compatible to ensure that the PGO profile
as a whole is backwards compatible. We propose a schema based format - per function description of a Memprof profile entry. Each field is identified by a tag. We propose the following schema:</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">struct MIBMeta {</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> enum MIBFieldTag {</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> // The tag ids remain unchanged. </span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> Unused = 0, </span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> StackID = 1,</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> AllocCount = 2,</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> AveSize = 3,</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> MinSize = 4,</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> MaxSize = 5,</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> AveAccessCount = 6,</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> MinAccessCount = 7,</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> MaxAccessCount = 8,</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> AveLifetime = 9,</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> MinLifetime = 10,</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> MaxLifetime = 11,</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> NumMigration = 12,</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> NumLifetimeOverlaps = 13,</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> NumSameAllocCPU = 14,</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> NumSameDeallocCPU = 15</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> };</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> enum MIBFieldType {</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> Unused = 0,</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> UINT8 = 1,</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> UINT16 = 2,</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> UINT32 = 3, // Varint encoded</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> UINT64 = 4, // Varint encoded</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> };</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">};</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal" style="margin-bottom:12pt"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">// Mapping of tags to their descriptive names</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">const char *MIBFieldName[] = {</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> "",</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> "StackID", </span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> "AllocCount",</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> …</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> "NumSameDeallocCPU"</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">};</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">// Mapping of tags to their types</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">uint8 MIBFieldType [] = {</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> 0, // unused</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> MIBMeta::MIBFieldType::UINT64,</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> ….</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">};</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">To make the field tag, field name, and field type declarations always in sync, a shared .inc file will be used. This file will be shared between compiler-rt and llvm/lib/ProfileData
libraries. Dependencies across the compiler-rt project are not recommended for isolation.</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">// MIBDef.inc file</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">// define MIBEntryDef(tag, name, type) before inclusion</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">MIBEntryDef(StackID = 1, "StackID",</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> MIBMeta::MIBFieldType::UINT64)</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">MIBEntryDef(AllocCount = 2, "AllocCount", </span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> MIBMeta::MIBFieldType::UINT32)</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">...</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">MIBEntryDef(NumSameDeallocCPU=13,"NumSameDeallocCPU",
</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> MIBMeta::MIBFieldType:UINT8)</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal" style="margin-bottom:12pt"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">enum MIBFieldTag {</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> StartTag = 0,</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> #define MIBEntryDef(tag, name, type) tag</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> #include "MIBDef.inc"</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> #undef MIBEntryDef</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">};</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">const char *MIBFieldName {</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> #define MIBEntryDef(tag, name, type) name</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> "",</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> #include "MIBEntryDef.inc"</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> #undef MIBEntryDef</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">};</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">uint8 MIBFieldType {</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> #define MIBEntryDef(tag, name, type) type</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> 0, // not used</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> #include "MIBEntryDef.inc"</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> #undef MIBDefEntry</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">};</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">The InstrProfRecord for each function will hold the schema and an array of Memprof info blocks, one for each unique allocation context.</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">Symbolized Memprof call stack section </span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">—------------------------------------<br>
<br>
<br>
</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">This section holds the symbolized version of the call stack section in the raw profile. This is necessary to enable the compiler to map recorded runtime PC addresses
to source locations during recompilation. For space efficiency, this section is split into three subsections: 1. stack entry table 2. file path table 3. string table.<br>
<br>
<br>
</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">Fig 4 shows the relationship between the three tables.</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> STACK ENTRY FILE PATH STRING</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> TABLE TABLE TABLE</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> ┌────────┐ ┌──┬──┬──┐ ┌─────────┐</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> │ │ │ │ │ │ │10 abc │</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> │ │ ├──┼──┼──┤ ├─────────┤</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> │ │ │ │ │ │◄┐ │11 def │</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> │ │ ├──┼──┼──┤ │ ├─────────┤</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> ├──┬──┬──┤ │ │ │ │ │ │12 ghi │</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> ┌───┤03│LN│DI│ ├──┼──┼──┤ │ ├─────────┤</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> │ ├──┴──┴──┤ ┌───►│03│13│01├─┘ ┌─►│13 XY.h │</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> │ └────────┘ │ └──┴─┬┴──┘ │ └─────────┘</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> │ │ │ │</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> └────────────────┘ └────────┘</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">Fig 4. The Stack Entry, File Path and String Table. </span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> LN = Line Number, DI = Discriminator</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">* Stack Entry Table: Each uniquely identified symbolized call stack consists of a variable number of callsites. In the indexed format, each callsite needs to be represented
as file:line:discriminator (as shown in Fig 4). The call stack location is a 64-bit field, but it is split into three subfields: file table index, line number, and the discriminator value. The file table index is a pointer to a leaf node in the prefix encoded
scheme described below.</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">* File path table using prefix encoding: Full path filenames have lots of common substrings due to the directory structure so they can be compressed. One simple scheme
is to use the reverse prefix tree representation. In this representation, the name string of a directory at each level (not including prefixes) is represented by a node, and it is linked to its parent node. To summarize, the file path table is represented
as an array of nodes organized as a forest of reversed tree structures. </span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">For instance, the strings
<br>
“abc/def/ghi/XY.c”, <br>
“abc/def/ghi/XY.h”, </span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">“abc/def/jkl/UV.c” </span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">are represented as </span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> +-----------+<------+</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | abc | |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">+----->+---->+-----------+ |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| | | def +-------+</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| | +-----------+</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| +-----+ ghi |<-------+----+</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">| +-----------+ | |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">+-+--------->| jkl | | |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | +-----------+ | | </span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | | XY.c +--------+ |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | +-----------+ |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> +----------+ UV.c | |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> +-----------+ |</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> | XY.h +-------------+</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> +-----------+ </span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal" style="margin-bottom:12pt"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">Fig 5. Prefix tree representation of paths in the file path table</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">Each parent link implies a path separator ‘/’. Furthermore we represent each file or directory string as an integer offset into the string table (see Fig 4). Thus each
node holds an offset into the string table and a pointer to the parent (interior) directory node.</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">* String table: To remove redundancies across prefix tree nodes in the file path encoding, we use a string table which stores the mapping of string to a unique id. The
id can be simplified as the implicit offset into the table.</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">Thus this representation exploits redundancy in the source file path prefix where there are a large number of source files in a small number of deeply nested directories.</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">Symbolizing the raw PC addresses in post-processing using llvm-profdata requires the binary to also be provided as input. As an alternative, we will also experiment with
incrementally generating the symbolized call stack section as part of the raw profile dump at the cost of increased profiling overhead.</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal" style="margin-bottom:12pt"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">Profile Data matching in IR</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">—--------------------------</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">When matching the profile data, we may have already early inlined the direct caller of the allocation into its caller(s). This means we need to take some additional steps
to identify the matching MIBs. For example, consider the following partial call graph:</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"><br>
<br>
<br>
</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> ┌────────────────────┐ ┌────────────────────┐</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> A │ Call B; debug loc1 │ C │ Call B; debug loc2 │</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> └───────┬────────────┘ └───────────┬────────┘</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> │ │</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> │ │</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> │ ┌─────────────────────────┐ │</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> └────► Call malloc; debug loc3 ◄─────┘</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> B └─────────────────────────┘</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">There will be 2 MIB entries, one for each context (A->B and C->B). As noted earlier, the MIB profile entries will be owned by the function calling the allocation function.
Therefore, we will keep both MIB entries associated with function B in the profile.</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">If early inlining (i.e. before profile matching) inlines B into A but not into C it will look like the following when we try to match the profile:</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal" style="margin-bottom:12pt"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> ┌────────────────────┐</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> ┌────────────────────────┐ C │ Call B; debug loc2 │</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> A │Call malloc; debug loc3 │ └───────────┬────────┘</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> │ ; inlined at │ │</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> │ ; debug loc1 │ ┌──────────────┴───────────┐</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> └────────────────────────┘ B │ Call malloc; debug loc3 │</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black"> └──────────────────────────┘</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">Because the MIB corresponding to the A->B context is associated with function B in the profile, we do not find it by looking at function A’s profile when we see function
A’s malloc call during matching. To address this we need to keep a correspondence from debug locations to the associated profile information. The details of the design will be shared in a separate RFC in the future.</span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><span style="font-family:"Courier New""><u></u> <u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">[1]
<a href="https://lists.llvm.org/pipermail/llvm-dev/2020-June/142744.html" target="_blank">https://lists.llvm.org/pipermail/llvm-dev/2020-June/142744.html</a></span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">[2]
<a href="https://git.io/JzdRa" target="_blank">https://git.io/JzdRa</a></span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">[3]
<a href="https://git.io/JzdRR" target="_blank">https://git.io/JzdRR</a></span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p style="margin:0in"><span style="font-size:10pt;font-family:"Courier New";color:black">[4]
<a href="https://git.io/JzdRN" target="_blank">https://git.io/JzdRN</a></span><span style="font-size:10pt;font-family:"Courier New""><u></u><u></u></span></p>
<p class="MsoNormal"><u></u> <u></u></p>
</div>
</div>
</div>
</div>
</div>
</blockquote></div>