<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<meta name="Generator" content="Microsoft Word 12 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
{font-family:Tahoma;
panose-1:2 11 6 4 3 5 4 4 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:blue;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{mso-style-priority:99;
color:purple;
text-decoration:underline;}
span.EmailStyle17
{mso-style-type:personal-reply;
font-family:"Calibri","sans-serif";
color:#1F497D;}
.MsoChpDefault
{mso-style-type:export-only;}
@page WordSection1
{size:8.5in 11.0in;
margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
{page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang="EN-US" link="blue" vlink="purple">
<div class="WordSection1">
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D">Currently we parse the CU headers even in a DWP+DWP merge to check for duplicates. So I don’t think it would be a regression.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D">-- wolfgang<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D"><o:p> </o:p></span></p>
<div style="border:none;border-left:solid blue 1.5pt;padding:0in 0in 0in 4.0pt">
<div>
<div style="border:none;border-top:solid #B5C4DF 1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal"><b><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif"">From:</span></b><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif""> David Blaikie [mailto:dblaikie@gmail.com]
<br>
<b>Sent:</b> Tuesday, February 27, 2018 9:39 AM<br>
<b>To:</b> reviews+D42937+public+3fe0cbc66cf01928@reviews.llvm.org<br>
<b>Cc:</b> Pieb, Wolfgang; aprantl@apple.com; jdevlieghere@apple.com; llvm-commits@lists.llvm.org; mgrang@codeaurora.org<br>
<b>Subject:</b> Re: [PATCH] D42937: [DWARF] Make llvm-dwp handle DWARF v5 string offsets tables and indexed strings.<o:p></o:p></span></p>
</div>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<div>
<p class="MsoNormal">Yeah, I'll just call it - I think the maintenance burden of committing/maintaining support for v4/v5 mixed str_offsets isn't worth it (but is worth it for llvm-dwarfdump - for investigation, etc) for llvm-dwp, etc.<br>
<br>
But I guess it's still a bit complicated - in a DWP+DWP merge, we'd still have to parse at least one unit header to decide which version to look for in str_offsets? (but that's still an improvement compared to having to read all the headers - which is a regression,
right? Currently (without this patch) no headers need to be parsed when doing DWP+DWP, I think? (even though for any DWO file being merged in, we need to do lots of parsing to find its identifier, etc))<o:p></o:p></p>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<div>
<div>
<p class="MsoNormal">On Mon, Feb 26, 2018 at 5:53 PM Paul Robinson via Phabricator <<a href="mailto:reviews@reviews.llvm.org">reviews@reviews.llvm.org</a>> wrote:<o:p></o:p></p>
</div>
<blockquote style="border:none;border-left:solid #CCCCCC 1.0pt;padding:0in 0in 0in 6.0pt;margin-left:4.8pt;margin-right:0in">
<p class="MsoNormal" style="margin-bottom:12.0pt">probinson added inline comments.<br>
<br>
<br>
================<br>
Comment at: string_offsets_mixed.s:40-46<br>
+# With DWARF v5 we need to match up CUs with their corresponding string offsets table<br>
+# contributions (in order to properly process the contribution headers) and hence need<br>
+# to remap and write them unit-by-unit to the output file. The pre-v5 implementation<br>
+# did not need to worry about this because it could just rewrite the input file's string<br>
+# offsets table in one chunk. The test therefore ensures that llvm-dwp rewrites the<br>
+# string offsets table contributions in the order they appear in the input file section,<br>
+# and not in the order in which their corresponding CUs appear in the index table.<br>
----------------<br>
dblaikie wrote:<br>
> wolfgangp wrote:<br>
> > dblaikie wrote:<br>
> > > Oh... I hadn't realized/understood/thought about this.<br>
> > ><br>
> > > That's kind of awkward - mixing blobs with headers and blobs without headers in the same section (str_offsets) & then having to use the CU/TUs to disambiguate/dictate how to parse those chunks.<br>
> > ><br>
> > > Can we avoid that? Could we just say v4 and v5 are incompatible? Have a flag or something that checks.<br>
> > ><br>
> > > Then we could always walk the str_offsets alone, either expecting header'd sections (which I hope are self descriptive - once you know you're reading a v5 str_offsets, you don't need to consult the CU for anything to do that?) or non-header'd sections
(where you just treat every word as a string offset without consideration for how those are divided into contributions)<br>
> > ><br>
> > ><br>
> > > (Ah, I see you mentioned/highlighted that in the patch description too)<br>
> > > That's kind of awkward - mixing blobs with headers and blobs without headers in the same section (str_offsets) & then having to use the CU/TUs to disambiguate/dictate how to parse those chunks.<br>
> > ><br>
> > > Can we avoid that? Could we just say v4 and v5 are incompatible? Have a flag or something that checks.<br>
> > ><br>
> > That would have been the simpler solution, but llvm-dwarfdump is already handling the same thing, so I thought it would be inconsistent if llvm-dwp didn't handle it too. We also would have to reject any dwp input files with mixed units that were created
by non-llvm tools.<br>
> ><br>
> > It would certainly be easy to reject any mixing of v5 and v4 units.<br>
> > Pity, though, since it's already working...<br>
> I'd be happy to hear some other people's opinions on this (Adrian & Paul?).<br>
><br>
> It seems pretty unfortunate to have this split between versions making a bunch of complexity to support like this.<br>
The sticking point for me is whether there are objects in the wild that use GNU-style .debug_str_offsets, and might get linked with proper v5 .debug_str_offsets. The linker will combine them without a second thought, and it's unreasonable to ask a linker to
verify.<br>
I guess... given that you need to enable this stuff explicitly for gcc with DWARF v4, and the point is to reduce relocations which is an irrelevant consideration for fission, it's maybe not so likely that we have objects like this in the wild. That makes it
reasonable to put our collective foot down and say llvm-dwp won't mix-n-match.<br>
<br>
I do think it's worthwhile for llvm-dwarfdump to handle the mixed case, partly because it's already done and partly because it's a different kind of tool (diagnostic/analysis rather than production).<br>
<br>
<br>
<a href="https://reviews.llvm.org/D42937" target="_blank">https://reviews.llvm.org/D42937</a><br>
<br>
<br>
<o:p></o:p></p>
</blockquote>
</div>
</div>
</div>
</body>
</html>