<html><head><meta http-equiv="Content-Type" content="text/html charset=utf-8"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" class=""><br class=""><div><blockquote type="cite" class=""><div class="">On May 11, 2015, at 3:32 PM, David Blaikie <<a href="mailto:dblaikie@gmail.com" class="">dblaikie@gmail.com</a>> wrote:</div><br class="Apple-interchange-newline"><div class=""><div dir="ltr" class=""><br class=""><div class="gmail_extra"><br class=""><div class="gmail_quote">On Mon, May 11, 2015 at 3:25 PM, Adrian Prantl <span dir="ltr" class=""><<a href="mailto:aprantl@apple.com" target="_blank" class="">aprantl@apple.com</a>></span> wrote:<br class=""><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div style="word-wrap:break-word" class=""><br class=""><div class=""><div class=""><div class="h5"><blockquote type="cite" class=""><div class="">On May 11, 2015, at 3:14 PM, David Blaikie <<a href="mailto:dblaikie@gmail.com" target="_blank" class="">dblaikie@gmail.com</a>> wrote:</div><br class=""><div class=""><div dir="ltr" class=""><br class=""><div class="gmail_extra"><br class=""><div class="gmail_quote">On Mon, May 11, 2015 at 3:09 PM, Adrian Prantl <span dir="ltr" class=""><<a href="mailto:aprantl@apple.com" target="_blank" class="">aprantl@apple.com</a>></span> wrote:<br class=""><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div class=""><div class=""><br class="">
> On May 11, 2015, at 3:04 PM, Duncan P. N. Exon Smith <<a href="mailto:dexonsmith@apple.com" target="_blank" class="">dexonsmith@apple.com</a>> wrote:<br class="">
><br class="">
><br class="">
>> On 2015 May 11, at 14:51, David Blaikie <<a href="mailto:dblaikie@gmail.com" target="_blank" class="">dblaikie@gmail.com</a>> wrote:<br class="">
>><br class="">
>><br class="">
>><br class="">
>> On Mon, May 11, 2015 at 2:42 PM, Duncan P. N. Exon Smith <<a href="mailto:dexonsmith@apple.com" target="_blank" class="">dexonsmith@apple.com</a>> wrote:<br class="">
>><br class="">
>>> On 2015 May 11, at 09:43, Adrian Prantl <<a href="mailto:aprantl@apple.com" target="_blank" class="">aprantl@apple.com</a>> wrote:<br class="">
>>><br class="">
>>>><br class="">
>>>> On May 9, 2015, at 6:55 AM, Duncan P. N. Exon Smith <<a href="mailto:dexonsmith@apple.com" target="_blank" class="">dexonsmith@apple.com</a>> wrote:<br class="">
>>>><br class="">
>>>><br class="">
>>>>> On 2015 May 8, at 17:38, Adrian Prantl <<a href="mailto:aprantl@apple.com" target="_blank" class="">aprantl@apple.com</a>> wrote:<br class="">
>>>>><br class="">
>>>>>><br class="">
>>>>>> On May 8, 2015, at 1:00 PM, Duncan P. N. Exon Smith <<a href="mailto:dexonsmith@apple.com" target="_blank" class="">dexonsmith@apple.com</a>> wrote:<br class="">
>>>>>><br class="">
>>>>>><br class="">
>>>>>>> On 2015 May 8, at 15:32, David Blaikie <<a href="mailto:dblaikie@gmail.com" target="_blank" class="">dblaikie@gmail.com</a>> wrote:<br class="">
>>>>>>><br class="">
>>>>>>><br class="">
>>>>>>><br class="">
>>>>>>> On Fri, May 8, 2015 at 11:18 AM, Adrian Prantl <<a href="mailto:aprantl@apple.com" target="_blank" class="">aprantl@apple.com</a>> wrote:<br class="">
>>>>>>> Hi dexonsmith, dblaikie, echristo,<br class="">
>>>>>>><br class="">
>>>>>>> This adds a DIExternalTypeRef debug metadata node to the IR that is meant to be emitted by Clang when referring to a type that is defined in a .pcm file. DIExternalTypeRef is a child of DICompositeTypeBase because it shares common traits such as the unique identifier.<br class="">
>>>>>>><br class="">
>>>>>>> Could this be an attribute to the DIClassType/StructureType? (to emit the currently needed type unit references, we still should know the correct class/structure type tag) LLVM already knows to hash the ref identifier (mangled name) for the type unit identifier.<br class="">
>>>>><br class="">
>>>>> Note that DIExternalTypeRef also knows the tag.<br class="">
>>>>>><br class="">
>>>>>> I think they should be separate for now. When we have an<br class="">
>>>>>> `identifier:` field in `DICompositeType` we use the<br class="">
>>>>>> string-based `DITypeRef`s instead of direct pointers. I don't<br class="">
>>>>>> imagine there's any value in adding that indirection for these.<br class="">
>>>>><br class="">
>>>>> It would make lookups slower, but otherwise it would be harmless.<br class="">
>>>>><br class="">
>>>>>> (Although, this makes me wonder about an LTO scenario. Imagine you<br class="">
>>>>>> have two translation units, and they both use a type identified by<br class="">
>>>>>> (e.g.) "my-type". One of them pulled the type in from a module<br class="">
>>>>>> and has a `DIExternalTypeRef`, while the other translation unit<br class="">
>>>>>> didn't use modules and has a `DICompositeType`. What should the<br class="">
>>>>>> debug info look like? Is it okay if we end up with both? If not,<br class="">
>>>>>> which one should win?<br class="">
>>>>><br class="">
>>>>> This is a scenario that can only happen in C++. For maximum space efficiency you would want the external type ref to win or you will end up with two copies of the type (which isn’t otherwise harmful). If you are are using dsymutil, it will still unique the two copies.<br class="">
>>>>><br class="">
>>>>>> Will these new nodes show up in the<br class="">
>>>>>> `retainedType:` list?)<br class="">
>>>>><br class="">
>>>>> They will not show up in the retained types list.<br class="">
>>>><br class="">
>>>> That's a show stopper for sharing with `DICompositeType`, at least<br class="">
>>>> until we get rid of type refs.<br class="">
>>>><br class="">
>>>> There's logic scattered around the compiler that assumes that if a<br class="">
>>>> node is `DICompositeType` and it has a valid identifier field, then<br class="">
>>>> it should be referenced by string. If something is referenced by<br class="">
>>>> a string, then it needs to be in the retained type list, or you<br class="">
>>>> can't convert the string back to a `DICompositeType`.<br class="">
>>>><br class="">
>>>> I think it would be better to keep these classes distinct. If it<br class="">
>>>> makes sense to, we can always merge them later (once we no longer<br class="">
>>>> have type refs).<br class="">
>>><br class="">
>>> There is no hard reason for why the external type references couldn’t be added to the list of retained types. We could add code to the DWARF backend to not emit the “definitions” of external retained types.<br class="">
>><br class="">
>> Okay, sure, but that seems like a hack. To me it looks like all the<br class="">
>> IR-level and backend code that cares that something is a<br class="">
>> DICompositeType will need special handling based on whether it's an<br class="">
>> external type ref.<br class="">
>><br class="">
>> So far as I recall very little code cares about DICompositeType specifically - it should just be one place, essentially, that fills out the attributes/child DIEs of the type DIE. We'd have a special case there that would be about as costly/shoehorned as the case for emitting a declaration. (it seems like it's just a different kind of declaration - one with a DW_AT_signature as well as the DW_AT_declaration and DW_AT_name)<br class="">
>><br class="">
>> Is there some complexity I'm missing here?<br class="">
><br class="">
> Hmm. I just did a `git grep DICompositeType`, and there's less<br class="">
> in the backend than I thought.<br class="">
><br class="">
> I guess the only real problem I have is with the type refs. It<br class="">
> doesn't make sense to me that we'd add these to the retained<br class="">
> type list.<br class="">
<br class="">
</div></div>The fwddecl-style references (with full DeclContext + Name) that David referred to actually make some sense in the retained types list.</blockquote><div class=""><br class="">Not sure I follow how one (fwddecl style references) relate to the other (retained types list)?<br class=""></div></div></div></div></div></blockquote><div class=""><br class=""></div></div></div><div class="">There is really nothing we could emit for an external type ref that is only a signature or UID.</div></div></div></blockquote><div class=""><br class=""></div><div class="">Still missing some steps in this thread. Confused.<br class=""><br class="">If all we have is a signature, then we emit DW_AT_type + DW_FORM_ref_sig8</div></div></div></div></div></blockquote><div><br class=""></div><div>I was thinking in the context of the loop that emits the standalone versions of all retained types:</div><div><div> for (auto *Ty : CUNode->getRetainedTypes())</div><div> CU.getOrCreateTypeDIE(cast<DIType>(resolve(Ty->getRef())));</div><div class=""><br class=""></div><div class="">(Not that it would be hard to ignore signature/UID-only types here.)</div></div><blockquote type="cite" class=""><div class=""><div dir="ltr" class=""><div class="gmail_extra"><div class="gmail_quote"><div class=""> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div style="word-wrap:break-word" class=""><div class=""><span class=""><br class=""><blockquote type="cite" class=""><div class=""><div dir="ltr" class=""><div class="gmail_extra"><div class="gmail_quote"><div class=""> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"> As I recently noted in the thread about DIModule there are good reasons to emit them even on Darwin (they tell us which module a type originated from) </blockquote><div class=""><br class="">Don't follow this either - what does the type reference style (DW_TAG_type + DW_FORM_ref_sig8 compared to DW_TAG_type as a reference to a declaration type DIE) relate to module origin and dsymutil's behavior?<br class=""></div></div></div></div></div></blockquote><div class=""><br class=""></div></span><div class="">I was thinking more about DW_AT_type + DW_FORM_ref_sig8 (not sure if that was a typo)</div></div></div></blockquote><div class=""><br class="">Not a typo - that's one alternative way to reference a type in a type unit if we don't emit a full declaration. (but it presents problems when we want to emit the out-of-line definition of a member function that needs to reference the declaration which we currently put inside the declaration - GCC Produces 3 different ways of referencing type units, I just implemented the most general (the one that allows member function declarations))<br class=""> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div style="word-wrap:break-word" class=""><div class=""><div class=""> versus a DW_AT_TYPE + ref to fwd decl that is a child of the DW_TAG_module or the skeleton CU.</div></div></div></blockquote><div class=""><br class="">I'd rather not reference things in the module or skeleton CU - I'm hoping to keep this usable by a DWARF 5 debugger supporting fission and type units rather than adding more custom/experimental features.<br class=""></div></div></div></div></div></blockquote><div><br class=""></div>That would be something we’d only do on systems that don’t support type units. (i.e. Darwin: partially because Mach-O doesn’t do comdat sections, partially because other dwarf consumers like Instruments and dtrace also don’t support them.) Which is not to say that it will always be different. It would be straightforward to have dsymutil take bag-of-dwarf-style debug info and produce compact output that is easy for these tools to digest.<br class=""><blockquote type="cite" class=""><div class=""><div dir="ltr" class=""><div class="gmail_extra"><div class="gmail_quote"><div class=""><br class="">For anything outside fission+type units, I really want to talk about that sort of thing in detail to understand the motivation, direction, benefits, tradeoffs, alternatives, etc. That's why I like piggybacking on the existing features - it requires far less scrutiny/design as the features are already formalized/implemented/used in practice, etc.<br class=""></div></div></div></div></div></blockquote><div><br class=""></div><div>That’s fair.</div><div><br class=""></div><blockquote type="cite" class=""><div class=""><div dir="ltr" class=""><div class="gmail_extra"><div class="gmail_quote"><div class=""> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div style="word-wrap:break-word" class=""><div class=""><div class="">In the first case the consumer cannot know in which module to look for the type (other than by searching through all modules imported by the CU). In the second case it could use the decl context of the fwd decl (assuming that it would be the root of the DeclContext some way or another) to figure out the module. It also could load the type directly from the module using the info from the decl context.</div></div></div></blockquote><div class=""><br class="">Using the fully qualified name built from the context/name? If that suits you guys, though I think we originally discussed having a type unit hash->module info type identifier table somewhere, but that's for the other review threads I think... which I'll eventually muster the courage to get back to. Sorry for the delay.</div></div></div></div></div></blockquote><div><br class=""></div>thanks,</div><div>adrian<br class=""><blockquote type="cite" class=""><div class=""><div dir="ltr" class=""><div class="gmail_extra"><div class="gmail_quote"><div class=""><br class="">- David<br class=""> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div style="word-wrap:break-word" class=""><div class=""><span class="HOEnZb"><font color="#888888" class=""><div class=""><br class=""></div><div class="">-- adrian</div></font></span><span class=""><br class=""><blockquote type="cite" class=""><div class=""><div dir="ltr" class=""><div class="gmail_extra"><div class="gmail_quote"><div class=""> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">and let dsymutil optimize them away.<br class="">
<span class=""><font color="#888888" class=""><br class="">
-- adrian<br class="">
</font></span><div class=""><div class=""><br class="">
><br class="">
>><br class="">
>> It seems like it's just being shoehorned in.<br class="">
>><br class="">
>> @David, if you still feel like they should be the same class, can you<br class="">
>> explain why?<br class="">
>><br class="">
>>> I’m fine with either variant, with a preference to introducing a separate DIExternalTypeRef over having a very special DICompositeType that is distinguished by a flag.<br class="">
>><br class="">
>><br class="">
><br class="">
<br class="">
</div></div></blockquote></div><br class=""></div></div>
</div></blockquote></span></div><br class=""></div></blockquote></div><br class=""></div></div>
</div></blockquote></div><br class=""></body></html>