[cfe-dev] Catching Temporarily Used Types in Debug Info

Mon Mar 3 13:07:26 PST 2014

On Mon, Mar 3, 2014 at 11:17 AM, David Blaikie <dblaikie at gmail.com> wrote:

>
>
>
> On Mon, Mar 3, 2014 at 10:37 AM, Adrian Prantl <aprantl at apple.com> wrote:
>
>> Hi David!
>>
>> Slightly tangential, but this also reminds me that I eventually was going
>> to send out a proposal for debug info for Modules/PCH. Having access to the
>> serialized AST in its entirety would eliminate problems like this nicely.
>> DWARF is extended by an external AST type DIE that is merely a USR-based
>> index into the corresponding module file. Everything that is not explicitly
>> referenced in the DWARF can still be found in the module/PCH and read via
>> libclang...
>>
>>
>> On Mar 3, 2014, at 8:41 AM, David Blaikie <dblaikie at gmail.com> wrote:
>>
>> > We have a few bugs like ( http://llvm.org/bugs/show_bug.cgi?id=19005 )
>> > in debug info that all stem from the same basic problem:
>> >
>> > A type that's not referenced by another debug entity (such as a
>> > variable, parameter, function, etc) is not emitted in the debug info.
>> >
>> > GCC, while not being wholely consistent at addressing this, does get
>> > it 'more' right. I'd like Clang to be better at this as well, even if
>> > we're not perfect.
>> >
>> > The basic premise to implement this perfectly would be: If we're
>> > emitting code for a function (or global variable initializer, etc) and
>> > within that function a certain type is required to be complete, emit
>> > the type (and include it in the "retained types list").
>> >
>> > Does anyone have nice ideas on how we could realistically implement
>> > that test (yeah, I'm mostly looking at Richard on this) or a rough
>> > approximation that might get the 90% case?
>>
>> The idea would be for the fronted to register a record type as
>> debug-info-retained whenever, e.g., it is calculating the record's memory
>> layout (or similar)?
>>
>> I guess this is really a question for the frontend people :-)
>
>
> Hence this thread :)
>
> What you described is essentially "required complete type" which is a
> thing we already know - but we don't really want to go around emitting
> every type that is required to be complete. Consider this header:
>
> struct foo {
> ...
> };
> inline void func() {
>   foo f;
>   ...
> }
>
> Now any translation unit that includes that header, even if it doesn't use
> 'foo' at all and never calls 'func', would emit foo (since foo is required
> to be complete due to the 'f' variable in the 'func' function). Now
> consider that Sema.h looks exactly like this.
>
> An example use where we wouldn't want to emit 'foo's definition:
>
> bar.h:
> #include "foo.h"
> class bar {
>   foo *f;
> public:
>   bar();
>   int compute_thing_with_foo();
> };
>
> The out of line member function and ctor use the full definition of 'foo',
> but 'bar' does not, neither do clients of 'bar' need to. It'd be nice if we
> didn't emit the full definition of 'foo' here.
>
> Richard and I talked through some of this a bit and considered two
> solutions
>
> 1) essentially what we had before I made the change to power
> -flimit-debug-info/-fstandalone-debug (prior to the vtable optimization
> going in there too) by "requiresCompleteType": callback during Clang's
> IRGen for various AST constructs that we know require types to be complete,
> and emit debug info for those types. This is a bit painful since we
> essentially ad-hoc implement complete type detection again... - but I might
> be OK with this going into it knowing how/why we have to do this and taking
> a somewhat more systematic approach.
>
> 2) Have a callback at every point a type was required to be complete, even
> if it was already complete (currently we just have a callback on the
> instnant the type is first required to be complete) and if necessary,
> record the context in which that type was required to be complete (eg: as a
> member of another type, as a use in a function, etc). Then if/when we emit
> that contextual decl, check if any types were associated with it and emit
> those - recursively expanding (since the associated type might itself be
> the contextual decl for some other type).
>

As a nuance here, we only need the extra cost here if we're in a context
that we don't know for sure that we're going to emit (inside an inline
function definition or a class definition). Inside a normal function
definition, we can just directly register the type to have debug info
emitted, if we've not already done so.

> I'm erring towards the latter, though we know in both cases there might be
> gotchas for all manner of things that we need to blacklist/whitelist to get
> to a good place...
>
> - David
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/cfe-dev/attachments/20140303/e845f634/attachment.html>