[PATCH] AsmWriter/Bitcode: Support specialized debug nodes
Adrian Prantl
aprantl at apple.com
Wed Feb 11 08:25:13 PST 2015
> On Feb 10, 2015, at 6:29 PM, Duncan P. N. Exon Smith <dexonsmith at apple.com> wrote:
>
>>
>> On 2015-Feb-10, at 16:59, Adrian Prantl <aprantl at apple.com> wrote:
>>
>>
>>> On Feb 10, 2015, at 4:48 PM, Duncan P. N. Exon Smith <dexonsmith at apple.com> wrote:
>>>
>>>
>>>> On 2015-Feb-10, at 16:05, Adrian Prantl <aprantl at apple.com> wrote:
>>>>
>>>> A few nitpicky suggestions and questions:
>>>>
>>>> - !MDSubrange(count: 30, lo: 2)
>>>> what about spelling out “low” and “high”?
>>>
>>> It doesn't mean "low" and "high", though, right? It means
>>> "count"/"size" and "index of first element”.
>>
>> You are probably right. I was thinking of DWARF where a subrange may either have a
>> DW_AT_lower_bound and/or a DW_AT_upper_bound
>> or a
>> DW_AT_lower_bound and/or a DW_AT_count
>>
>> We only support the second form so count and lower_bound would probably be most accurate.
>
> Sounds good. In keeping with the field naming convention I've adopted,
> I'll use 'count:' and 'lowerBound:’.
yes, sure!
>
>>>
>>> Or have I misinterpreted it?
>>>
>>> How about "count" and "low"?
>>>
>>>>
>>>> - do we still need the uniqueID on MDLexicalBlock? Could it be a distinct node instead?
>>>
>>> I think this should be distinct instead. I was planning to do this
>>> as part of the upgrade (moving them into place).
>>>
>>>>
>>>> - is it possible to have it print DW_OP-* constants such as in
>>>> !MDExpression(DW_OP_deref, DW_OP_bit_piece, 0, 8)
>>>> ?
>>>
>>> I think this is pretty difficult the way that expressions are stored
>>> right now -- it requires knowing how many arguments each thing takes,
>>> etc. In particular, how would the `AsmWriter` know whether something
>>> should have a symbolic constant (`DW_OP_deref`) vs. being the number 6?
>>
>> It would have to reuse the code from DIExpression::printInternal().
>
> I think this is doable.
>
> On the LLParser side, I've generally allowed raw ints (for `DW_TAG*`,
> `DW_LANG*`, etc.), but for `DW_OP`s we could (at least for now) require
> symbolic constants.
>
>>>
>>> I was thinking of trying to clean this up later, but since we're
>>> talking about it:
>>>
>>> I think instead of `std::vector<uint64_t>` it should be
>>> `std::vector<ExprOperand>`, where:
>>>
>>> struct ExprOperand {
>>> LocationAtom Kind;
>>> unsigned NumArgs;
>>> uint64_t Args[2];
>>> ExprOperand(LocationAtom Kind);
>>> ExprOperand(LocationAtom Kind, uint64_t Arg);
>>> ExprOperand(LocationAtom Kind, uint64_t Arg1, uint64_t Arg2);
>>>
>>> static ExprOperand getDeref() {
>>> return ExprOperand(dwarf::DW_OP_deref);
>>> }
>>> static ExprOperand getPiece(uint64_t A1, uint64_t A2) {
>>> return ExprOperand(dwarf::DW_OP_bit_piece, A1, A2);
>>> }
>>> };
>>>
>>> ExprOperand Ops = {
>>> ExprOperand::getDeref(),
>>> ExprOperand::getPiece(0, 8)
>>> };
>>> auto *Expr = MDExpression::get(Ops);
>>>
>>> Then the above could trivially be pretty-printed/parsed as:
>>>
>>> !MDExpression(DW_OP_deref, DW_OP_bit_piece(0, 8))
>>
>> or even better ... DW_OP_bit_piece(offset: 0, size: 8)
>
> Sure.
>
>> Yeah that looks good. Also we would need to introduce DW_OP_constu and get rid of the pseudo argument that DW_OP_plus currently takes.
>
> Not necessary. You can represent this already with:
>
> DW_OP_plus(7)
>
> where `7` is the pseudo-argument. This is somewhat horrible, but
> so is `DW_OP_plus` taking a pseudo-argument ;).
>
>> This doesn’t need to happen right away, but we should keep it in mind.
>
> I think something like this would simplify the logic a lot, besides
> making it more extensible.
>
>>>
>>> Thoughts?
>>>
>>>
>>>>
>>>> Will it still print the old-style comments that testcases tend to match on, or are they obsoleted by the human-readable syntax?
>>>
>>> They're obsoleted. The old-style comments will get stale and just
>>> add noise. If we've regressed somehow let me know how.
>>
>> See my comment on DW_OP_bit_piece above :-)
>>
>
> Hah, right, right :).
>
> I'll see if I can get something together for you. If not we can keep
> the DW_TAG_expression comments temporarily.
>
> Is it alright to lose the "offset=" and "size=" markers (at least
> temporarily)? I don't see a natural place to put them in the syntax.
> E.g., I was thinking I'd aim for the following for now:
>
> !MDExpression(DW_OP_deref, DW_OP_plus, 8, DW_OP_bit_piece, 0, 8)
>
> (and with no comment). Is this sufficient?
Sufficient as an intermediate step on our way to the above expression being written as
!MDExpression(DW_OP_deref, DW_OP_constu(8), DW_OP_plus, DW_OP_bit_piece(offset: 0, size: 8))
-- adrian
>
>>
>>>
>>>>
>>>> -- adrian
More information about the llvm-commits
mailing list