[LLVMdev] [lldb-dev] MCJIT Mach-O JIT debugging

Keno Fischer kfischer at college.harvard.edu
Mon Jun 2 14:42:46 PDT 2014


Hmm, nevermind, it seems to be working just fine now. I'll clean it up and
submit a patch.


On Mon, Jun 2, 2014 at 4:54 PM, Keno Fischer <kfischer at college.harvard.edu>
wrote:

> We do for ELF (ObjectFileELF::RelocateSection), because LLVM doesn't do
> the debug info relocation for us in that case. It currently does for Mach-O
> so that shouldn't be an issue yet, the only question is whether lldb
> correctly loads the relocated section (which I think it should since the
> load address is being set correctly), or whether it loads the section
> directly from the object file.
>
>
> On Mon, Jun 2, 2014 at 4:50 PM, Greg Clayton <gclayton at apple.com> wrote:
>
>> We don't currently apply any relocations (that I know of) for debug info
>> in LLDB.
>>
>> > On Jun 2, 2014, at 12:35 PM, Keno Fischer <kfischer at college.harvard.edu>
>> wrote:
>> >
>> > I think I'm getting closer. The debug_info section is being relocated
>> correctly (I think):
>> >
>> > 0x00000000: Compile Unit: length = 0x00000045  version = 0x0003
>>  abbr_offset = 0x00000000  addr_size = 0x08  (next CU at 0x00000049)
>> >
>> > 0x0000000b: TAG_compile_unit [1] *
>> >              AT_producer( "julia" )
>> >              AT_language( DW_LANG_C89 )
>> >              AT_name( "string.jl" )
>> >              AT_stmt_list( 0x00000000 )
>> >              AT_comp_dir( "." )
>> >              AT_APPLE_optimized( 0x01 )
>> >              AT_low_pc( 0x0000000112f5f1c0 )
>> >              AT_high_pc( 0x000006fb )
>> >
>> > 0x0000002b:     TAG_subprogram [2]
>> >                  AT_low_pc( 0x0000000112f5f1c0 )
>> >                  AT_high_pc( 0x0000000112f5f8bb )
>> >                  AT_frame_base( rbp )
>> >                  AT_MIPS_linkage_name( "julia_parseint_nocheck;18749" )
>> >                  AT_name( "parseint_nocheck" )
>> >                  AT_external( 0x01 )
>> >                  AT_accessibility( DW_ACCESS_private )
>> >
>> > 0x00000048:     NULL
>> >
>> > but lldb is still showing it at the original location:
>> >
>> > 0x7ff3afca9280: SymbolVendor
>> > 0x7ff3afcafa20:   Type{0x0000002b} , name = "parseint_nocheck",
>> clang_type = 0x00007ff3ab548df0 void (void)
>> > 0x7ff3afca93e0:   CompileUnit{0x00000000}, language =
>> "Language(language = 0xafca93e0)", file = './string.jl'
>> > 0x7ff3afcafe20:     Function{0x0000002b}, mangled =
>> julia_parseint_nocheck;18749, type = 0x7ff3afcafa20
>> >
>> > even though the section seems to be loaded correctly:
>> >
>> > Sections for 'JIT(0x7fc4230f4e00)(0x00007fc4230f4e00)' (x86_64):
>> >   SectID     Type             Load Address
>> File Off.  File Size  Flags      Section Name
>> >   ---------- ---------------- ---------------------------------------
>>  ---------- ---------- ---------- ----------------------------
>> >   0x00000100 container        [0x0000000112efccf8-0x0000000112f5f8fb)*
>> 0x000003b0 0x00000950 0x00000000 JIT(0x7fc4230f4e00).__TEXT
>> >   0x00000001 code             [0x0000000112f5f1c0-0x0000000112f5f8fb)
>>  0x000003b0 0x0000073b 0x80000400 JIT(0x7fc4230f4e00).__TEXT.__text
>> >   0x00000009 eh-frame         [0x0000000112efccf8-0x0000000112efcd68)
>>  0x00000c90 0x00000070 0x6800000b JIT(0x7fc4230f4e00).__TEXT.__eh_frame
>> >   0x00000200 container        [0x0000000000000784-0x0000000112efce75)*
>> 0x00000aeb 0x00000160 0x00000000 JIT(0x7fc4230f4e00).__DWARF
>> >   0x00000002 dwarf-info       [0x0000000112efcd68-0x0000000112efcdb1)
>>  0x00000aeb 0x00000049 0x02000000 JIT(0x7fc4230f4e00).__DWARF.__debug_info
>> >   0x00000003 dwarf-abbrev     [0x00007fc4230f5934-0x00007fc4230f595f)
>>  0x00000b34 0x0000002b 0x02000000 JIT(0x7fc4230f4e00).__DWARF.__debug_abbrev
>> >   0x00000004 dwarf-line       [0x0000000112efcdc9-0x0000000112efce75)
>>  0x00000b5f 0x000000ac 0x02000000 JIT(0x7fc4230f4e00).__DWARF.__debug_line
>> >   0x00000005 dwarf-str        [0x00007fc4230f5a0b-0x00007fc4230f5a4b)
>>  0x00000c0b 0x00000040 0x02000000 JIT(0x7fc4230f4e00).__DWARF.__debug_str
>> >   0x00000006 dwarf-loc
>> 0x00000c4b 0x00000000 0x02000000 JIT(0x7fc4230f4e00).__DWARF.__debug_loc
>> >   0x00000007 dwarf-ranges
>>  0x00000c4b 0x00000000 0x02000000 JIT(0x7fc4230f4e00).__DWARF.__debug_ranges
>> >   0x00000300 container        [0x0000000112efce80-0x0000000112efcec0)*
>> 0x00000c50 0x00000040 0x00000000 JIT(0x7fc4230f4e00).__LD
>> >   0x00000008 regular          [0x0000000112efce80-0x0000000112efcec0)
>>  0x00000c50 0x00000040 0x02000000 JIT(0x7fc4230f4e00).__LD.__compact_unwind
>> >
>> > (the relocated address is
>> >
>> > julia> datapointer(filter(s->s.sectname == "__debug_info",sects)[1])
>> > Ptr{Uint8} @0x0000000112efcd68
>> >
>> > )
>> >
>> > so it seems like despite knowing the correct load address for the
>> __debug_info section, it's still somehow picking up on the old addresses.
>> I'll keep looking, but if something springs to mind, please let me know.
>> >
>> >
>> >
>> >
>> >
>> > On Mon, Jun 2, 2014 at 11:47 AM, Keno Fischer <
>> kfischer at college.harvard.edu> wrote:
>> > I didn't get to work on this more last week, but I'll look at
>> incorporating that suggestion.
>> >
>> > The other question of course is how to do this in LLDB. Right, now what
>> I'm doing is going through and adjusting the load address of every leaf in
>> the section tree. That basically works and gets me backtraces with the
>> correct function names and the ability to set breakpoints at functions in
>> JITed modules. What it doesn't get me yet is line numbers. I suspect that
>> is because the DWARF still refer to the old addresses. I thought
>> relocations should take care of that, but apparently they don't so I'll
>> have to look at whether to solve this in LLDB or in LLVM. Suggestions are
>> most welcome.
>> >
>> >
>> >
>> > On Wed, May 28, 2014 at 12:53 PM, Greg Clayton <gclayton at apple.com>
>> wrote:
>> >
>> > > On May 28, 2014, at 8:57 AM, Keno Fischer <
>> kfischer at college.harvard.edu> wrote:
>> > >
>> > > Hello,
>> > >
>> > > I'm finally getting back to getting JIT debugging work for MCJIT.
>> This has worked for ELF for a while in LLVM and support in lldb was added
>> in January (for ELF). I'm now trying to add support for Mach-O and would
>> appreciate some feedback (though I'm fighting my way through learning the
>> format, I'm still just a novice).
>> > >
>> > > My current patchset for llvm is here:
>> https://gist.github.com/loladiro/8d909ddd04e6d7e9a5d0 . I have a
>> corresponding patch for lldb and I basically got this working (modulo line
>> table information, though I'm sure I'm doing something stupid in lldb here).
>> > > The basic approach is to, when a section gets allocated rewrite the
>> sections `addr` and update every symbols `n_value` correspondingly. This is
>> very much in line with what is done for ELF, but I'm not sure if it's the
>> right approach, so I'd appreciate if somebody who has more experience with
>> Mach-O could look at the above patch and give some feedback. If this
>> approach looks sane in general, I'll finish up and post both the LLVM and
>> the LLDB patch for formal review.
>> >
>> > The one thing you might want to look into is the n_value only needs to
>> be updated "if ((N_TYPE & n_type) == N_SECT)" (the symbol is in a section
>> and therefore is has a address value). Other symbols have values that
>> usually don't need to be modified. You might also need to watch out for
>> absolute symbols (if ((N_TYPE & n_type) == N_ABS)) as there are a few that
>> sometimes don't claim to be a symbol that has a valid address, but they
>> actually do point to an address. The symbol named "mach_header" is one such
>> absolute symbol.
>> >
>> > If this is all new code, get it as close as you can and then we can
>> work the kinks out once it is in the codebase.
>> >
>> > Greg
>> >
>> >
>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20140602/4459dbaf/attachment.html>


More information about the llvm-dev mailing list