[cfe-dev] [RFC] clang-ifso: A new clang-based tool for generating interface libraries.
Jake Ehrlich via cfe-dev
cfe-dev at lists.llvm.org
Tue Apr 9 02:06:33 PDT 2019
I think using .o files is possible from llvm-elfabi. It should also be
possible to support doing this from headers. I think both of those methods
have issues however. In particular I think producing this from .o files
isn't worth it when you can just produce it from the fully linked binary
anyways.
Let's explore some options and some trade offs.
1) Producing from headers: This let's you build the .tbe file and/or stub
before anything is compiled or linked. It however requires you to keep
track of headers which is slightly complicated or verbose depending nonnhow
you do it. It also requires reparsing everything. Further still you have to
make sure you have all the same flags and you also have to make lots.of
assumptions about the linker. It's basically very redundant and requires a
lot of matching of behavior of the compiler and linker. Also keep in mind
that you can at most match the behavior of 1 compiler and 1 linker. They're
similar but not exactly the same and there are 4 major combinations of
these things between GNU and llvm. Also keep in mind that in general you
actually need to use arbitrary source nfiles not just headers files.
2) Producing .tbe files from .o files: This means that all compilation must
be performed first but linking can be done in parallel. That's a pyrrhic
victory however when you consider the IO cost. Additionally while you no
longer have to match the behavior of the compiler you still have to match
linker behavior. The pro is that this probably fits nicely into existing
builds. It also requires no changes to clang.
3) Outputting a single file for each translation unit that can eventually
be linked just like above: this is just a slightly better version if the
above. Keep in mind that .tbe files match the ABI if a DSO so they have
things like soname and dynamic library dependencies so you can't reuse the
same format. This still has most of the draw backs of the above but I cost
is probably slightly less overall but you increase the amount of IO output
by the compiler. This also requires a change.to clang
4) Producing from fully linked DSO. This should be just as easy to
incorporate into a build and uses far less IO than using a bunch of special
files or object files. It also requires no matching of behavior with the
linker. This seems strictly better to me than the above methods. This also
requires no modifications to clang. This is intended to be a supported mode
for llvm-elfabi.
5) keep .tbe files in your source tree and produce stubs from .the files.
This has all the benefits of 4 and then some at the consequence of
requiring users to maintain these files themselves (via llvm-elfabi mind
you) and the consequence of being far more complicated to incorporate into
a build. But the benefits are huge. Your ABI becomes a reviewable part of
your source. Your modules build in 100% since linking one module no longer
depends on linking another module, just building the stub (or potentially
just the .tbe file). This is an intended mode for llvm-elfabi and the
situation I want our personal builds to work towards. I think 4) is still
important to make work however.
>From my perspective, llvm-elfabi already is intended to support the optimal
use cases. Other ways of operating seem strictly less optimal. For
timelines I plan on picking llvm-elfabi back up on April 16th.
Contributions welcome. I can start filling bugs for features if you'd like.
There was some other talk if using yaml2obj or code from llvm-objcopy to
emit ELF files. I think this should be more carefully considered before
proceeding. Neither it these things are quite in ideal fidelity with the
needs at play here. I can talk more about it but if we use llvm-elfabi, or
at least use the same writer from it, I don't think that's really need.
Making llvm-objcopy into a library is a big task and the result is probably
not ideally suited for this but we could speak more about that.
Best,
Jake
On Tue, Apr 9, 2019, 1:58 AM Puyan Lotfi via cfe-dev <cfe-dev at lists.llvm.org>
wrote:
> So, we are actually now thinking of redoing this patch but using a similar
> idea.
>
> Petr, would llvm-elfabi be capable of merging multiple of these tbe files?
>
> I'm looking at implementing something in the Driver that basically
> generates a tbe file for each .o file. The benefit would be that you could
> pretty easily just let loose your standard clang build invocation already
> in your build system with a couple added flags, and that's all you'd need
> to do to generate the ifsos assuming llvm-elfabi could consume all of the
> tbe's and link them together etc.
>
> I think tbe could be a point of collaboration on this. Thoughts?
>
> PL
>
>
> On Fri, Apr 5, 2019 at 2:35 PM Puyan Lotfi <puyan.lotfi.llvm at gmail.com>
> wrote:
>
>> A few of us (Alex, Saleem and I) have discussed ideas for ELF generation.
>> It sounds like aside from yaml2obj, lld, and MC's ELFObjectWriter there is
>> also all of the ELF emission done by llvm-objcopy.
>> Alex tells me he's actually been intending to extract a lot of the ELF
>> emission code from llvm-objcopy (llvm/tools/llvm-objcopy/ELF/Object.cpp/h)
>> into a library somewhere in lib/Object.
>> We think it's probably best to do this refactoring, then eventually have
>> llvm-elfabi, clang-ifso, yaml2obj, etc all do their ELF writing through it
>> in some capacity (either going through one of the text formats, or
>> directly).
>>
>> As far as clang-ifso goes, I think it should be a standalone tool that is
>> capable of generating multiple ifso formats (yaml, tbe, raw elf, etc).
>> At some point soon I will probably put up a phabricator patch that only
>> includes the yaml emission along with some tests; I think that could work
>> for now.
>>
>> Thoughts?
>>
>> PL
>>
>> On Fri, Apr 5, 2019 at 9:47 AM Puyan Lotfi <puyan.lotfi.llvm at gmail.com>
>> wrote:
>>
>>> CC'ing George and Alex:
>>>
>>> I was thinking along these lines, that it would be nice to have some
>>> kind of common code base for things like yaml2obj as well. I spent some
>>> time at first looking at and trying to decouple yaml2elf from the yaml
>>> parts until I realized that it'd be a lot better to just generate the yaml
>>> in memory and pass it on to yaml2elf.
>>>
>>> Is llvm-elfabi planning to interop with yaml2obj any? As far as I can
>>> tell, currently in the monorepo the only places that are creating ELF .dyn*
>>> sections are in yaml2obj and lld. I agree it'd be nice to figure out some
>>> kind of code reuse plan.
>>>
>>> PL
>>>
>>>
>>>
>>> On Thu, Apr 4, 2019 at 6:21 PM Petr Hosek <phosek at chromium.org> wrote:
>>>
>>>> llvm-elfabi is going to generate ELF binaries directly as well (we
>>>> originally considered extending lld to support tbe files directly but we
>>>> later decided not to do that to avoid introducing the additional
>>>> complexity). It'd be great if clang-ifso could emit tbe files and use the
>>>> same implementation for generating ELF binaries to avoid duplication
>>>> between the two tools.
>>>>
>>>> On Thu, Apr 4, 2019 at 5:42 PM Puyan Lotfi <puyan.lotfi.llvm at gmail.com>
>>>> wrote:
>>>>
>>>>> Shoaib, Petr:
>>>>>
>>>>> I don’t yet know the full details of elf-tapi, but the two ways I
>>>>> believe these differs are:
>>>>>
>>>>> 1) clang-ifso is a clang based tool that scans from the source level
>>>>> and uses visibility attributes to put together a set of symbols to expose
>>>>> in the ifso file.
>>>>>
>>>>> 2) clang-ifso primarily aims to generate the ELF binary directly, but
>>>>> also can generate yaml that yaml2obj can generate the ifso from. Currently
>>>>> the aim is to avoid introducing linker changes.
>>>>>
>>>>> It’s possible clang-ifso could also support generating said text stubs
>>>>> (tbe’s) too.
>>>>>
>>>>> PL
>>>>>
>>>>>
>>>>> On Thu, Apr 4, 2019 at 5:23 PM Petr Hosek <phosek at chromium.org> wrote:
>>>>>
>>>>>> To expand on Shoaib's answer, llvm-elfabi
>>>>>> <https://github.com/llvm/llvm-project/tree/master/llvm/tools/llvm-elfabi> is
>>>>>> a very similar tool that we've been working on for the past few months in
>>>>>> upstream. The current implementation is still incomplete, but once
>>>>>> D56569 <https://reviews.llvm.org/D56569>, D55839
>>>>>> <https://reviews.llvm.org/D55839> and D55864
>>>>>> <https://reviews.llvm.org/D55864> land (hopefully within the next
>>>>>> few weeks), it should be possible to use llvm-elfabi to produce text stubs
>>>>>> (called .tbe files) and generate ELF interface libraries out of those.
>>>>>>
>>>>>> On Thu, Apr 4, 2019 at 5:18 PM Shoaib Meenai via cfe-dev <
>>>>>> cfe-dev at lists.llvm.org> wrote:
>>>>>>
>>>>>>> How does this compare to the ELF TAPI implementation in LLVM (
>>>>>>> http://lists.llvm.org/pipermail/llvm-dev/2018-September/126472.html
>>>>>>> and https://reviews.llvm.org/D53051)?
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> *From: *cfe-dev <cfe-dev-bounces at lists.llvm.org> on behalf of
>>>>>>> cfe-dev <cfe-dev at lists.llvm.org>
>>>>>>> *Reply-To: *Puyan Lotfi <puyan.lotfi.llvm at gmail.com>
>>>>>>> *Date: *Thursday, April 4, 2019 at 4:52 PM
>>>>>>> *To: *cfe-dev <cfe-dev at lists.llvm.org>
>>>>>>> *Cc: *Puyan Lotfi <puyan at puyan.org>
>>>>>>> *Subject: *[cfe-dev] [RFC] clang-ifso: A new clang-based tool for
>>>>>>> generating interface libraries.
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> Hi All,
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> The following is our proposal for clang-ifso, we hope to get some
>>>>>>> valuable feedback from the community on this and we hope this work helps
>>>>>>> SDK authors in the future:
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> On platforms such as Darwin and Windows, there exist library
>>>>>>> interface files (TAPI tbd files, Windows Import Library files) that appear
>>>>>>> to the compile-time linker as just another library file (i.e. dylib, dll,
>>>>>>> etc) but are in-fact only empty listings of symbols to interface functions
>>>>>>> that were intended for exposure by the library writer (i.e. their `.text`
>>>>>>> sections are stripped, but they have the API symbols needed for linking).
>>>>>>> These library interfaces can be used to both limit access to internals of a
>>>>>>> library at static compile time, and can be used to speedup link time in the
>>>>>>> case of linking with extremely large dynamic libraries. Aside from
>>>>>>> providing more controlled API exposure and reduced memory usage in linking,
>>>>>>> there is also the benefit of having a much smaller distribution size for
>>>>>>> development SDKs (in the case where you’ve got an SDK with applications
>>>>>>> that are built and linked on a PC but then deployed to run on a totally
>>>>>>> different device). Finally, these interface libraries can in many cases be
>>>>>>> used to break up build dependencies as well if they can be cached or
>>>>>>> generated quickly in some way prior to building all the different libraries
>>>>>>> in a build.
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> clang-ifso is a tool that intends to bring the concept of interface
>>>>>>> libraries to ELF shared objects. We call it ifso as a shorthand for
>>>>>>> InterFace-Shared-Object: as in, we intend to support ELF by producing a .so
>>>>>>> that looks just like a regular .so file to the linker but has most of the
>>>>>>> .text and other contents dropped and has only the intended API interface
>>>>>>> symbols populated. That means a .so file generated by clang-ifso contains
>>>>>>> only a stripped .text section plus the `.dynsym` and `.dynstr` sections
>>>>>>> along with the global symbols populated. Currently it is in a prototype
>>>>>>> state and is capable of generating object-yaml or ELF, and is implemented
>>>>>>> using the ClangTool interface.
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> As mentioned at the beginning, there is prior art in this area with
>>>>>>> things like TBD files on Darwin as well as Microsoft’s import libraries. As
>>>>>>> an aside, there is evidence that ELF versions of ifsos are also deployed in
>>>>>>> production in some settings, so this is not completely new with ELF either
>>>>>>> but the effort to make all of the work upstream is. One other way that
>>>>>>> clang-ifso does differ from a lot of the prior work is that it uses the
>>>>>>> clang parser to glean what visible NamedDecls are present in the library
>>>>>>> headers.
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> Because clang-ifso operates from the clang level we can and do
>>>>>>> enable the programmer to use visibility attributes to expose or hide
>>>>>>> whatever interface libraries they want in their actual code rather than
>>>>>>> relying on a separate file to strip symbols as a post-build step. And since
>>>>>>> all compile-time linking will be done with the ifso rather than the .so,
>>>>>>> all clients of the library will need to completely rely on whatever was
>>>>>>> exposed through visibility attributes.
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> We are eager to to hear feedback and ideas in this space. Code for
>>>>>>> clang-ifso is available at:
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> https://github.com/plotfi/llvm-project/tree/master/clang/tools/clang-ifso
>>>>>>> .
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> PL
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>> _______________________________________________
>>>>>>> cfe-dev mailing list
>>>>>>> cfe-dev at lists.llvm.org
>>>>>>> https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev
>>>>>>>
>>>>>> _______________________________________________
> cfe-dev mailing list
> cfe-dev at lists.llvm.org
> https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/cfe-dev/attachments/20190409/78e4d064/attachment.html>
More information about the cfe-dev
mailing list