[llvm-dev] [llvm-readobj][RFC]Making llvm-readobj GNU command-line compatible
Jordan Rupprecht via llvm-dev
llvm-dev at lists.llvm.org
Tue Nov 6 09:52:22 PST 2018
Hi James,
I also wanted to work on this discrepancy, but I just sent a patch instead
of an RFC: https://reviews.llvm.org/D54124. Thanks for sending the RFC that
I should have started myself :)
On Tue, Nov 6, 2018 at 4:53 AM James Henderson via llvm-dev <
llvm-dev at lists.llvm.org> wrote:
> Hi all,
>
> A broad goal of many of the LLVM binary tools, such as llvm-objcopy and
> llvm-objdump is to provide an alternative to the GNU equivalent, and as
> such, these tools have been developed to be command-line compatible. One
> tool where this hasn’t been the case up to now is llvm-readobj (aka
> llvm-readelf).
>
I don't want to digress too much, but llvm-objdump isn't compatible either.
For instance, "-df" is an llvm-objdump flag that accepts a list of
functions to disassemble, but objdump accepts "-df" as a merged form of "-d
-f" i.e. "--dissassemble --file-headers". So we may want to consider this
as a meta-discussion for other tools like llvm-objdump.
>
> There was some discussion in https://reviews.llvm.org/D33872 about the
> purpose of llvm-readobj, so I’d like to ask the community's opinion. What
> is the purpose of llvm-readobj? Is it purely intended as an aid to testing?
> Should it be aiming to be GNU compatible, like most of the rest of the LLVM
> tools?
>
>From the source:
// This is a tool similar to readelf, except it works on multiple object
file
// formats. The main purpose of this tool is to provide detailed output
suitable
// for FileCheck.
My impression is that llvm-readobj is intended to provide information in
the spirit of readelf, but not with any strong goal of keeping the format
the same. Then, llvm-readelf (as a symlink wrapper) was added recently, to
be more of a drop-in replacement, although still maybe not strict (same
format, but maybe not char-for-char compatible). That's just what I've
inferred from looking at code though, don't take my impression as judgement.
If that's the case, I think llvm-readelf should be relatively easy to make
breaking changes to if it breaks in favor of increasing GNU readelf
compatibility. llvm-readobj on the other hand has been around for a long
time that folks might be relying on its flag parsing. I'd be happy if the
latter were wrong and we could change llvm-readobj more freely though.
>
> The main issue I discovered with GNU compatibility is that llvm-readobj
> has a few incompatible command-line flags with different interpretations
> between the two tools:
>
> * -s means dump symbols in GNU readelf, but dump sections in llvm-readobj
> * -t means dump section details in GNU readelf, but dump symbols in
> llvm-readobj
> * -a means dump all in GNU readelf, but dump arm attributes in llvm-readobj
>
> There are also several missing aliases and some missing features, but we
> can implement those with no negative impact on the users of llvm-readobj,
> so I won't discuss those here.
>
> Also of relevance here are long options preceded with only a single dash.
> My understanding of GNU’s behaviour is that each letter following it is
> treated as a different option, whereas in llvm-readobj, we get one single
> option (e.g. ‘readobj -abc’ would be equivalent to ‘readobj -a -b -c’, but
> ‘llvm-readobj -abc’ is equivalent to ‘llvm-readobj --abc’). This is at
> least partly related to the cl::opt/libOption issues discussed in
> http://lists.llvm.org/pipermail/llvm-dev/2018-October/127328.html).
>
> I'd like to propose that we fix the three switches above such that they
> match GNU readelf's interpretation, and to change short-option handling
> similarly. This would inevitably result in some test churn (there are
> approximately 200 tests between core llvm and lld that would need
> updating), but it is manageable. More of an issue is that any users would
> suddenly find the switches changing on them, if they have started using
> llvm-readobj. On the other hand, I think the benefit for those used to GNU
> readelf outweighs the cost.
>
+1
>
> We could do a few different things to mitigate the impact of changing
> over, roughly in my order of preference, if we decide against just taking
> the plunge and changing the meaning:
>
> 1) For the next release, add a deprecation warning, saying that the
> switches’ meanings will be changed in a following release, and then fix it
> after the next release has been created, along with release notes
> documenting the change.
> 2) Provide a “--gnu-mode” or similar switch that changes the meaning of
> the command-line switches above to match the GNU mode. This again provides
> an opt-in, but also allows downstream ports to enable it by default, should
> they wish.
> 3) Change the meaning of the switches only for llvm-readelf, and not for
> llvm-readobj. This is similar to the behaviour of --elf-output-style: it is
> GNU for llvm-readelf, and LLVM for llvm-readobj, but does have essentially
> the same potential for disrupting users as 1).
> 4) Provide a third user-facing driver (e.g. “llvm-gnu-readelf”) that
> provides a different CLI to the others. This makes it an opt-in feature, by
> using a different executable.
> 5) Just accept this divergence, although I personally would prefer not to,
> as this has the potential to confuse users migrating from GNU tools to LLVM
> tools.
>
> Thoughts?
>
(3) SGTM (that's the approach I went with in my patch)
(2) Sounds like it could get messy to have dependencies between flags, e.g.
"--gnu-mode --help" and "--help" would have to be programmed to print
different things for what "-s" is an alias of.
(1) Means we would need to wait until the next release (March?) to do
anything? I'd rather not be tied down to slow release cycles :( [btw, does
LLVM have a deprecation policy anywhere?]
(4) I could live with this if it came to it, but I think it's assuming that
someone would want llvm-readelf and *not* want readelf compatibility,
enough to outweigh all the people that want llvm-readelf to be like readelf
-- who is that?
(5) I think we should veto this option -- this discussion means we clearly
don't accept divergence :)
>
> James
> _______________________________________________
> LLVM Developers mailing list
> llvm-dev at lists.llvm.org
> http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20181106/c28fdb0d/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 4849 bytes
Desc: S/MIME Cryptographic Signature
URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20181106/c28fdb0d/attachment.bin>
More information about the llvm-dev
mailing list