[llvm-dev] [RFC] Stop giving a default CPU to the LTO plugin?
Eric Christopher via llvm-dev
llvm-dev at lists.llvm.org
Mon Mar 19 21:59:31 PDT 2018
On Fri, Mar 16, 2018 at 4:09 AM Alex Bradbury via llvm-dev <
llvm-dev at lists.llvm.org> wrote:
> On 16 March 2018 at 10:38, Peter Smith via llvm-dev
> <llvm-dev at lists.llvm.org> wrote:
> > On 15 March 2018 at 19:12, Friedman, Eli <efriedma at codeaurora.org>
> wrote:
> >> Having ARMv7a instructions in an ARMv4t file shouldn't be a problem: a
> >> function should be allowed to override the CPU attributes to generate
> code
> >> for a newer target. This is generally done using the "target" function
> >> attribute. If this doesn't work correctly, we should fix it. It looks
> like
> >> it's currently broken; testcase:
> >>
> >> void g();
> >> __attribute__((target("thumb,arch=cortex-a53")))
> >> void f() { g(); }
> >>
> >
> > Hmmm, allowing that makes life much more complicated. For example I
> > can also write:
> > void g();
> > __attribute__((target("thumb,arch=cortex-m0")))
> > void f() { g(); }
> >
> > void i();
> > __attribute__((target("arm,arch=cortex-a53")))
> > void h() { i(); }
> >
> > With -mcpu=cortex-m0 and get ARM code within an object claiming to be
> > Thumb only with no errors or warnings, with no chance of a linker
> > detecting a mismatch either.
>
> I think we can all agree that there should be no real problem with
> instruction selection when adding these sorts of target attributes. As
> you point out below, the problems start occurring when it gets to the
> MC layer and object emission.
>
> As the author of the well-intentioned cleanup patch that unmasked this
> issue, I'd like to thank you for putting in the time to delve into
> things. The patches in question were:
> * https://reviews.llvm.org/rL321707
> * https://reviews.llvm.org/rL321692
>
> > I think that part of this is the same problem that is observed in
> > PR36542 the ARMAsmBackend that is responsible for widening the tail
> > call to a Thumb2 branch is created with ARMv4T which doesn't support
> > Thumb1. There has been a recent change that threads through the
> > existing SubtargetInfo instead of recreating it from the triple alone.
> > It is worth mentioning that the object level BuildAttributes do not
> > include Thumbv7a which is misleading to a linker as it will be
> > expecting no ARMv7A in the object.
> >
> > Has there already been a discussion about what per function
> > code-generation with BuildAttributes higher than the base object
> > should mean in the context of capabilities of the ARMAsmBackend and
> > BuildAttributes? My thoughts right now are that if ARMAsmBackend is to
> > operate at an object level, rather than a per-function level then it
> > has to use the capabilities of the highest architecture in the file.
> > This also means giving the object BuildAttributes of the highest
> > architecture in the file, and giving an error if they contradict, for
> > example mixing Thumb Cortex-M0 and ARM Cortex-A53. If the
> > ARMAsmBackend could be made to work on a per-function level then there
> > is a chance that we could only widen the tail call to g() in f(), but
> > not elsewhere. To honestly describe this in the BuildAttributes we
> > would need to use per Section or per Function attributes though.
> >
> > My suggestion to move forward here is:
> > - Recreate the SubtargetInfo based on the merge of all the Targets and
> > CPU information that we have seen, or warn/error if they are
> > incompatible.
> > - Ouput the Tag_CPU_arch BuildAttributes based on the merge of all the
> > Targets and CPU information that we have seen.
> >
> > It is probably worth moving any discussion of this particular part to
> > PR36542 since it is somewhat Arm specific. I'll add this comment to
> > there.
>
> I'm not so sure this is ARM specific, as other targets might well
> encounter similar issues (even if there is no direct equivalent to
> build attributes, there are cases where information is encoded into
> ELF flags on a per-object basis).
>
>
FWIW I've followed up in the bug with both a high level description of how
these things work (and should work) as well as what to do for things that
are encoded on a per-object basis.
-eric
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20180320/47bdfb83/attachment.html>
More information about the llvm-dev
mailing list