<div dir="ltr"><br><br><div class="gmail_quote"><div dir="ltr">On Fri, Mar 16, 2018 at 4:09 AM Alex Bradbury via llvm-dev <<a href="mailto:llvm-dev@lists.llvm.org">llvm-dev@lists.llvm.org</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">On 16 March 2018 at 10:38, Peter Smith via llvm-dev<br>

<<a href="mailto:llvm-dev@lists.llvm.org" target="_blank">llvm-dev@lists.llvm.org</a>> wrote:<br>

> On 15 March 2018 at 19:12, Friedman, Eli <<a href="mailto:efriedma@codeaurora.org" target="_blank">efriedma@codeaurora.org</a>> wrote:<br>

>> Having ARMv7a instructions in an ARMv4t file shouldn't be a problem: a<br>

>> function should be allowed to override the CPU attributes to generate code<br>

>> for a newer target.  This is generally done using the "target" function<br>

>> attribute.  If this doesn't work correctly, we should fix it.  It looks like<br>

>> it's currently broken; testcase:<br>

>><br>

>> void g();<br>

>> __attribute__((target("thumb,arch=cortex-a53")))<br>

>> void f() { g(); }<br>

>><br>

><br>

> Hmmm, allowing that makes life much more complicated. For example I<br>

> can also write:<br>

> void g();<br>

> __attribute__((target("thumb,arch=cortex-m0")))<br>

> void f() { g(); }<br>

><br>

> void i();<br>

> __attribute__((target("arm,arch=cortex-a53")))<br>

> void h() { i(); }<br>

><br>

> With -mcpu=cortex-m0 and get ARM code within an object claiming to be<br>

> Thumb only with no errors or warnings, with no chance of a linker<br>

> detecting a mismatch either.<br>

<br>

I think we can all agree that there should be no real problem with<br>

instruction selection when adding these sorts of target attributes. As<br>

you point out below, the problems start occurring when it gets to the<br>

MC layer and object emission.<br>

<br>

As the author of the well-intentioned cleanup patch that unmasked this<br>

issue, I'd like to thank you for putting in the time to delve into<br>

things. The patches in question were:<br>

* <a href="https://reviews.llvm.org/rL321707" rel="noreferrer" target="_blank">https://reviews.llvm.org/rL321707</a><br>

* <a href="https://reviews.llvm.org/rL321692" rel="noreferrer" target="_blank">https://reviews.llvm.org/rL321692</a><br>

<br>

> I think that part of this is the same problem that is observed in<br>

> PR36542 the ARMAsmBackend that is responsible for widening the tail<br>

> call to a Thumb2 branch is created with ARMv4T which doesn't support<br>

> Thumb1. There has been a recent change that threads through the<br>

> existing SubtargetInfo instead of recreating it from the triple alone.<br>

> It is worth mentioning that the object level BuildAttributes do not<br>

> include Thumbv7a which is misleading to a linker as it will be<br>

> expecting no ARMv7A in the object.<br>

><br>

> Has there already been a discussion about what per function<br>

> code-generation with BuildAttributes higher than the base object<br>

> should mean in the context of capabilities of the ARMAsmBackend and<br>

> BuildAttributes? My thoughts right now are that if ARMAsmBackend is to<br>

> operate at an object level, rather than a per-function level then it<br>

> has to use the capabilities of the highest architecture in the file.<br>

> This also means giving the object BuildAttributes of the highest<br>

> architecture in the file, and giving an error if they contradict, for<br>

> example mixing Thumb Cortex-M0 and ARM Cortex-A53. If the<br>

> ARMAsmBackend could be made to work on a per-function level then there<br>

> is a chance that we could only widen the tail call to g() in f(), but<br>

> not elsewhere. To honestly describe this in the BuildAttributes we<br>

> would need to use per Section or per Function attributes though.<br>

><br>

> My suggestion to move forward here is:<br>

> - Recreate the SubtargetInfo based on the merge of all the Targets and<br>

> CPU information that we have seen, or warn/error if they are<br>

> incompatible.<br>

> - Ouput the Tag_CPU_arch BuildAttributes based on the merge of all the<br>

> Targets and CPU information that we have seen.<br>

><br>

> It is probably worth moving any discussion of this particular part to<br>

> PR36542 since it is somewhat Arm specific. I'll add this comment to<br>

> there.<br>

<br>

I'm not so sure this is ARM specific, as other targets might well<br>

encounter similar issues (even if there is no direct equivalent to<br>

build attributes, there are cases where information is encoded into<br>

ELF flags on a per-object basis).<br><br></blockquote><div><br></div><div>FWIW I've followed up in the bug with both a high level description of how these things work (and should work) as well as what to do for things that are encoded on a per-object basis.</div><div><br></div><div>-eric </div></div></div>