[cfe-dev] [LLVMdev] Odd PPC inline asm constraint
Hal Finkel
hfinkel at anl.gov
Fri May 11 22:47:18 PDT 2012
On Tue, 01 May 2012 21:25:29 -0500
Peter Bergner <bergner at vnet.ibm.com> wrote:
> On Tue, 2012-05-01 at 19:58 -0500, Peter Bergner wrote:
> > On Tue, 2012-05-01 at 17:47 -0500, Hal Finkel wrote:
> > > By default it should build for
> > > whatever the current host is (no special flags required). To
> > > specifically build for something else, use:
> > > -ccc-host-triple powerpc64-unknown-linux-gnu
> > > or
> > > -ccc-host-triple powerpc-unknown-linux-gnu
> >
> > So LLVM isn't biarch capable? Meaning one LLVM compiler cannot
> > generate both 32-bit and 64-bit binaries?
>
> Sorry for replying to my own message, but...
>
> Oh, -ccc-host-triple is a compiler option and not a configure option.
> That does work, though it seems I have to link with gcc, since llvm
> still wants to link against the 64-bit crt*.o and libs. Maybe it is
> easier to just have two separate builds.
>
> That said, my simple dynamically linked hello world executed fine
> (ie, it was able to call into libc.so just fine), as well as an
> old C version of the SPEC97 tomcatv benchmark I have laying around.
> So it seems both 32-bit and 64-bit can call into shared libs.
>
> Not to say I haven't seen some code gen warts (using -O3). :)
>
> From hello.s:
>
> main:
> mflr 0
> stw 31, -4(1)
> stw 0, 4(1)
> stwu 1, -16(1)
> lis 3, .Lstr at ha
> mr 31, 1
> la 3, .Lstr at l(3)
> bl puts
> li 3, 0
> addi 1, 1, 16
> lwz 0, 4(1)
> lwz 31, -4(1)
> mtlr 0
> blr
>
> By the strict letter of the 32-bit ABI, the save and restore of
> r31 at a negative offset of r1 is verboten. The ABI states the
> the stack space below the stack pointer is declared as volatile.
> I actually debugged a similar problem way back in my Blue Gene/L
> days, where gcc had a bug and was doing the same thing. We ended
> up taking a signal between the restore of the stack pointer and
> the restore of the nonvolatile reg and the BGL compute node kernel
> trashed the stack below the stack pointer.
Just to confirm, this is an issue specific to the 32-bit ABI, correct?
gcc (4.4.6) seems to do the same thing for PPC64.
Thanks again,
Hal
>
> The second wart is the dead copy to r31...which leads to the
> unnecessary save and restore of r31.
>
> For tomcatv, we have to basically save/restore the entire set
> of non-volatile integer and fp registers. Looking at how
> llvm does that shows:
>
> ...
> lis 3, 56
> ori 3, 3, 57680
> stwx 16, 31, 3
> lis 3, 56
> ori 3, 3, 57684
> stwx 17, 31, 3
> lis 3, 56
> ori 3, 3, 57688
> stwx 18, 31, 3
> lis 3, 56
> ori 3, 3, 57692
> stwx 19, 31, 3
> lis 3, 56
> ori 3, 3, 57696
> stwx 20, 31, 3
> lis 3, 56
> ori 3, 3, 57700
> stwx 21, 31, 3
> [repeated over and over and ...]
>
> Kind of ugly! :) GCC on the other hand stashes away the old value of
> the stack pointer and then uses small negative offsets (legal at this
> point since we've already decremented the stack pointer) from that for
> all of its saves/restores:
>
> ...
> lis 0,0xffc7
> mr 12,1
> ori 0,0,7728
> stwux 1,1,0
> mflr 0
> stw 0,4(12)
> stfd 14,-144(12)
> stfd 15,-136(12)
> stfd 16,-128(12)
> stfd 17,-120(12)
> stfd 18,-112(12)
> ...
> For things that don't work, do you have a small example program
> that shows what's wrong?
>
> Peter
>
>
>
>
--
Hal Finkel
Postdoctoral Appointee
Leadership Computing Facility
Argonne National Laboratory
More information about the cfe-dev
mailing list