[llvm-dev] Representations of IR in the output of opt
Mehdi AMINI via llvm-dev
llvm-dev at lists.llvm.org
Mon May 27 13:15:40 PDT 2019
On Mon, May 27, 2019 at 1:12 PM Sébastien Michelland <
sebastien.michelland at ens-lyon.fr> wrote:
> Hi Mehdi,
>
> Thank you for mentioning this tool, I was looking for something like
> this. By default the analyzer produces identical output on both files,
> but a complete -dump shows that the storage order of the symbol table is
> different.
>
Thanks for the update!
It may be desirable to sort the table before writing the bitcode out,
adding Peter to the thread for his opinion.
--
Mehdi
> This would explain why text files are not affected: the symbols are used
> directly in text form so there is no need for this table.
>
> I suppose that settles the question of where. Out of curiosity, I'd like
> to know if there is a way to order the table in a canonical form. I
> found -preserve-bc-uselistorder which makes more sense (and seems to
> correspond because the table lists all uses of each symbol), but no luck
> yet.
>
> At least now I'm sure that there is no semantic difference between the
> programs so it's a great help. :D
>
> Thanks,
> Sébastien Michelland
>
> On 5/27/19 1:48 PM, Mehdi AMINI wrote:
> > Hi,
> >
> > I would give try to run llvm-bcanalyzer on these bc files, this may help
> > to understand where the discrepancy is coming from.
> >
> > Best,
> >
> > --
> > Mehdi
> >
> >
> > On Mon, May 27, 2019 at 10:42 AM Sébastien Michelland via llvm-dev
> > <llvm-dev at lists.llvm.org <mailto:llvm-dev at lists.llvm.org>> wrote:
> >
> > Hi Eli,
> >
> > Unfortunately the differences remain, I do not observe a significant
> > change in the output besides the fact that it's random.
> >
> > I noticed that running opt without options on the random file changes
> > the order of references in the predecessors of basic blocks (sample
> > below). Further invocations of opt are idempotent.
> >
> > I don't know of this information is stored in the bytecode file as
> well.
> >
> > < ; preds = %CF, %CF80, %CF78
> > > ; preds = %CF80, %CF, %CF78
> >
> > FWIW, the conflicting section of the bytecode file is likely not a
> > permutation because the byte patterns don't match (some of the btte
> > values of stress-1.bc are not present in stress-2.bc).
> >
> > Thanks for your help :)
> > Sébastien Michelland
> >
> > On 5/24/19 5:32 PM, Eli Friedman wrote:
> > > Are you passing -preserve-ll-uselistorder when you create the .ll
> > files? It's off by default because the output tends to be sort of
> > unreadable, but it could explain some of the differences you're
> seeing.
> > >
> > > -Eli
> > >
> > >> -----Original Message-----
> > >> From: llvm-dev <llvm-dev-bounces at lists.llvm.org
> > <mailto:llvm-dev-bounces at lists.llvm.org>> On Behalf Of Sébastien
> > >> Michelland via llvm-dev
> > >> Sent: Friday, May 24, 2019 12:53 PM
> > >> To: llvm-dev at lists.llvm.org <mailto:llvm-dev at lists.llvm.org>
> > >> Subject: [EXT] [llvm-dev] Representations of IR in the output of
> opt
> > >>
> > >> Hi LLVM,
> > >>
> > >> I'm currently setting up some tools to investigate the influence
> > of the
> > >> order of optimization passes on the performance of compiled
> programs
> > >> -nothing exceptional here.
> > >>
> > >> I noticed something inconvenient with opt, namely that splitting
> > a call
> > >> does not always give the same output:
> > >>
> > >> % llvm-stress > stress.ll
> > >> % opt -dse -verify -dce stress.ll -o stress-1.bc
> > >> % opt -dse stress.ll | opt -dce -o stress-2.bc
> > >> % diff stress-{1,2}.bc
> > >> Binary files stress-1.bc and stress-2.bc differ
> > >>
> > >> The difference seems meaningful; it's ~180 bytes out of ~1400
> > bytes of
> > >> output in my random case. I can't decode it however, because
> > >> disassembling the bytecode produces identical text files, even
> with
> > >> annotations. (!)
> > >>
> > >> I made sure that the sequence for [-dse -verify -dce] is the
> > >> concatenation of the individual sequences; this falls in place
> > naturally
> > >> because -dce has no dependencies. The verifier pass helps make
> two
> > >> function pass managers, just in case.
> > >>
> > >> Now if I do the same thing but staying in text format, I get the
> > same IR
> > >> (up to module name):
> > >>
> > >> % opt -S -dse -verify -dce stress.ll -o stress-1.ll
> > >> % opt -S -dse stress.ll | opt -S -dce -o stress-2.ll
> > >> % diff -y --suppress-common-lines stress-{1,2}.ll
> > >> ; ModuleID = 'stress.ll' | ; ModuleID = '<stdin>'
> > >>
> > >> Is there a specific behavior of opt that could explain this
> > situation?
> > >> What kind of difference could there be in the bytecode files
> that is
> > >> lost in translation to text format ?
> > >>
> > >> Cheers,
> > >> Sébastien Michelland
> > >>
> > >> _______________________________________________
> > >> LLVM Developers mailing list
> > >> llvm-dev at lists.llvm.org <mailto:llvm-dev at lists.llvm.org>
> > >> https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
> > _______________________________________________
> > LLVM Developers mailing list
> > llvm-dev at lists.llvm.org <mailto:llvm-dev at lists.llvm.org>
> > https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
> >
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-dev/attachments/20190527/0f037f0d/attachment.html>
More information about the llvm-dev
mailing list