[LLVMdev] Using LLVM to serialize object state -- and performance
Paul J. Lucas
paul at lucasmail.org
Wed Nov 7 07:12:56 PST 2012
On Nov 6, 2012, at 11:49 AM, "Kaylor, Andrew" <andrew.kaylor at intel.com> wrote:
> I think you may have gone beyond what I understand in how the legacy JIT code works. It looks like the call to addGlobalMapping should short-circuit the named function look up that I described ...
Well, I first look for the function by name and, if I didn't find it, then I call addGlobalMapping(). But that's not where the time is going. Here:
https://dl.dropbox.com/u/46791180/callgraph.pdf
is a call graph generated by kcachegrind. I still don't understand all the numbers (and this PDF seems not to include commas where it should), but if you look at the left fork, the bottom two ovals, "Schedule..." is called 16K times and "setHeightToAtLeas..." is called 37K times. On the right fork, RAGreed... is called 35K times.
Those are far too many calls to *anything* for a simple sequence of "call" LLVM instructions. Something seems horribly wrong.
- Paul
More information about the llvm-dev
mailing list