[llvm-dev] [RFC] Clean up the way we store optional Function data

Sanjoy Das via llvm-dev llvm-dev at lists.llvm.org
Wed Oct 21 15:27:07 PDT 2015



David Majnemer wrote:
> I feel OK about this trade-off.  The vast majority of C++ TUs have < 65k
> functions.  Sanjoy?

Most of our "TU"s have at most a handful of (~ 20) functions, so I'm 
fine with this too.

-- Sanjoy

>
> On Wed, Oct 21, 2015 at 2:25 PM, Vedant Kumar <vsk at apple.com
> <mailto:vsk at apple.com>> wrote:
>
>     I've done some measurements on this.
>
>     The test program I have just calls Function::Create(),
>     F->setPersonalityFn(), and then F->eraseFromParent() in a loop 2^20
>     times.
>
>     Results:
>
>          pre-patch  --- min: 1.10s    max: 1.13s    avg: 1.11s
>          post-patch --- min: 1.26s    max: 1.35s    avg: 1.29s
>
>     So we expect to lose 0.2 seconds per 1 million functions (with
>     personality functions) in a project. I've only tested on my machine,
>     and I haven't accounted for performance variations in other parts of
>     the codebase where code was removed.
>
>     I'm happy enough with the patch and think that this is a reasonable
>     tradeoff. Should we go with this?
>
>     http://reviews.llvm.org/D13829
>
>     vedant
>
>
>      > On Oct 16, 2015, at 5:56 PM, Duncan P. N. Exon Smith
>     <dexonsmith at apple.com <mailto:dexonsmith at apple.com>> wrote:
>      >
>      >
>      >> On 2015-Oct-16, at 13:54, Vedant Kumar <vsk at apple.com
>     <mailto:vsk at apple.com>> wrote:
>      >>
>      >> Here is a WIP patch as promised:
>      >>
>      >> http://reviews.llvm.org/D13829
>      >>
>      >> It uses a hungoff uselist to store optional data as needed.
>      >>
>      >> Some early objections from Duncan:
>      >>
>      >>   - An extra one-time malloc() is required to set personality
>     functions.
>      >>   - We get and set personality functions frequently. This patch
>     introduces a level of indirection which slows the common case down.
>      >>
>      >> Is this overhead acceptable?
>      >
>      > If the overhead isn't bad, then this SGTM.  (I haven't looked at the
>      > patch.)
>      >
>      > I guess I was thinking about large applications that use C++
>     exceptions
>      > and build with LTO, but even there it's a per-function overhead.  I
>      > doubt it really matters.
>      >
>      >> If not, maybe we could leave personality function handling
>     untouched and add a "Constant **OptionalData" array to Function.
>      >
>      > The `IndirectUser` approach you outlined initially might be better
>      > than this?
>      >
>      >> vedant
>      >>
>      >>
>      >>> On Oct 14, 2015, at 3:12 PM, Vedant Kumar <vsk at apple.com
>     <mailto:vsk at apple.com>> wrote:
>      >>>
>      >>> I like the idea of using hung off uses.
>      >>>
>      >>> We can keep using SubclassData to indicate whether or not some
>     optional data is present.
>      >>>
>      >>> Benefits: zero space overhead until some optional data is set,
>     we can get rid of the DenseMaps in LLVMContextImpl, and RAUW just
>     works (so no clang changes are needed).
>      >>>
>      >>> I'll have a patch up before the end of the week.
>      >>>
>      >>> thanks
>      >>> vedant
>      >>>
>      >>>
>      >>>> On Oct 12, 2015, at 12:15 PM, Sanjoy Das via llvm-dev
>     <llvm-dev at lists.llvm.org <mailto:llvm-dev at lists.llvm.org>> wrote:
>      >>>>
>      >>>>
>      >>>>
>      >>>> David Majnemer wrote:
>      >>>>>
>      >>>>>
>      >>>>> On Mon, Oct 12, 2015 at 11:00 AM, Duncan P. N. Exon Smith via
>     llvm-dev
>      >>>>> <llvm-dev at lists.llvm.org <mailto:llvm-dev at lists.llvm.org>
>     <mailto:llvm-dev at lists.llvm.org <mailto:llvm-dev at lists.llvm.org>>>
>     wrote:
>      >>>>>
>      >>>>>
>      >>>>>> On 2015-Oct-12, at 10:41, Sanjoy Das
>      >>>>> <sanjoy at playingwithpointers.com
>     <mailto:sanjoy at playingwithpointers.com>
>      >>>>> <mailto:sanjoy at playingwithpointers.com
>     <mailto:sanjoy at playingwithpointers.com>>> wrote:
>      >>>>>>
>      >>>>>>
>      >>>>>>
>      >>>>>> Vedant Kumar wrote:
>      >>>>>>>>> That's a neat idea. To summarize: make Function have 3
>     optional
>      >>>>> operands. (For context -- Function currently has 1 optional
>     operand,
>      >>>>> and my proposal is to move to 0.)
>      >>>>>>>>>
>      >>>>>>>>> Could someone else chime in on what they'd like to see?
>      >>>>>>>> Sanjoy's idea makes sense to me, but only if we never need
>     to add
>      >>>>>>>> prefix/prologue data after functions are created.  Are
>     there any
>      >>>>> places
>      >>>>>>>> where we need/want to add them after the fact?
>      >>>>>>>
>      >>>>>>> I think so. I see:
>      >>>>>>>
>      >>>>>>> LinkModules.cpp:
>      >>>>> Dst.setPrefixData(MapValue(Src.getPrefixData(), ValueMap,
>      >>>>>>> BitcodeReader.cpp:
>      >>>>> FunctionPrologueWorklist.back().first->setPrologueData(C);
>      >>>>>>> InlineFunction.cpp:
>     Caller->setPersonalityFn(CalledPersonality);
>      >>>>>>>
>      >>>>>>> Some of these sites could be refactored so that the
>     Functions are
>      >>>>> created with the prefix/prologue data they need. I don't think
>      >>>>> that's possible for personality functions (see my third example).
>      >>>>>>>
>      >>>>>>> Would we inhibit any future patches which add prefix/prologue
>      >>>>> data to Functions on the fly by taking this approach?
>      >>>>>>
>      >>>>>> You should always be able to create a new `llvm::Function`
>      >>>>> instance (and RAUW it in) if you want to add prefix/prologue
>     data to
>      >>>>> functions after they've been created; just like you have to
>     do today
>      >>>>> for any other `llvm::User`s that do not have hung off uses.
>      >>>>>
>      >>>>> It's possible, but a lot more involved with `Function`s.  Besides
>      >>>>> RAUW, you need to transfer over all the basic blocks.
>      >>>>>
>      >>>>> This seems kind of wrong to me, if we expect it to happen.
>      >>>>>
>      >>>>>> Which brings me to -- can you use hung off uses for this?  These
>      >>>>> use lists can be resized on the fly, so you should be able to add
>      >>>>> and remove prologue data on the fly.  If you're using hung
>     off uses,
>      >>>>> you'll probably still need a descriptor to remember whether /
>     which
>      >>>>> operands are prologue data etc.
>      >>>>>
>      >>>>> Sure, this is another option.  It might be simplest.  I'd be
>      >>>>> tempted to start with a 0/3 choice (if we allocate any hung-off
>      >>>>> uses, allocate enough for all three operands) to simplify the
>      >>>>> logic.  Although I can't remember right now whether that's
>      >>>>> legal (having nullptr operands followed by real ones)...
>      >>>>>
>      >>>>>>>>>>> Personalities are stored as ``optional`` Function operands.
>      >>>>> We actually always
>      >>>>>>>>>>> allocate the space for this ``optional`` operand: there's a
>      >>>>> FIXME in the
>      >>>>>>>>>>> destructor for Function about this.
>      >>>>>
>      >>>>> Makes me wonder, why didn't we use hung off uses to begin with?
>      >>>>> Do functions "usually" have personality functions, for some
>      >>>>> definition of?
>      >>>>>
>      >>>>>
>      >>>>> Depends. In C++? It's pretty common to have objects which have
>      >>>>> non-trivial destructors on the stack which means calling a
>     function will
>      >>>>> be an invoke which will require the function to have a
>     personality. In
>      >>>>> C? It's pretty rare. You'd need something like
>     __attribute__((cleanup))
>      >>>>> to do it, the most common source of this will be something
>      >>>>> like pthread_cleanup_push. If I recall correctly, Julia sets the
>      >>>>> personality on functions regardless of whether or not there
>     are any
>      >>>>> invokes, they need the AsmPrinter to scribble something
>     down.  I can't
>      >>>>> say for other languages (Rust, etc.).  From what I
>     understand, Swift
>      >>>>> doesn't use landingpad for EH so they wouldn't need the
>     personality set.
>      >>>>
>      >>>> Most functions we emit from our Java frontend have personalities.
>      >>>>
>      >>>> -- Sanjoy
>      >>>>
>      >>>>>
>      >>>>> _______________________________________________
>      >>>>> LLVM Developers mailing list
>      >>>>> llvm-dev at lists.llvm.org <mailto:llvm-dev at lists.llvm.org>
>     <mailto:llvm-dev at lists.llvm.org <mailto:llvm-dev at lists.llvm.org>>
>      >>>>> http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>      >>>>>
>      >>>>>
>      >>>> _______________________________________________
>      >>>> LLVM Developers mailing list
>      >>>> llvm-dev at lists.llvm.org <mailto:llvm-dev at lists.llvm.org>
>      >>>> http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev
>      >>>
>      >>
>      >
>
>


More information about the llvm-dev mailing list