[PATCH] D55045: Add a version of std::function that includes a few optimizations.
Jordan Soyke via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Dec 7 11:41:23 PST 2018
jsoyke marked 3 inline comments as done.
jsoyke added inline comments.
================
Comment at: include/functional:1774
+template <typename _Tp>
+using __fast_forward =
+ typename _VSTD::conditional<_VSTD::is_scalar<_Tp>::value, _Tp, _Tp&&>::type;
----------------
sbenza wrote:
> jsoyke wrote:
> > This optimization wasn't in the last version, sbenza@ mentioned we might want to do it. It helps with the Invoke benchmarks, as much as 10-20%.
> >
> > It's probably worth mentioning that this can hurt performance in cases where the caller is using std::forward, or passing small value types by reference, but these are very uncommon (at least in my experience).
> Can you show how this can hurt?
> Do you have a benchmark that gets better when this is removed?
>
> This will only trigger if the signature of the function itself takes by value.
> `std::function::operator()` will have it by value already, and the user callback will take it by value.
> If we pass by ref we will be forced to put it on the stack on a different address even if the caller passed an lvalue because `operator()`'s arguments forced a copy.
>
> We will need to read that data and pass it in by value anyway. We are only moving that lvalue-to-rvalue decay before the vtable instead of in the vtable.
>
> https://gcc.godbolt.org/z/R7JBq2
Never-mind, my reasoning was flawed. I was basing my experience on a piece of software that did something like fast_forward but internally dispatched to std::function. It had to move register objects to the stack and made the fast_forward code perform worse, but we're fixing that problem here.
Repository:
rCXX libc++
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D55045/new/
https://reviews.llvm.org/D55045
More information about the llvm-commits
mailing list