[libcxx] [PATCH] unordered_map: Avoid unnecessary mallocs when no insert occurs

Duncan P. N. Exon Smith via cfe-commits cfe-commits at lists.llvm.org
Mon Jan 11 16:23:43 PST 2016


ping

> On 2016-Jan-04, at 12:37, Duncan P. N. Exon Smith <dexonsmith at apple.com> wrote:
> 
> ping
> 
>> On 2015-Dec-17, at 13:56, Duncan P. N. Exon Smith <dexonsmith at apple.com> wrote:
>> 
>> 
>>> On 2015-Dec-16, at 14:42, Duncan P. N. Exon Smith <dexonsmith at apple.com> wrote:
>>> 
>>> This is a follow-up to r239666: "Fix PR12999 - unordered_set::insert
>>> calls operator new when no insert occurs".  That fix didn't apply to
>>> `unordered_map` because unordered_map::value_type gets packed inside:
>>> --
>>> union __value_type {
>>> pair<key_type, mapped_type> __nc;       // Only C++11 or higher.
>>> pair<const key_type, mapped_type> __cc; // Always.
>>> // Constructors...
>>> };
>>> --
>>> and the underlying __hash_table only knows about __value_type.
>> 
>> Sorry for the quick ping, but I realized this morning that my approach
>> was still leaving mallocs on the table.
>> 
>> I've attached a new patch that handles more cases.
>> 
>> This patch should avoid unnecessary mallocs whenever the caller passes
>> in a pair<T, U> such that T is trivially convertible to key_type.
>> 
>> Since __hash_table's value_type is really *never* a pair (for
>> unordered_map, it's a union of two pairs) the static dispatch is all in
>> unordered_map.  It's doing this:
>> - If the argument isn't a pair<>, alloc.
>> - If argument.first can be referenced as const key_type&, don't alloc.
>> - If argument.first can be trivially converted to key_type, don't
>>   alloc.
>> - Else alloc.
>> 
>> In the pre-C++11 world the caller has already converted to
>> unordered_map::value_type.  We can always avoid the alloc.
>> 
>> To support all of this:
>> - In C++03, __unordered_map_equal and __unordered_map_hasher need to
>>   handle unordered_map::value_type.
>> - In C++03, __hash_table::__insert_unique_value() now takes its
>>   argument by template.
>> - In C++11, __hash_table::__insert_unique_value() is now a one-liner
>>   that forwards to __insert_unique_key_value() for the real work.
>> - The versions of __hash_table::__construct_node() that take a
>>   pre-computed hash have been renamed to __construct_node_hash(), and
>>   these versions use perfect forwarding.
>> 
>> Most of the following still apply:
>> 
>>> This is one of my first patches for libc++, and I'm not sure of a few
>>> things:
>>> - Did I successfully match the coding style?  (I'm kind of lost
>>>  without clang-format TBH.)
>>> - Should I separate the change to __construct_node_hash() into a
>>>  separate prep commit?  (I would if this were LLVM, but I'm not sure
>>>  if the common practice is different for libc++.)
>>> - Most of the overloads I added to __unordered_map_hasher and
>>>  __unordered_map_equal aren't actually used by
>>>  __hash_table::__insert_unique_value().  Should I omit the unused
>>>  ones?  (Again, for LLVM I would have omitted them.)
>> 
>> (For the updated patch, I went with the LLVM approach of only adding
>> the used API.  It seems more appropriate in this case.)
>> 
>>> After this I'll fix the same performance issue in std::map (and I
>>> assume std::set?).
> 
> <0001-unordered_map-Avoid-unnecessary-mallocs-when-no-i-v2.patch>



More information about the cfe-commits mailing list