[PATCH] Indirect call target profiling compiler-rt changes

Mon Jun 15 21:33:23 PDT 2015

> "Betul Buyukkurt" <betulb at codeaurora.org> writes:
>>> "Betul Buyukkurt" <betulb at codeaurora.org> writes:
>>>>>> Index: lib/profile/InstrProfilingBuffer.c
>>>>>> ===================================================================
>>>>>> --- lib/profile/InstrProfilingBuffer.c
>>>>>> +++ lib/profile/InstrProfilingBuffer.c
>>>>>> @@ -68,11 +68,14 @@
>>>>>>     * Match logic in __llvm_profile_write_file().
>>>>>>     */
>>>>>>
>>>>>> +  __llvm_profile_value_data *ValueDataBegin = NULL;
>>>>>> +
>>>>>>    /* Calculate size of sections. */
>>>>>>    const uint64_t DataSize = DataEnd - DataBegin;
>>>>>>    const uint64_t CountersSize = CountersEnd - CountersBegin;
>>>>>>    const uint64_t NamesSize = NamesEnd - NamesBegin;
>>>>>>    const uint64_t Padding = sizeof(uint64_t) - NamesSize %
>>>>>> sizeof(uint64_t);
>>>>>> +  const uint64_t ValueDataSize = 0;
>>>>>
>>>>> I guess this is supposed to do:
>>>>>
>>>>>   __llvm_profile_gather_value_data(&ValueDataBegin);
>>>>
>>>> This is correct. However, on the buffer access API's the size of the
>>>> buffer is calculated so that the caller of the write API allocates the
>>>> necessary buffer. A pointer to the buffer is passed back to the write
>>>> function for the data to be dumped to this memory. In my
>>>> implementation,
>>> I
>>>> kept the buffer API's unmodified - other than changing the format to
>>> match
>>>> that of version 2's. I'd like to know if a re-alloc on the passed in
>>> (char
>>>> *Buffer) be in order as a change to accommodate value profiling's
>>> dynamic
>>>> memory needs.
>>>
>>> But isn't this completely broken if someone uses value profiling and
>>> the
>>> buffer APIs? What happens in that case?
>>
>> I'd not say it's "broken". The profile header and the version number is
>> updated, so the profile reading/merges should happen cleanly. The only
>> problem is that currently there would not be any value profiling support
>> when the buffer API's are used. To come up with a solution, I've been
>> looking forward to external input from the community on the buffer API
>> usage scenarios w/ these CL's and if realloc is in order with how buffer
>> API usage is intended.
>
> Okay, I guess a failure behaviour of "you just don't get the value data"
> isn't too terrible as a temporary measure. We should at least have an
> idea of where we're going with this though.
>
> I think we want to avoid using realloc here - this API is intended for
> constrained environments that want total control over
> memory/filesystem/etc.
>
>>> Can we teach __llvm_profile_get_size_for_buffer to calculate the
>>> correct
>>> size ahead of time so that a realloc isn't necessary?
>>
>> Perhaps __llvm_profile_get_size_for_buffer can take as input how many
>> values to output per value site. This can be used to pre-calculate and
>> allocate the size of the overall profiling data in advance. Then at each
>> value site, only the top N values would be outputted per value site.
>>
>> The collected value profiles may fail to fill the pre-allocated buffer
>> completely, so the buffer write API should also return the total size of
>> the buffer used.
>
> Of course, we'll need an eviction strategy if we get too many values in
> this case, which we've managed to avoid dealing with so far.
>