[PATCH] D60086: [SampleProfile] Check entry count instead of total count to decide if inlined callsite is hot.
Taewook Oh via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Apr 1 17:06:05 PDT 2019
twoh added a comment.
@wmi Thanks for the reply! I can totally understand that entry count is not as precise as total count, but still don't think current implementation is the right way to address the issue. As I mentioned in the summary it compares two different things (instruction level counter vs function level counter), opens up a possibility for optimizing against wrong function (e.g. long and cold function), and makes it hard to find the root cause of the performance issue.
If we can't have a precise entry count, the right way to address the issue would be not using PSI based heuristic but using a heuristic that actually considers a total count of the function.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D60086/new/
https://reviews.llvm.org/D60086
More information about the llvm-commits
mailing list