[clang] Bfi precision (PR #66285)

Matthias Braun via cfe-commits cfe-commits at lists.llvm.org
Fri Oct 27 10:54:08 PDT 2023


MatzeB wrote:

Some very ad-hoc benchmarking. Of clang compilation speed (measured in instructions as reported by valgrind/callgrind which I think somewhat matches the setup of nikic) compiling `sqlite3` of CTMark:

Old-BFI (this PR reverted), New-BFI (this PR applied), no-cold (cold-callsite-rel-freq set to 0 to disable `InlineCostCallAnalyzer::isColdCallSite` behavior for non-PGO builds:

```
Old-BFI:          size:  59,802,772  (baseline)  insn: 2,812,833,144  (baseline)
New-BFI:          size:  60,247,076  +0.74%      insn: 2,818,639,641  +0.21%
Old-BFI, no-cold: size:  60,773,988  +1.62%      insn: 2,806,521,932  -0.22%
New-BFI, no-cold: size:  60,741,700  +1.57%      insn: 2,803,489,298  -0.33%
```

I could benchmark more with llvm-test-suite and put up a PR to disable cold-callsite-rel-freq by default if people support this direction...

https://github.com/llvm/llvm-project/pull/66285


More information about the cfe-commits mailing list