[llvm] [BOLT][AArch64] Introduce SPE mode in BasicAggregation (PR #120741)
Paschalis Mpeis via llvm-commits
llvm-commits at lists.llvm.org
Tue Jan 21 01:33:05 PST 2025
paschalis-mpeis wrote:
Great, thanks a lot Michael for filling in with details!
Indeed the differences are subtle. Just want to clarify that I've answered a slightly different question:
> What I believe you are asking here is to configure SPE to get us a pair of SRC -> TGT , where SRC and TGT are two taken branches. In other words, make SPE act as an LBR-like buffer with a branch stack depth of 1.
Whether we filter-out the non-taken branches at the HW collection level (with the `inv_event_filter` interface) or in post-processing SW, the information loss still holds:
> Please note that if we filter-out any non-taken branches at this point (ie `perf script`), then we'll exclude information we cannot later infer.
And regardless of filtering, we cannot end up with a pair of two taken branches, which could have left opportunities for inferring any FTs in between them.
https://github.com/llvm/llvm-project/pull/120741
More information about the llvm-commits
mailing list