[llvm] [BOLT][AArch64] Introduce SPE mode in BasicAggregation (PR #120741)
Paschalis Mpeis via llvm-commits
llvm-commits at lists.llvm.org
Fri Jan 17 01:53:54 PST 2025
================
@@ -180,22 +187,29 @@ void DataAggregator::start() {
findPerfExecutable();
- if (opts::BasicAggregation) {
- launchPerfProcess("events without LBR",
- MainEventsPPI,
+ if (opts::ArmSPE) {
+ if (!opts::BasicAggregation) {
+ errs() << "PERF2BOLT-ERROR: Arm SPE mode is combined only with "
+ "BasicAggregation.\n";
+ exit(1);
+ }
+ launchPerfProcess("branch events with SPE", MainEventsPPI,
+ "script -F pid,event,ip,addr --itrace=i1i",
----------------
paschalis-mpeis wrote:
Hey Maks,
We expect:
- branch entries to have non-zero values for both `addr` and `ip`
- non-branch entries to have non-zero value only for `ip`
This [unit test](https://github.com/llvm/llvm-project/pull/120741/files#diff-b16a95c2248de4f899b4bea1591c813c69caed2f5e3702ea53e0a80d75b1a79cR133) illustrates such an output.
> On my system ip is always 0 for the data collected ...
The zeros you are seeing actually refer to `addr` and not `ip`. It is a bit confusing but the printed order is different than the fields order specified in CLI. Can verify using:
```bash
perf script -F pid,event,ip ..
perf script -F pid,event,addr .. # seeing zeros on unpatched perf
```
> The zeros on my side could be b/c we don't have kernel patches listed in [#115333](https://github.com/llvm/llvm-project/issues/115333).
Correct. We believe these patches might make it to the next kernel RC.
In the meantime, I've added some [sample instructions](https://github.com/llvm/llvm-project/issues/115333#issuecomment-2597809642) on the issue to quickly use it locally, w/o requiring to re-flush a kernel.
https://github.com/llvm/llvm-project/pull/120741
More information about the llvm-commits
mailing list