[PATCH] D140271: [NFCI][llvm-exegesis] Benchmark: parallelize codegen (5x ... 8x less wallclock)

Mon Jan 16 07:13:54 PST 2023

lebedev.ri added a comment.

In D140271#4056202 <https://reviews.llvm.org/D140271#4056202>, @courbet wrote:

>> Are you seriously saying that 10x wallclock improvement is negligible?
>
> 10x speed improvement is not negligible. I'm simply questioning whether speed matters in this case.
>
> If the speed improvement came at no cost, that would be a no-brainer. But there is a speed/readability tradeoff, which we need to evaluate.

All the changes here are to a **single** function,
that isn't really going to change further anyway.
It's not like this requires changes to many places.

> In my personal experience, I did not feel that benchmarking speed was ever an issue, but I do feel that this code is more complex to understand. Therefore to me that tradeoff is negative. If other people feel otherwise I'm happy to reconsider.

I would not bother with this if if i didn't find the existing speed to be problematic.
I would not add progress meter either. I would not fix the analysis speed either.
I'm doing these things because i found them to be sub-par, during my usage.

In D140271#4056248 <https://reviews.llvm.org/D140271#4056248>, @RKSimon wrote:

> Has anyone got a recent profile of llvm-exegesis upto --benchmark-phase=assemble-measured-code?

for all-opcode pass,

- `--benchmark-phase=prepare-snippet` is instantenious, takes less than a sedonc
- `--benchmark-phase=prepare-and-assemble-snippet` takes maybe 2..5 seconds
- `--benchmark-phase=assemble-measured-code` takes minutes.

Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D140271/new/

https://reviews.llvm.org/D140271