[llvm] [AArch64] Neoverse V2 FeatureDisableLatencySchedHeuristic (PR #140897)

Igor Kirillov via llvm-commits llvm-commits at lists.llvm.org
Fri Jul 11 03:39:44 PDT 2025


igogo-x86 wrote:

LLVM IR is exactly the same before the SLP patch (#136329) and after my fix (#140897). However, the performance was still worse. So I looked at the assembly and saw that the instructions were identical, but the order was different. I then bisected the issue down to your change.

Here are the `perf` sample data:

| Function                       | Old Good Times | After SLP Patch | My Fix with Your Patch | My Fix without Your Patch |
|-------------------------------|----------------|------------------|----------------------|----------------------|
| `sym.x264_pixel_satd_16x16`   | 11488          | 12236            | 12044                | 11582                |
| `sym.x264_pixel_satd_8x8`     | 7192           | 7884             | 6949                 | 6644                 |

P.S. Something improved `sym.x264_pixel_satd_8x8` in the meantime.

https://github.com/llvm/llvm-project/pull/140897


More information about the llvm-commits mailing list