[PATCH] D19885: [AArch64] Decouple zero store promotion from narrow ld merge. NFC.

Jun Bum Lim via llvm-commits llvm-commits at lists.llvm.org
Tue May 3 13:14:02 PDT 2016


junbuml added a comment.

In our internal tests, we found performance regressions with the narrow load merge in some cases. Initially, this optimization was driven by the +3% performance gain in spec2006/h264ref that has a load intensive hot loop. However, the gain I was targeting in h264ref is now completely covered by  SLP vectorizer.

As this optimization converts two loads into one load with two shift instructions, it could potentially hurt performance if a loop is arithmetic operation intensive.

Through this change I want to let other people run performance test with/without the narrow load merge. If there is no objection I would like to disable the narrow load merge by default in separate patch.


http://reviews.llvm.org/D19885





More information about the llvm-commits mailing list