[llvm] [LV] Pure runtime check for minimum profitable trip count. (PR #115833)
David Sherwood via llvm-commits
llvm-commits at lists.llvm.org
Wed Nov 13 07:01:14 PST 2024
david-arm wrote:
Just for information I tried this patch out with the x264 benchmark on neoverse-v1 and it causes a ~2% performance regression. It looks like in the hot loop in `mc_chroma` we are not entering the tail-folded vector loop as often, and falling back on the scalar tail. I guess that means the min profitable trip count isn't quite right.
https://github.com/llvm/llvm-project/pull/115833
More information about the llvm-commits
mailing list