[llvm] [AArch64] Add MATCH loops to LoopIdiomVectorizePass (PR #101976)
David Sherwood via llvm-commits
llvm-commits at lists.llvm.org
Wed Dec 11 06:24:07 PST 2024
david-arm wrote:
The memory checks for the inner loop only need to be done once I assume, since I'd expect the start value for the needle to be loop invariant? If not, you'll need to calculate the entire needle range for the whole outer loop. I'd expect a minor hit to performance, but given the fantastic improvements you're reporting I'd still expect it to be a nice win. :) For the mismatch case the trip counts were typically quite small and the memory checks reduced the performance from around 8% -> 7% or something like that.
https://github.com/llvm/llvm-project/pull/101976
More information about the llvm-commits
mailing list