[PATCH] Improved udivmodsi4 with support for ARMv4
Renato Golin
renato.golin at linaro.org
Fri Jan 24 04:24:37 PST 2014
Hi Joerg,
I've tested all new routines on an A9, both with the v5 and v6 encoding (CLZ) and here are the numbers:
* divmod:
* v5: on par
* v6: 8% faster
* div:
* v5: 10% slower
* v6: 5% faster
* mod:
* v5: 15% faster
* v6: 4% faster
All results are statistically relevant (differences larger than the standard deviation). I'm not sure what the regression in div-v5 is, but as I told you, the A9 is very aggressive, and it could be anything. On average, though, and where it matters (v6+), it's consistently faster, so I'm happy with the results.
Fixing the typos, the patch looks good to go.
cheers,
--renato
http://llvm-reviews.chandlerc.com/D2595
More information about the llvm-commits
mailing list