[PATCH] Improved udivmodsi4 with support for ARMv4

Renato Golin renato.golin at linaro.org
Fri Jan 24 04:24:37 PST 2014


  Hi Joerg,

  I've tested all new routines on an A9, both with the v5 and v6 encoding (CLZ) and here are the numbers:

  * divmod:
   * v5: on par
   * v6: 8% faster
  * div:
   * v5: 10% slower
   * v6: 5% faster
  * mod:
   * v5: 15% faster
   * v6: 4% faster

  All results are statistically relevant (differences larger than the standard deviation). I'm not sure what the regression in div-v5 is, but as I told you, the A9 is very aggressive, and it could be anything. On average, though, and where it matters (v6+), it's consistently faster, so I'm happy with the results.

  Fixing the typos, the patch looks good to go.

  cheers,
  --renato

http://llvm-reviews.chandlerc.com/D2595



More information about the llvm-commits mailing list