[PATCH] Improved udivmodsi4 with support for ARMv4
    Renato Golin 
    renato.golin at linaro.org
       
    Fri Jan 24 04:24:37 PST 2014
    
    
  
  Hi Joerg,
  I've tested all new routines on an A9, both with the v5 and v6 encoding (CLZ) and here are the numbers:
  * divmod:
   * v5: on par
   * v6: 8% faster
  * div:
   * v5: 10% slower
   * v6: 5% faster
  * mod:
   * v5: 15% faster
   * v6: 4% faster
  All results are statistically relevant (differences larger than the standard deviation). I'm not sure what the regression in div-v5 is, but as I told you, the A9 is very aggressive, and it could be anything. On average, though, and where it matters (v6+), it's consistently faster, so I'm happy with the results.
  Fixing the typos, the patch looks good to go.
  cheers,
  --renato
http://llvm-reviews.chandlerc.com/D2595
    
    
More information about the llvm-commits
mailing list