escha added a comment. Some numbers for this optimization (before and after, on our perf test suite): Before: 76.1k cycles After: 47.5k cycles (standard deviation is +/- a few hundred cycles) Repository: rL LLVM http://reviews.llvm.org/D13220