davide added a comment. Also, maybe we can do this separately, but after discussing with Sanjay I concluded that maybe it's a good idea to emit intrinsics instead of libcalls here as `optimizeSqrt()` does? David, what do you think? http://reviews.llvm.org/D22104