tra added a comment. OK. Consensus seems to be that -ffp-contract=fast is the way to go. I'll update the patch. I've just checked Steve's example with nvcc and indeed it fused mul+add. http://reviews.llvm.org/D20341