[PATCH] D20341: [CUDA] Enable fusing FP ops for CUDA by default.

Artem Belevich via cfe-commits cfe-commits at lists.llvm.org
Tue May 17 16:35:02 PDT 2016


tra added a comment.

OK. Consensus seems to be that -ffp-contract=fast is the way to go. I'll update the patch.
I've just checked Steve's example with nvcc and indeed it fused mul+add.


http://reviews.llvm.org/D20341





More information about the cfe-commits mailing list