[PATCH] D20341: [CUDA] Enable fusing FP ops for CUDA by default.

Tue May 17 15:54:26 PDT 2016

jlebar added a comment.

I am not sure we want this?  Although it matches nvcc, it does not match our floating-point behavior for C++ in general -- it makes us non-IEEE-whatever compliant by default.

Although I agree that if we don't do this, lots of people are not going to pass -fp-contract=fast and resultantly will think that we're slower than nvcc.  There's no way to win.  :(

http://reviews.llvm.org/D20341