[PATCH] D20341: [CUDA] Enable fusing FP ops for CUDA by default.

Justin Lebar via cfe-commits cfe-commits at lists.llvm.org
Tue May 17 15:54:26 PDT 2016


jlebar added a comment.

I am not sure we want this?  Although it matches nvcc, it does not match our floating-point behavior for C++ in general -- it makes us non-IEEE-whatever compliant by default.

Although I agree that if we don't do this, lots of people are not going to pass -fp-contract=fast and resultantly will think that we're slower than nvcc.  There's no way to win.  :(


http://reviews.llvm.org/D20341





More information about the cfe-commits mailing list