[PATCH] D20341: [CUDA] Enable fusing FP ops for CUDA by default.
Justin Lebar via cfe-commits
cfe-commits at lists.llvm.org
Tue May 17 15:54:26 PDT 2016
jlebar added a comment.
I am not sure we want this? Although it matches nvcc, it does not match our floating-point behavior for C++ in general -- it makes us non-IEEE-whatever compliant by default.
Although I agree that if we don't do this, lots of people are not going to pass -fp-contract=fast and resultantly will think that we're slower than nvcc. There's no way to win. :(
http://reviews.llvm.org/D20341
More information about the cfe-commits
mailing list