[PATCH] D20341: [CUDA] Enable fusing FP ops for CUDA by default.

Steve Canon via cfe-commits cfe-commits at lists.llvm.org
Tue May 17 16:29:05 PDT 2016


scanon added a comment.

`-ffp-contract=on` obeys the semantics of C's FP_CONTRACT pragma.  In particular, it will not fuse:

  float m = x*y;
  float a = m + z;

Whereas you probably want that to fuse for your purposes.  `-ffp-contract=fast` seems more in line with your needs.


http://reviews.llvm.org/D20341





More information about the cfe-commits mailing list