[PATCH] D20494: [CUDA] Add section to docs about controlling fp optimizations.

Reid Kleckner via llvm-commits llvm-commits at lists.llvm.org
Wed May 25 16:04:32 PDT 2016


rnk accepted this revision.
rnk added a comment.
This revision is now accepted and ready to land.

lgtm


================
Comment at: docs/CompileCudaWithLLVM.rst:166-168
@@ +165,5 @@
+    and add instructions.
+  * ``on``: fuse multiplies and adds within a single statement, but never
+    across statements (C11 semantics).  Prevent ptxas from fusing other
+    multiplies and adds.
+  * ``fast``: fuse multiplies and adds wherever profitable, even across
----------------
Do we actually know how to honor the C11 semantics?


http://reviews.llvm.org/D20494





More information about the llvm-commits mailing list