[PATCH] D20494: [CUDA] Add section to docs about controlling fp optimizations.
Reid Kleckner via llvm-commits
llvm-commits at lists.llvm.org
Wed May 25 16:04:32 PDT 2016
rnk accepted this revision.
rnk added a comment.
This revision is now accepted and ready to land.
Comment at: docs/CompileCudaWithLLVM.rst:166-168
@@ +165,5 @@
+ and add instructions.
+ * ``on``: fuse multiplies and adds within a single statement, but never
+ across statements (C11 semantics). Prevent ptxas from fusing other
+ multiplies and adds.
+ * ``fast``: fuse multiplies and adds wherever profitable, even across
Do we actually know how to honor the C11 semantics?
More information about the llvm-commits