[clang] 9f10cc2 - [OPENMP] [DOCS] fix section formatting issues [NFC]
Kelvin Li via cfe-commits
cfe-commits at lists.llvm.org
Wed Nov 6 19:09:17 PST 2019
Author: Kelvin Li
Date: 2019-11-06T22:03:09-05:00
New Revision: 9f10cc2d124c275f2e7a6764e168f6b75527e78f
URL: https://github.com/llvm/llvm-project/commit/9f10cc2d124c275f2e7a6764e168f6b75527e78f
DIFF: https://github.com/llvm/llvm-project/commit/9f10cc2d124c275f2e7a6764e168f6b75527e78f.diff
LOG: [OPENMP] [DOCS] fix section formatting issues [NFC]
Differential Revision: https://reviews.llvm.org/D69909
Added:
Modified:
clang/docs/OpenMPSupport.rst
Removed:
################################################################################
diff --git a/clang/docs/OpenMPSupport.rst b/clang/docs/OpenMPSupport.rst
index b47a2b65d4f4..0ef4acf7631a 100644
--- a/clang/docs/OpenMPSupport.rst
+++ b/clang/docs/OpenMPSupport.rst
@@ -13,9 +13,8 @@
.. contents::
:local:
-==================
OpenMP Support
-==================
+==============
Clang fully supports OpenMP 4.5. Clang supports offloading to X86_64, AArch64,
PPC64[LE] and has `basic support for Cuda devices`_.
@@ -30,7 +29,7 @@ Interface (OMPT) on x86, x86_64, AArch64, and PPC64 on Linux, Windows, and macOS
For the list of supported features from OpenMP 5.0 see `OpenMP implementation details`_.
General improvements
---------------------
+====================
- New collapse clause scheme to avoid expensive remainder operations.
Compute loop index variables after collapsing a loop nest via the
collapse clause by replacing the expensive remainder operation with
@@ -47,6 +46,13 @@ General improvements
- Simplified SPMD code generation for `distribute parallel for` when
the new default schedules are applicable.
+- When using the collapse clause on a loop nest the default behavior
+ is to automatically extend the representation of the loop counter to
+ 64 bits for the cases where the sizes of the collapsed loops are not
+ known at compile time. To prevent this conservative choice and use
+ at most 32 bits, compile your program with the
+ `-fopenmp-optimistic-collapse`.
+
.. _basic support for Cuda devices:
Cuda devices support
@@ -77,15 +83,6 @@ are stored in the global memory. In `Cuda` mode local variables are not shared
between the threads and it is user responsibility to share the required data
between the threads in the parallel regions.
-Collapsed loop nest counter
----------------------------
-
-When using the collapse clause on a loop nest the default behavior is to
-automatically extend the representation of the loop counter to 64 bits for
-the cases where the sizes of the collapsed loops are not known at compile
-time. To prevent this conservative choice and use at most 32 bits,
-compile your program with the `-fopenmp-optimistic-collapse`.
-
Features not supported or with limited support for Cuda devices
---------------------------------------------------------------
@@ -112,7 +109,7 @@ Features not supported or with limited support for Cuda devices
.. _OpenMP implementation details:
OpenMP 5.0 Implementation Details
----------------------------------
+=================================
The following table provides a quick overview over various OpenMP 5.0 features
and their implementation status. Please contact *openmp-dev* at
More information about the cfe-commits
mailing list