[Openmp-commits] [openmp] 5a64794 - [OpenMP][NVPTX] Added the missing -O1 when building NVPTX bitcode libraries
Shilei Tian via Openmp-commits
openmp-commits at lists.llvm.org
Thu Jan 28 05:17:55 PST 2021
Author: Shilei Tian
Date: 2021-01-28T08:13:38-05:00
New Revision: 5a64794bbad4010778406dfee7748e6080258dbf
URL: https://github.com/llvm/llvm-project/commit/5a64794bbad4010778406dfee7748e6080258dbf
DIFF: https://github.com/llvm/llvm-project/commit/5a64794bbad4010778406dfee7748e6080258dbf.diff
LOG: [OpenMP][NVPTX] Added the missing -O1 when building NVPTX bitcode libraries
In the past `-O1` was used when building NVPTX bitcode libraries. After
we switched to OpenMP, `-O1` was missing by mistake, leading to a huge performance
regression.
Reviewed By: JonChesterfield
Differential Revision: https://reviews.llvm.org/D95545
Added:
Modified:
openmp/libomptarget/deviceRTLs/nvptx/CMakeLists.txt
Removed:
################################################################################
diff --git a/openmp/libomptarget/deviceRTLs/nvptx/CMakeLists.txt b/openmp/libomptarget/deviceRTLs/nvptx/CMakeLists.txt
index 23efbba29d66..eeda137ef120 100644
--- a/openmp/libomptarget/deviceRTLs/nvptx/CMakeLists.txt
+++ b/openmp/libomptarget/deviceRTLs/nvptx/CMakeLists.txt
@@ -126,14 +126,14 @@ set(cuda_src_files
)
# Set flags for LLVM Bitcode compilation.
-set(bc_flags -S -x c++
- -target nvptx64
- -Xclang -emit-llvm-bc
- -Xclang -aux-triple -Xclang ${aux_triple}
- -fopenmp -fopenmp-cuda-mode -Xclang -fopenmp-is-device
- -D__CUDACC__
- -I${devicertl_base_directory}
- -I${devicertl_nvptx_directory}/src)
+set(bc_flags -S -x c++ -O1 -std=c++14
+ -target nvptx64
+ -Xclang -emit-llvm-bc
+ -Xclang -aux-triple -Xclang ${aux_triple}
+ -fopenmp -fopenmp-cuda-mode -Xclang -fopenmp-is-device
+ -D__CUDACC__
+ -I${devicertl_base_directory}
+ -I${devicertl_nvptx_directory}/src)
if(${LIBOMPTARGET_NVPTX_DEBUG})
list(APPEND bc_flags -DOMPTARGET_NVPTX_DEBUG=-1)
More information about the Openmp-commits
mailing list