[Openmp-commits] [PATCH] D83591: [OpenMP][CUDA] Fix std::complex in GPU regions

Johannes Doerfert via Phabricator via Openmp-commits openmp-commits at lists.llvm.org
Fri Jul 10 15:05:51 PDT 2020


jdoerfert created this revision.
jdoerfert added reviewers: tra, hfinkel, JonChesterfield.
Herald added subscribers: sstefan1, guansong, bollu, yaxunl.
Herald added a project: clang.

The old way worked to some degree for C++-mode but in C mode we actually
tried to introduce variants of macros (e.g., isinf). To make both modes
work reliably we get rid of those extra variants and directly use NVIDIA
intrinsics in the complex implementation. While this has to be revisited
as we add other GPU targets which want to reuse the code, it should be
fine for now.


Repository:
  rG LLVM Github Monorepo

https://reviews.llvm.org/D83591

Files:
  clang/lib/Headers/__clang_cuda_complex_builtins.h
  clang/lib/Headers/__clang_cuda_math.h
  clang/test/Headers/nvptx_device_math_complex.c
  clang/test/Headers/nvptx_device_math_complex.cpp

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D83591.277152.patch
Type: text/x-patch
Size: 6424 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/openmp-commits/attachments/20200710/ff0c247a/attachment.bin>


More information about the Openmp-commits mailing list