[PATCH] D47849: [OpenMP][Clang][NVPTX] Enable math functions called in an OpenMP NVPTX target device region to be resolved as device-native function calls
Gheorghe-Teodor Bercea via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Fri Aug 10 06:08:37 PDT 2018
gtbercea added a comment.
> The downside of this approach is that LLVM doesn't recognize these function calls and doesn't perform optimizations to fold libcalls. For example `pow(a, 2)` is transformed into a multiplication but `__nv_pow(a, 2)` is not.
Doesn't CUDA have the same problem?
Repository:
rC Clang
https://reviews.llvm.org/D47849
More information about the cfe-commits
mailing list