[PATCH] D47849: [OpenMP][Clang][NVPTX] Enable math functions called in an OpenMP NVPTX target device region to be resolved as device-native function calls

Gheorghe-Teodor Bercea via Phabricator via cfe-commits cfe-commits at lists.llvm.org
Fri Aug 10 06:08:37 PDT 2018


gtbercea added a comment.



> The downside of this approach is that LLVM doesn't recognize these function calls and doesn't perform optimizations to fold libcalls. For example `pow(a, 2)` is transformed into a multiplication but `__nv_pow(a, 2)` is not.

Doesn't CUDA have the same problem?


Repository:
  rC Clang

https://reviews.llvm.org/D47849





More information about the cfe-commits mailing list