[clang] [llvm] [Clang][NVVM] Support `-f[no-]cuda-prec-sqrt` and propagate precision flag to `NVVMReflect` (PR #134244)
via llvm-commits
llvm-commits at lists.llvm.org
Fri Apr 4 19:46:40 PDT 2025
Lai-YT wrote:
> At first glance, it seems like a good idea to allow people to choose how their square roots are lowered on a per-function level: some code cares about precise square roots, some doesn't, and you should be able to make choices on a case-by-case basis.
>
> But looking at the code structure, maybe cuda doesn't allow that?
Yeah, AFAIK, CUDA—or more precisely, NVCC—only allows this to be specified at the module level. 😣
https://github.com/llvm/llvm-project/pull/134244
More information about the llvm-commits
mailing list