[clang] [llvm] [Clang][NVVM] Support `-f[no-]cuda-prec-sqrt` and propagate precision flag to `NVVMReflect` (PR #134244)
via llvm-commits
llvm-commits at lists.llvm.org
Tue Apr 8 19:52:48 PDT 2025
Lai-YT wrote:
Got it. I did come across `nvptx-prec-sqrtf32`, but since _NVVMReflect_ makes the decision earlier by selecting the intrinsic in _libdevice_, I figured I couldn't rely on it.
I completely agree that unifying the behavior would be ideal moving forward.
Thanks so much for the insights, @Artem-B and @AlexMaclean! 🤗
It sounds like there's not much more I can do with this PR as it stands.
https://github.com/llvm/llvm-project/pull/134244
More information about the llvm-commits
mailing list