[clang] [llvm] [Clang][NVVM] Support `-f[no-]cuda-prec-sqrt` and propagate precision flag to `NVVMReflect` (PR #134244)

via llvm-commits llvm-commits at lists.llvm.org
Thu Apr 3 19:14:09 PDT 2025


Lai-YT wrote:

> Why is this a module flag, instead of a function attribute?

The other value handled by the _NVVMReflect_ pass is `"__CUDA_FTZ"`, which Clang currently sets via a module flag. I followed the same approach here for consistency.

Also, since the decision to use precise square root functions seems to apply at the module level, using a module flag feels reasonable.
@efriedma-quic, do you see any pros or cons to using function attributes instead in this case? :raising_hand: 

https://github.com/llvm/llvm-project/pull/134244


More information about the llvm-commits mailing list