petrhosek wrote: Using `__builtin_elementwise_fma` was suggested in #126025 and in my limited testing does seem to be working but I don't know if there are any downsides to using it over `__builtin_fma`. https://github.com/llvm/llvm-project/pull/126288