[llvm] [NVPTX] Add patterns for fma.relu.{f16|bf16} (PR #114977)

Alex MacLean via llvm-commits llvm-commits at lists.llvm.org
Tue Nov 5 09:28:25 PST 2024


AlexMaclean wrote:

Suppose the `fma` has more uses in addition to the `fmaxnum`, If this optimization kicks in it may increase the register pressure and won't be a clear win in terms of performance. I'm not sure this will be a problem, but to be conservative it may be better to implement this as a DAG combine and verify the `fma` has a single use.

https://github.com/llvm/llvm-project/pull/114977


More information about the llvm-commits mailing list