[llvm] [NVPTX] Add patterns for fma.relu.{f16|bf16} (PR #114977)
Alex MacLean via llvm-commits
llvm-commits at lists.llvm.org
Tue Nov 5 09:28:25 PST 2024
AlexMaclean wrote:
Suppose the `fma` has more uses in addition to the `fmaxnum`, If this optimization kicks in it may increase the register pressure and won't be a clear win in terms of performance. I'm not sure this will be a problem, but to be conservative it may be better to implement this as a DAG combine and verify the `fma` has a single use.
https://github.com/llvm/llvm-project/pull/114977
More information about the llvm-commits
mailing list