[llvm] [NVPTX] Select bfloat16 add/mul/sub as fma on SM80 (PR #121065)

Artem Belevich via llvm-commits llvm-commits at lists.llvm.org
Thu Jan 9 16:14:00 PST 2025


Artem-B wrote:

> Why would PTX >= 7.8 mean that sm_80 doesn't benefit from this patch?

Ugh. Indeed. You're absolutely right. sm_80 still does not have mul/add, even in newer PTX versions. I was concentrating on sm_90 and somehow ignored that sm_80 is still affected. 

OK, the patch is still useful. Sorry about the noise.

https://github.com/llvm/llvm-project/pull/121065


More information about the llvm-commits mailing list