[llvm] [mlir] [NVPTX] Added more MMA intrinsics for F8F6F4 and FP64 types. (PR #156040)
Artem Belevich via llvm-commits
llvm-commits at lists.llvm.org
Tue Sep 2 14:57:00 PDT 2025
================
@@ -90,6 +90,21 @@ def __init__(self, geom, frag, ptx_elt_type, is_mma_sparse=False):
"m16n8k32:b:s8": 2,
----------------
Artem-B wrote:
<!--__GRAPHITE_HTML_TAG_START__--><p class='graphite__hidden'><i>[Re: line +3]</i></p><!--__GRAPHITE_HTML_TAG_END__-->
You should generate those tests, include them into this PR, and make sure they pass with PTXAS testing enabled via `LLVM_PTXAS_EXECUTABLE=/path/to/ptxas` with ptxas from both cuda-13, and a few older versions (e.g. with 12.9, and 11.8) to make sure instructions are properly constrained and are accepted by ptxas.
<!--__GRAPHITE_HTML_TAG_START__--><p class='graphite__hidden'>See this comment inline on <a href="https://app.graphite.dev/github/pr/llvm/llvm-project/156040?utm_source=unchanged-line-comment">Graphite</a>.</p><!--__GRAPHITE_HTML_TAG_END__-->
https://github.com/llvm/llvm-project/pull/156040
More information about the llvm-commits
mailing list