[clang] [llvm] [mlir] [AMDGPU] add clamp immediate operand to WMMA iu8 intrinsic (PR #171069)

Matt Arsenault via llvm-commits llvm-commits at lists.llvm.org
Sun Dec 14 00:21:58 PST 2025


================
@@ -0,0 +1,17 @@
+; RUN: llvm-as < %s | llvm-dis | FileCheck %s
+
+; Verify that the legacy WMMA IU8 intrinsic without the clamp operand is
+; upgraded by appending clamp=false.
+
+define <8 x i32> @wmma_legacy(<8 x i32> %a, <8 x i32> %b, <8 x i32> %c) {
+; CHECK-LABEL: @wmma_legacy(
+; CHECK: call <8 x i32> @llvm.amdgcn.wmma.i32.16x16x64.iu8.v8i32.v8i32(i1 false, <8 x i32> %a, i1 false, <8 x i32> %b, <8 x i32> %c, i1 false, i1 false, i1 false)
+; CHECK: ret <8 x i32>
+  %res = call <8 x i32> @llvm.amdgcn.wmma.i32.16x16x64.iu8.v8i32.v8i32(
+      i1 false, <8 x i32> %a, i1 false, <8 x i32> %b, <8 x i32> %c,
+      i1 false, i1 false)
----------------
arsenm wrote:

Test with metadata attachment, and a callsite attribute 

https://github.com/llvm/llvm-project/pull/171069


More information about the llvm-commits mailing list