[llvm] [NVPTX] Add intrinsics for wgmma.fence PTX instructions (PR #120523)

Durgadoss R via llvm-commits llvm-commits at lists.llvm.org
Thu Dec 19 00:38:29 PST 2024


================
@@ -4805,6 +4805,21 @@ def int_nvvm_redux_sync_or : ClangBuiltin<"__nvvm_redux_sync_or">,
   Intrinsic<[llvm_i32_ty], [llvm_i32_ty, llvm_i32_ty],
             [IntrConvergent, IntrInaccessibleMemOnly, IntrNoCallback]>;
 
+//
+// WGMMA instructions
+//
+// wgmma.fence.sync.aligned;
+def int_nvvm_wgmma_fence_sync_aligned 
+  : Intrinsic<[], [], [IntrConvergent], "llvm.nvvm.wgmma.fence.sync.aligned">;
----------------
durga4github wrote:

we may not need the explicit name string here. "llvm.nvvm.wgmma.fence.sync.aligned"

https://github.com/llvm/llvm-project/pull/120523


More information about the llvm-commits mailing list