[llvm] [NVPTX] Add intrinsics for wgmma.fence PTX instructions (PR #120523)
Durgadoss R via llvm-commits
llvm-commits at lists.llvm.org
Thu Dec 19 00:38:29 PST 2024
================
@@ -4805,6 +4805,21 @@ def int_nvvm_redux_sync_or : ClangBuiltin<"__nvvm_redux_sync_or">,
Intrinsic<[llvm_i32_ty], [llvm_i32_ty, llvm_i32_ty],
[IntrConvergent, IntrInaccessibleMemOnly, IntrNoCallback]>;
+//
+// WGMMA instructions
+//
+// wgmma.fence.sync.aligned;
+def int_nvvm_wgmma_fence_sync_aligned
+ : Intrinsic<[], [], [IntrConvergent], "llvm.nvvm.wgmma.fence.sync.aligned">;
----------------
durga4github wrote:
we may not need the explicit name string here. "llvm.nvvm.wgmma.fence.sync.aligned"
https://github.com/llvm/llvm-project/pull/120523
More information about the llvm-commits
mailing list