[llvm] [LLVM][NVPTX] Add NVPTX codegen support for fence.proxy.tensormap (PR #100748)
Pradeep Kumar via llvm-commits
llvm-commits at lists.llvm.org
Tue Aug 6 03:40:20 PDT 2024
================
@@ -251,6 +251,34 @@ Overview:
The '``@llvm.nvvm.barrier0()``' intrinsic emits a PTX ``bar.sync 0``
instruction, equivalent to the ``__syncthreads()`` call in CUDA.
+Membar/Fences
+-------------
+
+
+'``llvm.nvvm.fence.proxy.tensormap.*``'
----------------
schwarzschild-radius wrote:
Added `generic` explicitly in the intrinsic to mark the direction of the fencing
https://github.com/llvm/llvm-project/pull/100748
More information about the llvm-commits
mailing list