[llvm] [LLVM][NVPTX] Add NVPTX codegen support for fence.proxy.tensormap (PR #100748)
    Pradeep Kumar via llvm-commits 
    llvm-commits at lists.llvm.org
       
    Tue Aug  6 03:40:20 PDT 2024
    
    
  
================
@@ -251,6 +251,34 @@ Overview:
 The '``@llvm.nvvm.barrier0()``' intrinsic emits a PTX ``bar.sync 0``
 instruction, equivalent to the ``__syncthreads()`` call in CUDA.
 
+Membar/Fences
+-------------
+
+
+'``llvm.nvvm.fence.proxy.tensormap.*``'
----------------
schwarzschild-radius wrote:
Added `generic` explicitly in the intrinsic to mark the direction of the fencing
https://github.com/llvm/llvm-project/pull/100748
    
    
More information about the llvm-commits
mailing list