[llvm] [NVPTX] Add intrinsics for st.bulk instruction (PR #128856)

Alex MacLean via llvm-commits llvm-commits at lists.llvm.org
Thu Feb 27 20:55:57 PST 2025


================
@@ -7816,3 +7816,17 @@ foreach shape = ["16x64b", "16x128b", "16x256b", "32x32b", "16x32bx2"] in {
 }
 
 } // isConvergent
+
+// Bulk store instructions
+                            
+def INT_NVVM_ST_BULK_GENERIC :
+  NVPTXInst<(outs), (ins Int64Regs:$dest_addr, Int64Regs:$size),
+            "st.bulk [$dest_addr], $size, 0;",
+            [(int_nvvm_st_bulk i64:$dest_addr, i64:$size, (i64 0))]>,
+            Requires<[hasSM<100>, hasPTX<86>]>;
+
+def INT_NVVM_ST_BULK_SHARED_CTA:
+  NVPTXInst<(outs), (ins Int64Regs:$dest_addr, Int64Regs:$size),
+            "st.bulk.shared::cta [$dest_addr], $size, 0;",
+            [(int_nvvm_st_bulk_shared_cta i64:$dest_addr, i64:$size, (i64 0))]>,
----------------
AlexMaclean wrote:

Perhaps this would be a good use case for the `ADDR` operand and `addr` pattern added in https://github.com/llvm/llvm-project/pull/129102

https://github.com/llvm/llvm-project/pull/128856


More information about the llvm-commits mailing list