[llvm] [NVPTX] Add intrinsics for st.bulk instruction (PR #128856)
Alex MacLean via llvm-commits
llvm-commits at lists.llvm.org
Thu Feb 27 20:55:57 PST 2025
================
@@ -7816,3 +7816,17 @@ foreach shape = ["16x64b", "16x128b", "16x256b", "32x32b", "16x32bx2"] in {
}
} // isConvergent
+
+// Bulk store instructions
+
+def INT_NVVM_ST_BULK_GENERIC :
+ NVPTXInst<(outs), (ins Int64Regs:$dest_addr, Int64Regs:$size),
+ "st.bulk [$dest_addr], $size, 0;",
+ [(int_nvvm_st_bulk i64:$dest_addr, i64:$size, (i64 0))]>,
+ Requires<[hasSM<100>, hasPTX<86>]>;
+
+def INT_NVVM_ST_BULK_SHARED_CTA:
+ NVPTXInst<(outs), (ins Int64Regs:$dest_addr, Int64Regs:$size),
+ "st.bulk.shared::cta [$dest_addr], $size, 0;",
+ [(int_nvvm_st_bulk_shared_cta i64:$dest_addr, i64:$size, (i64 0))]>,
----------------
AlexMaclean wrote:
Perhaps this would be a good use case for the `ADDR` operand and `addr` pattern added in https://github.com/llvm/llvm-project/pull/129102
https://github.com/llvm/llvm-project/pull/128856
More information about the llvm-commits
mailing list