[llvm] [NVPTX] Add TMA Bulk Copy Intrinsics (PR #138679)

Alex MacLean via llvm-commits llvm-commits at lists.llvm.org
Tue May 6 09:11:44 PDT 2025


================
@@ -516,6 +516,9 @@ class CpAsyncBulkStr<bit mc, bit ch> {
   string S2G = "cp.async.bulk.global.shared::cta.bulk_group"
                # !if(ch, ".L2::cache_hint", "");
 
+  // Shared to Global memory with bytemask
+  string S2G_BM = S2G # ".cp_mask";
+
----------------
AlexMaclean wrote:

Why add this here instead of just appending the suffix when using S2G below. I think adding this level of indirection between the instruction name and it's actual definition is making things a bit more confusing. 

https://github.com/llvm/llvm-project/pull/138679


More information about the llvm-commits mailing list