[llvm] [NVPTX] Add TMA Bulk Copy Intrinsics (PR #138679)
Alex MacLean via llvm-commits
llvm-commits at lists.llvm.org
Tue May 6 09:11:44 PDT 2025
================
@@ -516,6 +516,9 @@ class CpAsyncBulkStr<bit mc, bit ch> {
string S2G = "cp.async.bulk.global.shared::cta.bulk_group"
# !if(ch, ".L2::cache_hint", "");
+ // Shared to Global memory with bytemask
+ string S2G_BM = S2G # ".cp_mask";
+
----------------
AlexMaclean wrote:
Why add this here instead of just appending the suffix when using S2G below. I think adding this level of indirection between the instruction name and it's actual definition is making things a bit more confusing.
https://github.com/llvm/llvm-project/pull/138679
More information about the llvm-commits
mailing list