[llvm] [NVPTX] Add TMA Bulk Copy Intrinsics (PR #138679)
Durgadoss R via llvm-commits
llvm-commits at lists.llvm.org
Tue May 6 09:31:36 PDT 2025
================
@@ -516,6 +516,9 @@ class CpAsyncBulkStr<bit mc, bit ch> {
string S2G = "cp.async.bulk.global.shared::cta.bulk_group"
# !if(ch, ".L2::cache_hint", "");
+ // Shared to Global memory with bytemask
+ string S2G_BM = S2G # ".cp_mask";
+
----------------
durga4github wrote:
yes, the mask suffix here is rather short ;-)
Otherwise, we generally keep these strings in a class since the final instr name tends to be rather long.
I feel using a bool for the mask and generating one S2G will be more concise and consistent with other uses here. I will try this out.
https://github.com/llvm/llvm-project/pull/138679
More information about the llvm-commits
mailing list