[llvm] [NVPTX] Add Bulk Copy Prefetch Intrinsics (PR #123226)
Durgadoss R via llvm-commits
llvm-commits at lists.llvm.org
Thu Jan 16 11:18:43 PST 2025
================
@@ -547,6 +547,25 @@ multiclass CP_ASYNC_BULK_CTA_TO_CLUSTER<NVPTXRegClass rc> {
defm CP_ASYNC_BULK_CTA_TO_CLUSTER : CP_ASYNC_BULK_CTA_TO_CLUSTER<Int64Regs>;
defm CP_ASYNC_BULK_CTA_TO_CLUSTER_SHARED32 : CP_ASYNC_BULK_CTA_TO_CLUSTER<Int32Regs>;
+//------------------------------
+// Bulk Copy Prefetch Functions
+//------------------------------
+multiclass CP_ASYNC_BULK_PREFETCH_INTR {
+ defvar prefetch = "cp.async.bulk.prefetch.L2.global";
+ def "": NVPTXInst<(outs),
+ (ins Int64Regs:$src, Int32Regs:$size),
+ !strconcat(prefetch," [$src], $size;"),
----------------
durga4github wrote:
Looks like the indentation needs to be fixed (same for the below def also)
https://github.com/llvm/llvm-project/pull/123226
More information about the llvm-commits
mailing list