[PATCH] D107046: [NVPTX] Add NVPTX intrinsics for CUDA PTX 6.5 ldmatrix instructions
Artem Belevich via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Thu Jul 29 13:47:53 PDT 2021
tra accepted this revision.
tra added a comment.
This revision is now accepted and ready to land.
Nice!
BTW, I think it may be a good time for you to ask for LLVM commit access. llvm.org/docs/DeveloperPolicy.html#obtaining-commit-access
================
Comment at: llvm/lib/Target/NVPTX/NVPTXIntrinsics.td:7919
+//
+// ldmatrix.sync.aligned.m8n8[|.trans][|.shared].b16
+//
----------------
Nit: `|` is redundant here, IMO.
I think `[something]` already reads as `something is optional.`
================
Comment at: llvm/lib/Target/NVPTX/NVPTXIntrinsics.td:7954-7955
+ } // space
+ } // transposed
+} // defset
----------------
Nit: something is off with the indentation here.
================
Comment at: llvm/test/CodeGen/NVPTX/wmma.py:636
+"""
+ intrinsic_template = "llvm.nvvm.ldmatrix.sync.aligned.${geom}.${x}${trans}.${itype}.${pspace}"
+ instruction_template = "ldmatrix.sync.aligned.${geom}.${x}${trans}${space}.${itype}"
----------------
`x` -> `frag` ?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D107046/new/
https://reviews.llvm.org/D107046
More information about the llvm-commits
mailing list