[clang] [llvm] [mlir] [NVPTX] Add support for Shared Cluster Memory address space. (PR #135444)
Durgadoss R via llvm-commits
llvm-commits at lists.llvm.org
Mon Apr 14 06:39:13 PDT 2025
================
@@ -43,7 +43,10 @@ enum NVVMMemorySpace {
/// Tensor memory space identifier.
/// Tensor memory is available only in arch-accelerated
/// variants from sm100 onwards.
- kTensorMemorySpace = 6
+ kTensorMemorySpace = 6,
+ /// Distributed shared memory space identifier.
+ /// Distributed shared memory is available only in sm80+.
----------------
durga4github wrote:
We have sm_86, sm_87 etc. that do not support dsmem.
So, let us say "sm 90 onwards" here.
https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#memory-hierarchy
https://github.com/llvm/llvm-project/pull/135444
More information about the llvm-commits
mailing list