[all-commits] [llvm/llvm-project] 9a5a8c: [NVPTX] Add intrinsics for st.bulk instruction (#1...

Srinivasa Ravi via All-commits all-commits at lists.llvm.org
Mon Mar 10 20:57:42 PDT 2025


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: 9a5a8c9a8072d9af9cea087e506ea213bd89c0f5
      https://github.com/llvm/llvm-project/commit/9a5a8c9a8072d9af9cea087e506ea213bd89c0f5
  Author: Srinivasa Ravi <srinivasar at nvidia.com>
  Date:   2025-03-11 (Tue, 11 Mar 2025)

  Changed paths:
    M llvm/docs/NVPTXUsage.rst
    M llvm/include/llvm/IR/IntrinsicsNVVM.td
    M llvm/lib/Target/NVPTX/NVPTXIntrinsics.td
    A llvm/test/CodeGen/NVPTX/st_bulk.ll

  Log Message:
  -----------
  [NVPTX] Add intrinsics for st.bulk instruction (#128856)

Adds NVVM intrinsics and NVPTX codegen for the `st.bulk` instruction
introduced in ptx8.6 for sm_100. Tests added in
`CodeGen/NVPTX/st_bulk.ll` and verified through ptxas 12.8.0.

PTX Spec Reference:
https://docs.nvidia.com/cuda/parallel-thread-execution/#data-movement-and-conversion-instructions-st-bulk



To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications


More information about the All-commits mailing list