[all-commits] [llvm/llvm-project] bd860f: [NVPTX] Add intrinsics for redux.sync f32 instruct...

Srinivasa Ravi via All-commits all-commits at lists.llvm.org
Thu Feb 13 21:42:06 PST 2025


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: bd860f986406b6630e49b1836b3c208acd721d3e
      https://github.com/llvm/llvm-project/commit/bd860f986406b6630e49b1836b3c208acd721d3e
  Author: Srinivasa Ravi <srinivasar at nvidia.com>
  Date:   2025-02-14 (Fri, 14 Feb 2025)

  Changed paths:
    M clang/include/clang/Basic/BuiltinsNVPTX.td
    A clang/test/CodeGenCUDA/redux-f32-builtins.cu
    M llvm/include/llvm/IR/IntrinsicsNVVM.td
    M llvm/lib/Target/NVPTX/NVPTXIntrinsics.td
    A llvm/test/CodeGen/NVPTX/redux-sync-f32.ll

  Log Message:
  -----------
  [NVPTX] Add intrinsics for redux.sync f32 instructions (#126664)

Adds NVVM intrinsics, NVPTX codegen and Clang builtins for `redux.sync`
f32 instructions introduced in ptx8.6 for sm_100a.
Tests added in `CodeGen/NVPTX/redux-sync.ll` and
`CodeGenCUDA/redux-builtins.cu` and verified through ptxas 12.8.0.

PTX Spec Reference:

https://docs.nvidia.com/cuda/parallel-thread-execution/#parallel-synchronization-and-communication-instructions-redux-sync



To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications


More information about the All-commits mailing list