[all-commits] [llvm/llvm-project] f7a9fc: [flang][cuda] Use PTX instruction for atomicAdd wi...

Valentin Clement (バレンタイン クレメン) via All-commits all-commits at lists.llvm.org
Tue Nov 25 15:45:45 PST 2025


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: f7a9fcad99e155fed32d98a2d41d1e3bfdeebf48
      https://github.com/llvm/llvm-project/commit/f7a9fcad99e155fed32d98a2d41d1e3bfdeebf48
  Author: Valentin Clement (バレンタイン クレメン) <clementval at gmail.com>
  Date:   2025-11-25 (Tue, 25 Nov 2025)

  Changed paths:
    M flang/include/flang/Optimizer/Builder/CUDAIntrinsicCall.h
    M flang/lib/Optimizer/Builder/CUDAIntrinsicCall.cpp
    M flang/test/Lower/CUDA/cuda-atomicadd.cuf

  Log Message:
  -----------
  [flang][cuda] Use PTX instruction for atomicAdd with 4xf32 (#169581)

Implementation similar to the clang one in
`clang/lib/Headers/__clang_cuda_intrinsics.h`



To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications


More information about the All-commits mailing list