[all-commits] [llvm/llvm-project] fca503: [flang][cuda] Add conversion pass for cuf.allocate...

Valentin Clement (バレンタイン クレメン) via All-commits all-commits at lists.llvm.org
Fri Aug 2 16:19:32 PDT 2024


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: fca5038597af65d2fd1af0815014440aff055630
      https://github.com/llvm/llvm-project/commit/fca5038597af65d2fd1af0815014440aff055630
  Author: Valentin Clement (バレンタイン クレメン) <clementval at gmail.com>
  Date:   2024-08-02 (Fri, 02 Aug 2024)

  Changed paths:
    M flang/include/flang/Optimizer/Transforms/Passes.h
    M flang/include/flang/Optimizer/Transforms/Passes.td
    M flang/lib/Optimizer/Transforms/CMakeLists.txt
    A flang/lib/Optimizer/Transforms/CufOpConversion.cpp
    A flang/test/Fir/CUDA/cuda-allocate.fir

  Log Message:
  -----------
  [flang][cuda] Add conversion pass for cuf.allocate and cuf.deallocate (#101563)

Allocator can be specified in the descriptor. For simple local
allocatable, we can simply convert `cuf.allocate`/`cuf.deallocate` to
their corresponding runtime calls in the standard flang runtime. More
specific cases will require dedicated entry points. Global descriptor
will require sync between host and device copy.

This patch adds a pass to perform this conversion.



To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications


More information about the All-commits mailing list