[all-commits] [llvm/llvm-project] fca503: [flang][cuda] Add conversion pass for cuf.allocate...
Valentin Clement (バレンタイン クレメン) via All-commits
all-commits at lists.llvm.org
Fri Aug 2 16:19:32 PDT 2024
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: fca5038597af65d2fd1af0815014440aff055630
https://github.com/llvm/llvm-project/commit/fca5038597af65d2fd1af0815014440aff055630
Author: Valentin Clement (バレンタイン クレメン) <clementval at gmail.com>
Date: 2024-08-02 (Fri, 02 Aug 2024)
Changed paths:
M flang/include/flang/Optimizer/Transforms/Passes.h
M flang/include/flang/Optimizer/Transforms/Passes.td
M flang/lib/Optimizer/Transforms/CMakeLists.txt
A flang/lib/Optimizer/Transforms/CufOpConversion.cpp
A flang/test/Fir/CUDA/cuda-allocate.fir
Log Message:
-----------
[flang][cuda] Add conversion pass for cuf.allocate and cuf.deallocate (#101563)
Allocator can be specified in the descriptor. For simple local
allocatable, we can simply convert `cuf.allocate`/`cuf.deallocate` to
their corresponding runtime calls in the standard flang runtime. More
specific cases will require dedicated entry points. Global descriptor
will require sync between host and device copy.
This patch adds a pass to perform this conversion.
To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications
More information about the All-commits
mailing list