[all-commits] [llvm/llvm-project] 53bcd1: [libomptarget][nfc] Wrap cuda min() in target_impl
Jon Chesterfield via All-commits
all-commits at lists.llvm.org
Mon Dec 16 17:30:18 PST 2019
Branch: refs/heads/master
Home: https://github.com/llvm/llvm-project
Commit: 53bcd1e1413c878d2d988df80142a430a9abf24a
https://github.com/llvm/llvm-project/commit/53bcd1e1413c878d2d988df80142a430a9abf24a
Author: Jon Chesterfield <jonathanchesterfield at gmail.com>
Date: 2019-12-17 (Tue, 17 Dec 2019)
Changed paths:
M openmp/libomptarget/deviceRTLs/amdgcn/CMakeLists.txt
M openmp/libomptarget/deviceRTLs/amdgcn/src/target_impl.h
M openmp/libomptarget/deviceRTLs/common/src/parallel.cu
M openmp/libomptarget/deviceRTLs/nvptx/src/reduction.cu
M openmp/libomptarget/deviceRTLs/nvptx/src/target_impl.h
Log Message:
-----------
[libomptarget][nfc] Wrap cuda min() in target_impl
Summary:
[libomptarget][nfc] Wrap cuda min() in target_impl
nvptx forwards to cuda min, amdgcn implements directly.
Sufficient to build parallel.cu for amdgcn, added to CMakeLists.
All call sites are homogenous except one that passes a uint32_t and an
int32_t. This could be smoothed over by taking two type parameters
and some care over the return type, but overall I think the inline
<uint32_t> calling attention to what was an implicit sign conversion
is cleaner.
Reviewers: ABataev, jdoerfert
Reviewed By: jdoerfert
Subscribers: jvesely, mgorny, openmp-commits
Tags: #openmp
Differential Revision: https://reviews.llvm.org/D71580
More information about the All-commits
mailing list