[all-commits] [llvm/llvm-project] f6685f: [OpenMP][CUDA] Refine the logic to determine grid ...
Shilei Tian via All-commits
all-commits at lists.llvm.org
Thu Feb 10 11:13:45 PST 2022
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: f6685f774697c85d6a352dcea013f46a99f9fe31
https://github.com/llvm/llvm-project/commit/f6685f774697c85d6a352dcea013f46a99f9fe31
Author: Shilei Tian <i at tianshilei.me>
Date: 2022-02-10 (Thu, 10 Feb 2022)
Changed paths:
M openmp/libomptarget/plugins/cuda/src/rtl.cpp
Log Message:
-----------
[OpenMP][CUDA] Refine the logic to determine grid size
This patch refines the logic to determine grid size as previous method
can escape the check of whether `CudaBlocksPerGrid` could be greater than the actual
hardware limit.
Reviewed By: jdoerfert
Differential Revision: https://reviews.llvm.org/D119311
More information about the All-commits
mailing list