[all-commits] [llvm/llvm-project] f6685f: [OpenMP][CUDA] Refine the logic to determine grid ...

Shilei Tian via All-commits all-commits at lists.llvm.org
Thu Feb 10 11:13:45 PST 2022


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: f6685f774697c85d6a352dcea013f46a99f9fe31
      https://github.com/llvm/llvm-project/commit/f6685f774697c85d6a352dcea013f46a99f9fe31
  Author: Shilei Tian <i at tianshilei.me>
  Date:   2022-02-10 (Thu, 10 Feb 2022)

  Changed paths:
    M openmp/libomptarget/plugins/cuda/src/rtl.cpp

  Log Message:
  -----------
  [OpenMP][CUDA] Refine the logic to determine grid size

This patch refines the logic to determine grid size as previous method
can escape the check of whether `CudaBlocksPerGrid` could be greater than the actual
hardware limit.

Reviewed By: jdoerfert

Differential Revision: https://reviews.llvm.org/D119311




More information about the All-commits mailing list