[all-commits] [llvm/llvm-project] c7b683: [PGO][CUDA][HIP] Skip generating profile on the de...

darkbuck via All-commits all-commits at lists.llvm.org
Mon Aug 10 08:02:33 PDT 2020


  Branch: refs/heads/master
  Home:   https://github.com/llvm/llvm-project
  Commit: c7b683c126b849dab5c81e7deecfc1e61f8563a0
      https://github.com/llvm/llvm-project/commit/c7b683c126b849dab5c81e7deecfc1e61f8563a0
  Author: Michael Liao <michael.hliao at gmail.com>
  Date:   2020-08-10 (Mon, 10 Aug 2020)

  Changed paths:
    M clang/lib/CodeGen/CodeGenPGO.cpp
    A clang/test/CodeGenCUDA/profile-coverage-mapping.cu

  Log Message:
  -----------
  [PGO][CUDA][HIP] Skip generating profile on the device stub and wrong-side functions.

- Skip generating profile data on `__global__` function in the host
  compilation. It's a host-side stub function only and don't have
  profile instrumentation generated on the real function body. The extra
  profile data results in the malformed instrumentation profile data.
- Skip generating region mapping on functions in the wrong-side, i.e.,
  + For the device compilation, skip host-only functions; and,
  + For the host compilation, skip device-only functions (including
    `__global__` functions.)
- As the device-side profiling is not ready yet, only host-side profile
  code generation is checked.

Differential Revision: https://reviews.llvm.org/D85276




More information about the All-commits mailing list