[all-commits] [llvm/llvm-project] 18f810: [KernelInfo] Implement new LLVM IR pass for GPU co...

Joel E. Denny via All-commits all-commits at lists.llvm.org
Wed Jan 29 09:40:40 PST 2025


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: 18f8106f310ee702046a11f360af47947c030d2e
      https://github.com/llvm/llvm-project/commit/18f8106f310ee702046a11f360af47947c030d2e
  Author: Joel E. Denny <jdenny.ornl at gmail.com>
  Date:   2025-01-29 (Wed, 29 Jan 2025)

  Changed paths:
    A llvm/docs/KernelInfo.rst
    M llvm/docs/Passes.rst
    A llvm/include/llvm/Analysis/KernelInfo.h
    M llvm/include/llvm/Analysis/TargetTransformInfo.h
    M llvm/include/llvm/Analysis/TargetTransformInfoImpl.h
    M llvm/include/llvm/IR/Function.h
    M llvm/include/llvm/Target/TargetMachine.h
    M llvm/lib/Analysis/CMakeLists.txt
    A llvm/lib/Analysis/KernelInfo.cpp
    M llvm/lib/Analysis/TargetTransformInfo.cpp
    M llvm/lib/Passes/PassBuilder.cpp
    M llvm/lib/Passes/PassRegistry.def
    M llvm/lib/Target/AMDGPU/AMDGPUTargetMachine.cpp
    M llvm/lib/Target/AMDGPU/AMDGPUTargetTransformInfo.cpp
    M llvm/lib/Target/AMDGPU/AMDGPUTargetTransformInfo.h
    M llvm/lib/Target/NVPTX/NVPTXTargetMachine.cpp
    M llvm/lib/Target/NVPTX/NVPTXTargetTransformInfo.cpp
    M llvm/lib/Target/NVPTX/NVPTXTargetTransformInfo.h
    M llvm/lib/Target/TargetMachine.cpp
    M llvm/lib/Transforms/IPO/OpenMPOpt.cpp
    A llvm/test/Analysis/KernelInfo/allocas.ll
    A llvm/test/Analysis/KernelInfo/calls.ll
    A llvm/test/Analysis/KernelInfo/enable-kernel-info/Inputs/test.ll
    A llvm/test/Analysis/KernelInfo/enable-kernel-info/amdgpu.test
    A llvm/test/Analysis/KernelInfo/enable-kernel-info/nvptx.test
    A llvm/test/Analysis/KernelInfo/flat-addrspace/Inputs/test.ll
    A llvm/test/Analysis/KernelInfo/flat-addrspace/amdgpu.test
    A llvm/test/Analysis/KernelInfo/flat-addrspace/nvptx.test
    A llvm/test/Analysis/KernelInfo/launch-bounds/amdgpu.ll
    A llvm/test/Analysis/KernelInfo/launch-bounds/nvptx.ll
    A llvm/test/Analysis/KernelInfo/linkage.ll
    A llvm/test/Analysis/KernelInfo/openmp/README.md
    A llvm/test/Analysis/KernelInfo/openmp/amdgpu.ll
    A llvm/test/Analysis/KernelInfo/openmp/nvptx.ll

  Log Message:
  -----------
  [KernelInfo] Implement new LLVM IR pass for GPU code analysis (#102944)

This patch implements an LLVM IR pass, named kernel-info, that reports
various statistics for codes compiled for GPUs. The ultimate goal of
these statistics to help identify bad code patterns and ways to mitigate
them. The pass operates at the LLVM IR level so that it can, in theory,
support any LLVM-based compiler for programming languages supporting
GPUs. It has been tested so far with LLVM IR generated by Clang for
OpenMP offload codes targeting NVIDIA GPUs and AMD GPUs.

By default, the pass runs at the end of LTO, and options like
``-Rpass=kernel-info`` enable its remarks. Example `opt` and `clang`
command lines appear in `llvm/docs/KernelInfo.rst`. Remarks include
summary statistics (e.g., total size of static allocas) and individual
occurrences (e.g., source location of each alloca). Examples of its
output appear in tests in `llvm/test/Analysis/KernelInfo`.



To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications


More information about the All-commits mailing list