[all-commits] [llvm/llvm-project] 9ce74d: [AMDGPU][CodeGen] Improve handling of memcpy for -...

Shilei Tian via All-commits all-commits at lists.llvm.org
Tue Apr 16 06:34:41 PDT 2024


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: 9ce74d6d470437cde82e91631395322ec1c36eeb
      https://github.com/llvm/llvm-project/commit/9ce74d6d470437cde82e91631395322ec1c36eeb
  Author: Shilei Tian <i at tianshilei.me>
  Date:   2024-04-16 (Tue, 16 Apr 2024)

  Changed paths:
    M llvm/lib/Target/AMDGPU/AMDGPUISelLowering.cpp
    A llvm/test/CodeGen/AMDGPU/memcpy-libcall.ll

  Log Message:
  -----------
  [AMDGPU][CodeGen] Improve handling of memcpy for -Os/-Oz compilations (#87632)

We had some instances when LLVM would not inline fixed-count memcpy and
ended up
attempting to lower it a a libcall, which would not work on AMDGPU as
the
address space doesn't meet the requirement, causing compiler crash.

The patch relaxes the threshold used for -Os/-Oz compilation so we're
always allowed
to inline memory copy functions.

This patch basically does the same thing as
https://reviews.llvm.org/D158226 for
AMDGPU.

Fix #88497.



To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications


More information about the All-commits mailing list