[all-commits] [llvm/llvm-project] b95a6c: [AMDGPU] Remove special cases in TTI::getMemcpyLoo...

Mon Feb 3 23:18:46 PST 2025

  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: b95a6c750c9e45237071328a9d7fec64a33cb56b
      https://github.com/llvm/llvm-project/commit/b95a6c750c9e45237071328a9d7fec64a33cb56b
  Author: Fabian Ritter <fabian.ritter at amd.com>
  Date:   2025-02-04 (Tue, 04 Feb 2025)

  Changed paths:
    M llvm/lib/Target/AMDGPU/AMDGPUTargetTransformInfo.cpp
    M llvm/test/CodeGen/AMDGPU/lower-mem-intrinsics.ll
    M llvm/test/CodeGen/AMDGPU/memmove-var-size.ll

  Log Message:
  -----------
  [AMDGPU] Remove special cases in TTI::getMemcpyLoop(Residual)LoweringType (#125507)

These special cases limit the width of memory operations we use for
lowering memcpy/memmove when the pointer arguments are 2-aligned or in
the LDS/GDS.

I found that performance in microbenchmarks on gfx90a, gfx1030, and
gfx1100 is better without this limitation.

To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications