[all-commits] [llvm/llvm-project] 7eb25d: [AMDGPU] Fix folding clamp into pseudo scalar inst...

Mirko Brkušanin via All-commits all-commits at lists.llvm.org
Sat Aug 10 03:11:18 PDT 2024


  Branch: refs/heads/release/19.x
  Home:   https://github.com/llvm/llvm-project
  Commit: 7eb25ddccc1060776a34410d193bb3d066444b78
      https://github.com/llvm/llvm-project/commit/7eb25ddccc1060776a34410d193bb3d066444b78
  Author: Mirko Brkušanin <Mirko.Brkusanin at amd.com>
  Date:   2024-08-10 (Sat, 10 Aug 2024)

  Changed paths:
    M llvm/lib/Target/AMDGPU/SIFoldOperands.cpp
    A llvm/test/CodeGen/AMDGPU/si-fold-scalar-clamp.mir

  Log Message:
  -----------
  [AMDGPU] Fix folding clamp into pseudo scalar instructions (#100568)

Clamp is canonically a v_max* instruction with a VGPR dst. Folding clamp
into a pseudo scalar instruction can cause issues due to a change in
regbank. We fix this with a copy.

(cherry picked from commit 817cd726454f01e990cd84e5e1d339b120b5ebaa)



To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications


More information about the All-commits mailing list