[all-commits] [llvm/llvm-project] bcdb0d: AMDGPU: Handle rewriting non-tied MFMA to AGPR form

Matt Arsenault via All-commits all-commits at lists.llvm.org
Thu Jul 24 18:15:09 PDT 2025


  Branch: refs/heads/users/arsenm/amdgpu/handle-untied-agpr-mfma-rewrite
  Home:   https://github.com/llvm/llvm-project
  Commit: bcdb0d78fe8c227e7b2c9b539db496950332f66b
      https://github.com/llvm/llvm-project/commit/bcdb0d78fe8c227e7b2c9b539db496950332f66b
  Author: Matt Arsenault <Matthew.Arsenault at amd.com>
  Date:   2025-07-25 (Fri, 25 Jul 2025)

  Changed paths:
    M llvm/lib/Target/AMDGPU/AMDGPURewriteAGPRCopyMFMA.cpp
    M llvm/test/CodeGen/AMDGPU/inflate-reg-class-vgpr-mfma-to-av-with-load-source.mir

  Log Message:
  -----------
  AMDGPU: Handle rewriting non-tied MFMA to AGPR form

If src2 and dst aren't the same register, to fold a copy
to AGPR into the instruction we also need to reassign src2
to an available AGPR. All the other uses of src2 also need
to be compatible with the AGPR replacement in order to avoid
inserting other copies somewhere else.

Perform this transform, after verifying all other uses are
compatible with AGPR, and have an available AGPR available at
all points (which effectively means rewriting a full chain of
mfmas and load/store at once).



To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications


More information about the All-commits mailing list