[all-commits] [llvm/llvm-project] 58c9da: [AMDGPU] Deallocate VGPRs before exiting in dynami...

Diana Picus via All-commits all-commits at lists.llvm.org
Tue Mar 18 04:28:16 PDT 2025


  Branch: refs/heads/users/rovka/dvgpr-3
  Home:   https://github.com/llvm/llvm-project
  Commit: 58c9daabc6dbe8bbb21050ee0ead663fbb54fd50
      https://github.com/llvm/llvm-project/commit/58c9daabc6dbe8bbb21050ee0ead663fbb54fd50
  Author: Diana Picus <Diana-Magda.Picus at amd.com>
  Date:   2025-03-18 (Tue, 18 Mar 2025)

  Changed paths:
    M llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp
    A llvm/test/CodeGen/AMDGPU/release-vgprs-gfx12.mir

  Log Message:
  -----------
  [AMDGPU] Deallocate VGPRs before exiting in dynamic VGPR mode

In dynamic VGPR mode, Waves must deallocate all VGPRs before exiting. If
the shader program does not do this, hardware inserts `S_ALLOC_VGPR 0`
before S_ENDPGM, but this may incur some performance cost. Therefore
it's better if the compiler proactively generates that instruction.

This patch extends `si-insert-waitcnts` to deallocate the VGPRs via
a `S_ALLOC_VGPR 0` before any `S_ENDPGM` when in dynamic VGPR mode.


  Commit: 6b7d1740c5c794040999cb36d62be565aebb5b91
      https://github.com/llvm/llvm-project/commit/6b7d1740c5c794040999cb36d62be565aebb5b91
  Author: Diana Picus <diana-magda.picus at amd.com>
  Date:   2025-03-18 (Tue, 18 Mar 2025)

  Changed paths:
    M llvm/test/CodeGen/AMDGPU/release-vgprs-gfx12.mir

  Log Message:
  -----------
  Remove useless -O2 from test


Compare: https://github.com/llvm/llvm-project/compare/3d09c944c256...6b7d1740c5c7

To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications


More information about the All-commits mailing list