[all-commits] [llvm/llvm-project] 58c9da: [AMDGPU] Deallocate VGPRs before exiting in dynami...
Diana Picus via All-commits
all-commits at lists.llvm.org
Tue Mar 18 04:28:16 PDT 2025
Branch: refs/heads/users/rovka/dvgpr-3
Home: https://github.com/llvm/llvm-project
Commit: 58c9daabc6dbe8bbb21050ee0ead663fbb54fd50
https://github.com/llvm/llvm-project/commit/58c9daabc6dbe8bbb21050ee0ead663fbb54fd50
Author: Diana Picus <Diana-Magda.Picus at amd.com>
Date: 2025-03-18 (Tue, 18 Mar 2025)
Changed paths:
M llvm/lib/Target/AMDGPU/SIInsertWaitcnts.cpp
A llvm/test/CodeGen/AMDGPU/release-vgprs-gfx12.mir
Log Message:
-----------
[AMDGPU] Deallocate VGPRs before exiting in dynamic VGPR mode
In dynamic VGPR mode, Waves must deallocate all VGPRs before exiting. If
the shader program does not do this, hardware inserts `S_ALLOC_VGPR 0`
before S_ENDPGM, but this may incur some performance cost. Therefore
it's better if the compiler proactively generates that instruction.
This patch extends `si-insert-waitcnts` to deallocate the VGPRs via
a `S_ALLOC_VGPR 0` before any `S_ENDPGM` when in dynamic VGPR mode.
Commit: 6b7d1740c5c794040999cb36d62be565aebb5b91
https://github.com/llvm/llvm-project/commit/6b7d1740c5c794040999cb36d62be565aebb5b91
Author: Diana Picus <diana-magda.picus at amd.com>
Date: 2025-03-18 (Tue, 18 Mar 2025)
Changed paths:
M llvm/test/CodeGen/AMDGPU/release-vgprs-gfx12.mir
Log Message:
-----------
Remove useless -O2 from test
Compare: https://github.com/llvm/llvm-project/compare/3d09c944c256...6b7d1740c5c7
To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications
More information about the All-commits
mailing list