[PATCH] D89187: [AMDGPU] Minimize number of s_mov generated by copyPhysReg
Carl Ritson via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Sun Oct 11 18:24:06 PDT 2020
critson added a comment.
To motivate the peephole.
This pattern effects 2% of graphics shaders on GFX9, and nearly 7% on GFX10.
On average we save ~1.5 instructions per effected shader.
On some VulkanCTS tests the savings are much higher.
Given the relatively low gain, I assume it was not worth introducing a new peephole pass, and took this approach to address the duplicate s_mov instructions at the point of generation (when they cheapest to spot).
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D89187/new/
https://reviews.llvm.org/D89187
More information about the llvm-commits
mailing list