[PATCH] D28874: [AMDGPU] Add VGPR copies post regalloc fix pass
Stanislav Mekhanoshin via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Jan 18 15:02:15 PST 2017
rampitec created this revision.
Herald added a reviewer: tstellarAMD.
Herald added subscribers: tony-tye, yaxunl, mgorny, nhaehnle, wdng, kzhuravl, arsenm.
Regalloc creates COPY instructions which do not formally use VALU.
That results in v_mov instructions displaced after exec mask modification.
One pass which do it is SIOptimizeExecMasking, but potentially it can be
done by other passes too.
This patch adds a pass immediately after regalloc to add implicit exec
use operand to all VGPR copy instructions.
Repository:
rL LLVM
https://reviews.llvm.org/D28874
Files:
lib/Target/AMDGPU/AMDGPU.h
lib/Target/AMDGPU/AMDGPUTargetMachine.cpp
lib/Target/AMDGPU/CMakeLists.txt
lib/Target/AMDGPU/SIFixVGPRCopies.cpp
test/CodeGen/AMDGPU/fix-vgpr-copies.mir
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D28874.84892.patch
Type: text/x-patch
Size: 5373 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20170118/adf30935/attachment.bin>
More information about the llvm-commits
mailing list