[PATCH] D76033: [AMDGPU] Simplify nested SI_END_CF

Stanislav Mekhanoshin via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Mar 11 16:00:11 PDT 2020


rampitec created this revision.
rampitec added reviewers: arsenm, sameerds, AlexVlx.
Herald added subscribers: kerbowa, hiraditya, t-tye, tpr, dstuttard, yaxunl, nhaehnle, wdng, jvesely, kzhuravl.
Herald added a project: LLVM.

This is to replace the optimization from the SIOptimizeExecMaskingPreRA.
We have less opportunities in the control flow lowering because many
VGPR copies are still in place and will be removed later, but we know
for sure an instruction is SI_END_CF and not just an arbitrary S_OR_B64
with EXEC.

The subsequent change needs to convert s_and_saveexec into s_and and
address new TODO lines in tests, then code block guarded by the
-amdgpu-remove-redundant-endcf option in the pre-RA exec mask optimizer
will be removed.


https://reviews.llvm.org/D76033

Files:
  llvm/lib/Target/AMDGPU/SILowerControlFlow.cpp
  llvm/test/CodeGen/AMDGPU/collapse-endcf.ll
  llvm/test/CodeGen/AMDGPU/collapse-endcf.mir
  llvm/test/CodeGen/AMDGPU/mul24-pass-ordering.ll

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D76033.249786.patch
Type: text/x-patch
Size: 51665 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20200311/ab9bfba1/attachment.bin>


More information about the llvm-commits mailing list