[PATCH] D54411: [Codegen] Merge tail blocks with no successors after block placement

Jim Lin via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Thu Jun 13 01:23:06 PDT 2019


Jim added inline comments.


================
Comment at: lib/CodeGen/BranchFolding.cpp:1074
+  if (!EnableTailMerge)
+    return MadeChange;
 
----------------
This change is it neccessary?


================
Comment at: test/CodeGen/X86/tail-opts.ll:439
+  %1 = icmp eq i32 undef, 0
+  br i1 %1, label %return, label %bb12
 
----------------
RKSimon wrote:
> dmgreen wrote:
> > Can you explain this change? The comment makes it look like it should be the same test as above.
> > 
> > But it somehow has different fall-through branches with the switches?
> icmp with undef will now constant fold in SelectionDAG - you're probably being affected by that
This testcase tests that two `tail_call_me` can't be merged into one for performance issue.
But bby and bbx are likely the same. This patch makes bby and bbx merge into one. 
And make `tail_call_me` only need one. 
So the two comparison of switch in bby and bbx should be changed difference.

This is my testcase to keep `tail_call_me` tail-duplicated.

```
define void @two_nosize(i32 %x, i32 %y, i32 %z) nounwind {
entry:
  %0 = icmp eq i32 %x, 0
  br i1 %0, label %bbx, label %bby

bby:
  switch i32 %y, label %bb7 [
    i32 0, label %return
  ]

bb7:
  store volatile i32 0, i32* @XYZ
  tail call void @tail_call_me()
  ret void

bbx:
  switch i32 %z, label %bb12 [
    i32 -1, label %return
  ]

bb12:
  store volatile i32 0, i32* @XYZ
  tail call void @tail_call_me()
  ret void

return:
  ret void
}

```
Let two comparison compare with 0 or -1. So that tail call `tail_call_me` can be put on the fall through edge.



CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D54411/new/

https://reviews.llvm.org/D54411





More information about the llvm-commits mailing list