[llvm] r307498 - [PM] Fix a nasty bug in the new PM where we failed to properly

Mon Jul 10 10:39:59 PDT 2017

On Mon, Jul 10, 2017 at 1:28 PM David Blaikie via llvm-commits <
llvm-commits at lists.llvm.org> wrote:

> On Sun, Jul 9, 2017 at 6:45 AM Chandler Carruth via llvm-commits <
> llvm-commits at lists.llvm.org> wrote:
>
>> Author: chandlerc
>> Date: Sun Jul  9 06:45:11 2017
>> New Revision: 307498
>>
>> URL: http://llvm.org/viewvc/llvm-project?rev=307498&view=rev
>> Log:
>> [PM] Fix a nasty bug in the new PM where we failed to properly
>> invalidation of analyses when merging SCCs.
>>
>> While I've added a bunch of testing of this, it takes something much
>> more like the inliner to really trigger this as you need to have
>> partially-analyzed SCCs with updates at just the right time.
>
>
> Why is that ^ not reproducible via a unit test with a dummy/stub
> analysis/transform/etc? Too much boilerplate to write those?
>

Yeah, it's definitely technically possible, just incredibly complicated and
requires huge amounts of boilerplate so it didn't seem like it would remain
an effective test going forward.

>
>> So I've
>> added a direct test for this using the inliner and verifying the
>> domtree. Without the changes here, this test ends up finding a stale
>> dominator tree.
>>
>> However, to handle this properly, we need to invalidate analyses
>> *before* merging the SCCs. After talking to Philip and Sanjoy about this
>> they convinced me this was the right approach. To do this, we need
>> a callback mechanism when merging SCCs so we can observe the cycle that
>> will be merged before the merge happens. This API update ended up being
>> surprisingly easy.
>>
>> With this commit, the new PM passes the test-suite again. It hadn't
>> since MemorySSA was enabled for EarlyCSE as that also will find this bug
>> very quickly.
>>
>> Modified:
>>     llvm/trunk/include/llvm/Analysis/LazyCallGraph.h
>>     llvm/trunk/lib/Analysis/CGSCCPassManager.cpp
>>     llvm/trunk/lib/Analysis/LazyCallGraph.cpp
>>     llvm/trunk/test/Transforms/Inline/cgscc-incremental-invalidate.ll
>>     llvm/trunk/unittests/Analysis/LazyCallGraphTest.cpp
>>
>> Modified: llvm/trunk/include/llvm/Analysis/LazyCallGraph.h
>> URL:
>> http://llvm.org/viewvc/llvm-project/llvm/trunk/include/llvm/Analysis/LazyCallGraph.h?rev=307498&r1=307497&r2=307498&view=diff
>>
>> ==============================================================================
>> --- llvm/trunk/include/llvm/Analysis/LazyCallGraph.h (original)
>> +++ llvm/trunk/include/llvm/Analysis/LazyCallGraph.h Sun Jul  9 06:45:11
>> 2017
>> @@ -652,17 +652,23 @@ public:
>>      /// Make an existing internal ref edge into a call edge.
>>      ///
>>      /// This may form a larger cycle and thus collapse SCCs into
>> TargetN's SCC.
>> -    /// If that happens, the deleted SCC pointers are returned. These
>> SCCs are
>> -    /// not in a valid state any longer but the pointers will remain
>> valid
>> -    /// until destruction of the parent graph instance for the purpose of
>> -    /// clearing cached information.
>> +    /// If that happens, the optional callback \p MergedCB will be
>> invoked (if
>> +    /// provided) on the SCCs being merged away prior to actually
>> performing
>> +    /// the merge. Note that this will never include the target SCC as
>> that
>> +    /// will be the SCC functions are merged into to resolve the cycle.
>> Once
>> +    /// this function returns, these merged SCCs are not in a valid
>> state but
>> +    /// the pointers will remain valid until destruction of the parent
>> graph
>> +    /// instance for the purpose of clearing cached information. This
>> function
>> +    /// also returns 'true' if a cycle was formed and some SCCs merged
>> away as
>> +    /// a convenience.
>>      ///
>>      /// After this operation, both SourceN's SCC and TargetN's SCC may
>> move
>>      /// position within this RefSCC's postorder list. Any SCCs merged are
>>      /// merged into the TargetN's SCC in order to preserve reachability
>> analyses
>>      /// which took place on that SCC.
>> -    SmallVector<SCC *, 1> switchInternalEdgeToCall(Node &SourceN,
>> -                                                   Node &TargetN);
>> +    bool switchInternalEdgeToCall(
>> +        Node &SourceN, Node &TargetN,
>> +        function_ref<void(ArrayRef<SCC *> MergedSCCs)> MergeCB = {});
>>
>>      /// Make an existing internal call edge between separate SCCs into a
>> ref
>>      /// edge.
>>
>> Modified: llvm/trunk/lib/Analysis/CGSCCPassManager.cpp
>> URL:
>> http://llvm.org/viewvc/llvm-project/llvm/trunk/lib/Analysis/CGSCCPassManager.cpp?rev=307498&r1=307497&r2=307498&view=diff
>>
>> ==============================================================================
>> --- llvm/trunk/lib/Analysis/CGSCCPassManager.cpp (original)
>> +++ llvm/trunk/lib/Analysis/CGSCCPassManager.cpp Sun Jul  9 06:45:11 2017
>> @@ -570,25 +570,48 @@ LazyCallGraph::SCC &llvm::updateCGAndAna
>>      // Otherwise we are switching an internal ref edge to a call edge.
>> This
>>      // may merge away some SCCs, and we add those to the UpdateResult.
>> We also
>>      // need to make sure to update the worklist in the event SCCs have
>> moved
>> -    // before the current one in the post-order sequence.
>> +    // before the current one in the post-order sequence
>> +    bool HasFunctionAnalysisProxy = false;
>>      auto InitialSCCIndex = RC->find(*C) - RC->begin();
>> -    auto InvalidatedSCCs = RC->switchInternalEdgeToCall(N, *CallTarget);
>> -    if (!InvalidatedSCCs.empty()) {
>> +    bool FormedCycle = RC->switchInternalEdgeToCall(
>> +        N, *CallTarget, [&](ArrayRef<SCC *> MergedSCCs) {
>> +          for (SCC *MergedC : MergedSCCs) {
>> +            assert(MergedC != &TargetC && "Cannot merge away the target
>> SCC!");
>> +
>> +            HasFunctionAnalysisProxy |=
>> +                AM.getCachedResult<FunctionAnalysisManagerCGSCCProxy>(
>> +                    *MergedC) != nullptr;
>> +
>> +            // Mark that this SCC will no longer be valid.
>> +            UR.InvalidatedSCCs.insert(MergedC);
>> +
>> +            // FIXME: We should really do a 'clear' here to forcibly
>> release
>> +            // memory, but we don't have a good way of doing that and
>> +            // preserving the function analyses.
>> +            auto PA =
>> PreservedAnalyses::allInSet<AllAnalysesOn<Function>>();
>> +            PA.preserve<FunctionAnalysisManagerCGSCCProxy>();
>> +            AM.invalidate(*MergedC, PA);
>> +          }
>> +        });
>> +
>> +    // If we formed a cycle by creating this call, we need to update
>> more data
>> +    // structures.
>> +    if (FormedCycle) {
>>        C = &TargetC;
>>        assert(G.lookupSCC(N) == C && "Failed to update current SCC!");
>>
>> -      // Any analyses cached for this SCC are no longer precise as the
>> shape
>> -      // has changed by introducing this cycle.
>> -      AM.invalidate(*C, PreservedAnalyses::none());
>> +      // If one of the invalidated SCCs had a cached proxy to a function
>> +      // analysis manager, we need to create a proxy in the new current
>> SCC as
>> +      // the invaliadted SCCs had their functions moved.
>> +      if (HasFunctionAnalysisProxy)
>> +        AM.getResult<FunctionAnalysisManagerCGSCCProxy>(*C, G);
>>
>> -      for (SCC *InvalidatedC : InvalidatedSCCs) {
>> -        assert(InvalidatedC != C && "Cannot invalidate the current
>> SCC!");
>> -        UR.InvalidatedSCCs.insert(InvalidatedC);
>> -
>> -        // Also clear any cached analyses for the SCCs that are dead.
>> This
>> -        // isn't really necessary for correctness but can release memory.
>> -        AM.clear(*InvalidatedC);
>> -      }
>> +      // Any analyses cached for this SCC are no longer precise as the
>> shape
>> +      // has changed by introducing this cycle. However, we have taken
>> care to
>> +      // update the proxies so it remains valide.
>> +      auto PA = PreservedAnalyses::allInSet<AllAnalysesOn<Function>>();
>> +      PA.preserve<FunctionAnalysisManagerCGSCCProxy>();
>> +      AM.invalidate(*C, PA);
>>      }
>>      auto NewSCCIndex = RC->find(*C) - RC->begin();
>>      if (InitialSCCIndex < NewSCCIndex) {
>>
>> Modified: llvm/trunk/lib/Analysis/LazyCallGraph.cpp
>> URL:
>> http://llvm.org/viewvc/llvm-project/llvm/trunk/lib/Analysis/LazyCallGraph.cpp?rev=307498&r1=307497&r2=307498&view=diff
>>
>> ==============================================================================
>> --- llvm/trunk/lib/Analysis/LazyCallGraph.cpp (original)
>> +++ llvm/trunk/lib/Analysis/LazyCallGraph.cpp Sun Jul  9 06:45:11 2017
>> @@ -456,8 +456,10 @@ updatePostorderSequenceForEdgeInsertion(
>>    return make_range(SCCs.begin() + SourceIdx, SCCs.begin() + TargetIdx);
>>  }
>>
>> -SmallVector<LazyCallGraph::SCC *, 1>
>> -LazyCallGraph::RefSCC::switchInternalEdgeToCall(Node &SourceN, Node
>> &TargetN) {
>> +bool
>> +LazyCallGraph::RefSCC::switchInternalEdgeToCall(
>> +    Node &SourceN, Node &TargetN,
>> +    function_ref<void(ArrayRef<SCC *> MergeSCCs)> MergeCB) {
>>    assert(!(*SourceN)[TargetN].isCall() && "Must start with a ref edge!");
>>    SmallVector<SCC *, 1> DeletedSCCs;
>>
>> @@ -475,7 +477,7 @@ LazyCallGraph::RefSCC::switchInternalEdg
>>    // we've just added more connectivity.
>>    if (&SourceSCC == &TargetSCC) {
>>      SourceN->setEdgeKind(TargetN, Edge::Call);
>> -    return DeletedSCCs;
>> +    return false; // No new cycle.
>>    }
>>
>>    // At this point we leverage the postorder list of SCCs to detect when
>> the
>> @@ -488,7 +490,7 @@ LazyCallGraph::RefSCC::switchInternalEdg
>>    int TargetIdx = SCCIndices[&TargetSCC];
>>    if (TargetIdx < SourceIdx) {
>>      SourceN->setEdgeKind(TargetN, Edge::Call);
>> -    return DeletedSCCs;
>> +    return false; // No new cycle.
>>    }
>>
>>    // Compute the SCCs which (transitively) reach the source.
>> @@ -555,12 +557,16 @@ LazyCallGraph::RefSCC::switchInternalEdg
>>        SourceSCC, TargetSCC, SCCs, SCCIndices, ComputeSourceConnectedSet,
>>        ComputeTargetConnectedSet);
>>
>> +  // Run the user's callback on the merged SCCs before we actually merge
>> them.
>> +  if (MergeCB)
>> +    MergeCB(makeArrayRef(MergeRange.begin(), MergeRange.end()));
>> +
>>    // If the merge range is empty, then adding the edge didn't actually
>> form any
>>    // new cycles. We're done.
>>    if (MergeRange.begin() == MergeRange.end()) {
>>      // Now that the SCC structure is finalized, flip the kind to call.
>>      SourceN->setEdgeKind(TargetN, Edge::Call);
>> -    return DeletedSCCs;
>> +    return false; // No new cycle.
>>    }
>>
>>  #ifndef NDEBUG
>> @@ -596,8 +602,8 @@ LazyCallGraph::RefSCC::switchInternalEdg
>>    // Now that the SCC structure is finalized, flip the kind to call.
>>    SourceN->setEdgeKind(TargetN, Edge::Call);
>>
>> -  // And we're done!
>> -  return DeletedSCCs;
>> +  // And we're done, but we did form a new cycle.
>> +  return true;
>>  }
>>
>>  void LazyCallGraph::RefSCC::switchTrivialInternalEdgeToRef(Node &SourceN,
>>
>> Modified:
>> llvm/trunk/test/Transforms/Inline/cgscc-incremental-invalidate.ll
>> URL:
>> http://llvm.org/viewvc/llvm-project/llvm/trunk/test/Transforms/Inline/cgscc-incremental-invalidate.ll?rev=307498&r1=307497&r2=307498&view=diff
>>
>> ==============================================================================
>> --- llvm/trunk/test/Transforms/Inline/cgscc-incremental-invalidate.ll
>> (original)
>> +++ llvm/trunk/test/Transforms/Inline/cgscc-incremental-invalidate.ll Sun
>> Jul  9 06:45:11 2017
>> @@ -127,3 +127,80 @@ entry:
>>    ret void
>>  ; CHECK: ret void
>>  }
>> +
>> +; The 'test2_' prefixed code works to carefully trigger forming an SCC
>> with
>> +; a dominator tree for one of the functions but not the other and
>> without even
>> +; a function analysis manager proxy for the SCC that things get merged
>> into.
>> +; Without proper handling when updating the call graph this will find a
>> stale
>> +; dominator tree.
>> +
>> + at test2_global = external global i32, align 4
>> +
>> +define void @test2_hoge(i1 (i32*)* %arg) {
>> +; CHECK-LABEL: define void @test2_hoge(
>> +bb:
>> +  %tmp2 = call zeroext i1 %arg(i32* @test2_global)
>> +; CHECK: call zeroext i1 %arg(
>> +  br label %bb3
>> +
>> +bb3:
>> +  %tmp5 = call zeroext i1 %arg(i32* @test2_global)
>> +; CHECK: call zeroext i1 %arg(
>> +  br i1 %tmp5, label %bb3, label %bb6
>> +
>> +bb6:
>> +  ret void
>> +}
>> +
>> +define zeroext i1 @test2_widget(i32* %arg) {
>> +; CHECK-LABEL: define zeroext i1 @test2_widget(
>> +bb:
>> +  %tmp1 = alloca i8, align 1
>> +  %tmp2 = alloca i32, align 4
>> +  call void @test2_quux()
>> +; CHECK-NOT:     call
>> +;
>> +; CHECK:         call zeroext i1 @test2_widget(i32* @test2_global)
>> +; CHECK-NEXT:    br label %[[NEW_BB:.*]]
>> +;
>> +; CHECK:       [[NEW_BB]]:
>> +; CHECK-NEXT:    call zeroext i1 @test2_widget(i32* @test2_global)
>> +;
>> +; CHECK:       {{.*}}:
>> +
>> +  call void @test2_hoge.1(i32* %arg)
>> +; CHECK-NEXT:    call void @test2_hoge.1(
>> +
>> +  %tmp4 = call zeroext i1 @test2_barney(i32* %tmp2)
>> +  %tmp5 = zext i1 %tmp4 to i32
>> +  store i32 %tmp5, i32* %tmp2, align 4
>> +  %tmp6 = call zeroext i1 @test2_barney(i32* null)
>> +  call void @test2_ham(i8* %tmp1)
>> +; CHECK:         call void @test2_ham(
>> +
>> +  call void @test2_quux()
>> +; CHECK-NOT:     call
>> +;
>> +; CHECK:         call zeroext i1 @test2_widget(i32* @test2_global)
>> +; CHECK-NEXT:    br label %[[NEW_BB:.*]]
>> +;
>> +; CHECK:       [[NEW_BB]]:
>> +; CHECK-NEXT:    call zeroext i1 @test2_widget(i32* @test2_global)
>> +;
>> +; CHECK:       {{.*}}:
>> +  ret i1 true
>> +; CHECK-NEXT:    ret i1 true
>> +}
>> +
>> +define internal void @test2_quux() {
>> +; CHECK-NOT: @test2_quux
>> +bb:
>> +  call void @test2_hoge(i1 (i32*)* @test2_widget)
>> +  ret void
>> +}
>> +
>> +declare void @test2_hoge.1(i32*)
>> +
>> +declare zeroext i1 @test2_barney(i32*)
>> +
>> +declare void @test2_ham(i8*)
>>
>> Modified: llvm/trunk/unittests/Analysis/LazyCallGraphTest.cpp
>> URL:
>> http://llvm.org/viewvc/llvm-project/llvm/trunk/unittests/Analysis/LazyCallGraphTest.cpp?rev=307498&r1=307497&r2=307498&view=diff
>>
>> ==============================================================================
>> --- llvm/trunk/unittests/Analysis/LazyCallGraphTest.cpp (original)
>> +++ llvm/trunk/unittests/Analysis/LazyCallGraphTest.cpp Sun Jul  9
>> 06:45:11 2017
>> @@ -1277,9 +1277,10 @@ TEST(LazyCallGraphTest, InternalEdgeMuta
>>    // be invalidated.
>>    LazyCallGraph::SCC &AC = *CG.lookupSCC(A);
>>    LazyCallGraph::SCC &CC = *CG.lookupSCC(C);
>> -  auto InvalidatedSCCs = RC.switchInternalEdgeToCall(A, C);
>> -  ASSERT_EQ(1u, InvalidatedSCCs.size());
>> -  EXPECT_EQ(&AC, InvalidatedSCCs[0]);
>> +  EXPECT_TRUE(RC.switchInternalEdgeToCall(A, C,
>> [&](ArrayRef<LazyCallGraph::SCC *> MergedCs) {
>> +    ASSERT_EQ(1u, MergedCs.size());
>> +    EXPECT_EQ(&AC, MergedCs[0]);
>> +  }));
>>    EXPECT_EQ(2, CC.size());
>>    EXPECT_EQ(&CC, CG.lookupSCC(A));
>>    EXPECT_EQ(&CC, CG.lookupSCC(C));
>> @@ -1586,8 +1587,7 @@ TEST(LazyCallGraphTest, InternalRefEdgeT
>>
>>    // Switch the ref edge from A -> D to a call edge. This should have no
>>    // effect as it is already in postorder and no new cycles are formed.
>> -  auto MergedCs = RC.switchInternalEdgeToCall(A, D);
>> -  EXPECT_EQ(0u, MergedCs.size());
>> +  EXPECT_FALSE(RC.switchInternalEdgeToCall(A, D));
>>    ASSERT_EQ(4, RC.size());
>>    EXPECT_EQ(&DC, &RC[0]);
>>    EXPECT_EQ(&BC, &RC[1]);
>> @@ -1596,8 +1596,7 @@ TEST(LazyCallGraphTest, InternalRefEdgeT
>>
>>    // Switch B -> C to a call edge. This doesn't form any new cycles but
>> does
>>    // require reordering the SCCs.
>> -  MergedCs = RC.switchInternalEdgeToCall(B, C);
>> -  EXPECT_EQ(0u, MergedCs.size());
>> +  EXPECT_FALSE(RC.switchInternalEdgeToCall(B, C));
>>    ASSERT_EQ(4, RC.size());
>>    EXPECT_EQ(&DC, &RC[0]);
>>    EXPECT_EQ(&CC, &RC[1]);
>> @@ -1605,9 +1604,10 @@ TEST(LazyCallGraphTest, InternalRefEdgeT
>>    EXPECT_EQ(&AC, &RC[3]);
>>
>>    // Switch C -> B to a call edge. This forms a cycle and forces merging
>> SCCs.
>> -  MergedCs = RC.switchInternalEdgeToCall(C, B);
>> -  ASSERT_EQ(1u, MergedCs.size());
>> -  EXPECT_EQ(&CC, MergedCs[0]);
>> +  EXPECT_TRUE(RC.switchInternalEdgeToCall(C, B,
>> [&](ArrayRef<LazyCallGraph::SCC *> MergedCs) {
>> +    ASSERT_EQ(1u, MergedCs.size());
>> +    EXPECT_EQ(&CC, MergedCs[0]);
>> +  }));
>>    ASSERT_EQ(3, RC.size());
>>    EXPECT_EQ(&DC, &RC[0]);
>>    EXPECT_EQ(&BC, &RC[1]);
>> @@ -1720,8 +1720,7 @@ TEST(LazyCallGraphTest, InternalRefEdgeT
>>    // Switch C3 -> B1 to a call edge. This doesn't form any new cycles
>> but does
>>    // require reordering the SCCs in the face of tricky internal node
>>    // structures.
>> -  auto MergedCs = RC.switchInternalEdgeToCall(C3, B1);
>> -  EXPECT_EQ(0u, MergedCs.size());
>> +  EXPECT_FALSE(RC.switchInternalEdgeToCall(C3, B1));
>>    ASSERT_EQ(8, RC.size());
>>    EXPECT_EQ(&DC, &RC[0]);
>>    EXPECT_EQ(&B3C, &RC[1]);
>> @@ -1852,10 +1851,12 @@ TEST(LazyCallGraphTest, InternalRefEdgeT
>>    // C   F      C   |  |
>>    //  \ /        \ /   |
>>    //   G          G    |
>> -  auto MergedCs = RC.switchInternalEdgeToCall(F, B);
>> -  ASSERT_EQ(2u, MergedCs.size());
>> -  EXPECT_EQ(&FC, MergedCs[0]);
>> -  EXPECT_EQ(&DC, MergedCs[1]);
>> +  EXPECT_TRUE(RC.switchInternalEdgeToCall(
>> +      F, B, [&](ArrayRef<LazyCallGraph::SCC *> MergedCs) {
>> +        ASSERT_EQ(2u, MergedCs.size());
>> +        EXPECT_EQ(&FC, MergedCs[0]);
>> +        EXPECT_EQ(&DC, MergedCs[1]);
>> +      }));
>>    EXPECT_EQ(3, BC.size());
>>
>>    // And make sure the postorder was updated.
>>
>>
>> _______________________________________________
>> llvm-commits mailing list
>> llvm-commits at lists.llvm.org
>> http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-commits
>>
> _______________________________________________
> llvm-commits mailing list
> llvm-commits at lists.llvm.org
> http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-commits
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20170710/f2d9883b/attachment.html>