[llvm] [MemProf] Support cloning through recursive cycles (PR #127429)
Snehasish Kumar via llvm-commits
llvm-commits at lists.llvm.org
Tue Feb 18 15:03:38 PST 2025
================
@@ -3546,13 +3625,99 @@ void CallsiteContextGraph<DerivedCCG, FuncTy, CallTy>::identifyClones(
//
// Then check if by cloning node at least one of the callee edges will be
// disambiguated by splitting out different context ids.
+ //
+ // However, always do the cloning if this is a backedge, in which case we
+ // have not yet cloned along this caller edge.
assert(CallerEdge->AllocTypes != (uint8_t)AllocationType::None);
assert(Node->AllocTypes != (uint8_t)AllocationType::None);
- if (allocTypeToUse(CallerAllocTypeForAlloc) ==
+ if (!CallerEdge->IsBackedge &&
+ allocTypeToUse(CallerAllocTypeForAlloc) ==
allocTypeToUse(Node->AllocTypes) &&
allocTypesMatch<DerivedCCG, FuncTy, CallTy>(
- CalleeEdgeAllocTypesForCallerEdge, Node->CalleeEdges))
+ CalleeEdgeAllocTypesForCallerEdge, Node->CalleeEdges)) {
continue;
+ }
+
+ if (CallerEdge->IsBackedge) {
+ // We should only mark these if cloning recursive contexts, where we
+ // need to do this deferral.
+ assert(CloneRecursiveContexts);
+ DeferredBackedges++;
+ }
+
+ // If this is a backedge, we now do recursive cloning starting from its
+ // caller since we may have moved unambiguous caller contexts to a clone
+ // of this Node in a previous iteration of the current loop, giving more
+ // opportunity for cloning through the backedge. Because we sorted the
+ // caller edges earlier so that cold caller edges are first, we would have
+ // visited and cloned this node for any unamibiguously cold non-recursive
+ // callers before any ambiguous backedge callers. Note that we don't do this
+ // if the caller is already cloned or visited during cloning (e.g. via a
+ // different context path from the allocation).
+ // TODO: Can we do better in the case where the caller was already visited?
+ if (CallerEdge->IsBackedge && !CallerEdge->Caller->CloneOf &&
+ !Visited.count(CallerEdge->Caller)) {
+ auto OrigIdCount = CallerEdge->getContextIds().size();
+ // Now do the recursive cloning of this backedge's caller, which was
+ // deferred earlier.
+ identifyClones(CallerEdge->Caller, Visited, CallerEdgeContextsForAlloc);
+ removeNoneTypeCalleeEdges(CallerEdge->Caller);
+ // See if the recursive call to identifyClones moved the context ids to a
+ // new edge from this node to a clone of caller, and switch to looking at
+ // that new edge so that we clone Node for the new caller clone.
+ bool UpdatedEdge = false;
+ if (OrigIdCount > CallerEdge->getContextIds().size()) {
+ for (auto E : Node->CallerEdges) {
+ // Only interested in clones of the current edges caller.
+ if (E->Caller->CloneOf != CallerEdge->Caller)
+ continue;
+ // See if this edge contains any of the context ids originally on the
+ // current caller edge.
+ auto CallerEdgeContextsForAllocNew =
+ set_intersection(CallerEdgeContextsForAlloc, E->getContextIds());
+ if (CallerEdgeContextsForAllocNew.empty())
+ continue;
+ // Make sure we don't pick a previously existing caller edge of this
+ // Node, which would be processed on a different iteration of the
+ // outer loop over the saved CallerEdges.
+ if (std::find(CallerEdges.begin(), CallerEdges.end(), E) !=
+ CallerEdges.end())
+ continue;
+ // The CallerAllocTypeForAlloc and CalleeEdgeAllocTypesForCallerEdge
+ // are updated further below for all cases where we just invoked
+ // identifyClones recursively.
+ CallerEdgeContextsForAlloc.swap(CallerEdgeContextsForAllocNew);
+ CallerEdge = E;
+ UpdatedEdge = true;
+ break;
+ }
+ }
+ // If cloning removed this edge (and we didn't update it to a new edge
+ // above), we're done with this edge. It's possible we moved all of the
+ // context ids to an existing clone, in which case there's no need to do
+ // further processing for them.
+ if (CallerEdge->isRemoved())
+ continue;
+
+ // Now we need to update the information used for the cloning decisions
+ // further below, as we may have modified edges and their context ids.
+
+ // Note if we changed the CallerEdge above we would have already updated
+ // the context ids.
+ if (!UpdatedEdge) {
+ CallerEdgeContextsForAlloc = set_intersection(
+ CallerEdgeContextsForAlloc, CallerEdge->getContextIds());
+ if (CallerEdgeContextsForAlloc.empty())
+ continue;
+ }
+ // Update the other information that depends on the edges and on the now
+ // updated CallerEdgeContextsForAlloc.
+ CallerAllocTypeForAlloc = computeAllocType(CallerEdgeContextsForAlloc);
+ CalleeEdgeAllocTypesForCallerEdge.clear();
+ for (auto &CalleeEdge : Node->CalleeEdges)
----------------
snehasish wrote:
nit: braces for multiple lines?
https://github.com/llvm/llvm-project/pull/127429
More information about the llvm-commits
mailing list