Finding the KT partition of a weighted graph in near-linear time

Simon Apers Paweł Gawrychowski Troy Lee IRIF, CNRS, Paris. Email: smgapers@gmail.comInstitute of Computer Science, University of Wrocław, Poland. Email: gawry@cs.uni.wroc.pl. Partially supported by the Bekker programme of the Polish National Agency for Academic Exchange (PPN/BEK/2020/1/00444).Centre for Quantum Software and Information, University of Technology Sydney. Email: troyjlee@gmail.com. Supported in part by the Australian Research Council Grant No: DP200100950

Abstract

In a breakthrough work, Kawarabayashi and Thorup (J. ACM’19) gave a near-linear time deterministic algorithm to compute the weight of a minimum cut in a simple graph $G=(V,E)$ . A key component of this algorithm is finding the $(1+\varepsilon)$ -KT partition of $G$ , the coarsest partition $\{P_{1},\ldots,P_{k}\}$ of $V$ such that for every non-trivial $(1+\varepsilon)$ -near minimum cut with sides $\{S,\bar{S}\}$ it holds that $P_{i}$ is contained in either $S$ or $\bar{S}$ , for $i=1,\ldots,k$ . In this work we give a near-linear time randomized algorithm to find the $(1+\varepsilon)$ -KT partition of a weighted graph. Our algorithm is quite different from that of Kawarabayashi and Thorup and builds on Karger’s framework of tree-respecting cuts (J. ACM’00).

We describe a number of applications of the algorithm. (i) The algorithm makes progress towards a more efficient algorithm for constructing the polygon representation of the set of near-minimum cuts in a graph. This is a generalization of the cactus representation, and was initially described by Benczúr (FOCS’95). (ii) We improve the time complexity of a recent quantum algorithm for minimum cut in a simple graph in the adjacency list model from $\widetilde{\mathcal{O}}(n^{3/2})$ to $\widetilde{\mathcal{O}}(\sqrt{mn})$ , when the graph has $n$ vertices and $m$ edges. (iii) We describe a new type of randomized algorithm for minimum cut in simple graphs with complexity $\mathcal{O}(m+n\log^{6}n)$ . For graphs that are not too sparse, this matches the complexity of the current best $\mathcal{O}(m+n\log^{2}n)$ algorithm which uses a different approach based on random contractions.

The key technical contribution of our work is the following. Given a weighted graph $G$ with $m$ edges and a spanning tree $T$ of $G$ , consider the graph $H$ whose nodes are the edges of $T$ , and where there is an edge between two nodes of $H$ iff the corresponding 2-respecting cut of $T$ is a non-trivial near-minimum cut of $G$ . We give a $\mathcal{O}(m\log^{4}n)$ time deterministic algorithm to compute a spanning forest of $H$ .

1 Introduction

Given a weighted and undirected graph $G$ with $n$ vertices and $m$ edges,¹¹1Throughout this paper we will use $n$ and $m$ to denote the number of vertices and edges of the input graph. the minimum cut problem is to find the minimum weight $\lambda(G)$ of a set of edges whose removal disconnects $G$ . When $G$ is unweighted, this is simply the minimum number of edges whose removal disconnects $G$ , also known as the edge connectivity of $G$ . The minimum cut problem is a fundamental problem in theoretical computer science whose study goes back to at least the 1960s when the first polynomial time algorithm computing edge connectivity was given by Gomory and Hu [GH61]. In the current state-of-the-art, there are near-linear time randomized algorithms for the minimum cut problem in weighted graphs [Kar00, GMW20, MN20] and near-linear time deterministic algorithms in the case of simple graphs²²2A simple graph is an unweighted graph with no self loops and at most one edge between any pair of vertices. [KT19, HRW20]. Very recently, Li [Li21] has given an almost-linear time (i.e. time $\mathcal{O}(m^{1+o(1)})$ ) deterministic algorithm for weighted graphs as well.

The best known algorithms for weighted graphs all rely on a framework developed by Karger [Kar00] which, for an input graph $G$ , relies on finding $\mathcal{O}(\log n)$ spanning trees of $G$ such that with high probability one of these spanning trees will contain at most 2 edges from a minimum cut of $G$ . In this case the cut is said to 2-respect the tree. A key insight of Karger is that, given a spanning tree $T$ of $G$ , the problem of finding a 2-respecting cut of $T$ that has minimum weight in $G$ can be solved deterministically in near-linear time, specifically time $\mathcal{O}(m\log^{2}n)$ . After standing for 20 years, the bound for this minimum-weight 2-respecting cut problem was recently improved by Gawrychowski, Mozes, and Weimann [GMW20], who gave a deterministic $\mathcal{O}(m\log n)$ time algorithm, and independently by Mukhopadhyay and Nanongkai [MN20] who gave a randomized algorithm with complexity $\mathcal{O}(m\log n+n\log^{4}n)$ .

The best algorithms in the case of a simple graph $G$ rely on a quite different approach, pioneered by Kawarabayashi and Thorup [KT19]. This approach begins by finding the minimum degree $d$ of a vertex in $G$ . Then the question becomes if there is a non-trivial cut, i.e. a cut where both sides of the corresponding bipartition have cardinality at least 2, whose weight is less than $d$ . This problem is solved by finding what we call the $(1+\varepsilon)$ -KT partition of the graph. Let $\mathcal{B}_{\varepsilon}^{nt}(G)$ be the set of all bipartitions $\{S,\bar{S}\}$ of the vertex set corresponding to non-trivial cuts whose weight is at most $(1+\varepsilon)\lambda(G)$ . The $(1+\varepsilon)$ -KT partition of $G$ is the coarsest partition $\{P_{1},\ldots,P_{k}\}$ of the vertex set such that for any $\{S,\bar{S}\}\in\mathcal{B}_{\varepsilon}^{nt}(G)$ it holds that $P_{i}$ is contained in either $S$ or $\bar{S}$ , for each $i=1,\ldots,k$ . If one considers the multigraph $G^{\prime}$ formed from $G$ by identifying vertices in the same set $P_{i}$ , then $G^{\prime}$ preserves all non-trivial $(1+\varepsilon)$ -near minimum cuts of $G$ . Kawarabayashi and Thorup further show that for any $\varepsilon<1$ the graph $G^{\prime}$ only has $\widetilde{\mathcal{O}}(n)$ edges. This bound crucially uses that the original graph is simple. The edge connectivity of $G$ is thus the minimum of $d$ and the edge connectivity of $G^{\prime}$ . One can use Gabow’s deterministic $\mathcal{O}(\lambda m^{\prime}\log n)$ edge connectivity algorithm [Gab95] for a multigraph with $m^{\prime}$ edges and edge connectivity $\lambda$ to check in time $\widetilde{\mathcal{O}}(nd\log n)=\widetilde{\mathcal{O}}(m)$ if the edge connectivity of $G^{\prime}$ is less than $d$ and, if so, compute it. In the most technical part of their work, Kawarabayashi and Thorup give a deterministic algorithm to find the $(1+\varepsilon)$ -KT partition of a simple graph $G$ in time $\widetilde{\mathcal{O}}(m)$ , giving an $\widetilde{\mathcal{O}}(m)$ time deterministic algorithm overall for edge connectivity. The key tool in their algorithm is the PageRank algorithm, which they use for finding low conductance cuts in the graph.

The KT partition has proven to be a very useful concept. Rubinstein, Schramm, and Weinberg [RSW18] also go through the $(1+\varepsilon)$ -KT partition to give a near-optimal $\widetilde{\mathcal{O}}(n)$ randomized query algorithm determining the edge connectivity of a simple graph in the cut query model. In the cut query model one can query a subset of the vertices $S$ and receive in return the number of edges with exactly one endpoint in $S$ . En route to their result, [RSW18] also improved the bound on the number of inter-component edges in the $(1+\varepsilon)$ -KT partition of a simple graph to $\mathcal{O}(n)$ , for any $\varepsilon<1$ . In the case $\varepsilon=0$ this was independently done by Lo, Schmidt, and Thorup [LST20]. The KT partition approach is also used in the current best randomized algorithm for edge connectivity, which runs in time $\mathcal{O}(\min\{m+n\log^{2}n,m\log n\})$ [GNT20].³³3The bound quoted in [GNT20] is $\mathcal{O}(m+n\log^{3}n)$ but the improvement to Karger’s algorithm by [GMW20] reduces this to $\mathcal{O}(m+n\log^{2}n)$ .

1.1 Our results

In this work we give the first near-linear time randomized algorithm to find the $(1+\varepsilon)$ -KT partition of a weighted graph, for $0\leq\varepsilon\leq 1/16$ . An interesting aspect of our algorithm is that it uses Karger’s 2-respecting cut framework to find the $(1+\varepsilon)$ -KT partition, thereby combining the aforementioned major lines of work on the minimum cut problem. This makes progress on a number of problems.

1.

The polygon representation is a compact representation of the set of near-minimum cuts of a weighted graph, originally described by Benczúr [Ben95, Ben97] and Benczúr-Goemans [BG08]. It extends the cactus representation [DKL76], which only works for the set of exact minimum cuts, and has played a key role in recent breakthroughs on the traveling salesperson problem [GSS11, KKG21]. For a general weighted graph the polygon representation has size $O(n^{2})$ , and Benczúr has given a randomized algorithm to construct a polygon representation of the $(1+\varepsilon)$ -near mincuts of a graph in time $\mathcal{O}(n^{2(1+\varepsilon)})$ [Ben97, Section 6.3] by building on the Karger-Stein algorithm. It is an open question whether we can construct a polygon representation in time $\widetilde{\mathcal{O}}(n^{2})$ for $\varepsilon>0$ . In his thesis [Ben97, pg. 126], Benczúr says, “It already seems hard to directly identify the system of atoms within the $\widetilde{\mathcal{O}}(n^{2})$ time bound,” where the system of atoms is defined analogously to the $(1+\varepsilon)$ -KT partition but for the set of all $(1+\varepsilon)$ -near minimum cuts, not just the non-trivial ones. One can easily construct the set of atoms from a $(1+\varepsilon)$ -KT partition, thus our KT partition algorithm gives a $\widetilde{\mathcal{O}}(m)$ time algorithm for this task as well, making progress on this open question.
2.

The $(1+\varepsilon)$ -KT partition of a weighted graph is exactly what is needed to give an optimal quantum algorithm for minimum cut: Apers and Lee [AL21] showed that the quantum query and time complexity of minimum cut in the adjacency matrix model is $\widetilde{\Theta}(n^{3/2}\sqrt{\tau})$ for a weighted graph where the ratio of the largest to smallest edge weights is $\tau$ , with the algorithm proceeding by finding a $(1+\varepsilon)$ -KT partition.

In the case where the graph is instead represented as an adjacency list, they gave an algorithm with query complexity $\widetilde{\mathcal{O}}(\sqrt{mn\tau})$ but whose running time is larger at $\widetilde{\mathcal{O}}(\sqrt{mn\tau}+n^{3/2})$ . The bottleneck in the time complexity is the time taken to find a $(1+\varepsilon)$ -KT partition of a weighted graph with $\widetilde{\mathcal{O}}(n)$ edges. Using the near-linear time randomized algorithm we give to find a $(1+\varepsilon)$ -KT partition here improves the time complexity of this algorithm to $\widetilde{\mathcal{O}}(\sqrt{mn\tau})$ , matching the query complexity. We detail the full algorithm in Section 6.1.

Both quantum algorithms also used the following observation [AL21, Lemma 2]: if in a weighted graph $G$ the ratio of the largest edge weight to the smallest is $\tau$ , then the total weight of inter-component edges in a $(1+\varepsilon)$ -KT partition of $G$ for $\varepsilon<1$ is $\mathcal{O}(\tau n)$ , which can be tight.
3.

The best randomized algorithm to compute the edge connectivity of a simple graph is the 2-out contraction approach of Ghaffari, Nowicki, and Thorup [GNT20], which has running time $\mathcal{O}(\min\{m+n\log^{2}n,m\log n\})$ . Using our algorithm to find a $(1+\varepsilon)$ -KT partition in a weighted graph we can follow Karger’s 2-respecting tree approach to compute the edge connectivity of a simple graph in time $\mathcal{O}(m+n\log^{6}n)$ , thus also achieving the optimal bound on graphs that are not too sparse. We postpone details to Section 6.2.

Apart from these examples, we are hopeful that our near-optimal randomized algorithm for finding the KT partition of a weighted graph will find further applications.

In order to find a $(1+\varepsilon)$ -KT partition in near-linear time, Apers and Lee [AL21] show that it suffices to solve the following problem in near-linear time. Let $G$ be a connected weighted graph and $T$ a spanning tree of $G$ . Consider a graph $H$ whose nodes are the edges of $T$ , and where two nodes $e,f$ of $H$ are connected by an edge iff the 2-respecting cut defined by $e,f$ is a non-trivial $(1+\varepsilon)$ -near minimum cut of $G$ . Then the problem is to find a spanning forest of $H$ . Our main technical contribution is to give a $\mathcal{O}(m\log^{4}n)$ time deterministic algorithm to solve this problem, where $m$ is the number of edges of the original graph $G$ .

It is interesting to compare the problem of finding a spanning forest of $H$ with the original problem solved by Karger of finding a minimum-weight 2-respecting cut of $T$ . To find a spanning forest of $H$ we potentially have to find $\Omega(n)$ many $(1+\varepsilon)$ -near minimum cuts, which we accomplish with only an additional logarithmic overhead in the running time. The first insight to how this might be possible is to note that Karger’s original algorithm to find the minimum weight 2-respecting cut actually does something stronger than needed. Let $\mathrm{cost}(e,f)$ be the weight of the 2-respecting cut of $T$ defined by $\{e,f\}$ . For every edge $e$ of $T$ Karger’s algorithm attempts to find an $f^{*}\in\operatorname*{arg\,min}_{f}\mathrm{cost}(e,f)$ . It does not always succeed in this task, but if the candidate $f^{\prime}$ returned for edge $e$ is not such a minimizer, then for $f^{*}\in\operatorname*{arg\,min}_{f}\mathrm{cost}(e,f)$ it must be the case that the candidate $g$ returned for $f^{*}$ satisfies $\mathrm{cost}(f^{*},g)\leq\mathrm{cost}(e,f^{*})$ . In this way, the algorithm still succeeds to find a minimum weight 2-respecting cut in the end.

In contrast, we give an algorithm that for every edge $e$ of $T$ actually finds

f^{*}\in\operatorname*{arg\,min}_{f}\large\{\mathrm{cost}(e,f):\{e,f\}\text{ defines a non-trivial cut}\large\}\enspace.

We then show that this suffices to implement a round of Borůvka’s spanning forest algorithm [NMN01] on $H$ in near-linear time. Borůvka’s spanning forest algorithm consists of $\log n$ rounds and maintains the invariant of having a partition $\{S_{1},\ldots,S_{k}\}$ of the vertex set and a spanning tree for each set $S_{i}$ . The algorithm terminates when there is no outgoing edge from any set of the partition, at which point the collection of spanning trees for the sets of the partition is a spanning forest of $H$ . The sets of the partition are initialized to be individual nodes of $H$ .

In each round of Borůvka’s algorithm the goal is to find an outgoing edge from each set $S_{i}$ of the partition which has one. Consider a node $e$ of $H$ with $e\in S_{i}$ . We can find the best partner $f$ for $e$ and check if $\{e,f\}$ indeed gives rise to a non-trivial $(1+\varepsilon)$ -near minimum cut and so is an edge of $H$ . The problem is that $f$ could also be in $S_{i}$ in which case the edge $\{e,f\}$ is not an outgoing edge of $S_{i}$ as desired. To handle this, we maintain a data structure that allows us to find both the best partner $f$ for $e$ , but also the best partner $f^{\prime}$ for $e$ that lies in a different set of the partition from $f$ . We call this operation a categorical top two query. If there actually is an edge of $H$ with one endpoint $e$ and the other endpoint outside of $S_{i}$ then either $\{e,f\}$ or $\{e,f^{\prime}\}$ will be such an edge. Following the approach of [GMW20] to the minimum-weight 2-respecting cut problem, combined with an efficient data structure for handling categorical top two queries, we are able to do this for all nodes $e$ of $H$ in near-linear time, which allows us to implement a round of Borůvka’s algorithm in near-linear time.

1.2 Technical overview

We now give a more detailed description of our main result. Let $G=(V,E,w)$ be a weighted graph, where $E$ is the set of edges and $w:E\rightarrow\mathbb{R}_{+}$ assigns a positive weight to each edge. For a set $S\subseteq V$ let $\Delta_{G}(S)$ be the set of all edges of $G$ with exactly one endpoint in $S$ . A cut of $G$ is a set of edges of the form $\Delta_{G}(S)$ for some $\emptyset\neq S\subsetneq V$ . We call $S$ and $\bar{S}$ the shores of the cut. Let $w(\Delta_{G}(S))=\sum_{e\in\Delta(S)}w(e)$ . We use $\lambda(G)=\min_{\emptyset\neq S\subsetneq V}w(\Delta(S))$ for the minimum weight of a cut in $G$ .

We will be interested in partitions of $V$ and the partial order on partitions induced by refinement. For two partitions $\mathcal{X},\mathcal{Y}$ of $V$ we say that $\mathcal{X}\preceq\mathcal{Y}$ iff for every $X\in\mathcal{X}$ there is a $Y\in\mathcal{Y}$ with $X\subseteq Y$ . In this case we say $\mathcal{X}$ is a refinement of $\mathcal{Y}$ . The meet of two partitions $\mathcal{X}$ and $\mathcal{Y}$ , denoted $\mathcal{X}\wedge\mathcal{Y}$ , is the partition $\mathcal{Z}$ such that $\mathcal{Z}\preceq\mathcal{X},\mathcal{Z}\preceq\mathcal{Y}$ and for any other partition $\mathcal{W}$ satisfying these two conditions $\mathcal{W}\preceq\mathcal{Z}$ . In other words, $\mathcal{X}\wedge\mathcal{Y}$ is the greatest lower bound on $\mathcal{X}$ and $\mathcal{Y}$ under $\preceq$ . For a set of partitions $\mathcal{D}=\{\mathcal{D}_{1},\ldots,\mathcal{D}_{K}\}$ we write $\bigwedge\mathcal{D}=\mathcal{D}_{1}\wedge\cdots\wedge\mathcal{D}_{K}$ .

For our applications we need to consider not only minimum cuts, but also near-minimum cuts. For $\varepsilon\geq 0$ , let $\mathcal{B}_{\varepsilon}(G)=\{\{S,\bar{S}\}:w(\Delta_{G}(S))\leq(1+\varepsilon)\lambda(G)\}$ be the set of all bipartitions of $V$ corresponding to $(1+\varepsilon)$ -near minimum cuts. Let $\mathcal{B}_{\varepsilon}^{nt}(G)\subseteq\mathcal{B}_{\varepsilon}(G)$ be the set of all the non-trivial cuts in $\mathcal{B}_{\varepsilon}(G)$ . The $(1+\varepsilon)$ -KT partition of $G$ is exactly $\bigwedge\mathcal{B}_{\varepsilon}^{nt}(G)$ .

Both $\bigwedge\mathcal{B}_{\varepsilon}(G)$ and $\bigwedge\mathcal{B}_{\varepsilon}^{nt}(G)$ are important sets for understanding the structure of (near)-minimum cuts in a graph. Consider first $\bigwedge\mathcal{B}_{0}(G)$ , the meet of the set of all bipartitions corresponding to minimum cuts. This set arises in the cactus decomposition of $G$ [DKL76], a compact representation of all minimum cuts of $G$ . A cactus is a connected multigraph where every edge appears in exactly one cycle. The edge connectivity of a cactus is 2 and the minimum cuts are obtained by removing any two edges from the same cycle. A cactus decomposition of a graph $G$ is a cactus $H$ on $\mathcal{O}(n)$ vertices and a mapping $\phi:V(G)\rightarrow V(H)$ such that $\Delta_{G}(\phi^{-1}(S))$ is a mincut of $G$ iff $\Delta_{H}(S)$ is a mincut of $H$ . The mapping $\phi$ does not have to be injective, so multiple vertices of $G$ can map to the same vertex of $H$ . In this case, however, the cactus decomposition property means that all vertices in $\phi^{-1}(\{v\})$ must be on the same side of every minimum cut of $G$ , for every $v\in V(H)$ . Thus as $v$ ranges over $V(H)$ the sets $\phi^{-1}(\{v\})$ give the elements of $\bigwedge\mathcal{B}_{0}(G)$ (note that $\phi^{-1}(\{v\})$ can also be empty). A cactus decomposition of a weighted graph can be constructed by a randomized algorithm in near-linear time [KP09], thus this also gives a near-linear time randomized algorithm to compute $\bigwedge\mathcal{B}_{0}(G)$ .

Lo, Schmidt, and Thorup [LST20] give a version of the cactus decomposition that only represents the non-trivial minimum cuts. In fact, they give a deterministic $\mathcal{O}(n)$ time algorithm that converts a standard cactus into one representing the non-trivial minimum cuts. Combining this with the near-linear time algorithm to compute a cactus decomposition, this gives a near-linear time randomized algorithm to compute $\bigwedge\mathcal{B}_{0}^{nt}(G)$ as well.

The situation changes once we go to near-minimum cuts, which can no longer be represented by a cactus, but require the deformable polygon representation from [Ben95, Ben97, BG08]. This construction is fairly intricate, and the best known randomized algorithm to construct a deformable polygon representation of the $(1+\varepsilon)$ -near mincuts of a graph builds on the Karger-Stein algorithm and takes time $\mathcal{O}(n^{2(1+\varepsilon)})$ [Ben97, Section 6.3]. A prerequisite to constructing a deformable polygon representation is being able to compute $\bigwedge\mathcal{B}_{\varepsilon}(G)$ as, analogously to the case of a cactus, these sets will be the “atoms” that label regions of the polygons.

Our main result in this work is to give a randomized algorithm to compute $\bigwedge\mathcal{B}_{\varepsilon}(G)$ and $\bigwedge\mathcal{B}_{\varepsilon}^{nt}(G)$ in time $\mathcal{O}(m\log^{5}n)$ .

Theorem 1.

Let $G=(V,E,w)$ be a graph with $n$ vertices and $m$ edges. For $0\leq\varepsilon\leq 1/16$ let $\mathcal{B}_{\varepsilon}=\{\{S,\bar{S}\}:w(\Delta(S))\leq(1+\varepsilon)\lambda(G)\}$ and $\mathcal{B}_{\varepsilon}^{nt}\subseteq\mathcal{B}_{\varepsilon}$ be the subset of $\mathcal{B}_{\varepsilon}$ containing only non-trivial cuts. Both $\bigwedge\mathcal{B}_{\varepsilon}$ and $\bigwedge\mathcal{B}_{\varepsilon}^{nt}$ can be computed with high probability by a randomized algorithm with running time $\mathcal{O}(m\log^{5}n)$ .

In the rest of this introduction we focus on computing $\bigwedge\mathcal{B}_{\varepsilon}^{nt}$ . It is easy to construct $\bigwedge\mathcal{B}_{\varepsilon}$ from $\bigwedge\mathcal{B}_{\varepsilon}^{nt}$ deterministically in $\mathcal{O}(n)$ time.

The first obstacle we face in designing a near-linear time algorithm to compute the meet of $\mathcal{B}_{\varepsilon}^{nt}$ is that the number of near-minimum cuts in $G$ can be $\Omega(n^{2})$ , so we cannot afford to consider all of them. An idea to get around this is to try the following:

1.

Efficiently find a “small” subset $\mathcal{B}^{\prime}\subseteq\mathcal{B}_{\varepsilon}^{nt}$ such that $\bigwedge\mathcal{B}^{\prime}=\bigwedge\mathcal{B}_{\varepsilon}^{nt}$ . We call such a subset a generating set.

A greedy argument shows that such a subset $\mathcal{B}^{\prime}$ exists of size at most $n-1$ . We initialize $\mathcal{B}^{\prime}=\{\{S,\bar{S}\}\}$ for some element $\{S,\bar{S}\}$ in $\mathcal{B}_{\varepsilon}^{nt}$ . We then iterate through the elements $\{T,\bar{T}\}$ of $\mathcal{B}_{\varepsilon}^{nt}$ and add it to $\mathcal{B}^{\prime}$ iff $\bigwedge\mathcal{B}^{\prime}\cup\{T,\bar{T}\}\neq\bigwedge\mathcal{B}^{\prime}$ . Each bipartition added to $\mathcal{B}^{\prime}$ increases the number of elements in $\bigwedge\mathcal{B}^{\prime}$ by at least $1$ . As this size can be at most $n$ , and begins with size $2$ the total number of sets at termination is at most $n-1$ . This shows that a small generating set exists, but there still remains the problem of finding such a generating set efficiently.

Assuming we get past the first obstacle, there remains a second obstacle. The most straightforward algorithm to compute the meet of $k$ partitions of a set of size $n$ takes time $\Theta(kn\log n)$ , which is again too slow if $k=\Theta(n)$ . Thus we will also need to

2.

Exploit the structure of $\mathcal{B}^{\prime}$ to compute $\bigwedge\mathcal{B}^{\prime}$ efficiently.

Apers and Lee [AL21] give an approach to accomplish (1) and (2) following Karger’s framework of tree respecting cuts. Karger shows that in near-linear time one can compute a set of $K\in\mathcal{O}(\log n)$ spanning trees $T_{1},\ldots,T_{K}$ of $G$ such that every $(1+\varepsilon)$ -near minimum cut of $G$ 2-respects at least one of these trees. Let $\mathcal{B}_{i}\subseteq\mathcal{B}_{\varepsilon}^{nt}$ be the bipartitions corresponding to non-trivial near-minimum cuts that 2-respect $T_{i}$ . To compute $\bigwedge\mathcal{B}_{\varepsilon}^{nt}$ it suffices to compute $\mathcal{C}_{i}=\bigwedge\mathcal{B}_{i}$ for each $i=1,\ldots,K$ and then compute $\bigwedge_{i=1}^{K}\mathcal{C}_{i}$ . The latter can be done in time $\mathcal{O}(n\log^{2}n)$ by the aforementioned algorithm. This leaves the problem of computing $\bigwedge\mathcal{B}_{i}$ .

A key observation from [AL21] gives a generating set $\mathcal{B}_{i}^{\prime}$ for $\mathcal{B}_{i}$ of size $\mathcal{O}(n)$ . One intializes $\mathcal{B}_{i}^{\prime}$ to be empty and then adds bipartitions in $\mathcal{B}_{i}$ that 1-respect $T_{i}$ to $\mathcal{B}_{i}^{\prime}$ . This is a set of size $\mathcal{O}(n)$ , and Karger has shown that all near-minimum cuts that 1-respect a tree can be found in time $\mathcal{O}(m)$ .

Now we focus on the cuts that strictly 2-respect $T_{i}$ . To handle these one creates a graph $H$ whose nodes are the edges of $T_{i}$ and where there is an edge between nodes $e$ and $f$ iff the 2-respecting cut of $T_{i}$ defined by $\{e,f\}$ is a near-minimum cut in $\mathcal{B}_{i}$ . One then adds to $\mathcal{B}_{i}^{\prime}$ the bipartitions corresponding to a set of 2-respecting cuts that form a spanning forest of $H$ . The resulting set $\mathcal{B}_{i}^{\prime}$ has size $\mathcal{O}(n)$ and it can be shown to be a generating set for $\mathcal{B}_{i}$ .

Apers and Lee give a quantum algorithm to find a spanning forest of $H$ with running time $\widetilde{\mathcal{O}}(n^{3/2})$ . They then give a randomized algorithm to compute $\bigwedge\mathcal{B}_{i}^{\prime}$ in time $\widetilde{\mathcal{O}}(n)$ . As our main technical contribution, we give a deterministic algorithm to find a spanning forest of $H$ in time $\mathcal{O}(m\log^{4}n)$ . We also replace the randomization used in the algorithm to compute $\bigwedge\mathcal{B}_{i}^{\prime}$ with an appropriate data structure to give an $\widetilde{\mathcal{O}}(n)$ deterministic algorithm to compute the meet.

2 Preliminaries

For a natural number $n$ we use $[n]=\{1,\ldots,n\}$ .

Graph notation

For a set $S$ we let $S^{(2)}$ denote the set of unordered pairs of elements of $S$ . We represent an undirected edge-weighted graph as a triple $G=(V,E,w)$ where $E\subseteq V^{(2)}$ and $w:E\rightarrow\mathbb{R}_{+}$ gives the weight of an edge $e\in E$ . We will also use $V(G)$ to denote the vertex set of $G$ and $E(G)$ to denote the set of edges. We always use $n$ for the number of vertices in $G$ and $m$ for the number of edges. We will overload the function $w$ to let $w(F)=\sum_{e\in F}w(e)$ for a set of edges $F$ and for two disjoint sets $S,T\subseteq V$ we use $w(S,T)$ to denote $\sum_{e\in E:|e\cap S|=|e\cap T|=1}w(e)$ , that is the sum of the weights of edges with one endpoint in $S$ and one endpoint in $T$ . For a subset $\emptyset\neq S\subsetneq V$ we let $\Delta(S)$ be the set of edges with exactly one endpoint in $S$ . This is the cut defined by $S$ . We let $\lambda(G)$ denote the weight of a minimum cut in $G$ , i.e., $\lambda(G)=\min_{\emptyset\neq S\subsetneq V}w(\Delta(S))$ .

Heavy path decomposition

We use the standard notion of heavy path decomposition of $T$ [ST83, HT84], which is a partition of the edges of $T$ into heavy paths. We define this partition recursively: first, find the heavy path starting at the root by repeatedly descending to the child of the current node with the largest subtree. This creates the topmost heavy path starting at the root (called its head) and terminating at a leaf (called its tail). Second, remove the topmost heavy path from $T$ and repeat the reasoning on each of the obtained smaller trees. The crucial property is that, for any node $u$ , the path from $u$ to the root in $T$ intersects at most $\log n$ heavy paths.

Algorithmic preliminaries

We collect here a few theorems from previous work that we will need. The first is Karger’s result [Kar00] about finding $\mathcal{O}(\log n)$ many spanning trees of a graph $G$ such that every minimum cut of $G$ will 2-respect at least one of these trees. We will need the easy extension of this result to near-minimum cuts, which has been explicitly stated by Apers and Lee.

Theorem 2 ([Kar00, Theorem 4.1], [AL21, Theorem 24]).

Let $G$ be a weighted graph with $n$ vertices and $m$ edges. There is a randomized algorithm that in time $\mathcal{O}(m\log^{2}n+n\log^{4}n)$ time constructs a set of $\mathcal{O}(\log n)$ spanning trees such that every $(1+1/16)$ -near minimum cut of $G$ 2-respects $1/4$ of them with high probability.

We will also need the fact that for a weighted graph $G=(V,E,w)$ the values in $G$ of all 1-respecting cuts of a tree $T$ can be computed quickly. For a rooted spanning tree $T$ of $G$ and an edge $e\in E(T)$ , let $T_{e}$ be the set of vertices in the component not containing the root when $e$ is removed from $T$ . We use the shorthand $\mathrm{cost}(e)=w(\Delta_{G}(T_{e}))$ .

Lemma 3 ([Kar00, Lemma 5.1]).

Let $G$ be a weighted graph with $n$ vertices and $m$ edges, and $T$ a spanning tree of $G$ . There is a deterministic algorithm that computes $\mathrm{cost}(e)$ for every $e\in E(T)$ in time $\mathcal{O}(m+n)$ .

We will also make use of the improvement by Gawrychowski, Mozes and Weimann of Karger’s mincut algorithm.

Lemma 4 ([GMW20, Theorem 7]).

Let $G$ be a weighted graph with $n$ vertices and $m$ edges and $T$ a spanning tree of $G$ . A cut of minimum weight in $G$ that 2-respects $T$ can be found deterministically in $\mathcal{O}(m\log n)$ time. Using this, there is a randomized algorithm that finds a minimum cut in $G$ with high probability in time $\mathcal{O}(m\log^{2}n)$ .

Finally we give the formal statement of the result from [AL21] that underlies our algorithm to construct a KT partition.

Lemma 5 ([AL21, Lemma 29]).

Let $T=(V,E)$ be a tree and $\mathcal{M}\subseteq 2^{V}$ a family of subsets of $V$ such that $|\Delta_{T}(S)|=2$ for each $S\in\mathcal{M}$ . Let $Q=\{\Delta_{T}(S):S\in\mathcal{M}\}$ be a set of pairs of edges in $E$ . Suppose $F$ is spanning forest for the graph $L=(E,Q)$ . Then the set of shores of the 2-respecting cuts defined by edges in $E(F)$ is a generating set for $\bigwedge\mathcal{M}$ .

3 Data structures

In this section we develop the data structure we will need for a fast implementation of our spanning forest algorithm. We want to maintain a tree $T$ with root $r$ , in which each edge has a score and a color, so that we can support the following queries and updates. For an edge $e$ of the tree, let $T_{e}$ be the set of edges in the component not containing $r$ when $e$ is removed from the tree. On query an edge $e$ we want to find the edge $f\in T_{e}$ with the smallest score, and the edge $f^{\prime}\in T_{e}$ with the smallest score among edges whose color is different from that of $f$ . We call such a query a categorical top two query. We want to answer these queries while allowing adding $\Delta$ to the score of every edge on the path between two nodes. We could use link-cut trees [ST83] to accomplish this with $\mathcal{O}(\log n)$ update and query time using the fact that link-cut trees can be modified to support any semigroup operation under path updates. However, in our case the tree is static, and this allows for a simple and self-contained solution that requires only a well-known binary tree data structure coupled with the standard heavy path decomposition of a tree. This comes at the expense of implementing updates in $\mathcal{O}(\log^{2}n)$ time instead of $\mathcal{O}(\log n)$ time. The construction can be seen as folklore and has been explicitly stated by Bhardwaj, Lovett and Sandlund [BLS20] for the case when each edge maintains its score and there are no colors. We provide a detailed description of such an approach in Appendix A. We note that the increased update time does not to affect the overall time complexity of our algorithm.

Lemma 6.

Let $A[1],\ldots,A[n]$ be an array where each element has two fields, a color $A[i].\mathrm{color}$ and a score $A[i].\mathrm{score}$ . In $\mathcal{O}(n)$ time we can create a data structure using $\mathcal{O}(n)$ space and supporting the following operations in $\mathcal{O}(\log n)$ time per operation.

1.

$\textsc{Add}(\Delta,i,j)$ : for all $i\leq k\leq j$ do $A[k].\mathrm{score}\leftarrow A[k].\mathrm{score}+\Delta$ ,
2.

$\textsc{CatTopTwo}(i,j)$ : return $(k_{1},k_{2})$ where $k_{1}=\operatorname*{arg\,min}\{A[k].\mathrm{score}:i\leq k\leq j\}$ and $k_{2}=\mathrm{NULL}$ if $A[k].\mathrm{color}=A[k_{1}].\mathrm{color}$ for all $i\leq k\leq j$ and $k_{2}=\operatorname*{arg\,min}\{A[k].\mathrm{score}:i\leq k\leq j,A[k].\mathrm{color}\neq A[k_{1}].\mathrm{color}\}$ otherwise.

Lemma 7.

Let $T$ be a tree on $n$ nodes, with each edge $e\in T$ having its color and score. In $\mathcal{O}(n)$ time we can create a data structure using $\mathcal{O}(n)$ space and supporting the following operations.

1.

$\textsc{AddPath}(\Delta,p)$ : add $\Delta$ to the score of every edge on a path $p$ in $T$ in $\mathcal{O}(\log^{2}n)$ time.
2.

$\textsc{CatTopTwo}(e)$ : categorical top-two query in $T_{e}$ in $\mathcal{O}(\log n)$ time.

4 Spanning tree of near-minimum 2-respecting cuts in near-linear time

Let $G=(V,E,w)$ be a weighted undirected graph. We will assume throughout that $G$ is connected, and in particular that $m\geq n-1$ , as the KT partition of a disconnected graph can be easily determined from its connected components. Let $T$ be a spanning tree of $G$ . We will choose an $r\in V$ with degree 1 in $T$ to be the root of $T$ . We view $T$ as a directed graph with all edges directed away from $r$ . With some abuse of notation, we will also use $T$ to refer to this directed version. If we remove any edge $e\in E(T)$ from $T$ then $T$ becomes disconnected into two components. We use $e^{\downarrow}\subseteq V$ to denote the set of vertices in the component not containing the root, and $T_{e}\subseteq E(T)$ to denote the set of edges in the subtree rooted at the head of $e$ , i.e. the edges in the subgraph of $T$ induced by $e^{\downarrow}$ . We further use the shorthand $\mathrm{cost}(e)=w(\Delta(e^{\downarrow}))$ for the weight of the cut with shore $e^{\downarrow}$ .

Two edges $e,f\in E(T)$ define a unique cut in $G$ which we denote by $\mathrm{cut}_{T}(e,f)$ (or $\mathrm{cut}(e,f)$ if it is clear from the context which $T$ we are referring to). The cut depends on the relationship between $e$ and $f$ . If $e\in T_{f}$ or $f\in T_{e}$ then we say that $e$ and $f$ are descendant edges. Without loss of generality, say that $f\in T_{e}$ . Then the cut defined by $e$ and $f$ is $\mathrm{cut}(e,f)=\Delta(e^{\downarrow}\setminus f^{\downarrow})$ . If $e$ and $f$ are not descendant edges, then we say they are independent. For independent edges we see that $\mathrm{cut}(e,f)=\Delta(e^{\downarrow}\cup f^{\downarrow})$ . In both cases we use $\mathrm{cost}(e,f)$ to denote the weight of the corresponding cut.

In a KT partition we are only interested in non-trivial cuts. We first prove the following simple claim that characterizes when $\mathrm{cut}(e,f)$ is trivial.

Proposition 8.

Let $G=(V,E,w)$ be a connected graph with $n$ vertices and let $T$ be a spanning tree of $G$ with root $r$ . For $e,f\in E(T)$ if $\mathrm{cut}(e,f)$ is trivial then

1.

If $e,f$ are independent then they must be the unique edges incident to the root.
2.

If $e,f$ are descendant then there is a vertex $v\in V$ such that $e$ is the edge incoming to $v$ and $f$ is the unique edge outgoing from $v$ , or vice versa.

Proof.

First suppose that $e,f$ are independent. Then a shore of $\mathrm{cut}(e,f)$ is $S=e^{\downarrow}\cup f^{\downarrow}$ . We have that $|g^{\downarrow}|\geq 1$ any $g\in E(T)$ , thus $|S|\geq 2$ . Hence for $\mathrm{cut}(e,f)$ to be trivial we must have $|\bar{S}|=1$ . The root $r$ is not contained in $g^{\downarrow}$ for any $g\in E(T)$ thus it must be the case that $\bar{S}=\{r\}$ . For this to happen, $e,f$ must be incident to $r$ , and $r$ cannot have any other outgoing edges besides $e$ and $f$ .

Now consider the case that $e,f$ are descendant and suppose without loss of generality that $f\in T_{e}$ . Let $S=e^{\downarrow}\setminus f^{\downarrow}$ . In this case we have $|S|<n-1$ as $e^{\downarrow}$ does not contain the root and $|f^{\downarrow}|\geq 1$ . Let us understand when $|S|=1$ . As all vertices on the path from the head of $e$ to and including the tail of $f$ are in $S$ it must be the case that the head of $e$ is the tail of $f$ . Call this vertex $v$ and note $v\in S$ . If $v$ has any other child $u$ besides the head of $f$ then we would have $u\in S$ as well, thus $f$ must be the unique outgoing edge from $v$ . ∎

By choosing a root $r$ for $T$ that has degree 1 we avoid the case of item 1 of Proposition 8. Thus we only have to worry about trivial cuts when $e,f$ are descendant.

With that out of the way, we now turn to the main theorem of this section. As outlined in Section 1.2 this theorem is the key routine in our $(1+\varepsilon)$ -KT partition algorithm, which we fully describe in Section 5.

Theorem 9.

Let $G=(V,E,w)$ be a connected graph with $n$ vertices and $m$ edges and let $T$ be a spanning tree of $G$ . For a given parameter $\beta$ , define the graph $H$ , with $V(H)=E(T)$ and $E(H)=\{\{e,f\}\in E(T)^{(2)}:\mathrm{cost}(e,f)\leq\beta\text{ and }\mathrm{cut}(e,f)\text{ non-trivial}\}$ . There is a deterministic algorithm that given adjacency list access to $G$ and $T$ outputs a spanning forest of $H$ in $\mathcal{O}(m\log^{4}n)$ time.

At a high level, we prove Theorem 9 by following Borůvka’s algorithm to find a spanning forest of $H$ . Throughout the algorithm we maintain a subgraph $F$ of $H$ that is a forest, initialized to be the empty graph on vertex set $E(T)$ . At the end of the algorithm, $F$ will be a spanning forest of $H$ . The algorithm proceeds in rounds. In each round, for every tree in the forest, we find an edge connecting it to another tree in the forest, if such an edge exists. If $H$ has $k$ connected components, then in each round the number of trees in $F$ minus $k$ goes down by at least a factor of $2$ , and so the algorithm terminates in $\mathcal{O}(\log n)$ rounds.

The main work is implementing a round of Borůvka’s algorithm. We will think of the nodes of $F$ as having colors, where nodes in the same tree of the forest have the same color, and nodes in distinct trees have distinct colors. The goal of a single round is to find, for each color $c$ , a pair of edges $e,f\in T$ such that $c=\mathrm{color}(e)\neq\mathrm{color}(f)$ and $\{e,f\}\in E(H)$ , or detect that there is no such pair with these properties, in which case the nodes colored $c$ in $F$ already form a connected component of $H$ . As we need to refer to such pairs often we make the following definition.

Definition 10 (partner).

Let $T$ and $H$ be as in Theorem 9. Given an assignment of colors to the edges of $T$ we say that $f$ is a partner for $e$ if $\{e,f\}\in E(H)$ and $\mathrm{color}(e)\neq\mathrm{color}(f)$ .

We will actually do something stronger than what is required to implement a round of Borůvka’s algorithm, which we encapsulate in the following code header.

Algorithm 1

\mathsf{RoundEdges}

Input: Adjacency list access to $G$ , a spanning tree $T$ of $G$ , a parameter $\beta$ , and an assignment of colors to each $e\in E(T)$ .
Output: For every $e\in E(T)$ output a partner $f\in E(T)$ , or report that no partner for $e$ exists.

The implementation of $\mathsf{RoundEdges}$ is our main technical contribution. Let us first see how to use $\mathsf{RoundEdges}$ to find a spanning forest of $H$ .

Lemma 11.

Let $G$ , $T$ and $H$ be as in Theorem 9. There is a deterministic algorithm that makes $\mathcal{O}(\log n)$ calls to $\mathsf{RoundEdges}$ and in $\mathcal{O}(n\log n)$ additional time outputs a spanning forest of $H$ .

Proof.

We construct a spanning forest of $H$ by maintaining a collection of trees $F$ that will be updated in rounds by Borůvka’s algorithm until it becomes a spanning forest. We initialize $F=(E(T),\emptyset)$ and give all $e\in E(T)$ distinct colors. We maintain the invariants that $F$ is a forest and that nodes in the same tree have the same color and those in different trees have distinct colors.

Consider a generic round where $F$ contains $q$ trees. We call $\mathsf{RoundEdges}$ with the current color assignment. For every $e$ which has one we obtain a partner $f$ such that $\{e,f\}\in E(H)$ and $\mathrm{color}(e)\neq\mathrm{color}(f)$ . For each color class $c$ we select one $e$ with $\mathrm{color}(e)=c$ which has a returned partner (if it exists) and let $X$ be the set of selected edges. We then find a maximal subset of edges $X^{\prime}\subseteq X$ that do not create a cycle among the color classes by computing a spanning forest of the graph whose supervertices are given by the color classes and edges given by $X$ . We add the edges in $X^{\prime}$ to $E(F)$ . Finally we merge the color classes of the connected components in $F$ by appropriately updating the color assignments, and we pass the updated forest and color assignments to the next round of the algorithm. Each of the steps in a single round can be executed in $\mathcal{O}(n)$ time.

We have that $|X^{\prime}|\geq(q-\mathrm{cc}(H))/2$ where $\mathrm{cc}(H)$ is the number of connected components of $H$ . Each edge from $X^{\prime}$ added to $F$ decreases the number of trees in $F$ by one. Thus the number of trees in $F$ minus $\mathrm{cc}(H)$ decreases by at least a factor of 2 in each round and the algorithm terminates after $\mathcal{O}(\log n)$ rounds. The time spent outside of the calls to $\mathsf{RoundEdges}$ is $\mathcal{O}(n)$ for each of the $\mathcal{O}(\log n)$ rounds. This is $\mathcal{O}(n\log n)$ overall. ∎

If a node $e$ has a partner $f$ , then $\{e,f\}$ can either be a pair of descendant or independent edges. To implement $\mathsf{RoundEdges}$ we will separately handle these cases, as described in the next two subsections.

4.1 Descendant edges

We follow the approach from [GMW20] originally designed to find a single pair $\{e,f\}$ of descendant edges that minimizes $\mathrm{cost}(e,f)$ over all $e,f\in E(T)$ in $\mathcal{O}(m\log n)$ time. Their approach actually does something stronger (as does Karger’s original algorithm): for every $e\in E(T)$ it finds the best match in the subtree $T_{e}$ , i.e., it returns an edge $f^{*}\in\operatorname*{arg\,min}\{\mathrm{cost}(e,f)\mid f\in T_{e}\}$ . In order to implement the descendant edge part of $\mathsf{RoundEdges}$ we have three additional complications to handle:

1.

The edge $f^{*}$ might have the same color as $e$ .
2.

The resulting $\mathrm{cut}(e,f^{*})$ might be a trivial cut.
3.

Edge $e$ may have no partner in $T_{e}$ but still have a partner $f$ such that $e\in T_{f}$ . This partnership may not be discovered when we are looking for partners of $f$ if there is another $g\in T_{f}$ with $\mathrm{cost}(f,g)\leq\mathrm{cost}(e,f)$ .

Item 1 can be easily solved by, in addition to finding $f^{*}$ , also finding $g^{*}\in\operatorname*{arg\,min}\{\mathrm{cost}(e,f)\mid f\in E(T_{e}),\,\mathrm{color}(f)\neq\mathrm{color}(f^{*})\}$ . Phrasing things in this way, rather than simply looking for the edge $h$ with color different from $e$ which minimizes $\mathrm{cost}(e,h)$ , helps to limit the dependence of the query on $e$ and thus reduce the query time. If there is an $f\in T_{e}$ with $\mathrm{color}(f)\neq\mathrm{color}(e)$ and $\mathrm{cost}(e,f)\leq\beta$ then at least one of $f^{*},g^{*}$ will satisfy this too.

For item 2, we use the result of Proposition 8 that descendant edges that give rise to trivial cuts have a very constrained structure. This allow us to avoid trivial cuts when looking for a partner of $e$ .

Item 3 is relatively subtle and does not arise in the minimum weight 2-respecting cut problem. To explain the issue we have to first say something about the high level structure of our implementation of $\mathsf{RoundEdges}$ . We will perform an Euler tour of $T$ and, when the tour visits edge $e$ for the first time, we will look for a partner $f$ for $e$ in $T_{e}$ . The issue is the following, which we explain in the context of the very first round of Borůvka’s algorithm so we do not have to worry about nodes having different colors. Suppose that in the graph $H$ the only edge incident to node $e$ is a node $f$ with $e\in T_{f}$ . Thus in the execution of $\mathsf{RoundEdges}$ we want to find $f$ as a partner of $e$ . When the Euler tour is at $e$ we will not find any suitable partner for $e$ , as there is none in $T_{e}$ . We would like to identify $f$ as a partner for $e$ when the Euler tour visits $f$ for the first time. However, if there is a $g\in T_{f}$ with $\mathrm{cost}(f,g)<\mathrm{cost}(f,e)$ then the algorithm will return $g$ as a partner of $f$ rather than $e$ . To handle this we will actually make two passes over $T$ . In the first pass, when we visit edge $e$ for the first time we look for a partner $f$ in $T_{e}$ . In the second pass, we handle the case where the partner of $e$ might be an ancestor of $e$ . To do this we need to de-activate nodes. When the Euler tour visits $f$ for the first time, we first find the lowest cost partner for $f$ in $T_{f}$ . We then de-activate this node, and again find the best active partner for $f$ in $T_{f}$ . Repeating this process, we will eventually find $e$ if $\{e,f\}$ is indeed an edge of $H$ and $e,f$ have different colors.

Now we turn to more specific implementation details. A key idea in [GMW20] is that we can do an Euler tour of $T$ while maintaining a data structure such that when we first visit an edge $e$ we can easily look up $\mathrm{cost}(e,f)$ for any $f\in T_{e}$ . The way this is maintained can be best understood by noting that for $f\in T_{e}$ :

$\displaystyle\mathrm{cost}(e,f)$	$\displaystyle=w(\Delta(e^{\downarrow}\setminus f^{\downarrow}))$
	$\displaystyle=w(e^{\downarrow}\setminus f^{\downarrow},(e^{\downarrow})^{c})+w(e^{\downarrow}\setminus f^{\downarrow},f^{\downarrow})$
	$\displaystyle=\mathrm{cost}(e)+\underbrace{\mathrm{cost}(f)-2w(f^{\downarrow},(e^{\downarrow})^{c})}_{\mathrm{score}_{e}(f)}\enspace,$	(1)

where for convenience we defined $\mathrm{score}_{e}(f)=\mathrm{cost}(f)-2w(f^{\downarrow},(e^{\downarrow})^{c})$ , where the superscript $c$ denotes taking the complement.

We begin the algorithm by computing $\mathrm{cost}(e)$ for every $e\in E(T)$ , which can be done in $\mathcal{O}(m)$ time by Lemma 3. We then do an Euler of $T$ while maintaining a data structure from Lemma 7 such that, when we are considering $e\in E(T)$ , for every $f\in T_{e}$ the value of the data structure at location $f$ is $\mathrm{cost}(e,f)$ . For $f\not\in T_{e}$ this will not in general be the case.

As can be seen from Section 4.1, the key to maintaining this data structure is how to update the values $w(f^{\downarrow},(e^{\downarrow})^{c})$ when we descend edge $e$ . Consider the case where we are currently at edge $e^{\prime}=(z,x)$ and move to a descending edge $e=(x,y)$ . For two vertices $u,v$ let $p(u,v)$ be the set of edges on the path from $u$ to $v$ in $T$ , and let $\mathrm{lca}(u,v)$ be their least common ancestor in $T$ . For $f\in T_{e}$ we see that

w(f^{\downarrow},(e^{\downarrow})^{c})=w(f^{\downarrow},(e^{\prime\downarrow})^{c})+\sum_{\{u,v\}\in E\atop f\in p(u,v),\mathrm{lca}(u,v)=x}w(\{u,v\})\enspace.

(2)

By its definition in (4.1) we can compute $\mathrm{score}_{e}(f)$ from $\mathrm{score}_{e^{\prime}}(f)$ by subtracting $2w(\{u,v\})$ from for every $\{u,v\}\in E$ such that $f\in p(u,v)$ and $\mathrm{lca}(u,v)=x$ . The for-loop on line 2 of Algorithm 2 implements this step for all $f$ by looping over all $\{u,v\}\in E$ with $\mathrm{lca}(u,v)=x$ . After this update we have that $\mathrm{cost}(e,f)=\mathrm{cost}(e)+\mathrm{score}(f)$ for every $f\in T_{e}$ . This shows how to descend down $T$ while keeping the invariant. The full tree is then explored by taking an Euler tour through $T$ , and whenever we go back up in the tree we revert the score updates (for-loop on line 10 of Algorithm 3). This allows us to find candidate $f\in T_{e}$ for every $e\in E(T)$ . To bound the number of updates, note that each of the $m$ edges has a unique lca, and we only do an update corresponding to an edge when the lca is visited by the Euler tour. Since the Euler tour visits every vertex at most twice, the number of updates is at most $2m$ . In addition, the number of categorical top two queries is $n-1$ . The data structure from Lemma 7 then yields $\mathcal{O}(m\log^{2}n)$ time overall.

The algorithm is formalized in Algorithm 2, whose correctness we prove in the following lemma.

Lemma 12 (cf. [GMW20, Lemma 8]).

Assume that we first initialize $e.\mathrm{score}\leftarrow\mathrm{cost}(e)$ for every $e\in E(T)$ , and then run Algorithm 2 (doing nothing in line 6). Then whenever an edge $e=(x,y)$ is followed on line 5 in the call to Traverse $(x)$ it holds that $\mathrm{cost}(e,f)=\mathrm{cost}(e)+f.\mathrm{score}$ for all $f\in T_{e}$ .

Proof.

We will prove this by induction on the depth of $x$ . Consider the case where $x$ is the root $r$ . Before the call to Traverse $(r)$ we initialized all scores to $e.\mathrm{score}\leftarrow\mathrm{cost}(e)$ . Then, on line 3 of Traverse $(r)$ , for each $\{u,v\}\in E(G)$ with $\mathrm{lca}(u,v)=r$ we subtract $2w(\{u,v\})$ from the score of every edge on the $u$ to $v$ path in $T$ . Let us refer to scores at this point in time as “at time zero.” We first claim that at time zero for any outgoing edge $e=(r,y)$ from the root this makes $\mathrm{cost}(e,f)=\mathrm{cost}(e)+f.\mathrm{score}$ for all $f\in T_{e}$ .

Let $p(u,v)$ be the set of edges on the path from $u$ to $v$ in $T$ . By Section 4.1 we have $\mathrm{cost}(e,f)=\mathrm{cost}(e)+\mathrm{cost}(f)-2w(f^{\downarrow},(e^{\downarrow})^{c})$ thus it suffices to show that for any $f\in T_{e}$

w(f^{\downarrow},(e^{\downarrow})^{c})=\sum_{\{u,v\}\in E(G)\atop f\in p(u,v),\mathrm{lca}(u,v)=r}w(\{u,v\})\enspace.

This holds because by definition $h\in E(f^{\downarrow},(e^{\downarrow})^{c})$ iff one endpoint is in $f^{\downarrow}$ the other endpoint is in $(e^{\downarrow})^{c})$ , which in turn happens iff the least common ancestor of the endpoints is $r$ and $f$ lies on the path between the endpoints.

To finish the base case, we claim that at each iteration of the for loop all scores are the same as at time zero. This is because in the recursive calls that follow the update to the scores on line 3 the update is exactly canceled out by the reverse update on line 10 when the recursive call exits.

For the inductive step, let us suppose that when an edge $e=(x,y)$ is followed on line 5 in the call to Traverse $(x)$ it holds that $\mathrm{cost}(e,f)=\mathrm{cost}(e)+f.\mathrm{score}$ for all $f\in T_{e}$ . Let us now refer to scores at this point in time as “at time zero.” We then want to show that on line 5 in the call to Traverse $(y)$ that for an outgoing edge $e^{\prime}=(y,z)$ it holds that $\mathrm{cost}(e^{\prime},f)=\mathrm{cost}(e^{\prime})+f.\mathrm{score}$ for all $f\in T_{e^{\prime}}$ . The change in the scores from time zero to the execution of the for loop in the call to Traverse $(y)$ occurs in the update on line 3. Let us refer to scores at this point in time as “at time one.” We first show that at time one for any outgoing edge $e^{\prime}=(y,z)$ of $y$ it holds that $\mathrm{cost}(e^{\prime},f)=\mathrm{cost}(e^{\prime})+f.\mathrm{score}$ for all $f\in T_{e^{\prime}}$ . The key to this is to consider the difference between $\mathrm{cost}(e,f)$ and $\mathrm{cost}(e^{\prime},f)$ for an $f\in T_{e^{\prime}}$ . By Section 4.1 we have $\mathrm{cost}(e^{\prime},f)=\mathrm{cost}(e^{\prime})+\mathrm{cost}(f)-2w(f^{\downarrow},(e^{\prime\downarrow})^{c})$ , and by the inductive hypothesis at time zero $\mathrm{score}.f=\mathrm{cost}(f)-2w(f^{\downarrow},(e^{\downarrow})^{c})$ . Thus so that $\mathrm{cost}(e^{\prime},f)=\mathrm{cost}(e^{\prime})+\mathrm{score}.f$ we need to update the $\mathrm{score}.f$ by

2\left(w(f^{\downarrow},(e^{\downarrow})^{c})-w(f^{\downarrow},(e^{\prime\downarrow})^{c})\right)=-2\sum_{\{u,v\}\in E(G)\atop f\in p(u,v),\mathrm{lca}(u,v)=y}w(\{u,v\})\enspace.

(3)

To see this, first note that $E(f^{\downarrow},(e^{\downarrow})^{c})\subseteq E(f^{\downarrow},(e^{\prime\downarrow})^{c})$ . This confirms that we should subtract something to perform this update. An edge $\{u,v\}$ is in $E(f^{\downarrow},(e^{\prime\downarrow})^{c})$ but not $E(f^{\downarrow},(e^{\downarrow})^{c})$ iff one endpoint, say $u$ , is in $f\downarrow$ and the other endpoint $v$ is in $(e^{\prime\downarrow})^{c}\setminus(e^{\downarrow})^{c}$ . This means that $v\in e^{\downarrow}$ but now $e^{\prime\downarrow}$ and so $y=\mathrm{lca}(u,v)$ . The condition $u\in f^{\downarrow}$ is then equivalent to having $f$ on the path between $u$ and $v$ . This confirms that Eq. 3 performs the correct update.

To finish the proof, we claim that $\mathrm{cost}(e^{\prime},f)=\mathrm{cost}(e^{\prime})+\mathrm{score}.f$ not just at time one, but at the time when the for loop with $e^{\prime}$ is executed. This is again because in the changes to the scores on line 3 that are made in a recursive call are reversed when the recursive call exits on line 10, thus every time the for loop is executed the scores are the same as the scores at time one. ∎

Algorithm 2 Euler tour maintaining

\mathrm{cost}(e,f)

for

f\in T_{e}

1:function Traverse(

x

)

2: for all

\{u,v\}\in E(G)

such that

\mathrm{lca}(u,v)=x

3: AddPath(

-2w(\{u,v\}),u\text{-to-}v

)

4: end for

5: for all

y

such that

e=(x,y)\in E(T)

6: Process

e

\triangleright

“Process” depends on context.

7: Traverse(

y

)

8: end for

9: for all

\{u,v\}\in E(G)

such that

\mathrm{lca}(u,v)=x

10: AddPath(

2w(\{u,v\}),u\text{-to-}v

)

11: end for

12:end function

Given Lemma 12 to maintain $\mathrm{cost}(e,f)$ for $f\in T_{e}$ during an Euler tour of the tree, and with a link cut tree data structure to handle categorical top two queries, it is now straightforward to design an algorithm to find for every edge $e$ a partner for $e$ that is a descendant or ancestor, if such a partner exists.

Algorithm 3 The descendant edge portion of

\mathsf{RoundEdges}

X\leftarrow\emptyset

\triangleright

X

will hold all edges found during the round

2:for

e\in E(T)

e.\mathrm{score}\leftarrow\mathrm{cost}(e)

\triangleright

scores are maintained with Lemma 7

3:end for

4:Run Traverse(

r

) where “Process

e

” means running Below(

e

5:Run Traverse(

r

) where “Process

e

” means running Above(

e

1:function Below(

e

)

\triangleright

Find a partner for

e

T_{e}

2: if the head of

e

has outdegree 1 then

3: Let

h

be the outgoing edge of the head of

e

\triangleright

In this case

\mathrm{cut}(e,h)

is trivial

h.\mathrm{score}\mathrel{+}=\beta+1

5: end if

(f,g)=

CatTopTwo(

e

)

7: if

\mathrm{color}(f)\neq\mathrm{color}(e)\And\mathrm{cost}(e)+\mathrm{score}(f)\leq\beta

then

X\leftarrow X\cup\{e,f\}

9: else if

\mathrm{color}(g)\neq\mathrm{color}(e)\And\mathrm{cost}(e)+\mathrm{score}(f)\leq\beta

then

10:

X\leftarrow X\cup\{e,g\}

11: end if

12: if the head of

e

has outdegree 1 then

13:

h.\mathrm{score}\mathrel{-}=\beta+1

14: end if

15:end function

1:function Above(

e

)

\triangleright

Find all

f

such that

f\in T_{e}

and

e

is a partner of

f

2: if the head of

e

has outdegree 1 then

3: Let

h

be the edge coming into the head of

e

\triangleright

In this case

\mathrm{cut}(e,h)

is trivial

h.\mathrm{score}\mathrel{+}=\beta+1

5: end if

6: noMore = false

7: repeat

(f,g)=

CatTopTwo(

e

)

9: if

\mathrm{color}(f)\neq\mathrm{color}(e)\And\mathrm{cost}(e)+\mathrm{score}(f)\leq\beta

then

10:

X\leftarrow X\cup\{e,f\}

11:

f.\mathrm{score}\mathrel{+}=\beta+1

12: else if

\mathrm{color}(g)\neq\mathrm{color}(e)\And\mathrm{cost}(e)+\mathrm{score}(f)\leq\beta

then

13:

X\leftarrow X\cup\{e,g\}

14:

g.\mathrm{score}\mathrel{+}=\beta+1

15: else

16: noMore = True

17: end if

18: until noMore

19: if the head of

e

has outdegree 1 then

20:

h.\mathrm{score}\mathrel{-}=\beta+1

21: end if

22:end function

Theorem 13.

Given an assignment $e.\mathrm{color}$ for each $e\in E(T)$ , there is a deterministic algorithm that runs in time $\mathcal{O}(m\log^{2}n)$ and for each $e$ finds an $f$ such that

1.

$\{e,f\}\in H$
2.

$e\in T_{f}$ or $f\in T_{e}$
3.

$e.\mathrm{color}\neq f.\mathrm{color}$

if such an $f$ exists.

Proof.

The algorithm is given by Algorithm 3. Suppose that an edge $e$ has a partner $f$ satisfying the 3 conditions of the theorem. Then either $f\in T_{e}$ or $e\in T_{f}$ . We claim that if $f\in T_{e}$ then we will find a partner of $e$ in the call to Traverse( $r$ ) using Below( $e$ ) to process edge $e$ , and if $e\in T_{f}$ then we will find a partner of $e$ in the call to Traverse( $r$ ) using Above( $e$ ) to process edge $e$ .

Let us show these statements separately, starting with the case $f\in T_{e}$ . Consider the time when $e$ is considered in the for loop line 5 in a recursive call from Traverse( $r$ ) using Below( $e$ ) to process edge $e$ . In the call to Below( $e$ ) we first check if the tail of $e$ has a single outgoing edge $h$ . If this is the case then $\mathrm{cut}(e,h)$ is a trivial cut and thus we do not want to find $h$ as a partner for $e$ . We thus add $\beta+1$ to the score of $h$ ensuring that it will never be a valid partner for $e$ . By Proposition 8 this is the only situation where $\mathrm{cut}(e,f)$ is trivial for $f\in T_{e}$ . By Lemma 12 for all other $g\in T_{e}$ it holds that $\mathrm{cost}(e,g)=\mathrm{cost}(e)+g.\mathrm{score}$ . Thus the call to CatTopTwo( $e$ ) will perform correctly, and one of the returned edges must be a valid partner for $e$ . We then reset the score of $h$ , if it was changed, to maintain the property given by Lemma 12.

Now consider the case where $e$ has a partner $f$ with $e\in T_{f}$ . Let $f$ be the first such partner that is encountered in an Euler tour of $T$ . We claim that the edge $\{e,f\}$ will be added to $X$ in the call to Traverse( $r$ ) using Above( $e$ ) to process edge $e$ . First note that after the previous call to Traverse( $r$ ) terminates it holds that $g.\mathrm{score}=\mathrm{cost}(g)$ as all changes to the scores in the recursive calls are reverted after the call returns. Thus we are again in position to apply Lemma 12, although we have to be slightly more careful this time as scores are modified within the body of the for loop on line 5 of Algorithm 2 when we run Above( $e$ ) to process $e$ . We again handle the possibility of trivial cuts as in the “below” case. We also modify a score for an edge $e$ after a partner for $e$ has been found and thus our job for $e$ is done and we no longer need to use its score. As by assumption $f$ is the first potential partner for $f$ encountered in the Euler tour, the score of $e$ has not been modified at this point. Thus by Lemma 12 it holds that $\mathrm{cost}(e,f)=\mathrm{cost}(f)+e.\mathrm{score}$ . This means that $e$ will eventually be found in the repeat loop on line 7 of the function Above( $e$ ).

Let us now analyze the running time. Computing $\mathrm{cost}(e)$ for each edge $e$ can be done in time $\mathcal{O}(m+n)$ by Lemma 3. Before proceeding with the traversal, we gather, for each node $x$ , all edges $\{u,v\}\in E(G)$ such that $\mathrm{lca}(u,v)=x$ . This can be done in $\mathcal{O}(m)$ time by constructing in $\mathcal{O}(n)$ time a constant-time LCA structure [BF00], and iterating over the edges. Next consider the body of the for loop in the Below function. Here we make a single CatTopTwo query which takes time $\mathcal{O}(\log n)$ by Lemma 7. Thus over the entire Euler tour these queries contribute $\mathcal{O}(n\log n)$ to the running time. In the Above function all CatTopTwo queries in the body of the for loop except for one (when noMore becomes true) will result in de-activating an edge. Thus again the total query time over the Euler tour is $\mathcal{O}(n\log n)$ .

Finally consider the cost of updating the scores in the Euler tour. As discussed earlier, over the course of the Euler tour this requires doing 2 calls to AddPath for every edge of $G$ . Each AddPath call can be done in time $\mathcal{O}(\log^{2}n)$ by Lemma 7, thus the overall time for this is $\mathcal{O}(m\log^{2}n)$ , which dominates the complexity of the algorithm. ∎

4.2 Independent edges

The goal now is to find, for every edge $e\in E(T)$ , a partner $f\in E(T)$ such that $e,f$ are independent, or decide that there is no such $f$ . As we chose the root of $T$ to have degree 1, by Proposition 8 we do not have to worry about trivial cuts in the independent edge case. Instead of considering all edges $e\in E(T)$ one-by-one, we first find a heavy path decomposition of $T$ and then iterate over all pairs of heavy paths $h,h^{\prime}$ to look for a partner $f\in h^{\prime}$ for every $e\in h$ . We cannot literally carry out this plan as the number of pairs of heavy paths can be $\Omega(n^{2})$ and so we cannot explicitly consider every pair. We show next that many pairs $h,h^{\prime}$ result in a trivial case and that all these trivial pairs can be solved together in one batch. We then bound the number of non-trivial pairs and show that in near-linear time we can explicitly process all of them. The idea of processing pairs of heavy paths, and explicitly considering only the non-trivial ones, was introduced in the context of 2-respecting cuts by Mukhopadhyay and Nanongkai [MN20] (see also [GMW20]).

Consider two distinct heavy paths $h,h^{\prime}$ , where $h$ is the path $u_{1}-u_{2}-\cdots-u_{q}$ and $h^{\prime}$ is the path $v_{1}-v_{2}-\ldots v_{q^{\prime}}$ . We let $e_{i}=(u_{i},u_{i+1})$ for $i=1,\ldots,q-1$ and $f_{i}=(v_{i},v_{i+1})$ for $i=1,\ldots,q^{\prime}-1$ . It can be that not all pairs $e_{i},f_{j}$ are independent, see Fig. 1. However, we can easily identify the subpaths of $h,h^{\prime}$ containing pairwise independent edges in constant time by computing the least common ancestor $v$ of the tails of $h,h^{\prime}$ . If $v=v_{p^{\prime}}$ lies on $h^{\prime}$ then $e_{i},f_{j}$ will be independent for $1\leq i<q$ and $p^{\prime}\leq j<q^{\prime}$ , and similarly if $v$ lies on $h$ . In general we assume that $p,p^{\prime}$ have been determined so that $e_{i},f_{j}$ are independent for all $p^{\prime}\leq i<q$ and $p^{\prime}\leq j<q^{\prime}$ , and that these pairs comprise all of the independent pairs on $h,h^{\prime}$ . We can associate to $h,h^{\prime}$ a $(q-1)$ -by- $(q^{\prime}-1)$ matrix $M^{(h,h^{\prime})}$ where for $p^{\prime}\leq i<q$ and $p^{\prime}\leq j<q^{\prime}$

	$\displaystyle M^{(h,h^{\prime})}[i,j]$	$\displaystyle=\mathrm{cost}(e_{i},f_{j})$
		$\displaystyle=\mathrm{cost}(e_{i})+\mathrm{cost}(f_{j})-2w(e_{i}^{\downarrow},f_{j}^{\downarrow})\enspace,$		(4)

and $M^{(h,h^{\prime})}$ is undefined otherwise.⁴⁴4We could restrict $M^{(h,h^{\prime})}$ to the submatrix on which it is defined, but find it notationally easier for the $i,j$ indices in $M^{(h,h^{\prime})}$ to match the edge labels. By Lemma 3, all values of $\mathrm{cost}(e)$ can be computed in $\mathcal{O}(m)$ total time. To efficiently evaluate $M^{(h,h^{\prime})}$ , we will prepare a list $L(h,h^{\prime})$ of all edges that contribute to $w(e^{\downarrow},f^{\downarrow})$ for independent $e,f$ with $e\in h,f\in h^{\prime}$ . For many $h,h^{\prime}$ the list $L(h,h^{\prime})$ will be empty, leading to the trivial case mentioned above. The following lemma bounds the size of all the non-empty lists and shows they can be constructed efficiently.

Lemma 14.

The total length of all lists $L(h,h^{\prime})$ is $\mathcal{O}(m\log^{2}n)$ and all non-empty lists $L(h,h^{\prime})$ can be constructed deterministically in time $\mathcal{O}(m\log^{2}n)$ .

Proof.

Observe that an edge $\{u,v\}\in E$ can contribute to $w(e^{\downarrow},f^{\downarrow})$ for independent $e,f$ with $e\in h,f\in h^{\prime}$ only if $u$ is in the subtree rooted at the head of $h$ and $v$ is in the subtree rooted at the head of $h^{\prime}$ . There are at most $\log n$ heavy paths intersecting the path from $u$ to the root and from $v$ to the root, and we can iterate over all such heavy paths in time proportional to their number (for example, by storing for each edge of $T$ , a pointer to the head of the heavy path that contains it). Thus, we can iterate over all relevant $h,h^{\prime}$ in $\mathcal{O}(\log^{2}n)$ time, and add a triple $(h,h^{\prime},\{u,v\})$ to an auxiliary list in which the heavy paths are identified by their heads. The total size of the auxiliary list is now $\mathcal{O}(m\log^{2}n)$ and it can be lexicographically sorted in the same time with radix sort. After sorting each non-empty list $L(h,h^{\prime})$ constitutes a contiguous fragment of the auxiliary list. ∎

We can now describe how to find a partner $f$ for every $e$ such that $e,f$ are independent. The algorithm first solves together in one batch the case where the partner of $e\in h$ is in a heavy path $h^{\prime}$ where $L(h,h^{\prime})$ is empty. After that we explicitly consider all $h,h^{\prime}$ with $L(h,h^{\prime})$ non-empty. We consider these two cases in the next two subsections.

4.2.1 Empty lists

Lemma 15.

There is a deterministic algorithm that in time $\mathcal{O}(m+n)$ finds a partner for every edge $e\in E(T)$ that has a partner $f$ such that $e,f$ are independent and $e\in h,f\in h^{\prime}$ with $L(h,h^{\prime})$ empty.

Proof.

The key observation is that if $L(h,h^{\prime})$ is empty then $\mathrm{cost}(e,f)=\mathrm{cost}(e)+\mathrm{cost}(f)$ by Section 4.2. As can be seen from Section 4.1 and Section 4.2, for any edge $f^{\prime}$ it always holds that $\mathrm{cost}(e,f^{\prime})\leq\mathrm{cost}(e)+\mathrm{cost}(f^{\prime})$ , whether $e,f^{\prime}$ are descendant or independent. Thus in this case it suffices for us to find any $f^{\prime}$ of color different from $e$ such that $\mathrm{cost}(e)+\mathrm{cost}(f^{\prime})\leq\beta$ , and $\mathrm{cut}(e,f^{\prime})$ is non-trivial as this ensures $\mathrm{cost}(e,f^{\prime})\leq\beta$ . We are guaranteed such an $f^{\prime}$ exists as $f$ satisfies this.

By Lemma 3 we can compute $\mathrm{cost}(f^{\prime})$ for every $f^{\prime}\in E(T)$ in time $\mathcal{O}(m)$ . Then in time $\mathcal{O}(n)$ with one pass over $E(T)$ we compute the edge $f_{1}$ of lowest cost and the edge $f_{2}$ of lowest cost that is of color different to $f_{1}$ . We then repeat this categorical top two query twice more, each time excluding all previously found edges. At the end we obtain edges $f_{1},\ldots,f_{6}$ . We claim that for every $e$ , at least one of these must be a valid partner.

Consider any particular $e$ . The first categorical top two query can only fail to find a valid partner for $e$ if one of $f_{1},f_{2}$ creates a trivial cut with $e$ . In this case, the second categorical top two query can only fail if one of $f_{3},f_{4}$ creates a trivial cut with $e$ as well. By Proposition 8, however, there are at most two possible edges that can create a trivial cut with $e$ , thus in this case the third categorical top two query must succeed and we find a valid partner for $e$ . ∎

4.2.2 Non-empty lists

The more difficult case is to find partners among pairs $h,h^{\prime}$ with $L(h,h^{\prime})$ non-empty. To solve this case we will use the special structure of $M^{(h,h^{\prime})}$ . As above, say that $h$ is the path $u_{1}-u_{2}-\cdots-u_{q}$ and $h^{\prime}$ is the path $v_{1}-v_{2}-\ldots v_{q^{\prime}}$ , and let $e_{i}=(u_{i},u_{i+1})$ for $i=1,\ldots,q-1$ and $f_{i}=(v_{i},v_{i+1})$ for $i=1,\ldots,q^{\prime}-1$ . Further suppose $e_{i},f_{j}$ are independent for all $p\leq i<q,p^{\prime}\leq j<q^{\prime}$ . We have that $M^{(h,h^{\prime})}[i,j]=\mathrm{cost}(e_{i})+\mathrm{cost}(f_{j})-2w(e_{i}^{\downarrow},f_{j}^{\downarrow})$ for $p\leq i<q,p^{\prime}\leq j<q^{\prime}$ . Recall that $L(h,h^{\prime})$ is defined precisely as the list of edges that contribute to $w(e^{\downarrow},f^{\prime\downarrow})$ for independent $e\in h,f^{\prime}\in h^{\prime}$ . The contribution of a specific edge $\{u,v\}\in L(h,h^{\prime})$ can be understood as follows: let $u_{i}$ be the lowest common ancestor of $u$ and $u_{q}$ , and $v_{j}$ be the lowest common ancestor of $v$ and $v_{q^{\prime}}$ . Then the weight of $\{u,v\}$ contributes to $M[a,b]$ for every $p\leq a\leq i$ , $p^{\prime}\leq b\leq j$ . This is depicted in Fig. 1. We will compute these indices $i$ and $j$ for every $\{u,v\}\in L(h,h^{\prime})$ . This takes constant time per edge using an appropriate LCA structure [BF00], and so total time $\mathcal{O}(|L(h,h^{\prime})|)$ . Let $\mathcal{L}(h,h^{\prime})=\{(i,j)\mid\{u,v\}\in L(h,h^{\prime}),u_{i}=\mathrm{lca}(u,u_{q}),v_{j}=\mathrm{lca}(v,v_{q^{\prime}})\}$ denote the resulting list of index pairs, each of which has an associated weight.

Refer to caption — Figure 1: Contribution of an edge $\{u,v\}\in L(h,h^{\prime})$ (denoted in green on the left) to $M^{(h,h^{\prime})}[\cdot,\cdot]$ (denoted in grey on the right).

Lemma 16.

Let $\mathcal{F}=\{e\mid\exists h,h^{\prime},f:e\in h,f\in h^{\prime},e,f\text{ are partners and }L(h,h^{\prime})\text{ non-empty}\}$ . There is a deterministic algorithm to find a partner for every $e\in\mathcal{F}$ in time $\mathcal{O}(m\log^{3}n)$ .

Proof.

The algorithm is given in Algorithm 4. We describe the algorithm here and analyze its correctness and running time.

For every heavy path $h$ let $A_{h}$ be an array with $A_{h}[e].\mathrm{score}=\mathrm{cost}(e)$ and $A_{h}[e].\mathrm{color}=\mathrm{color}(e)$ for every $e\in h$ . Via Lemma 6 there is a data structure that supports path updates and CatTopTwo queries to $A_{h}$ in $\mathcal{O}(\log n)$ time. The total time for this initialization step is $\mathcal{O}(n)$ .

Let $L$ be an ordered list of pairs that contains $(h,h^{\prime})$ and $(h^{\prime},h)$ for every $h,h^{\prime}$ with $L(h,h^{\prime})$ non-empty. We sort $L$ by the name of the first path with radix sort in $\mathcal{O}(m\log^{2}n)$ time. We will follow $L$ to iterate over all $h,h^{\prime}$ with $L(h,h^{\prime})$ non-empty.

Let us describe what the algorithm does when considering pairs $h,h^{\prime}$ where $h$ consists of edges $e_{1},\ldots,e_{q}$ and $h^{\prime}$ consists of the edges $f_{1},\ldots,f_{q^{\prime}}$ , where edges $e_{i},f_{j}$ are independent for $p\leq i<q,p^{\prime}\leq j<q^{\prime}$ , and these comprise all the independent pairs in $h,h^{\prime}$ . We iterate over the columns of $M^{(h,h^{\prime})}$ , starting from $q^{\prime}-1$ and going until $p^{\prime}$ , and maintain the invariant that, when considering column $j$ , it holds that $A_{h}[i].\mathrm{score}=M^{(h,h^{\prime})}[i,j]-\mathrm{cost}(f_{j})$ for every active edge with index $p\leq i<q$ (where active will be defined later). We postpone describing how to maintain this invariant for the moment. Then we do a CatTopTwo query on $A_{h}$ which returns potential candidates $e_{a},e_{b}$ . If there is an edge $e\in h$ for which $f_{j}$ is a valid partner then $f_{j}$ must be a partner for either $e_{a},e_{b}$ . This can be checked in constant time. If $f_{j}$ is not a partner for either then we move on to column $j-1$ ; if it is a partner for, say $e_{a}$ , then we add $\beta+1$ to $A_{h}[a].\mathrm{score}$ to “de-activate” $e_{a}$ and repeat the process by doing a CatTopTwo query again on $A_{h}$ until no valid partner is returned.

The basic algorithm we have described considers every column of $M^{(h,h^{\prime})}$ from $q^{\prime}-1$ to $p^{\prime}$ . We now show how to accelerate this algorithm by restricting our attention to a subset of the columns of $M^{(h,h^{\prime})}$ in this interval. Let $K_{h,h^{\prime}}=|L(h,h^{\prime})|$ . We sort the pairs in $\mathcal{L}(h,h^{\prime})$ by the second coordinate in time $\mathcal{O}(K_{h,h^{\prime}}\log K_{h,h^{\prime}})=\mathcal{O}(K_{h,h^{\prime}}\log n)$ . Let $J_{1}<\cdots<J_{t}$ be the distinct values of the second coordinate that appear in this sorted list, where $t\leq K_{h,h^{\prime}}$ . Set $J_{0}=p^{\prime}-1$ and $J_{t+1}=q^{\prime}-1$ . For $J_{k}<j\leq J_{k+1}$ we have that $w(e_{i}^{\downarrow},f_{j}^{\downarrow})$ is constant over $j$ for every $p\leq i<q$ by the definition of $\mathcal{L}(h,h^{\prime})$ . We call such an interval a void interval. Thus the minimum of $M^{(h,h^{\prime})}[i,j]$ over $J_{k}<j\leq J_{k+1}$ necessarily occurs at $j^{*}=\operatorname*{arg\,min}_{J_{k}<j\leq J_{k+1}}\mathrm{cost}(j)$ . This means that if edge $e_{i}\in h$ has a partner $f_{j}$ with $J_{k}<j\leq J_{k+1}$ then one of the two edges returned by a CatTopTwo $(J_{k}+1,J_{k+1})$ to $A_{h^{\prime}}$ must be a partner for $e_{i}$ .

We can thus amend the algorithm to the following. For $J=J_{t+1},J_{t},\ldots,J_{1}$ we iterate over the endpoints of the void intervals. When $J=J_{k}$ we maintain the invariant that $A_{h}[i].\mathrm{score}=M^{(h,h^{\prime})}[i,J_{k}]-\mathrm{cost}(f_{J_{k}})$ . Thus for all $J_{k-1}<j\leq J_{k}$ it holds that $M^{(h,h^{\prime})}[i,j]=A_{h}[i].\mathrm{score}+\mathrm{cost}(f_{j})$ . We then do a CatTopTwo query on $A_{h}[p:q-1]$ and a CatTopTwo query on $A_{h^{\prime}}$ with the interval $(J_{k-1},J_{k}]$ . If any $e\in h$ has a partner $f_{j}$ with $J_{k-1}<j\leq J_{k}$ at least one of the 4 possible pairs returned must be partners. We de-activate any $e\in h$ which finds a partner by adding $\beta+1$ to its score and repeat the process until no valid partners are found, at which point we move on to the next void interval. If $P$ partners are found then the total time spent in this void interval will be $\mathcal{O}((P+1)\log n)$ for the CatTopTwo queries and updates to de-activate edges.

It remains to describe how to maintain the invariant $A_{h}[i].\mathrm{score}=M^{(h,h^{\prime})}[i,J_{k}]-\mathrm{cost}(f_{J_{k}})$ for all $p\leq i<q-1$ when $J=J_{k}$ . To do this for every pair $(i,j)$ in $\mathcal{L}(h,h^{\prime})$ with $j=J_{k}$ with associated weight $w$ we subtract $2w$ from $A_{h}$ in the interval $[p,J_{k}]$ . Each such interval update can be done in time $\mathcal{O}(\log n)$ by Lemma 6 so the total time for all updates is $\mathcal{O}(K_{h,h^{\prime}}\log n)$ .

Once we finish processing $h^{\prime}$ , we reverse all of the interval updates (but not the edge de-activations) so that we again have $A_{h}[i].\mathrm{score}=\mathrm{cost}(e_{i})$ for all $i$ for all active edges $e_{i}$ . This again can be done in time $\mathcal{O}(K_{h,h^{\prime}}\log n)$ . Once we finish processing all pairs $h^{\prime}$ associated $h$ we subtract $\beta+1$ from $A_{h}[e].\mathrm{score}$ for all edges $e$ that were de-activated to make them active again.

The total number of edge de-activations is at most $n$ thus this contributes $\mathcal{O}(n\log n)$ to the running time and is low order. Over all $h,h^{\prime}$ the total time spent is $\mathcal{O}(m\log^{3}n)$ . ∎

Algorithm 4 Find partners among non-empty lists

1:for all heavy paths

h

2: Initialize data structure

A_{h}

with scores

\mathrm{cost}(e)

and colors

e.\mathrm{color}

for all

e\in h

3:end for

4:Compile a list

L

of all ordered pairs

(h,h^{\prime})

and

(h^{\prime},h)

with

L(h,h^{\prime})

non-empty. Sort by the name of the first path.

5:for all

h

that appears as a first path in

L

6: for all

h^{\prime}

paired with

h

L

7: Compute the index list

\mathcal{L}(h,h^{\prime})

with associated weights.

8: Partition the interval

[p^{\prime},q^{\prime}-1]

into void intervals

(J_{k},J_{k+1}]

for

k=0,\ldots,t

9: Set

\mathrm{Found}=\emptyset

10: for all

k=t

0

11: For all

(i^{\prime},J_{k})\in\mathcal{L}(h,h^{\prime})

, subtract twice its weight from

A_{h}[p:i^{\prime}].\mathrm{score}

12: Call CatTopTwo(

J_{k}+1,J_{k+1}]

) on

A_{h^{\prime}}

to obtain edges

f_{c},f_{d}

13: while true do

14: Call CatTopTwo(

p,q-1)

A_{h}

to obtain edges

e_{a},e_{b}

15: if

f_{c}

f_{d}

is a partner for

e_{a}

then

16: Save this pair, add

e_{a}

\mathrm{Found}

, and do

A_{h}[e_{a}].\mathrm{score}\mathrel{+}=\beta+1

17: else if

f_{c}

f_{d}

is a partner for

e_{b}

then

18: Save this pair, add

e_{b}

\mathrm{Found}

and do

A_{h}[e_{b}].\mathrm{score}\mathrel{+}=\beta+1

19: else

20: break

21: end if

22: end while

23: end for

24: for all

k=t

0

25: For all

(i^{\prime},J_{k})\in\mathcal{L}(h,h^{\prime})

, add twice its weight to

A_{h}[p:i^{\prime}].\mathrm{score}

26: end for

27: end for

28: Subtract

\beta+1

from the score of all edges in

\mathrm{Found}

29:end for

4.3 Spanning tree algorithm

We now have all components of the spanning tree algorithm, which we can combine together to implement $\mathsf{RoundEdges}$ .

Lemma 17.

There is a deterministic algorithm to implement $\mathsf{RoundEdges}$ which runs in time $\mathcal{O}(m\log^{3}n)$ .

Proof.

Given an assignment of colors to the edges of $T$ , our task is to find a partner for every $e\in E(T)$ which has one. If $e$ has a partner $f$ such that $e,f$ are in a descendant relationship then a partner for $e$ can be found in time $\mathcal{O}(m\log^{2}n)$ by Theorem 13. The other case is that $e$ has a partner $f$ such that $e,f$ are independent. This divides into two subcases. If the heavy paths $h,h^{\prime}$ containing $e,f$ respectively are such that $L(h,h^{\prime})$ is empty then we will find a partner for $e$ via Lemma 15 in time $\mathcal{O}(m)$ . The bottleneck of the algorithm is where $L(h,h^{\prime})$ in non-empty in which case we use Lemma 16 to find a partner in time $\mathcal{O}(m\log^{3}n)$ . ∎

We can now prove the main theorem of this section, Theorem 9, that we can find a spanning tree of $H$ in time $\mathcal{O}(m\log^{4}n)$ .

Proof of Theorem 9.

Follows from Lemma 11 and Lemma 17. ∎

5 KT partition algorithm

For completeness we state here the full KT partition algorithm, including the reductions from [AL21]. At a high level, we follow Karger’s algorithm to find $\mathcal{O}(\log n)$ spanning trees so that with high probability every $(1+\varepsilon)$ -minimum cut 2-respects at least one of them. We then use our algorithm from Theorem 9 to, for each tree $T$ , find a generating set for the meet of all $(1+\varepsilon)$ -minimum cuts that 2-respect $T$ . We are then left with two problems. The first is that we still have to find the meet of the partitions in the generating set. A near-linear time randomized algorithm was given to do this in [AL21]. Here we give a deterministic algorithm to do this. Then we need to take the meet of $\mathcal{O}(\log n)$ partitions, one for each tree. This is simple to do and we handle this first.

Lemma 18.

Let $\{\mathcal{S}_{1},\ldots,\mathcal{S}_{K}\}$ be a set of $K$ partitions of $[n]$ . There is a deterministic algorithm to compute $\bigwedge_{i=1}^{K}\mathcal{S}_{i}$ in time $\mathcal{O}(Kn\log n)$ .

Proof.

In $\mathcal{O}(Kn\log n)$ time we can assign each $j\in[n]$ a $\mathcal{O}(K\log n)$ bit key indicating which set contains $j$ in each $\mathcal{S}_{i}$ . Collecting together elements with the same key gives $\bigwedge_{i=1}^{K}\mathcal{S}_{i}$ . ∎

Next we see how to explicitly construct the meet of all $(1+\varepsilon)$ -minimum cuts that 2-respect a tree from a generating set. We follow the idea of the proof in [AL21] but make it deterministic by replacing random hashing with an appropriate data structure.

Lemma 19 (Mehlhorn, Sundar, and Uhrig [MSU97]).

A dynamic family of persistent sequences, each of length at most $n$ , can be maintained under the following updates. A length-1 sequence is created in constant time, and a new sequence can be obtained by joining and splitting existing sequences in $\mathcal{O}(\log n(\log U\log^{*}U+\log n))$ time, where $U$ is the number of updates executed so far. Each sequence $s$ has an associated signature $\mathrm{sig}(s)\in[U^{3}]$ with the property that $s=s^{\prime}$ iff $\mathrm{sig}(s)=\mathrm{sig}(s^{\prime})$ .

For the proof of the lemma it will be useful to use the following definition.

Definition 20 (separate).

Let $V$ be a finite set and $X\subseteq V$ . For $u,v\in V$ we say that $X$ separates $u,v$ if exactly one of them is in $X$ .

Lemma 21 (cf. [AL21, Lemma 31]).

Consider as input a tree $T$ on a vertex set $V$ of size $n$ , and sets of edge singletons $\mathcal{C}_{1}\subseteq E(T)$ and edge pairs $\mathcal{C}_{2}\subseteq E(T)^{(2)}$ . These sets define sets of 1-respecting and 2-respecting cuts, respectively. There is an algorithm that in time $\mathcal{O}((n+|\mathcal{C}_{1}|+|\mathcal{C}_{2}|)\log^{2}n\log^{*}n)$ returns the meet of the bipartitions induced by these cuts.

Proof.

We root the tree at an arbitrary vertex $r\in V$ . When we speak of the shore of a cut we always refer to the shore not containing $r$ . Arrange all elements of $\mathcal{C}_{1}$ and $\mathcal{C}_{2}$ in an arbitrary order to obtain a sequence of $N=|\mathcal{C}_{1}|+|\mathcal{C}_{2}|$ cuts. Our goal is to construct, for each node $v\in V$ , a string $s(v)\in\{0,1\}^{N}$ where the $i^{\scriptsize\mbox{{\rm th}}}$ bit of $s(v)$ is $1$ iff the shore of the $i^{\scriptsize\mbox{{\rm th}}}$ cut contains $v$ . Assuming that we can indeed efficiently construct such strings, the meet is obtained by grouping together nodes $v$ with the same string $s(v)$ . However, the difficulty is that we cannot afford to construct $s(v)$ for all $v$ explicitly as this would require $nN$ bits. Instead, we will use Lemma 19 for representing a collection of strings of length $N$ .

Consider the preorder traversal of $T$ starting from the root $r$ . By definition $s(r)=0^{N}$ , which we create in the data structure by $N$ joins of $0$ . We then create $s(v)$ from the string $s(\mathrm{parent}(v))$ during the preorder traversal, where $\mathrm{parent}(v)$ is the parent of $v$ . To do this we set $s(v)\leftarrow s(\mathrm{parent}(v))$ and then flip the bits of $s(v)$ corresponding to cuts whose shore contains $v$ but not $\mathrm{parent}(v)$ or vice versa. Thus we need to understand when the shore of a 2-respecting cut separates $v$ from $\mathrm{parent}(v)$ . The shore of a one respecting cut defined by edge $e$ is $e^{\downarrow}$ , and hence separates $v$ and $\mathrm{parent}(v)$ iff $e=\{v,\mathrm{parent}(v)\}$ . A 2-respecting cut defined by edges $\{e,f\}$ separates two vertices $u$ and $v$ iff exactly one of $e,f$ is on the path from $u$ to $v$ in $T$ . Thus a 2-respecting cut will separate $v$ and $\mathrm{parent}(v)$ iff either $e=\{v,\mathrm{parent}(v)\}$ or $f=\{v,\mathrm{parent}(v)\}$ . Hence there will be at most $2N$ bit flips in total and in $\mathcal{O}(N)$ time we can annotate the tree with which bits should be flipped at each node.

A bit flip can be implemented in the data structure by a constant number of splits, joins and the creation of a length-1 sequence. As there are $\mathcal{O}(n+N)$ total operations on the data structure, the total time for all updates is $\mathcal{O}((n+N)\log^{2}n\log^{*}n)$ by Lemma 19.

Having obtained all the strings $s(v)$ , we can group together nodes $v$ with the same string $s(v)$ by sorting their signatures $\mathrm{sig}(s(v))$ . Because each signature is a positive integer bounded by $\mathcal{O}(N^{3})$ by Lemma 19, this can be implemented with radix sort in $\mathcal{O}(N)$ time. This gives the lemma. ∎

Algorithm 5

(1+\varepsilon)

-KT partition

Input: A weighted graph $G=(V,E,w)$ an a parameter $0\leq\varepsilon\leq 1/16$
Output: $(1+\varepsilon)$ -KT partition

1:Use Karger’s tree packing algorithm to construct a set of

K\in\mathcal{O}(\log n)

spanning trees

\mathcal{T}=\{T_{1},\ldots,T_{k}\}

so that with high probability every

(1+\varepsilon)

-minimum cut 2-respects at least one of them (Theorem 2).

2:Compute the weight of a minimum 2-respecting cut for each tree in

\mathcal{T}

by Lemma 4, and let

\lambda

be the minimum value found.

3:for

i=1,2,\dots,K

4: Find the set

\mathcal{A}_{i}=\{e\in E(T_{i})\mid\mathrm{cost}(e)\leq(1+\varepsilon)\lambda\}

indexing the 1-respecting near-minimum cuts of

T_{i}

by Lemma 3.

5: Use Theorem 9 with tree

T_{i}

to find a spanning forest

T_{H_{i}}

of the graph

H_{i}

with edge set

\{\{e,f\}:e,f\in E(T_{i}),\mathrm{cost}(e,f)\leq\beta,\mathrm{cut}(e,f)\text{ non-trivial}\}

. Set

\mathcal{B}_{i}=\{\{e,f\}\in E(T_{H_{i}})\}

6: Use Lemma 21 to construct the partition

\mathcal{S}_{i}

induced by the cuts indexed by

\mathcal{A}_{i}

and

\mathcal{B}_{i}

7:end for

8:Output the meet

\mathcal{S}=\bigwedge_{i=1}^{K}\mathcal{S}_{i}

by Lemma 18.

We are now ready to prove our main theorem. See 1

Proof.

We first prove the theorem for $\bigwedge\mathcal{B}_{\varepsilon}^{nt}$ . The algorithm for computing $\bigwedge\mathcal{B}_{\varepsilon}^{nt}$ is given in Algorithm 5. Let us first argue the correctness. Step 1 succeeds with high probability by Theorem 2, and the rest of the algorithm is deterministic. Thus if we show that the algorithm is correct assuming that Step 1 succeeds, then the algorithm will be correct with high probability.

Let us now assume that Step 1 succeeds. Then $\lambda=\lambda(G)$ in Step 2 . Let $\mathcal{T}_{i}$ be the set of bipartitions of all non-trivial $(1+\varepsilon)$ -minimum cuts of $G$ that 2-respect $T_{i}$ , for $i=1,\ldots,K$ . We have that $\cup_{i}\mathcal{T}_{i}=\mathcal{B}_{\varepsilon}^{nt}$ . Therefore

\bigwedge\mathcal{B}_{\varepsilon}^{nt}=\bigwedge_{i=1}^{K}\bigwedge\mathcal{T}_{i}\enspace.

For each $i$ we have $\bigwedge(\mathcal{A}_{i}\cup\mathcal{B}_{i})=\bigwedge\mathcal{T}_{i}$ by the correctness of our main algorithm Theorem 9 and Lemma 5. We compute $\bigwedge(\mathcal{A}_{i}\cup\mathcal{B}_{i})$ via Lemma 21. Finally, we compute $\bigwedge_{i=1}^{K}\bigwedge\mathcal{T}_{i}$ in Step 8 by Lemma 18.

Now let us go over the time complexity. Step 1 runs in time $\mathcal{O}(m\log^{2}(n)+n\log^{4}(n))$ by Theorem 2. Step 2 takes time $\mathcal{O}(m\log^{2}n)$ by Lemma 4. In the for loop, Step 4 takes time $\mathcal{O}(m)$ by Lemma 3; Step 5 takes time $\mathcal{O}(m\log^{4}n)$ by Theorem 9; Step 6 takes time $\mathcal{O}(n\log^{2}n\log^{*}n)$ by Lemma 21. Thus the time in the for loop is dominated by Step 4, and the total time taken over the $K=\mathcal{O}(\log n)$ iterations is $\mathcal{O}(m\log^{5}n)$ . The last step takes time $\mathcal{O}(n\log^{2}(n))$ . Thus the complexity overall is $\mathcal{O}(m\log^{5}n)$ .

To finish the proof of the theorem let us handle the case of $\bigwedge\mathcal{B}_{\varepsilon}$ . We claim that given the value of $\lambda(G)$ we can compute $\bigwedge\mathcal{B}_{\varepsilon}$ from $\bigwedge\mathcal{B}_{\varepsilon}^{nt}$ deterministically in $\mathcal{O}(m)$ time. In $\mathcal{O}(m)$ time we can identify the set $Z=\{v\in V:\Delta_{G}(\{v\})\leq(1+\varepsilon)\lambda(G)\}$ . Let $\mathcal{D}_{\varepsilon}=\{\{v,V\setminus v\}:v\in Z\}$ be the corresponding set of bipartitions and note that $\bigwedge\mathcal{B}_{\varepsilon}=\left(\bigwedge\mathcal{B}_{\varepsilon}^{nt}\right)\wedge\left(\bigwedge\mathcal{D}_{\varepsilon}\right)$ . The meet $\bigwedge\mathcal{D}_{\varepsilon}$ is simply the partition consisting of the sets $\{v\}$ for $v\in Z$ and $V\setminus Z$ . To take the meet of this partition with $\mathcal{P}=\bigwedge\mathcal{B}_{\varepsilon}^{nt}$ we simply cycle through each $S\in\mathcal{P}$ and split $S$ into the sets $\{v\}$ for $v\in S\cap Z$ and $S\setminus Z$ , which can be done in time $\mathcal{O}(n)$ . Thus the total time of computing $\bigwedge\mathcal{B}_{\varepsilon}$ is dominated by the computation of $\bigwedge\mathcal{B}_{\varepsilon}^{nt}$ , and can be done asymptotically in the same time. ∎

6 Applications

In this section we give two applications of our main result: an improved quantum algorithm for minimum cut in weighted graphs in the adjacency list model, and a new randomized algorithm with running time $\mathcal{O}(m+n\log^{6}n)$ to compute the edge connectivity of a simple graph.

6.1 Quantum algorithm for minimum cut in weighted graphs

In a recent work by Apers and Lee [AL21] the quantum complexity of the minimum cut problem was studied. They distinguish two models for querying a weighted graph as an input. In the adjacency matrix model a query is a pair of vertices $i,j\in V$ and the answer to the query reveals whether $\{i,j\}\in E$ , and if so, also returns the weight $w(\{i,j\})$ . In the adjacency list model a query is a vertex $i\in V$ and an integer $k\in[n]$ , and the answer to the query is the $k$ -th neighbor $j$ of vertex $i$ (if it exists) and the corresponding weight $w(\{i,j\})$ . The main results from [AL21] depend on the edge-weight ratio $\tau$ , defined as the ratio of the maximum edge weight over the minimum edge weight. These results can be summarized as follows:

•

In the adjacency matrix model, finding a minimum cut of a weighted graph with edge-weight ratio $\tau$ has quantum query and time complexity $\widetilde{\Theta}(n^{3/2}\sqrt{\tau})$ . This compares to the $\Theta(n^{2})$ query complexity of any classical algorithm for minimum cut in this model [DHHM06].
•

In the adjacency list model, finding a minimum cut of a weighted graph with edge-weight ratio $\tau$ requires quantum query complexity $\widetilde{\mathcal{O}}(\sqrt{mn\tau})$ and quantum time complexity $\widetilde{\mathcal{O}}(\sqrt{mn\tau}+n^{3/2})$ . There are also lower bounds of $\Omega(n^{3/2})$ for $\tau>1$ and $\Omega(\tau n)$ for $1\leq\tau\leq n$ . This compares to the $\Theta(m)$ query complexity of any classical algorithm for minimum cut in this model [BGMP21].

While this fully resolves the quantum complexity of minimum cut in the adjacency matrix model, there are two apparent gaps in the adjacency list model. On the one hand there is a gap between the upper and lower bounds on the quantum query complexity. On the other hand there is a gap between the upper bounds on the quantum query complexity and the quantum time complexity. Using our new result (Theorem 1) we can close this second gap.

Let $\kappa(n)$ denote the (quantum) time complexity for finding a $(1+\varepsilon)$ -KT partition of a weighted graph with $n$ vertices and $\widetilde{\mathcal{O}}(n)$ edges. The following lemma is proven in [AL21].

Lemma 22 ([AL21, Lemma 22]).

Let $G$ be a weighted graph with $n$ vertices, $m$ edges, and edge-weight ratio $\tau$ . There is a quantum algorithm to compute the weight and shores of a minimum cut of $G$ with time complexity $\kappa(n)+\widetilde{\mathcal{O}}(\sqrt{mn\tau})$ in the adjacency list model.

In [AL21] a quantum algorithm was proposed for finding the KT partition of a weighted graph with $m$ edges in time $\widetilde{\mathcal{O}}(m+n^{3/2})$ , giving an upper bound $\kappa(n)\in\widetilde{\mathcal{O}}(n^{3/2})$ and an upper bound $\widetilde{\mathcal{O}}(\sqrt{mn\tau}+n^{3/2})$ on the quantum time complexity. Our main result gives a classical algorithm that improves this upper bound to $\kappa(n)\in\widetilde{\mathcal{O}}(n)$ , and hence this yields a quantum algorithm for minimum cut with time complexity $\widetilde{\mathcal{O}}(\sqrt{mn\tau})$ .

Corollary 23.

Let $G$ be a weighted graph with $n$ vertices, $m$ edges, and edge-weight ratio $\tau$ . There is a quantum algorithm to compute the weight and shores of a minimum cut of $G$ with time complexity $\widetilde{\mathcal{O}}(\sqrt{mn\tau})$ in the adjacency list model.

6.2 Randomized algorithm for edge connectivity

We can use our algorithm for finding the KT partition of a weighted graph to give a randomized algorithm that computes the edge connectivity of a simple graph $G$ with high probability in time $\mathcal{O}(m+n\log^{6}n)$ . For graphs that are not too sparse this equals the best known $\mathcal{O}(m+n\log^{2}n)$ complexity of the random contraction based algorithm by Ghaffari, Nowicki and Thorup [GNT20].

Our new algorithm uses the key idea from Kawarabayashi and Thorup [KT19]: (i) find the KT partition of the graph and contract the components of the partition, and (ii) find a minimum cut in the contracted graph. By definition of the KT partition, this contraction will preserve the set of non-trivial minimum cuts, and so it suffices to find a minimum cut in the contracted graph and the minimum degree of a vertex. Moreover, the contracted graph has only $\mathcal{O}(n)$ edges and so we can find a minimum cut in this graph quickly.

Our algorithm follows the same blueprint, except that in order to obtain $\mathcal{O}(m)$ leading complexity we first find an $\varepsilon$ -cut sparsifier $F$ of the input graph, for a small constant $\varepsilon$ . For this step we can use the $\mathcal{O}(m)$ sparsification algorithm from Fung, Hariharan, Harvey and Panigrahi [FHHP19, Theorem 1.22]. Provided that the sparsification step is successful, any minimum cut of the original simple graph $G$ will be a $\gamma$ -near minimum cut of $F$ for $\gamma=(1+\varepsilon)/(1-\varepsilon)\leq 1+3\varepsilon$ . Thus if we find a $(1+3\varepsilon)$ -KT partition of $F$ and contract the sets of resulting partition in $G$ we obtain a multigraph $G^{\prime}$ which preserves all non-trivial minimum cuts of $G$ . In this way we only need to find the KT partition of $F$ , which has $\mathcal{O}(n\log n)$ edges rather than $m$ edges. On the other hand, the sparsifier $F$ will in general be weighted, and hence we cannot run the near-linear time algorithm from [KT19] to find its KT partition. This is a prime example where finding the KT partition of a weighted graph is very useful.

The next theorem fleshes out this algorithm. For this, we need the fact that for a simple graph there are only $\mathcal{O}(n)$ inter-component edges in a KT partition. We take the version from [AL21] which gives an explicit constant in the bound.

Lemma 24 ([RSW18, Lemma 2.6],[AL21, Lemma 2]).

Let $G=(V,E)$ be a simple graph with $|V|=n$ . Let $d=\min_{u\in V}\deg(u)$ . For a nonnegative $\varepsilon<1$ , let $\mathcal{T}=\{X:|X|,|\overline{X}|\geq 2\mbox{ and }|\Delta_{G}(X)|\leq\lambda(G)+\varepsilon d\}$ and let $G^{\prime}$ be the multigraph formed from $G$ by contracting the sets in $\bigwedge\mathcal{T}$ . Then

|E(G^{\prime})|\leq\frac{68n}{(1-\varepsilon)^{2}}\enspace.

Algorithm 6 Randomized algorithm for edge connectivity

Input: Adjacency list access to a simple graph $G$ .
Output: A minimum cut of $G$ .

1:Find a vertex

v

with minimum degree

d_{\min}

\triangleright

\mathcal{O}(m)

time.

2:Construct an

1/100

-cut sparsifier

F

G

with

\mathcal{O}(n\log n)

edges.

\triangleright

\mathcal{O}(m)

time by [FHHP19].

3:Find the

(101/99)

-KT partition

\mathcal{S}=\{S_{1},\dots,S_{k}\}

F

using Theorem 1.

\triangleright

\mathcal{O}(n\log^{6}n)

time.

4:Contract the components

S_{1},\dots,S_{k}

and let

G^{\prime}

be the resulting multigraph. If

G^{\prime}

has at most

100n

edges find a minimum cut

C

G^{\prime}

, otherwise abort.

\triangleright

Time

\mathcal{O}(m+n\log^{2}n)

using the minimum cut algorithm of [GMW20] from Lemma 4.

5:If

d_{\min}\leq|C|

, return the outgoing edges from

v

. Otherwise, return

C

Theorem 25.

Let $G$ be a simple graph with $m$ edges. There is a classical randomized algorithm that runs in time $\mathcal{O}(m+n\log^{6}n)$ and with high probability outputs the edge connectivity of $G$ and a cut realizing this value.

Proof.

The algorithm is given in Algorithm 6. The time complexity of each step is given in the comments. Let us prove correctness.

The algorithm either outputs a trivial cut or a cut from a contraction $G^{\prime}$ of $G$ . As contraction cannot decrease the edge connectivity, if the edge connectivity of $G$ is realized by a trivial cut the algorithm will be correct. Let us now assume that the edge connectivity $\lambda(G)$ is realized by a non-trivial cut $C^{*}=\Delta_{G}(S)$ . In step 2 we use the sparsification algorithm of Fung, Hariharan, Harvey and Panigrahi [FHHP19, Theorem 1.22] to find a $1/100$ -cut sparsifier $F=(V,E_{F},w_{F})$ of $G$ , which succeeds with high probability. Thus with high probability $w_{F}(\Delta_{F}(S))\leq(1+1/100)\lambda(G)$ . Also with high probability the weight of a minimum cut of $F$ is at least $(1-1/100)\lambda(G)$ , in which case $\Delta_{F}(S)$ will be a $101/99$ -near minimum cut of $F$ . Hence with high probability the $(101/99)$ -KT partition of $F$ will be a refinement of $\{S,\bar{S}\}$ , and in the contraction $G^{\prime}$ it will hold that $|\Delta_{G^{\prime}}(S)|=\lambda(G)$ and so the edge connectivity of $G^{\prime}$ is $\lambda(G)$ . Further, if $F$ is a valid $1/100$ -cut sparsifer of $G$ it will hold that $G^{\prime}$ has at most $100n$ edges by Lemma 24 and so we can find a minimum cut $C$ of $G^{\prime}$ in time $\mathcal{O}(n\log^{2}n)$ using the minimum cut algorithm of [GMW20] given in Lemma 4. Thus in this case with high probability $C$ will be a cut realizing the edge connectivity of $G$ and the algorithm is correct. ∎

7 Discussion

We find the $(1+\varepsilon)$ -KT partition of a weighted graph in near-linear time for any $0\leq\varepsilon\leq 1/16$ . The near-linear time deterministic algorithm of Kawarabayashi and Thorup [KT19] to find a KT-partition of a simple graph differs from ours with respect to the parameters in an interesting respect. Recall that we defined $\mathcal{B}_{\varepsilon}^{nt}(G)$ to be the set of all bipartitions $\{S,\bar{S}\}$ of the vertex set corresponding to non-trivial cuts whose weight is at most $(1+\varepsilon)\lambda(G)$ , and a $(1+\varepsilon)$ -KT partition to be $\bigwedge\mathcal{B}_{\varepsilon}^{nt}$ . Kawarabayashi and Thorup consider the larger set of bipartitions $\mathcal{K}_{\varepsilon}^{nt}(G)$ corresponding to non-trival cuts of weight at most $\lambda(G)+\varepsilon d$ , where $d$ is the minimum degree of $G$ . When $G$ is simple they can compute $\bigwedge\mathcal{K}_{\varepsilon}^{nt}(G)$ for any $\varepsilon<1$ in near-linear time. Thus it is stronger than our result with respect to the parameters in two ways: it allows any $\varepsilon<1$ and also lets $\varepsilon$ multiply the minimum degree rather than $\lambda(G)$ .

There is an inherent barrier to extending the 2-respecting cut framework we employ here to this parameter regime. The reason is that Karger’s tree packing lemma [Kar00, Lemma 2.3] only shows that a cut of weight $<3\lambda(G)/2$ will 2-respect a positive fraction of the trees from a maximum tree packing. To handle cuts of weight $3\lambda(G)/2$ one would have to move instead to considering 3-respecting cuts, which seems to add a good deal of complexity. Thus while we have not tried to optimize the constant $1/16$ , there is a natural barrier to extending our methods for $\varepsilon\geq 1/2$ . Pushing to larger $\varepsilon$ and also allowing $\varepsilon$ to multiply the minimum weight of a vertex rather than $\lambda(G)$ both seem to require new techniques, and we leave this as an open question.

References

[AL21] Simon Apers and Troy Lee. Quantum complexity of minimum cut. In Proceedings of the 36th Computational Complexity Conference (CCC ’21), page 28:1–28:3. LIPIcs, 2021.
[Ben95] András A. Benczúr. A representation of cuts within 6/5 times the edge connectivity with applications. In Proceedings of 36th Annual Symposium on Foundations of Computer Science (FOCS ’95), pages 92–102. IEEE Computer Society, 1995.
[Ben97] András Benczúr. Cut structures and randomized algorithms in edge-connectivity problems. PhD thesis, MIT, 1997.
[BF00] Michael A. Bender and Martin Farach-Colton. The LCA problem revisited. In Proceedings of 4th Latin American Symposium on Theoretical Informatics (LATIN ’00), pages 88–94. Springer, 2000.
[BG08] András A. Benczúr and Michel X. Goemans. Deformable Polygon Representation and Near-Mincuts, pages 103–135. Springer Berlin Heidelberg, Berlin, Heidelberg, 2008.
[BGMP21] Arijit Bishnu, Arijit Ghosh, Gopinath Mishra, and Manaswi Paraashar. Query complexity of global minimum cut. In Proceedings of the 24th international conference on Approximation Algorithms for Combinatorial Optimization Problems (APPROX ’21), 2021.
[BLS20] Nalin Bhardwaj, Antonio Molina Lovett, and Bryce Sandlund. A simple algorithm for minimum cuts in near-linear time. In 17th Scandinavian Symposium and Workshops on Algorithm Theory (SWAT ’20). Schloss Dagstuhl–Leibniz-Zentrum für Informatik, 2020.
[DHHM06] Christoph Dürr, Mark Heiligman, Peter Høyer, and Mehdi Mhalla. Quantum query complexity of some graph problems. SIAM Journal on Computing, 35(6):1310–1328, 2006.
[DKL76] Efim A Dinitz, Alexander V Karzanov, and Michael V Lomonosov. On the structure of the system of minimum edge cuts in a graph. Issledovaniya po Diskretnoi Optimizatsii (Studies in Discrete Optimization), pages 290–306, 1976. Appeared in Russian.
[FHHP19] Wai-Shing Fung, Ramesh Hariharan, Nicholas JA Harvey, and Debmalya Panigrahi. A general framework for graph sparsification. SIAM Journal on Computing, 48(4):1196–1223, 2019.
[Gab95] Harold N. Gabow. A matroid approach to finding edge connectivity and packing arborescences. Journal of Computer and System Sciences, 50(2):259–273, 1995.
[GH61] Ralph E. Gomory and Te C. Hu. Multi-terminal network flows. Journal of the Society for Industrial and Applied Mathematics, 9(4):551–570, 1961.
[GMW20] Pawel Gawrychowski, Shay Mozes, and Oren Weimann. Minimum cut in $O(m\log^{2}n)$ time. In Proceedings of the 47th International Colloquium on Automata, Languages, and Programming (ICALP ’20), volume 168 of LIPIcs, pages 57:1–57:15. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2020.
[GNT20] Mohsen Ghaffari, Krzysztof Nowicki, and Mikkel Thorup. Faster algorithms for edge connectivity via random 2-out contractions. In Proceedings of the ACM-SIAM Symposium on Discrete Algorithms (SODA ’20), pages 1260–1279. SIAM, 2020.
[GSS11] Shayan Oveis Gharan, Amin Saberi, and Mohit Singh. A randomized rounding approach to the traveling salesman problem. In 52nd Annual IEEE Symposium on Foundations of Computer Science (FOCS ’11), pages 550–559. IEEE, 2011.
[HRW20] Monika Henzinger, Satish Rao, and Di Wang. Local flow partitioning for faster edge connectivity. SIAM Journal on Computing, 49(1):1–36, 2020.
[HT84] Dov Harel and Robert Endre Tarjan. Fast algorithms for finding nearest common ancestors. SIAM Journal on Computing, 13(2):338–355, 1984.
[Kar00] David R. Karger. Minimum cuts in near-linear time. Journal of the ACM, 47(1):46–76, 2000. Announced at STOC 1996.
[KKG21] Anna R Karlin, Nathan Klein, and Shayan Oveis Gharan. A (slightly) improved approximation algorithm for metric TSP. In 53rd Annual ACM-SIGACT Symposium on Theory of Computing (STOC ’21), pages 32–45, 2021.
[KP09] David R. Karger and Debmalya Panigrahi. A near-linear time algorithm for constructing a cactus representation of minimum cuts. In Proceedings of the 20th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA ’09), pages 246–255. SIAM, 2009.
[KT19] Ken-ichi Kawarabayashi and Mikkel Thorup. Deterministic edge connectivity in near-linear time. Journal of the ACM, 66(1):4:1–4:50, 2019. Announced at STOC 2015.
[Li21] Jason Li. Deterministic mincut in almost-linear time. In Proceedings of the 53rd Annual ACM Symposium on Theory of Computing (STOC ’21), pages 384–395. ACM, 2021.
[LST20] On-Hei S. Lo, Jens M. Schmidt, and Mikkel Thorup. Compact cactus representations of all non-trivial min-cuts. Discrete Applied Mathematics, 303:296–304, 2020.
[MN20] Sagnik Mukhopadhyay and Danupon Nanongkai. Weighted min-cut: sequential, cut-query, and streaming algorithms. In Proceedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing (STOC ’20), pages 496–509. ACM, 2020.
[MSU97] Kurt Mehlhorn, R. Sundar, and Christian Uhrig. Maintaining dynamic sequences under equality tests in polylogarithmic time. Algorithmica, 17(2):183–198, 1997.
[NMN01] Jaroslav Nesetril, Eva Milková, and Helena Nesetrilová. Otakar Borůvka on the minimum spanning tree problem: Translation of both the 1926 papers, comments, history. Discrete Mathematics, 233(1-3):3–36, 2001.
[RSW18] Aviad Rubinstein, Tselil Schramm, and S. Matthew Weinberg. Computing exact minimum cuts without knowing the graph. In Proceedings of the 9th Innovations in Theoretical Computer Science Conference (ITCS ’18), pages 39:1–39:16. LIPIcs, 2018.
[ST83] Daniel D Sleator and Robert Endre Tarjan. A data structure for dynamic trees. Journal of Computer and System Sciences, 26(3):362–391, 1983.

Appendix A Data structures

We first show how to implement categorical top two queries on an array while allowing updates to add $\Delta$ to the scores in an interval. This can be accomplished using a well-known binary tree data structure. We will then port this construction to a tree $T$ by means of the heavy path decomposition of $T$ [ST83, HT84].

The key to the binary tree data structure is the following simple fact. For a node $u$ of a tree let $\mathrm{int}(u)$ be the set of labels of leaves that are descendants of $u$ .

Fact 26.

Let $n$ be a power of 2 and $T$ a complete binary tree with $n$ leaves labeled by $1,\ldots,n$ . For any interval $[i,j]$ there are $\mathcal{O}(\log n)$ many nodes $u_{1},\ldots,u_{t}$ such that $[i,j]=\mathrm{int}(u_{1})\sqcup\cdots\sqcup\mathrm{int}(u_{t})$ . Moreover $u_{1},\ldots,u_{t}$ can be found in $\mathcal{O}(\log n)$ time, and the total number of ancestors of $u_{1},\ldots,u_{t}$ is $\mathcal{O}(\log n)$ .

See 6

Proof.

By padding the array with scores of infinity and an arbitrary color we may assume that $n$ is a power of $2$ . The data structure will be a complete binary tree $B$ with $n$ leaves labeled as $\ell_{1},\ldots,\ell_{n}$ . Each leaf stores a three tuple $(\mathrm{score},\mathrm{index},\mathrm{color})$ and at leaf $i$ this three tuple is initialized to be $(A[i].\mathrm{score},i,A[i].\mathrm{color})$ . Every internal node $u$ will store a pair of such 3-tuples. The data structure will maintain the invariant (Invariant 1) that at every internal node $u$ the indices in this pair of three tuples is the answer to the categorical top two query for the interval $\mathrm{int}(u)$ . The answer to a categorical top two query for the interval $\mathrm{int}(u)$ can be computed in constant time from answers to this query at the children of $u$ . Thus in time $\mathcal{O}(n)$ we can propagate the answers to the categorical top two queries from the leaves to the root so that Invariant 1 holds.

Each node $u$ will also store an $\mathrm{update}$ value $u.\mathrm{update}$ . We initialize the leaves to have $\ell_{i}.\mathrm{update}=A[i].\mathrm{score}$ and set the update value of all internal nodes of the tree to be zero. Thus we have the property (Invariant 2) that the sum of the update values from $\ell_{i}$ to the root is $A[i]$ , which will be maintained under the $\textsc{Add}(\Delta,i,j)$ updates. This completes the pre-processing step and the total pre-processing time is $\mathcal{O}(n)$ .

We now show that after an update we can adjust the tree to maintain Invariant 1 and Invariant 2 in $\mathcal{O}(\log n)$ time. If the invariants hold, then we can answer a categorical top two query for the interval $[i,j]$ in time $\mathcal{O}(\log n)$ . This is done by first using 26 to find in $\mathcal{O}(\log n)$ time nodes $u_{1},\ldots,u_{t}$ such that $\mathrm{int}(u_{1}),\ldots,\mathrm{int}(u_{t})$ form a partition of $[i,j]$ . Then by building a binary tree on top of $u_{1},\ldots,u_{t}$ and propagating the categorical top two query answers up this tree we can answer the categorical top two query for $[i,j]$ in time $\mathcal{O}(\log n)$ .

To restore the invariants after $\textsc{Add}(\Delta,i,j)$ , we use 26 to find in $\mathcal{O}(\log n)$ time nodes $u_{1},\ldots,u_{t}$ such that $\mathrm{int}(u_{t}),\ldots,\mathrm{int}(u_{t})$ form a partition of $[i,j]$ . Then for each $i=1,\ldots,t$ we set $u_{i}.\mathrm{update}\leftarrow u_{i}.\mathrm{update}+\Delta$ . This restores Invariant 2 under the update. To restore Invariant 1, we recompute the answers to the categorical top two query at all ancestors of $u_{1},\ldots,u_{t}$ . By 26 there are only $\mathcal{O}(\log n)$ many such ancestors, thus we can perform this computation in $\mathcal{O}(\log n)$ time as well. ∎

In order to extend this structure to a general tree $T$ , we first construct its heavy path decomposition. Next, we concatenate the heavy paths to form a list of all edges of $T$ with the property that any subtree $T_{e}$ is described by a contiguous range of edges (but potentially containing many heavy paths). This is done recursively as follows. Let the topmost heavy path be $h=u_{1}-u_{2}-\ldots u_{k}$ . We first write down its edges $(u_{1},u_{2}),(u_{2},u_{3}),\ldots,(u_{k-1},u_{k})$ . Then, we remove them from the tree. We recurse on the trees consisting of more than one node rooted at $u_{k},u_{k-1},\ldots,u_{1}$ (note that $u_{k}$ is always a root of tree consisting of size 1), in this order. This guarantees that, for any $e\in h$ , $T_{e}$ indeed consists of a contiguous range of edges, while for other edges this is guaranteed recursively.

See 7

Proof.

Consider a heavy path decomposition of $T$ , and construct the edge array $A[1..(n-1)]$ by concatenating the heavy paths as described above. We will use the data structure from Lemma 6 on $A[1..(n-1)]$ . Any path $p$ can be decomposed into $\mathcal{O}(\log n)$ infixes of heavy paths (in fact, at most one proper infix and a number of prefixes), and hence it corresponds to $\mathcal{O}(\log n)$ contiguous ranges of $A[1..(n-1)]$ . Hence we implement the first operation by making $\mathcal{O}(\log n)$ calls to $\textsc{Add}(\Delta,i,j)$ , by Lemma 6 this takes $\mathcal{O}(\log n)$ time. Finally, since $T_{e}$ is described by a single contiguous range $A[i..j]$ , a categorical top-two query in $T_{e}$ corresponds to operation $\textsc{CatTopTwo}(i,j)$ , which again takes time $\mathcal{O}(\log n)$ . ∎