This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Folklore Sampling is Optimal for Exact Hopsets: Confirming the n\sqrt{n} Barrier111This work was supported by NSF:AF 2153680.

Greg Bodwin Gary Hoppenworth University of Michigan University of Michigan bodwin@umich.edu garytho@umich.edu
Abstract

For a graph GG, a DD-diameter-reducing exact hopset is a small set of additional edges HH that, when added to GG, maintains its graph metric but guarantees that all node pairs have a shortest path in GHG\cup H using at most DD edges. A shortcut set is the analogous concept for reachability rather than distances. These objects have been studied since the early ’90s, due to applications in parallel, distributed, dynamic, and streaming graph algorithms.

For most of their history, the state-of-the-art construction for either object was a simple folklore algorithm, based on randomly sampling nodes to hit long paths in the graph. However, recent breakthroughs of Kogan and Parter [SODA ’22] and Bernstein and Wein [SODA ’23] have finally improved over the folklore algorithm for shortcut sets and for (1+ε)(1+\varepsilon)-approximate hopsets. For either object, it is now known that one can use O(n)O(n) hop-edges to reduce diameter to O~(n1/3)\widetilde{O}(n^{1/3}), improving over the folklore diameter bound of O~(n1/2)\widetilde{O}(n^{1/2}). The only setting in which folklore sampling remains unimproved is for exact hopsets. Can these improvements be continued?

We settle this question negatively by constructing graphs on which any exact hopset of O(n)O(n) edges has diameter Ω~(n1/2)\widetilde{\Omega}(n^{1/2}). This improves on the previous lower bound of Ω(n1/3)\Omega(n^{1/3}) by Kogan and Parter [FOCS ’22]. Using similar ideas, we also polynomially improve the current lower bounds for shortcut sets, constructing graphs on which any shortcut set of O(n)O(n) edges reduces diameter to Ω~(n1/4)\widetilde{\Omega}(n^{1/4}). This improves on the previous lower bound of Ω(n1/6)\Omega(n^{1/6}) by Huang and Pettie [SIAM J. Disc. Math. ’18]. We also extend our constructions to provide lower bounds against O(p)O(p)-size exact hopsets and shortcut sets for other values of pp; in particular, we show that folklore sampling is near-optimal for exact hopsets in the entire range of parameters p[1,n2]p\in[1,n^{2}].

1 Introduction

In graph algorithms, many basic problems ask to compute information about the shortest path distances or reachability relation among node pairs in an input graph. In parallel, distributed, dynamic, or streaming settings, algorithm complexity often scales with the diameter of the graph, i.e., the smallest integer dd such that every connected node pair has a path of at most dd edges. Therefore, a popular strategy to optimize these algorithms is to add a few edges to the input graph in preprocessing, with the goal to reduce diameter while leaving the relevant distance or reachability information unchanged. In the context of reachability, this set of additional edges is called a shortcut set.

Definition 1 (Shortcut Sets).

For a directed graph GG, a DD-diameter reducing shortcut set is a set of additional edges HH such that:

  • Every edge (u,v)H(u,v)\in H is in the transitive closure of GG; that is, there exists a uvu\leadsto v path in GG.

  • For every pair of nodes (s,t)(s,t) in the transitive closure of GG, there exists an sts\leadsto t path in GHG\cup H using at most DD edges.

Shortcut sets were introduced by Thorup [33], after they were used implicitly in prior work. Many algorithmic applications of shortcut sets and their relatives were discovered in the following years [34, 26, 19, 18, 20, 16, 24, 15, 17, 6, 10, 25, 2], but actual constructions of shortcut sets were elusive. For most of their history, essentially the only known construction was the following simple algorithm: randomly sample a set SS of |S|=n1/2|S|=n^{1/2} vertices, and add a shortcut edge between each pair of sampled nodes that lie in the transitive closure of the input graph. To argue correctness: for any nodes s,ts,t in the graph where the shortest path π(s,t)\pi(s,t) has length Ω~(n1/2)\gg\widetilde{\Omega}(n^{1/2}), with high probability we sample nodes u,vu,v in SS that respectively hit a prefix and suffix of π(s,t)\pi(s,t) of length O~(n1/2)\widetilde{O}(n^{1/2}). Using the added shortcut edge (u,v)(u,v), we obtain an sts\leadsto t path of length O~(n1/2)\widetilde{O}(n^{1/2}). This analysis gives:

Theorem 1 (Folklore, [34]).

Every nn-node graph has a O~(n1/2)\widetilde{O}(n^{1/2})-diameter-reducing shortcut set on O(n)O(n) edges.

Remarkably, despite its simplicity, the diameter bound of O~(n1/2)\widetilde{O}(n^{1/2}) achieved by the folklore sampling algorithm remained nearly unimproved for 30 years (log factors were removed in [5], improving the diameter bound to O(n1/2)O(n^{1/2})). This led researchers to wonder if the bound could be improved in the exponent at all. This was finally answered affirmatively in a recent breakthrough of Kogan and Parter [29]:

Theorem 2 [29] ).

The folklore algorithm is polynomially suboptimal for shortcut sets. In particular, every nn-node graph has a O~(n1/3)\widetilde{O}(n^{1/3})-diameter-reducing shortcut set on O(n)O(n) edges.

Kogan and Parter proved this theorem via an elegant construction based on sampling vertices and sampling from a set of carefully-chosen paths from the input graph. Following this, there are two clear avenues for further progress. First, the new diameter bound of O~(n1/3)\widetilde{O}(n^{1/3}) is still not necessarily tight. It was still conceivable to improve diameter as far as O(n1/6)O(n^{1/6}), at which point we encounter a lower bound construction of Huang and Pettie [22] (improving on a classic construction of Hesse [21]). Second, many algorithms aim to compute exact or approximate shortest paths of an input graph, rather than any path as in the case of shortcut sets/reachability. These algorithms benefit from shortcut-set-like structures that more strongly reduce the number of edges along (near-)shortest paths in the input graph. Such a structure is called a hopset:

Definition 2 (Hopsets).

For a graph GG and ε0\varepsilon\geq 0, a DD-diameter reducing (1+ε)(1+\varepsilon) hopset is a set of additional edges HH such that:

  • Every edge (u,v)H(u,v)\in H has weight w(u,v):=distG(u,v)w(u,v):=\text{dist}_{G}(u,v).

  • For every pair of nodes (s,t)(s,t) in the transitive closure of GG, there exists an sts\leadsto t path π(s,t)\pi(s,t) in GHG\cup H that uses at most DD edges, and which satisfies w(π(s,t))(1+ε)distG(s,t)w(\pi(s,t))\leq(1+\varepsilon)\cdot\text{dist}_{G}(s,t).

When ε=0\varepsilon=0, the path π(s,t)\pi(s,t) is required to be an exact shortest path in GHG\cup H, so we call HH an exact hopset.

A nice feature of the folklore sampling algorithm is that it extends immediately to hopsets with no real changes. This yields:

Theorem 3 (Folklore).

Every nn-node graph has a O~(n1/2)\widetilde{O}(n^{1/2})-diameter-reducing (exact or (1+ε)(1+\varepsilon)) hopset on O(n)O(n) edges.

Thus, the hunt is back on for a hopset construction algorithm that beats folklore sampling. Kogan and Parter partially achieved this goal: they extended their shortcut set construction to also show a new diameter bound of O~(n2/5)\widetilde{O}(n^{2/5}) for (1+ε)(1+\varepsilon) hopsets [29]. Bernstein and Wein [7] then developed a clever extension of the Kogan-Parter construction, further improving the bound for (1+ε)(1+\varepsilon) hopsets to match the one achieved for shortcut sets:

Theorem 4 ([29, 7] ).

The folklore algorithm is polynomially suboptimal for (1+ε)(1+\varepsilon) hopsets. In particular, for all fixed ε>0\varepsilon>0, every (possibly directed and weighted) nn-node graph has a O~(n1/3)\widetilde{O}(n^{1/3})-diameter-reducing-shortcut set on O(n)O(n) edges.

Still, both of these improvements required ε>0\varepsilon>0, and so neither extended to exact hopsets, which still remained as the last holdout where the folklore algorithm had not been improved. The only progress for exact hopsets came on the lower bounds side, where a separate work of Kogan and Parter [29] showed a diameter lower bound of Ω(n1/3)\Omega(n^{1/3}) (see also [9]). Was it possible to translate the recent progress on shortcut sets and (1+ε)(1+\varepsilon) hopsets to exact hopsets, and finally move past folklore?

1.1 Our Results

folklore[7][29][28][22][22]upper boundsnew lower boundsold lower boundsExact Hopsets(1+ε)(1+\varepsilon) HopsetsShortcut SetsO(n)O(n)-size Edge Set vs. Weighted Input GraphDiametern1/2n^{1/2}n1/3n^{1/3}n1/6n^{1/6}
Figure 1: State-of-the-art bounds for O(n)O(n)-size hopsets and shortcut sets, before and after this paper

Our main results are polynomial improvements on the lower bounds for O(n)O(n)-size shortcut sets and hopsets, which we depict in Figure 1. These lower bounds confirm that the folklore algorithm for exact hopsets is essentially the right one, showing that its diameter bound is optimal up to logn\log n factors:

Theorem 5 (New).

The folklore algorithm is near-optimal for exact hopsets. In particular, there are nn-node graphs on which any exact hopset on O(n)O(n) edges reduces diameter to Ω~(n1/2)\widetilde{\Omega}(n^{1/2}).

This provides a strong separation between exact and (1+ε)(1+\varepsilon) hopsets. Our lower bound holds for both directed and undirected input graphs, but it critically uses edge weights and thus does not extend to unweighted graphs as well. Our lower bound in the body of the paper (Theorem 7) is actually a bit more general than the one stated in Theorem 5 above. Although it is popular to focus on hopsets of size O(n)O(n), one can also ask about hopsets of size O(p)O(p) for any parameter p[1,n2]p\in[1,n^{2}]. The folklore sampling algorithm extends to any pp, by adjusting the size of the sampled vertex set to p1/2p^{1/2}. Our generalized lower bound establishes that the diameter bound from folklore sampling is near-optimal in the entire range of parameters.

We now turn to shortcut sets. Our new lower bound is the following:

Theorem 6 (New).

There are nn-node directed input graphs on which any shortcut set on O(n)O(n) edges reduces diameter to Ω~(n1/4)\widetilde{\Omega}(n^{1/4}).

This improves over the previous lower bound of Ω(n1/6)\Omega(n^{1/6}) by Huang and Pettie [22], but a polynomial gap to the upper bound of O~(n1/3)\widetilde{O}(n^{1/3}) still remains [29]. It is an interesting open question is to narrow this gap further. We note that every (1+ε)(1+\varepsilon) hopset is also a shortcut set, and so this lower bound extends to (1+ε)(1+\varepsilon) hopsets as well.222Note that, since the shortcut set lower bound is only for directed graphs, the lower bound only extends to (1+ε)(1+\varepsilon) hopsets for directed input graphs. For (1+ε)(1+\varepsilon) hopsets in undirected (but possibly weighted) input graphs, Elkin, Gitlitz, and Neiman proved that much better diameter can be achieved [13]. In the body of the paper (Theorem 8) we prove a more general theorem giving improved lower bounds against O(p)O(p) size hopsets, but this time the parameter range in which our extended theorem is nontrivial is only p[1,n5/4]p\in[1,n^{5/4}].

1.2 Other Related Work and Open Questions

This work is only concerned with the existential bounds that can be achieved for shortcut sets and hopsets. Some prior work in the area has also focused on constructions that are efficient in the appropriate computational model. This was perhaps most famously achieved by Fineman [15], whose breakthrough algorithm for parallel reachability was centered around a new shortcut set construction. His construction reduced diameter to O~(n2/3)\widetilde{O}(n^{2/3}) using O(n)O(n) edges; this is a worse diameter bound than the one achieved by folklore sampling, but crucially its work-efficiency was much better than folklore. This was later improved by Jambulapati, Liu, and Sidford [24], who achieved roughly the diameter bound from folklore sampling with work-efficiency comparable to Fineman [15]. Relatedly, another work of Kogan and Parter [27] gave a construction improving the (centralized) construction time of their shortcut set algorithm.

This work focuses on hopsets for weighted graphs, but hopsets can be studied for unweighted graphs as well. Specifically for (1+ε)(1+\varepsilon) hopsets in undirected unweighted graphs, it is known that far better diameter bounds are achievable. In particular, following preliminary constructions in [26, 32, 11], constructions of Huang and Pettie [23] and Elkin and Neiman [14] showed that one can reduce diameter to no(1)n^{o(1)} using O(n)O(n) hop edges. These papers actually provide a more fine-grained tradeoff between hopset size, ε\varepsilon, and diameter bound, which is shown to be essentially tight in [1]. Hopsets for undirected unweighted graphs with larger stretch values were studied in [4], and a unification of the various hopset constructions in this setting was developed in [31].

This work focuses on shortcut/hopset bounds in the setting where the edge budget is O(n)O(n), or more generally some function of nn. These objects are also sometimes studied in a related setting where the edge budget is O(m)O(m), where mm is the number of edges in the input graph. Lower bounds in this setting were achieved in [21, 22, 30]; most recently, Lu, Williams, Wein, and Xu showed graphs where any O(m)O(m)-size shortcut set reduces diameter to Ω(n1/8)\Omega(n^{1/8}). On the upper bounds side, it is easy to get upper bounds as functions of both mm and nn – for example, folklore sampling with |S|=m1/2|S|=m^{1/2} sampled nodes yields a diameter bound of O~(n/m1/2)\widetilde{O}(n/m^{1/2}), and the construction by Kogan and Parter [29] implies a bound of O~(n2/3/m1/3)\widetilde{O}(n^{2/3}/m^{1/3}) for O(m)O(m)-size shortcut sets. However, it is an interesting open problem to construct O(m)O(m)-size shortcut/hopsets with nontrivial diameter upper bounds that depend only on nn. By “nontrivial,” we mean that one can always assume without loss of generality that the input graph is connected, and thus m=Ω(n)m=\Omega(n), and so a construction of O(n)O(n)-edge shortcut/hopsets is always valid in the O(m)O(m)-setting. A nontrivial construction is one that beats that bound.

Open Question 1.

Prove that, for any mm, every nn-node, mm-edge directed graph has an O(m)O(m)-edge shortcut/hopset that reduces diameter to O(nc)O(n^{c}), where cc is a constant strictly less than the one that can currently be achieved for O(n)O(n)-edge shortcut/hopsets.

Finally, we remark that the setting of exact hopsets in unweighted graphs seems to be unexplored. Although unpublished to our knowledge, one can obtain a lower bound by applying a standard analysis in [22, 28, 9] to the nn-node undirected unweighted distance preserver lower bound graphs constructed by Coppersmith and Elkin [12]. This would imply that any O(n)O(n)-size exact hopset on the Coppersmith-Elkin graphs would reduce diameter to Ω(n1/5)\Omega(n^{1/5}). Our new hopset lower bound can also be interpreted as an improved lower bound for this setting:333This corollary is not immediate from the discussion so far: since our shortcut set lower bound is directed, it is not clear that it would imply a lower bound against undirected unweighted exact hopsets. This holds specifically because our shortcut set lower bound construction happens to be layered.

Corollary 1.

There are nn-node undirected unweighted input graphs on which any exact hopset on O(n)O(n) edges reduces diameter to Ω~(n1/4)\widetilde{\Omega}(n^{1/4}).

But on the upper bounds side, folklore sampling remains the best known algorithm, and it only reduces diameter to O~(n1/2)\widetilde{O}(n^{1/2}). It would be interesting to narrow this gap, and in particular to confirm or refute whether folklore sampling is near-optimal.

Open Question 2.

Is the folklore algorithm near-optimal for exact hopsets in unweighted graphs? Or, alternately, does every nn-node unweighted graph have an O(n)O(n)-size exact hopset that reduces diameter to O(n1/2c)O(n^{1/2-c}), for some c>0c>0?

2 Technical Overview

2.1 Recap of Prior Work

In order to explain the strategy used for our improved lower bounds, it will be helpful to first recall the construction of Huang and Pettie [22] for shortcut set lower bounds, and the construction of Kogan and Parter [28] for hopset lower bounds. The Huang-Pettie shortcut set lower bound is a construction of a directed graph GG and a set of paths Π\Pi with the following properties:

  1. 1.

    Each path in Π\Pi is the unique path in GG between its endpoints

  2. 2.

    The paths in Π\Pi are pairwise edge-disjoint

  3. 3.

    There are |Π|=cn|\Pi|=cn paths, where cc is a constant that can be selected as large as we want

  4. 4.

    Subject to the above constraints, we want to make the paths in Π\Pi as long as possible. In particular, in [22], every path in Π\Pi has the same length =Θ(n1/6)\ell=\Theta(n^{1/6}).

Let us see why these properties imply a lower bound. Each time we add an edge (u,v)(u,v) to our hopset HH, by uniqueness and edge-disjointness of paths in Π\Pi, there can be at most one path πΠ\pi\in\Pi where the distance between its endpoints decreases due to (u,v)(u,v). Thus, if we build a shortcut set of size |H|=|Π|1|H|=|\Pi|-1, then for at least one path πΠ\pi\in\Pi the distance between its endpoints is the same in GG as in GHG\cup H. Thus, the final diameter of the graph is at least =Θ(n1/6)\ell=\Theta(n^{1/6}), giving the lower bound.

The Kogan-Parter exact hopset lower bound is similar, except that each path πΠ\pi\in\Pi is only required to be a unique shortest path between its endpoints in the weighted graph GG. This is a more relaxed constraint than requiring π\pi to be the unique path of any kind, and this additional freedom in the construction lets us improve the path lengths to =Θ(n1/3)\ell=\Theta(n^{1/3}). Besides that, the argument is identical.

2.2 Allowing Paths to Overlap

The change in our construction is a relaxation of item (2); that is, the paths in our constructions are not pairwise edge-disjoint. This has appeared in prior work only in a rather weak form: all of the lower bounds against O(m)O(m)-size shortcut sets use paths that may intersect pairwise on a single edge [21, 22, 30]. These constructions begin with a system of paths as above, and then apply a tool called the alternation product which introduces path overlap. However, the alternation product is not a useful tool for our purposes, and it does not appear in this paper at all. The alternation product harms the path lengths of the construction (relative to its number of nodes), in exchange for also reducing the number of edges mm relative to the number of paths in the construction. This is useful for constructing lower bounds against O(m)O(m)-size shortcut/hopsets, but is not helpful for our goal of constructing lower bounds against O(n)O(n)-size shortcut/hopsets.

In our construction, paths that may intersect pairwise on polynomially many edges. This property arises from an entirely different technique, and for an entirely different reason: our goal is to use this overlap to get improved path lengths. Let us first observe why we can tolerate some path overlap while maintaining correctness of the lower bound. Suppose our shortcut set has a budget of cncn edges, and we construct a graph GG and a set of |Π|=2cn|\Pi|=2cn paths, where each path has length \ell and each path is the unique path between its endpoints. Let PP be the set of node pairs that are the endpoints of paths in Π\Pi, and consider the following potential function over shortcut sets HH, which simply sums distances over critical pairs:

ϕ(H):=(s,t)PdistGH(s,t)\phi\left(H\right):=\sum\limits_{(s,t)\in P}\text{dist}_{G\cup H}(s,t)

Initially, we have ϕ()=|Π|=2cn\phi(\emptyset)=|\Pi|\cdot\ell=2cn\ell. Then we add edges to our shortcut set one at a time, gradually reducing ϕ\phi. How much could any given shortcut edge (u,v)H(u,v)\in H reduce ϕ\phi? Clearly it could reduce by 1\ell-1, in the case where (u,v)(u,v) are a pair in PP, since this reduces dist(u,v)\text{dist}(u,v) from \ell to 11. This is acceptable: if all edges reduce ϕ\phi by at most 1\ell-1, then the final potential would be

ϕ(H)ϕ(0)(1)cn=2cn(1)cn=(+1)cn.\phi(H)\geq\phi(0)-(\ell-1)cn=2\ell cn-(\ell-1)cn=(\ell+1)cn.

Thus, over the |P|=2cn|P|=2cn critical pairs, the average distance in GHG\cup H is Θ()\Theta(\ell), and so the lower bound still holds.

So the only overlap constraint we need to enforce in our lower bound is that no shortcut edge can reduce the potential ϕ\phi by more than \ell. This is a much more forgiving constraint than edge-disjoint paths. For example, for two internal nodes u,vu,v with distG(u,v)=/2\text{dist}_{G}(u,v)=\ell/2, we could allow two different paths to coincide on a uvu\leadsto v subpath: adding (u,v)(u,v) to the shortcut set would then reduce ϕ\phi by only 2(/21)=22\cdot(\ell/2-1)=\ell-2. In general, for two nodes at distance /x\ell/x, we can safely allow xx paths to coincide on the subpath between these nodes while maintaining correctness of the lower bound.

2.3 Constructing Overlapping Paths

The previous part explains why we are allowed overlapping paths, but it is still not clear how to leverage this freedom into an improved lower bound construction. This is where our technical contribution lies. It is again a bit easier to explain the new idea in the context of shortcut sets, but the intuition is essentially the same in the context of hopsets.

Let us return to the previous lower bound constructions. For the shortcut lower bounds of [22], one constructs an (+1)(\ell+1)-layered directed graph where the nodes in each layer are a copy of a grid within 2\mathbb{Z}^{2}. The next step is to construct a set of convex vectors CC. A key perspective shift in this paper is that we think of the vectors in CC as playing two independent roles in this previous construction:

  • They play the role of edge vectors: we include an edge from a node uu in layer ii to a node vv in layer i+1i+1 iff the difference between the grid points u,vu,v is a vector vu=cCv-u=\vec{c}\in C.

  • They also play the role of (objective) direction vectors. The paths πΠ\pi\in\Pi are indexed by a node uu in the first layer and a vector cC\vec{c}\in C, and we generate πu,c\pi_{u,\vec{c}} by using uu as its first node, and then iteratively selecting its node in the next layer by adding c\vec{c} to the node in the previous layer. (A technical detail is that only choices of (u,c)(u,\vec{c}) are allowed that reach the last layer without the path falling off the side of the grid.) Notice the argument for path uniqueness: using c\vec{c} as an objective direction, due to the convexity of the vectors in CC, the edge vector c\vec{c} itself is the one that maximizes progress in the direction c\vec{c}. Thus, no alternate path beginning at uu can reach the grid point u+cu+\ell\vec{c} within \ell steps, since it necessarily makes less progress in the direction of c\vec{c}, implying path uniqueness.

Our constructions disentangle these two uses of the vector set CC: we depart from prior work by explicitly using a separate edge vector set CC and direction vector set DD. These vector sets crucially do not have the same size: instead we will have |D||C||D|\gg|C|, and this difference allows for a technical optimization of parameters leading to improved lower bounds. Roughly, we can choose DD large enough to have |Π|=Θ(n)|\Pi|=\Theta(n), while also allowing |C|n|C|\ll n. This smaller size |C||C| can be achieved using shorter convex vectors, which in turn lets us pack more layers into the construction without worrying about paths falling off the sides of the grid before reaching the final layer.

We use the following generalized process for iteratively generating critical paths. Each path πΠ\pi\in\Pi is indexed by a node uu in the first layer and a direction vector dD\vec{d}\in D, and at each layer, we greedily select the edge vector cC\vec{c}\in C that maximizes progress in the objective direction d\vec{d}. Since |D||C||D|\gg|C|, by necessity many different objective directions will all select the same edge vector at each layer. This leads to overlapping paths discussed above, but more technical ingredients are still needed to ensure that paths don’t overlap too much. We explain these next.

2.4 Symmetry Breaking

There is an important problem with the construction sketched so far. Consider two nearby direction vectors d1,d2D\vec{d_{1}},\vec{d_{2}}\in D, which have the same optimizing edge vector c\vec{c}. Then for any start node uu, the paths πu,d1,πu,d2\pi_{u,\vec{d_{1}}},\pi_{u,\vec{d_{2}}} will simply select the same edge vector c\vec{c} at each layer, and these two paths will entirely coincide. In other words, the “extra paths” bought by using |D||C||D|\gg|C| are actually just identical copies of a much smaller set of paths, which is not interesting or useful.

We therefore need to somehow break the symmetry between paths that use nearby direction vectors, getting them to eventually choose different edge vectors at some layer to split apart. This is where our lower bound constructions diverge; we will need to use two different symmetry-breaking strategies for shortcut and hopset lower bounds.

Hopset Lower Bounds and ε\varepsilon-Shifting.

In our hopset lower bound construction GG, like [28], our vertices can be interpreted as points in 2\mathbb{R}^{2}. More specifically, they initially form a square grid within the integer lattice 2\mathbb{Z}^{2}, and the columns of this grid act as layers of GG. Our edges initially have the form e=(v,v+c)e=(v,v+\vec{c}) for edge vectors cC\vec{c}\in C; edge vectors always have first coordinate 11, so that they go from one layer to the next. Initially, the weight of an edge is the Euclidean distance between its endpoints.

Our symmetry-breaking step is a random operation where for each layer ii we choose a random variable εi\varepsilon_{i} sampled uniformly from the interval (0,1)(0,1), and we shift the ithi^{th} layer upwards so that its nodes are offset εi\varepsilon_{i} higher than the nodes in the previous layer. The shifts therefore compound across the layers. See Figure 2 for a picture.

ε1\varepsilon_{1}ε2\varepsilon_{2}ε3\varepsilon_{3}ε4\varepsilon_{4}ε5\varepsilon_{5}
Figure 2: Vertex set of the graph GG used for our lower bounds against exact hopsets. Each parameter εi\varepsilon_{i} is the amount the ithi^{th} column is shifted upwards in the plane, relative to the previous column; the {εi}\{\varepsilon_{i}\} values are chosen uniformly and independently from the interval (0,1)(0,1).

Our ε\varepsilon-shifting strategy does not affect the edge set of GG, nor does it affect the set of direction vectors in any way, but it does affect the Euclidean distance between nodes in adjacent layers, and hence it changes the edge weights. It achieves symmetry-breaking for roughly the following reason. In our greedy generation of paths, a path with direction vector d\vec{d} will use the closest edge vector c\vec{c} at each level. If two paths π1\pi_{1} and π2\pi_{2} with direction vectors dj,dkD\vec{d}_{j},\vec{d}_{k}\in D intersect at a node vv in the iith layer of GG, then there will be an interval (a,b)(0,1)(a,b)\subseteq(0,1) such that if εi+1\varepsilon_{i+1} lands in (a,b)(a,b), then π1,π2\pi_{1},\pi_{2} have different closest edge vectors after shifting. Thus, in this event, the paths π1\pi_{1} and π2\pi_{2} split apart at vv and never reconverge (this is formalized in Lemma 3). The size of the interval (a,b)(a,b), and hence the probability that it gets hit by εi+1\varepsilon_{i+1}, is proportional to the distance between dj\vec{d}_{j} and dk\vec{d}_{k}. The effect is that paths generated by nearby direction vectors tend to intersect on long subpaths, while paths generated by far apart direction vectors intersect on shorter subpaths or perhaps just a single node, but with high probability all pairs of paths split apart eventually. Paths generated by the same direction vector remain parallel, and do not intersect at all.

There is a technical detail remaining: we still need to prove that each critical path is a unique shortest path between its endpoints. In [28], the critical paths correspond to lines in Euclidean space, and since edge weights correspond to Euclidean distances, the analogous unique shortest paths property follows instantly from the geometry of 2\mathbb{R}^{2}. Since our critical paths are generated by a more involved process, it is much more technical to prove that they are unique shortest paths. Proposition 3 contains the optimization lemma that needs to hold for our process to generate unique shortest paths, and to push it through, it turns out that we essentially need the derivative of edge weights to be proportional to Euclidean distances. We therefore differ again from [28] by squaring all of our edge weights, meaning that our graph metric is ultimately quite different from 2\mathbb{R}^{2}. See Section 4.4 for additional details.

Shortcut Set Lower Bounds and Edge Vector Subsampling.

Shortcut set lower bounds are unweighted, and this makes the technique of ε\varepsilon-sampling essentially useless in this setting, since it only affects edge weights in the construction and it does not change the edge set. For shortcut sets, we need an entirely different symmetry-breaking strategy that actually changes the edge set from layer to layer.

Our starting graph is similar to the one used by Huang and Pettie [22], mentioned earlier. Each layer of the graph is an independent copy of a square grid in 2\mathbb{Z}^{2}. We generate a large convex set W2W\subseteq\mathbb{R}^{2}; initially, WW plays the role of both edge vectors CC and direction vectors DD. However, for the sake of symmetry-breaking, we do not put edges between all nodes in adjacent layers whose difference is a vector in WW. Instead, at each layer ii we randomly sample exactly two adjacent vectors cλi,cλi+1W\vec{c}_{\lambda_{i}},\vec{c}_{\lambda_{i}+1}\in W, and we use only these two edge vectors to generate edges to the next layer. This is depicted in Figure 3.

Figure 3: Our symmetry-breaking strategy for shortcut set lower bounds starts with a large set of convex vectors, but independently subsamples adjacent pairs of convex vectors to generate the edges between adjacent layers. In this picture, there are 44 edge vectors and 44 layers, but only two of the edge vectors (in blue) are sampled and available between any given pair of adjacent layers. For clarity, we have only drawn the edges leaving one particular node in each layer.

The fact that the sampled vectors are adjacent, and hence typically close together, allows for a key optimization in the construction. The rate at which paths drift apart from each other is much slower than in [22], even when they are generated by very different direction vectors. This allows us to apply a carefully-chosen translation of the grid from layer to layer, in order to keep all of the paths contained in the grid. This in turn lets us pack many more layers into the construction while still ensuring that all of our paths stay within the confines of the grid.

As before, paths πΠ\pi\in\Pi are generated greedily: for direction vector dD\vec{d}\in D, an associated path π\pi will traverse the sampled edge vector in each layer that maximizes progress in the objective direction d\vec{d}. For two paths π1,π2\pi_{1},\pi_{2} with direction vectors dj,dkD\vec{d}_{j},\vec{d}_{k}\in D and vπ1π2v\in\pi_{1}\cap\pi_{2}, these paths split at vv in the event that the sampled edge vectors cλi,cλi+1W\vec{c}_{\lambda_{i}},\vec{c}_{\lambda_{i}+1}\in W lie between dj\vec{d}_{j} and dk\vec{d}_{k} in DD. This again leads to behavior where paths generated by nearby direction vectors tend to coincide on long subpaths, while paths generated by far apart direction vectors have smaller intersections, but with high probability all pairs of paths split apart eventually. Paths with the same direction vector again remain parallel. This is formalized in Lemma 10.

3 Preliminaries

We use the following notations:

  • For a path π\pi, we use |π||\pi| to denote the number of nodes in π\pi. This is 11 different from the (unweighted) length of π\pi. In weighted graphs, we write w(π)w(\pi) for the sum of edge weights in π\pi.

  • We write distG(s,t)\text{dist}_{G}(s,t) for the shortest path distance from node ss to node tt in graph GG (counting edge weights, if GG is weighted). We write hopdistG(s,t)\texttt{hopdist}_{G}(s,t) for the least number of edges contained in any sts\leadsto t shortest path.

  • We use ,\langle\cdot,\cdot\rangle to denote the standard Euclidean inner product.

4 Exact Hopsets

In this section we will prove the following theorem.

Theorem 7.

For any parameter p[1,n2]p\in[1,n^{2}], there exists an nn-node weighted undirected graph G=(V,E,w)G=(V,E,w) such that for any exact hopset HH of size |H|p|H|\leq p where p[1,n2]p\in[1,n^{2}], the graph GHG\cup H must have hopbound Ω(np1/2log1/2n)\Omega\left(\frac{n}{p^{1/2}\log^{1/2}n}\right).

We will prove this via a construction of the following type:

Lemma 1.

For any p[1,n2]p\in[1,n^{2}], there is an infinite family of nn-node undirected weighted graphs G=(V,E,w)G=(V,E,w) and sets Π\Pi of |Π|=p|\Pi|=p paths in GG with the following properties:

  • GG has =Θ(np1/2log1/2n)\ell=\Theta\left(\frac{n}{p^{1/2}\log^{1/2}n}\right) layers, and each path in Π\Pi starts in the first layer, ends in the last layer, and contains exactly one node in each layer.

  • Each path in Π\Pi is the unique shortest path between its endpoints in GG.

  • For any two nodes u,vVu,v\in V, there are at most hopdistG(u,v)\frac{\ell}{\texttt{hopdist}_{G}(u,v)} paths in Π\Pi that contain both uu and vv.

  • Each node vVv\in V lies on at most O(pn)O\left(\frac{\ell p}{n}\right) paths in Π\Pi.

Next we show how Lemma 1 implies Theorem 7.

4.1 Proving Theorem 7 using Lemma 1

Fix an nn and p[1,n2]p\in[1,n^{2}]. Let G=(V,E,w)G=(V,E,w) be the graph in Lemma 1 with associated set Π\Pi of |Π|=2p|\Pi|=2p paths in GG. Let HH be an exact hopset of size |H|p|H|\leq p. Let PV×VP\subseteq V\times V be the set of node pairs that are the endpoints of paths in Π\Pi. We define the following potential function over hopsets HH, which simply sums hopdistances over critical pairs:

ϕ(H):=(s,t)PhopdistGH(s,t)\phi\left(H\right):=\sum\limits_{(s,t)\in P}\texttt{hopdist}_{G\cup H}(s,t)

Observe that by Lemma 1, we have ϕ()=(s,t)PhopdistG(s,t)=|Π|(1)=2p(1)\phi(\emptyset)=\sum_{(s,t)\in P}\texttt{hopdist}_{G}(s,t)=|\Pi|\cdot(\ell-1)=2p(\ell-1). Now fix a pair of nodes (x,y)V×V(x,y)\in V\times V, and let ΠΠ\Pi^{\prime}\subseteq\Pi be the set of paths πΠ\pi\in\Pi such that x,yπx,y\in\pi. We make the following observations.

  • For all (s,t)P(s,t)\in P, if the unique shortest sts\leadsto t-path πΠ\pi\in\Pi in GG is not in Π\Pi^{\prime}, then

    hopdistG{(x,y)}(s,t)=hopdistG(s,t).\texttt{hopdist}_{G\cup\{(x,y)\}}(s,t)=\texttt{hopdist}_{G}(s,t).
  • For all (s,t)P(s,t)\in P, if the unique shortest sts\leadsto t-path πΠ\pi\in\Pi in GG is in Π\Pi^{\prime}, then

    hopdistG(s,t)hopdistG{(x,y)}(s,t)hopdistG(x,y).\texttt{hopdist}_{G}(s,t)-\texttt{hopdist}_{G\cup\{(x,y)\}}(s,t)\leq\texttt{hopdist}_{G}(x,y).

Then by Lemma 1, ϕ()ϕ({(x,y)})|Π|hopdistG(x,y)\phi(\emptyset)-\phi(\{(x,y)\})\leq|\Pi^{\prime}|\cdot\texttt{hopdist}_{G}(x,y)\leq\ell. We obtain the following sequence of inequalities:

ϕ()ϕ(H)(x,y)H(ϕ()ϕ({(x,y)}))|H|p.\phi(\emptyset)-\phi(H)\leq\sum_{(x,y)\in H}\left(\phi(\emptyset)-\phi(\{(x,y)\})\right)\leq\ell\cdot|H|\leq\ell p.

Rearranging, we find that

ϕ(H)ϕ()p2p(1)p=(2)p.\phi(H)\geq\phi(\emptyset)-\ell p\geq 2p(\ell-1)-\ell p=(\ell-2)p.

Thus, over the |P|=2p|P|=2p pairs of path endpoints in PP, the average hopdistance in GHG\cup H is Θ()\Theta(\ell), and so there must be a pair (s,t)P(s,t)\in P such that hopdistGH(s,t)=Θ()\texttt{hopdist}_{G\cup H}(s,t)=\Theta(\ell) by the pigeonhole principle.

4.2 Constructing GG

Our goal is now to prove Lemma 1. Let nn be a sufficiently large positive integer, and let p[n,n2]p\in[n,n^{2}].444We will handle the case where p[1,n]p\in[1,n] later. For simplicity of presentation, we will frequently ignore issues related to non-integrality of expressions that arise in our construction; these issues affect our bounds only by lower-order terms. Initially, all edges (u,v)(u,v) in GG will be directed from uu to vv; we will convert GG into an undirected graph in the final step of our construction.

Vertex Set VV.

  • Let \ell be a positive integer parameter of the construction to be specified later. Our graph GG will have \ell layers L1,,LL_{1},\dots,L_{\ell}, and each layer will have n/n/\ell nodes, ordered from 11 to n/n/\ell. Initially, we will label the jjth node in layer LiL_{i} with tuple (i,j)(i,j). We will interpret the node labeled (i,j)(i,j) as a point in 2\mathbb{R}^{2} with integer coordinates. These nn nodes arranged in \ell layers will be the node set VV of graph GG.

  • We now perform the following random operation on the node labels of VV. For each layer LiL_{i}, i[1,]i\in[1,\ell], uniformly sample a random real number in the interval (0,1)(0,1) and call it εi\varepsilon_{i}. Now for each node in layer LiL_{i} of GG labeled (i,j)(i,j), relabel this node with the label

    (i,j+k=1jεk).\left(i,j+\sum_{k=1}^{j}\varepsilon_{k}\right).

    Again, we interpret the resulting labels for nodes in VV as points in 2\mathbb{R}^{2}. In a slight abuse of notation, we will treat vVv\in V as either a node in GG or a point in 2\mathbb{R}^{2}, depending on the context. Less formally: for each layer ii, this step shifts the nodes in layer ii vertically upwards to be εi\varepsilon_{i} higher than the previous layer (and thus, these vertical shifts compound across the layers). See Section 2 for intuition on this design choice.

Edge Set EE.

  • All the edges in EE will be between consecutive layers Li,Li+1L_{i},L_{i+1} of GG. We will let EiE_{i} denote the set of edges in GG between layers LiL_{i} and Li+1L_{i+1}.

  • Just as our nodes in VV correspond to points in 2\mathbb{R}^{2}, we can interpret the edges EE in GG as vectors in 2\mathbb{R}^{2}. In particular, for every edge e=(v1,v2)Ee=(v_{1},v_{2})\in E, we identify ee with the corresponding vector ue:=v2v1\vec{u}_{e}:=v_{2}-v_{1}. Note that since all edges in EE are between adjacent layers LiL_{i} and Li+1L_{i+1}, the first coordinate of ue\vec{u}_{e} is 11 for all eEe\in E. We will use ueu_{e} to denote the 2nd coordinate of ue\vec{u}_{e}, i.e., for all eEe\in E, we write ue=(1,ue)\vec{u}_{e}=(1,u_{e}).

  • We begin our construction of EE by defining the following set CC of vectors:

    C:={(1,x)x[0,n42]}.C:=\left\{(1,x)\mid x\in\left[0,\frac{n}{4\ell^{2}}\right]\right\}.

    We will refer to the vectors in CC as edge vectors.

  • For each i[1,1]i\in[1,\ell-1], let

    Ci:={c+(0,εi+1)}cC.C_{i}:=\{\vec{c}+(0,\varepsilon_{i+1})\}_{\vec{c}\in C}.

    Intuitively: we want the edge vectors in CC to point between nodes in adjacent layers, and due to the random vertical shifts between layers applied to the nodes, we need to apply a similar shift to CC at each layer to adjust for this.

  • For each vLiv\in L_{i} and edge vector cCi\vec{c}\in C_{i}, if v+cVv+\vec{c}\in V, then add edge (v,v+c)(v,v+\vec{c}) to EiE_{i}. After adding these edges to EiE_{i}, we will have that

    Ci={ueeEi}.C_{i}=\{\vec{u}_{e}\mid e\in E_{i}\}.

    Additionally, note that the case v+cVv+\vec{c}\not\in V only occurs if v+c=(i+1,j)v+\vec{c}=(i+1,j) for some jj that is higher than any point in the (i+1)(i+1)st layer; that is, j>n+k=1i+1εkj>\frac{n}{\ell}+\sum_{k=1}^{i+1}\varepsilon_{k}.

  • For each eEe\in E, if ue=(1,ue)\vec{u}_{e}=(1,u_{e}), then we assign edge ee the weight w(e):=ue2w(e):=u_{e}^{2}.

This completes the construction of our graph G=(V,E,w)G=(V,E,w).

4.3 Direction Vectors, Critical Pairs, and Critical Paths

Our next step is to generate a set of critical pairs PV×VP\subseteq V\times V, as well as a set of critical paths Π\Pi. Specifically, there will be one critical path πs,tΠ\pi_{s,t}\in\Pi going between each critical pair (s,t)P(s,t)\in P, and we will show that πs,t\pi_{s,t} is the unique shortest (weighted) sts\leadsto t path in GG. We will identify our critical pairs and paths by first constructing a set of vectors D2D\subseteq\mathbb{R}^{2} that we call direction vectors, which we define next.

Direction Vectors DD.

  • Let q0q\in\mathbb{Z}_{\geq 0} be a sufficiently large integer parameter to be specified later. The size of qq will roughly correspond to the maximum number of edges shared between any two critical paths in GG.

  • We choose our set of direction vectors DD to be 555Note that if (1,x)D(1,x)\in D, then x[1,n42]x\in\left[1,\frac{n}{4\ell^{2}}\right]. However, if (1,x)C(1,x)\in C, then x[0,n42]x\in\left[0,\frac{n}{4\ell^{2}}\right]. This +1+1 gap between CC and DD is needed to accommodate the ε\varepsilon-shifting operation used to obtain CiC_{i}, and is relevant in the proof of Lemma 3.

    D:={(1, x+yq)|x[1, n421] and y[0,q]}.D:=\left\{\left(1,\text{ }x+\frac{y}{q}\right)\bigl{|}x\in\left[1,\text{ }\frac{n}{4\ell^{2}}-1\right]\text{ and }y\in[0,q]\right\}.

    Note that there are q+1q+1 direction vectors between adjacent vectors (1,x),(1,x+1)C(1,x),(1,x+1)\in C for x0x\neq 0. Additionally, adjacent direction vectors in DD differ only by 1/q1/q in their second coordinate.

Proposition 1.

With probability 11, for every i[1,1]i\in[1,\ell-1] and every direction vector d=(1,d)D\vec{d}=(1,d)\in D, there is a unique vector (1,c)Ci(1,c)\in C_{i} that minimizes |cd||c-d| over all choices of (1,c)Ci(1,c)\in C_{i}.

Proof.

There are only finitely many choices of εi+1(0,1)\varepsilon_{i+1}\in(0,1) that result in there being two distinct vectors (1,c1),(1,c2)Ci(1,c_{1}),(1,c_{2})\in C_{i} such that |c1d|=|c2d||c_{1}-d|=|c_{2}-d|. We conclude that the claim holds with probability 11. ∎

In the following we assume that this event holds, i.e., there is a unique minimizing vector in CiC_{i} for all dD\vec{d}\in D. Each of our critical paths π\pi in Π\Pi will have an associated direction vector dD\vec{d}\in D, and for all i[1,1]i\in[1,\ell-1], path π\pi will take an edge vector in CiC_{i} that is closest to d\vec{d} in the sense of Proposition 1 (see Section 2 for more intuition).

Critical Pairs PP and Critical Paths Π\Pi.

  • We first define a set SL1S\subseteq L_{1} containing half of the nodes in the first layer L1L_{1} of GG:

    S:={(1,j+ε1)L1j[1,n2]}.S:=\left\{(1,j+\varepsilon_{1})\in L_{1}\mid j\in\left[1,\frac{n}{2\ell}\right]\right\}.

    We will choose our set of demand pairs PP so that PS×LP\subseteq S\times L_{\ell}. For every node sSs\in S and direction vector dD\vec{d}\in D, we will create a critical pair (s,t)S×L(s,t)\in S\times L_{\ell} and a corresponding critical path πs,t\pi_{s,t} to add to PP and Π\Pi.

  • Let v1Sv_{1}\in S, and let d=(1,d)D\vec{d}=(1,d)\in D. The associated path π\pi has start node v1v_{1}. We iteratively grow π\pi, layer-by-layer, as follows. Suppose that currently π=(v1,,vi)\pi=(v_{1},\dots,v_{i}), for i<i<\ell, with each viLiv_{i}\in L_{i}. To determine the next node vi+1Li+1v_{i+1}\in L_{i+1}, let EiviEiE_{i}^{v_{i}}\subseteq E_{i} be the edges in EiE_{i} incident to viv_{i}, and let

    ei:=argmineEivi(|ued|).e_{i}:=\text{argmin}_{e\in E_{i}^{v_{i}}}(|u_{e}-d|).

    By definition, eie_{i} is an edge whose first node is viv_{i}; we define vi+1Li+1v_{i+1}\in L_{i+1} to be the other node in eie_{i}, and we append vi+1v_{i+1} to π\pi.

  • This completes our construction of PP and Π\Pi. Note that

    • we will show that the paths generated in this way have distinct endpoints (with high probability), and therefore |P|=|S||D|n2q163|P|=|S||D|\geq\frac{n^{2}q}{16\ell^{3}}, and

    • every path πs,tΠ\pi_{s,t}\in\Pi contains one node in each layer, and therefore its number of nodes is |πs,t|=|\pi_{s,t}|=\ell.

An important feature for correctness of our construction is that, when we iteratively generate paths, we never reach a point viv_{i} such that vi+CiLi+1v_{i}+C_{i}\not\subseteq L_{i+1} (i.e. vi+cLi+1v_{i}+\vec{c}\not\in L_{i+1} for some cCi\vec{c}\in C_{i}). This follows by straightforward counting, based on the maximum second coordinate used in our edge vectors CC and also on our choice of start nodes SS as only the “lower half” of the nodes in the first layer. The following proposition expresses this correctness in a particular way, pointing out that for any node vv lying on a generated path π\pi, none of the edges from vv to the following layer are omitted from the graph due to falling off the top of the grid with a too-high second coordinate.

Proposition 2.

Let vLiπv\in L_{i}\cap\pi for some i[1,1]i\in[1,\ell-1] and πΠ\pi\in\Pi. Then {ueeEiv}=Ci\{\vec{u}_{e}\mid e\in E_{i}^{v}\}=C_{i}.

Proof.

Let v=:(i,j)v=:(i,j), and let (s,t)P(s,t)\in P be the endpoints of π\pi. Since sSs\in S, we have s=(1,s2)2s=(1,s_{2})\in\mathbb{R}^{2}, where

s2n2+ε1.s_{2}\leq\frac{n}{2\ell}+\varepsilon_{1}.

Moreover, since for all eEie\in E_{i} the corresponding vector ue=(1,ue)\vec{u}_{e}=(1,u_{e}) satisfies εi+1uen42+εi+1\varepsilon_{i+1}\leq u_{e}\leq\frac{n}{4\ell^{2}}+\varepsilon_{i+1}, we have

jn2+(i1)n42+k=1iεk3n4+k=1iεk.j\leq\frac{n}{2\ell}+(i-1)\cdot\frac{n}{4\ell^{2}}+\sum_{k=1}^{i}\varepsilon_{k}\leq\frac{3n}{4\ell}+\sum_{k=1}^{i}\varepsilon_{k}.

Then observe that for all c=(1,c)Ci\vec{c}=(1,c)\in C_{i}, we have that v+c=(i+1,j+c)v+\vec{c}=(i+1,j+c), where

j+c(3n4+k=1iεk)+(n42+εi+1)n+k=1i+1εk.j+c\leq\left(\frac{3n}{4\ell}+\sum_{k=1}^{i}\varepsilon_{k}\right)+\left(\frac{n}{4\ell^{2}}+\varepsilon_{i+1}\right)\leq\frac{n}{\ell}+\sum_{k=1}^{i+1}\varepsilon_{k}.

Thus (i+1,j+c)Li+1(i+1,j+c)\in L_{i+1}, and so we have (v,v+c)Eiv(v,v+\vec{c})\in E_{i}^{v} for all cCi\vec{c}\in C_{i}. It follows that {ueeEiv}=Ci\{\vec{u}_{e}\mid e\in E_{i}^{v}\}=C_{i}. ∎

4.4 Critical paths are unique shortest paths

We now verify that graph GG and paths Π\Pi have the unique shortest path property as stated in Lemma 1.

Lemma 2 (Unique shortest paths).

With probability 11, for every (s,t)P(s,t)\in P, path πs,tΠ\pi_{s,t}\in\Pi is a unique shortest (weighted) sts\leadsto t-path in GG.

We begin with a technical proposition:

Proposition 3.

Let b,x1,,xkb,x_{1},\dots,x_{k}\in\mathbb{R}. Now consider x^1,,x^k\hat{x}_{1},\dots,\hat{x}_{k} such that

  • |x^ib||xib||\hat{x}_{i}-b|\leq|x_{i}-b| for all i[1,k]i\in[1,k], and

  • i=1kxi=i=1kx^i\sum_{i=1}^{k}x_{i}=\sum_{i=1}^{k}\hat{x}_{i}.

Then

i=1kxi2i=1kx^i2,\sum_{i=1}^{k}x_{i}^{2}\geq\sum_{i=1}^{k}\hat{x}_{i}^{2},

with equality only if |x^ib|=|xib||\hat{x}_{i}-b|=|x_{i}-b| for all i[1,k]i\in[1,k].

Proof.

We will prove the equivalent statement i=1k(x^i2xi2)0\sum_{i=1}^{k}(\hat{x}_{i}^{2}-x_{i}^{2})\leq 0. Fix an i[1,k]i\in[1,k]. First we will show that

x^i2xi22b(x^ixi).\hat{x}_{i}^{2}-x_{i}^{2}\leq 2b(\hat{x}_{i}-x_{i}).

We split our analysis into four cases:

  • Case 1: bx^ixib\leq\hat{x}_{i}\leq x_{i}. In this case, x^i2xi2=(x^i+xi)(x^ixi)2b(x^ixi)\hat{x}_{i}^{2}-x_{i}^{2}=(\hat{x}_{i}+x_{i})(\hat{x}_{i}-x_{i})\leq 2b(\hat{x}_{i}-x_{i}).

  • Case 2: xix^ibx_{i}\leq\hat{x}_{i}\leq b. In this case, x^i2xi2=(x^i+xi)(x^ixi)2b(x^ixi)\hat{x}_{i}^{2}-x_{i}^{2}=(\hat{x}_{i}+x_{i})(\hat{x}_{i}-x_{i})\leq 2b(\hat{x}_{i}-x_{i}).

  • Case 3: x^ibxi\hat{x}_{i}\leq b\leq x_{i}. In this case, x^i2xi2=(x^i+xi)(x^ixi)2b(x^ixi)\hat{x}_{i}^{2}-x_{i}^{2}=(\hat{x}_{i}+x_{i})(\hat{x}_{i}-x_{i})\leq 2b(\hat{x}_{i}-x_{i}), since bx^ixibb-\hat{x}_{i}\leq x_{i}-b.

  • Case 4: xibx^ix_{i}\leq b\leq\hat{x}_{i}. In this case, x^i2xi2=(x^i+xi)(x^ixi)2b(x^ixi)\hat{x}_{i}^{2}-x_{i}^{2}=(\hat{x}_{i}+x_{i})(\hat{x}_{i}-x_{i})\leq 2b(\hat{x}_{i}-x_{i}), since x^ibbxi\hat{x}_{i}-b\leq b-x_{i}.

Then

i=1k(x^i2xi2)\displaystyle\sum_{i=1}^{k}(\hat{x}_{i}^{2}-x_{i}^{2}) 2bi=1k(x^ixi)=0.\displaystyle\leq 2b\sum_{i=1}^{k}(\hat{x}_{i}-x_{i})=0.

This inequality is strict if |x^ib|<|xib||\hat{x}_{i}-b|<|x_{i}-b| for some i[1,k]i\in[1,k]. ∎

Using Proposition 3, we can now prove Lemma 2.

Proof of Lemma 2.

Fix an (s,t)P(s,t)\in P, and let (1,x)D(1,x)\in D be the direction vector associated with πs,t\pi_{s,t}. Let x^1,,x^1\hat{x}_{1},\dots,\hat{x}_{\ell-1}\in\mathbb{R} be real numbers such that the iith edge of πs,t\pi_{s,t} has the corresponding vector (1,x^i)Ci(1,\hat{x}_{i})\in C_{i} for i[1,1]i\in[1,\ell-1]. Now consider an arbitrary sts\leadsto t-path π\pi in GG, where ππs,t\pi\neq\pi_{s,t}. Since all edges in GG are directed from LiL_{i} to Li+1L_{i+1}, it follows that π\pi has 1\ell-1 edges and the iith edge of π\pi is in EiE_{i}. Let x1,,x1x_{1},\dots,x_{\ell-1}\in\mathbb{R} be real numbers such that the iith edge of π\pi has the corresponding vector (1,xi)Ci(1,x_{i})\in C_{i} for i[1,1]i\in[1,\ell-1]. Now observe that since π\pi and πs,t\pi_{s,t} are both sts\leadsto t-paths, it follows that

i=11x^i=i=11xi.\sum_{i=1}^{\ell-1}\hat{x}_{i}=\sum_{i=1}^{\ell-1}x_{i}.

Additionally, by our construction of πs,t\pi_{s,t}, it follows that

|x^ix||xix||\hat{x}_{i}-x|\leq|x_{i}-x|

for all i[1,1]i\in[1,\ell-1]. In particular, since ππs,t\pi\neq\pi_{s,t}, there must be some j[1,1]j\in[1,\ell-1] such that x^jxj\hat{x}_{j}\neq x_{j}, and so by Proposition 1, |x^jx|<|xjx||\hat{x}_{j}-x|<|x_{j}-x| with probability 1. Then by Proposition 3,

w(πs,t)=eπs,tw(e)=i=11x^i2<i=11xi2=eπw(e)=w(π).w(\pi_{s,t})=\sum_{e\in\pi_{s,t}}w(e)=\sum_{i=1}^{\ell-1}\hat{x}_{i}^{2}<\sum_{i=1}^{\ell-1}x_{i}^{2}=\sum_{e\in\pi}w(e)=w(\pi).

Path πs,t\pi_{s,t} is a unique shortest sts\leadsto t-path in GG, as desired. ∎

4.5 Critical Paths Intersection Properties

Before finishing our proof of Lemma 1, we will need to establish several properties of the critical paths in Π\Pi.

Proposition 4.

Let π1,π2Π\pi_{1},\pi_{2}\in\Pi be two critical paths with the same corresponding direction vector dD\vec{d}\in D. Then π1π2=\pi_{1}\cap\pi_{2}=\emptyset.

Proof.

Let vijLiv_{i}^{j}\in L_{i} denote the iith node of πj\pi_{j}, where j{1,2}j\in\{1,2\}. Note that since π1\pi_{1} and π2\pi_{2} share the same direction vector d\vec{d}, the edges (vi1,vi+11)(v_{i}^{1},v_{i+1}^{1}) and (vi2,vi+12)(v_{i}^{2},v_{i+1}^{2}) have the same corresponding vector uiCi\vec{u}_{i}\in C_{i} for all i[1,1]i\in[1,\ell-1] by Proposition 2. By our construction of Π\Pi, for each node in the first layer vL1v\in L_{1}, vv belongs to at most one path πΠ\pi\in\Pi with direction vector d\vec{d}, so v11v12v_{1}^{1}\neq v_{1}^{2}. Then for all i[1,]i\in[1,\ell],

vi1=v11+i=1i1uiv12+i=1i1ui=vi2.v_{i}^{1}=v_{1}^{1}+\sum_{i=1}^{i-1}\vec{u}_{i}\neq v_{1}^{2}+\sum_{i=1}^{i-1}\vec{u}_{i}=v_{i}^{2}.\qed

Let π1,π2Π\pi_{1},\pi_{2}\in\Pi be two critical paths, and let vVv\in V be a node in GG. We say that paths π1\pi_{1} and π2\pi_{2} split at vv if vπ1π2v\in\pi_{1}\cap\pi_{2} and the node following vv in π1\pi_{1} is distinct from the node following vv in π2\pi_{2}, and we simply say that π1\pi_{1} and π2\pi_{2} split if there exists some vVv\in V such that they split at vv. Note that since π1,π2Π\pi_{1},\pi_{2}\in\Pi are unique shortest paths in GG, paths π1\pi_{1} and π2\pi_{2} can split at most once.

Lemma 3.

Fix a node vLiv\in L_{i}, where i[1,1]i\in[1,\ell-1], and let π1,π2Π\pi_{1},\pi_{2}\in\Pi be critical paths with direction vectors (1,d1),(1,d2)D(1,d_{1}),(1,d_{2})\in D such that vπ1v\in\pi_{1} and vπ2v\in\pi_{2}. Then paths π1\pi_{1} and π2\pi_{2} split at vv with probability at least min{|d1d2|,1}\min\left\{|d_{1}-d_{2}|,1\right\}.666For the sake of completeness, let us be more precise here about the probability claim being made in this lemma. Consider any two paths π1,π2\pi_{1},\pi_{2}, indexed by two start nodes and two direction vectors, and consider a node vLiv\in L_{i}. The event that we generate π1,π2\pi_{1},\pi_{2} in such a way that vπ1π2v\in\pi_{1}\cap\pi_{2} depends only on the random choices of ε1,,εi\varepsilon_{1},\dots,\varepsilon_{i}. If vπ1π2v\in\pi_{1}\cap\pi_{2}, then the event that π1,π2\pi_{1},\pi_{2} split at vv depends only on the random choice of εi+1\varepsilon_{i+1}. The claim is: conditional on the event that {ε1,,εi}\{\varepsilon_{1},\dots,\varepsilon_{i}\} are selected in such a way that vπ1π2v\in\pi_{1}\cap\pi_{2}, the probability that εi+1\varepsilon_{i+1} is selected such that π1,π2\pi_{1},\pi_{2} split at vv is at least min{|d1d2|,1}\min\left\{|d_{1}-d_{2}|,1\right\}.

Proof.

By Proposition 4, d1d2d_{1}\neq d_{2}, and assume wlog that d1<d2d_{1}<d_{2}. Let FF be the event that the random variable εi+1\varepsilon_{i+1} was sampled so that

(+εi+1+1/2)(d1,d2),(\mathbb{Z}+\varepsilon_{i+1}+1/2)\cap(d_{1},d_{2})\neq\emptyset,

(where (d1,d2)(d_{1},d_{2})\subseteq\mathbb{R} is the open interval with endpoints d1d_{1} and d2d_{2}). Our proof strategy is to show that FF implies that π1,π2\pi_{1},\pi_{2} split at vv, and then to show that FF occurs with the claimed probability.

FF implies that π1,π2\pi_{1},\pi_{2} split at vv.

Assume that FF occurs. By construction there is a nonnegative integer cc\in\mathbb{Z} such that c+εi+1+1/2c+\varepsilon_{i+1}+1/2 is in the interval (d1,d2)(d_{1},d_{2}). Since (d1,d2)(1,n42)(d_{1},d_{2})\subseteq\left(1,\frac{n}{4\ell^{2}}\right), it follows that vectors (1,c+εi+1),(1,c+εi+1+1)2(1,c+\varepsilon_{i+1}),(1,c+\varepsilon_{i+1}+1)\in\mathbb{R}^{2} are in CiC_{i}, because 0cn4210\leq c\leq\frac{n}{4\ell^{2}}-1. More generally, by our choice of sets CC and DD there are vectors (1,c1),(1,c2)Ci(1,c_{1}),(1,c_{2})\in C_{i} such that

c1d1c1+1 and c2d2c2+1.c_{1}\leq d_{1}\leq c_{1}+1\text{ \qquad and \qquad}c_{2}\leq d_{2}\leq c_{2}+1.

Now we claim that

argmin(1,x)Ci|xd1|argmin(1,x)Ci|xd2|.\text{argmin}_{(1,x)\in C_{i}}|x-d_{1}|\neq\text{argmin}_{(1,x)\in C_{i}}|x-d_{2}|.

To see this, suppose for the sake of contradiction that there is a vector (1,y)Ci(1,y)\in C_{i} such that

(1,y)=argmin(1,x)Ci|xd1|=argmin(1,x)Ci|xd2|.(1,y)=\text{argmin}_{(1,x)\in C_{i}}|x-d_{1}|=\text{argmin}_{(1,x)\in C_{i}}|x-d_{2}|.

Then using our assumption that d1<c+εi+1+1/2<d2d_{1}<c+\varepsilon_{i+1}+1/2<d_{2}, we obtain

yd1|yd1|min{|c1d1|,|(c1+1)d1|}1/2<(c+εi+1+1)d1y-d_{1}\leq|y-d_{1}|\leq\min\{|c_{1}-d_{1}|,|(c_{1}+1)-d_{1}|\}\leq 1/2<(c+\varepsilon_{i+1}+1)-d_{1}

and

d2y|yd2|min{|c2d2|,|(c2+1)d2|}1/2<d2(c+εi+1).d_{2}-y\leq|y-d_{2}|\leq\min\{|c_{2}-d_{2}|,|(c_{2}+1)-d_{2}|\}\leq 1/2<d_{2}-(c+\varepsilon_{i+1}).

Together, these two sequences of inequalities imply that c+εi+1<y<c+εi+1+1c+\varepsilon_{i+1}<y<c+\varepsilon_{i+1}+1. But this contradicts our assumption that (1,y)Ci(1,y)\in C_{i}, so we conclude that

argmin(1,x)Ci|xd1|argmin(1,x)Ci|xd2|.\text{argmin}_{(1,x)\in C_{i}}|x-d_{1}|\neq\text{argmin}_{(1,x)\in C_{i}}|x-d_{2}|.

By Proposition 2, {ueeEiv}=Ci\{\vec{u}_{e}\mid e\in E_{i}^{v}\}=C_{i}, so we have also shown that

argmin{ueeEiv}|ued1|argmin{ueeEiv}|ued2|.\text{argmin}_{\{\vec{u}_{e}\mid e\in E_{i}^{v}\}}|u_{e}-d_{1}|\neq\text{argmin}_{\{\vec{u}_{e}\mid e\in E_{i}^{v}\}}|u_{e}-d_{2}|.

Then π1\pi_{1} and π2\pi_{2} must split at vv by our construction of the critical paths in Π\Pi.

FF happens with good probability.

Since εi+1\varepsilon_{i+1} is sampled uniformly at random from the interval (0,1)(0,1), it follows that (+εi+1+1/2)(d1,d2)(\mathbb{Z}+\varepsilon_{i+1}+1/2)\cap(d_{1},d_{2})\neq\emptyset with probability at least min{d2d1,1}\min\{d_{2}-d_{1},1\}. ∎

We will use Lemma 3 to prove the following two lemmas, which capture the key properties of our graph GG.

Lemma 4 (Low path overlap).

Let π1,π2Π\pi_{1},\pi_{2}\in\Pi be critical paths with distinct associated direction vectors (1,d1),(1,d2)D(1,d_{1}),(1,d_{2})\in D. Then:

  • If |d1d2|<1|d_{1}-d_{2}|<1, then with probability at least 1n81-n^{-8}, we have777Formally, we consider any two paths π1,π2\pi_{1},\pi_{2} indexed by two start nodes and direction vectors. When we iteratively generate these paths, the number of nodes in their intersection (possibly 0) depends only on the random choices of ε1,,ε\varepsilon_{1},\dots,\varepsilon_{\ell}. The probability claim in this lemma is with respect to these random choices. |π1π2|8logn|d1d2|.|\pi_{1}\cap\pi_{2}|\leq\frac{8\log n}{|d_{1}-d_{2}|}.

  • If |d1d2|1|d_{1}-d_{2}|\geq 1, then |π1π2|1|\pi_{1}\cap\pi_{2}|\leq 1 (deterministically).

Proof.

We begin with the first point; suppose |d1d2|<1|d_{1}-d_{2}|<1. Suppose we iteratively generate π1,π2\pi_{1},\pi_{2} one layer at a time. Each time we choose a node vv that lies in both π1\pi_{1} and π2\pi_{2}, by Lemma 3, π1\pi_{1} and π2\pi_{2} split at vv with probability at least |d1d2||d_{1}-d_{2}| (over the random choice of εi+1\varepsilon_{i+1}). Moreover, since π1\pi_{1} and π2\pi_{2} are unique shortest paths in GG and GG is acyclic, it follows that π1π2\pi_{1}\cap\pi_{2} is a contiguous subpath of π1\pi_{1} and π2\pi_{2}; thus, once they split, they can no longer intersect in later layers. The number of nodes in the intersection |π1π2||\pi_{1}\cap\pi_{2}| is 11 more than the number of consecutive nodes at which π1,π2\pi_{1},\pi_{2} intersect but do not split. So by the above discussion, we have

Pr[|π1π2|>8logn|d1d2|]\displaystyle\Pr\left[|\pi_{1}\cap\pi_{2}|>\frac{8\log n}{|d_{1}-d_{2}|}\right] (1|d1d2|)8logn|d1d2|\displaystyle\leq(1-|d_{1}-d_{2}|)^{\frac{8\log n}{|d_{1}-d_{2}|}}
e|d1d2|8logn|d1d2|\displaystyle\leq e^{-|d_{1}-d_{2}|\cdot\frac{8\log n}{|d_{1}-d_{2}|}}
e8logn\displaystyle\leq e^{-8\log n}
=n8\displaystyle=n^{-8}

For the second point of the lemma: if |d1,d2|1|d_{1},d_{2}|\geq 1, then by Lemma 3, if there is a node vπ1π2v\in\pi_{1}\cap\pi_{2}, then π1\pi_{1} and π2\pi_{2} split at vv with probability 11, and then they can no longer intersect in later layers. So we have |π1π2|1|\pi_{1}\cap\pi_{2}|\leq 1. ∎

Since |Π|=pn2|\Pi|=p\leq n^{2}, we can argue by a union bound that Lemma 4 holds for all π1,π2Π\pi_{1},\pi_{2}\in\Pi simultaneously with probability at least 1n41-n^{-4}. From now on, we will assume that this property holds for our constructed graph GG.

Once we specify our construction parameters \ell and qq, the following lemma will immediately imply the third property of GG as stated in Lemma 1.

Lemma 5.

Let x,yVx,y\in V be distinct nodes in GG, and let z=hopdistG(x,y)z=\texttt{hopdist}_{G}(x,y). Let

Π:={πΠx,yπ}.\Pi^{\prime}:=\{\pi\in\Pi\mid x,y\in\pi\}.

Then |Π|max{16qlognz,1}|\Pi^{\prime}|\leq\max\left\{\frac{16q\log n}{z},1\right\}.

Proof.

Let Π={π1,,πk}\Pi^{\prime}=\{\pi_{1},\dots,\pi_{k}\} and let (1,di)D(1,d_{i})\in D be the direction vector associated with πi\pi_{i} for i[1,k]i\in[1,k]. Since the paths in Π\Pi^{\prime} all intersect, by Proposition 4 we must have didjd_{i}\neq d_{j} for iji\neq j. Let a=mini[1,k]dia=\min_{i\in[1,k]}d_{i} and let b=maxi[1,k]dib=\max_{i\in[1,k]}d_{i}. Then

dbdak1q,d_{b}-d_{a}\geq\frac{k-1}{q},

since |didj|1/q|d_{i}-d_{j}|\geq 1/q for all (1,di),(1,dj)D(1,d_{i}),(1,d_{j})\in D such that iji\neq j. Thus, by Lemma 4 we must have k1q<1\frac{k-1}{q}<1, since we have at least two nodes x,yπaπbx,y\in\pi_{a}\cap\pi_{b}. So by Lemma 4,

|πaπb|8logndbda8qlognk1.|\pi_{a}\cap\pi_{b}|\leq\frac{8\log n}{d_{b}-d_{a}}\leq\frac{8q\log n}{k-1}.

Since x,yπaπbx,y\in\pi_{a}\cap\pi_{b} and πa\pi_{a} and πb\pi_{b} are unique shortest paths in GG, it follows that they coincide on their subpaths πa[x,y]=πb[x,y]\pi_{a}[x,y]=\pi_{b}[x,y]. Moreover, since the hopdistance from xx to yy in GG is zz, it follows that |πa[x,y]|=|πb[x,y]|=z+1|\pi_{a}[x,y]|=|\pi_{b}[x,y]|=z+1. Then taken together we have

z|πaπb|8qlognk1.z\leq|\pi_{a}\cap\pi_{b}|\leq\frac{8q\log n}{k-1}.

Rearranging, we get

k8qlognz+1.k\leq\frac{8q\log n}{z}+1.

If 8qlognz1\frac{8q\log n}{z}\geq 1, then this implies k16qlognzk\leq\frac{16q\log n}{z}. Otherwise, if 8qlognz<1\frac{8q\log n}{z}<1, then this implies that k1k\leq 1 since kk\in\mathbb{Z}. ∎

4.6 Finishing the proof of Lemma 1

We note that Theorem 7 is trivial in the parameter regime p=Ω(n2/logn)p=\Omega(n^{2}/\log n), since its lower bound on hopbound is Ω(1)\Omega(1). So we will assume p=O(n2/logn)p=O(n^{2}/\log n) in the following, with as small of an implicit constant as needed. Let

=n210p1/2log1/2n and q=210logn.\ell=\frac{n}{2^{10}p^{1/2}\log^{1/2}n}\text{\qquad and \qquad}q=\frac{\ell}{2^{10}\log n}.

We now quickly verify that graph GG and associated critical paths Π\Pi satisfy the properties of Lemma 1:

  • By construction, GG has =Θ(np1/2log1/2n)\ell=\Theta\left(\frac{n}{p^{1/2}\log^{1/2}n}\right) layers, and each path in Π\Pi travels from the first layer to the last layer.

  • Proposition 4 implies that each vertex has at most |D|=O(qn/2)|D|=O\left(qn/\ell^{2}\right) paths passing through it. By our choice of construction parameters \ell and qq, we conclude that |D|=O(p/n)|D|=O(\ell\cdot p/n).

  • Each path πΠ\pi\in\Pi is a unique shortest path between its endpoints in GG by Lemma 2.

  • Since 16qlogn16q\log n\leq\ell and hopdistG(u,v)\texttt{hopdist}_{G}(u,v)\leq\ell, Lemma 5 immediately implies that for all u,vVu,v\in V, there are at most

    max{16qlognhopdistG(u,v),1}hopdistG(u,v)\max\left\{\frac{16q\log n}{\texttt{hopdist}_{G}(u,v)},1\right\}\leq\frac{\ell}{\texttt{hopdist}_{G}(u,v)}

    paths in Π\Pi that contain both uu and vv.

  • For all critical pairs (s,t)P(s,t)\in P, the hopdistance from ss to tt in GG is 116qlogn\ell-1\gg 16q\log n. Then by Lemma 5, each of the |Π||\Pi| paths constructed in Section 4.3 have distinct endpoints and thus are distinct. Then

    |Π|=|S||D|n2nq82n2q163p.|\Pi|=|S||D|\geq\frac{n}{2\ell}\cdot\frac{nq}{8\ell^{2}}\geq\frac{n^{2}q}{16\ell^{3}}\geq p.

We have shown that our directed graph GG satisfies the properties of Lemma 1 in the regime of p[n,n2]p\in[n,n^{2}]. Moreover, our construction still goes through even in the extended regime of p[n/c,n2]p\in[n/c,n^{2}] for any constant c>0c>0. All that remains is to extend our construction to the entire regime of p[1,n]p\in[1,n] and make GG undirected.

Extending the construction to p[1,n]p\in[1,n].

We can extend our construction to the regime of p[1,n]p\in[1,n] with a simple modification to GG that was previously used in the prior work of [29]. We will sketch the modification here and defer the proof of correctness to Lemma 6 in Appendix A.

Let G(n,p)G(n,p) denote an instance of our originally constructed graph GG with input parameters nn and p[n,n2]p\in[n,n^{2}]. Let nn be a sufficiently large integer and let p[1,n]p\in[1,n]. Let G1:=G(p1,p)G_{1}:=G(p_{1},p) where p1=Θ(p)p_{1}=\Theta(p) and p1p_{1} divides nn. Now for each node vv in G1G_{1}, replace vv with a directed path πv=(u1v,,ukv)\pi_{v}=(u_{1}^{v},\dots,u_{k}^{v}) with k=n/p1k=n/p_{1} nodes. For all vVv\in V, assign weight 0 to all edges in πv\pi_{v}. For each edge (v1,v2)(v_{1},v_{2}) originally in G1G_{1}, add edge (ukv1,u1v2)(u_{k}^{v_{1}},u_{1}^{v_{2}}) to the graph. Let G2G_{2} be the resulting graph, and let Π2\Pi_{2} be the updated set of critical paths. This completes the modification.

Lemma 6.

The nn-node graph G2G_{2} and the set Π2\Pi_{2} of |Π2|p|\Pi_{2}|\geq p paths satisfy the properties of Lemma 1.

Proof.

We defer the proof to Appendix A, as it largely follows our earlier analysis. ∎

Making GG undirected.

To make GG undirected, we use the following standard simple blackbox reduction. Let WW be the sum of all edge weights in GG, i.e., W=eEw(e)W=\sum_{e\in E}w(e). For every edge eEe\in E, add +W+W to the weight w(e)w(e) of ee, and treat ee as an undirected edge. Call the resulting graph GG^{\prime}.

We now argue correctness: in particular, we need to argue that for all s,tVs,t\in V such that tt is reachable from ss in GG, π\pi is a shortest weighted (directed) sts\leadsto t-path in GG if and only if π\pi is a shortest weighted (undirected) sts\leadsto t-path in GG^{\prime}.

  • First, note that for all sts\leadsto t-paths π\pi^{\prime} in GG^{\prime}, the number of nodes in π\pi^{\prime} satisfies |π||π||\pi^{\prime}|\geq|\pi| by the construction of GG and GG^{\prime}.

  • Moreover, if |π|>|π||\pi^{\prime}|>|\pi|, then π\pi^{\prime} has one more edge than π\pi. Thus, its weighted length in GG^{\prime} satisfies

    w(π)\displaystyle w(\pi^{\prime}) >(|π|+1)W\displaystyle>(|\pi|+1)\cdot W
    =W|π|+eE(G)w(e)\displaystyle=W|\pi|+\sum\limits_{e\in E(G)}w(e)
    >w(π)\displaystyle>w(\pi)

    and so π\pi^{\prime} is not a shortest path. We conclude that if π\pi^{\prime} is a shortest sts\leadsto t-path in GG^{\prime}, then |π|=|π||\pi^{\prime}|=|\pi|.

  • Any sts\leadsto t-path π\pi^{\prime} in GG^{\prime} with |π||\pi| edges must use exactly one node in each layer, and thus it respects the original edge directions in GG. We conclude that π\pi^{\prime} is a shortest weighted sts\leadsto t-path in GG^{\prime} if and only if π\pi^{\prime} is a shortest weighted (directed) sts\leadsto t-path in GG.

Lemma 1 is immediate from the above discussion.

5 Shortcut Sets

In this section we will prove the following theorem.

Theorem 8.

For any parameter p[1,n5/4]p\in[1,n^{5/4}], there exists an nn-node unweighted directed graph G=(V,E)G=(V,E) such that for any shortcut set HH of size |H|p|H|\leq p, the graph GHG\cup H must have diameter DD, where

D={Ω(np3/4logn)for p[1,n/(αlogn)], where α>0 is a sufficiently large constantΩ(n5/4plog9/4n)for p[n/(αlogn),n5/4].D=\begin{cases}\Omega\left(\frac{n}{p^{3/4}\log n}\right)&\text{for }p\in[1,n/(\alpha\log n)],\text{ where $\alpha>0$ is a sufficiently large constant}\\ \Omega\left(\frac{n^{5/4}}{p\log^{9/4}n}\right)&\text{for }p\in[n/(\alpha\log n),n^{5/4}].\\ \end{cases}

In particular, when p=O(n)p=O(n), GHG\cup H must have diameter D=Ω(n1/4log9/4n)D=\Omega\left(\frac{n^{1/4}}{\log^{9/4}n}\right).

We will prove this via a construction of the following type:

Lemma 7.

For any p[1,n/(αlogn)]p\in[1,n/(\alpha\log n)], where α>0\alpha>0 is a sufficiently large constant, there is an infinite family of nn-node directed unweighted graphs G=(V,E,w)G=(V,E,w) and sets Π\Pi of |Π|=p|\Pi|=p paths in GG with the following properties:

  • GG has =Θ(np3/4logn)\ell=\Theta\left(\frac{n}{p^{3/4}\log n}\right) layers. Each path in Π\Pi starts in the first layer, ends in the last layer, and contains exactly one node in each layer.

  • Each path in Π\Pi is the unique path between its endpoints in GG.

  • For any two nodes u,vVu,v\in V, there are at most hopdistG(u,v)\frac{\ell}{\texttt{hopdist}_{G}(u,v)} paths in Π\Pi that contain both uu and vv.

We quickly verify that Lemma 7 implies Theorem 8.

Proof of Theorem 8 using Lemma 7.

Fix an nn and p[1,n/(αlogn)]p\in[1,n/(\alpha\log n)]. Let G=(V,E,w)G=(V,E,w) be the graph in Lemma 7 with associated set Π\Pi of |Π|=2p|\Pi|=2p paths in GG. Let HH be a shortcut set of size |H|p|H|\leq p. Let PV×VP\subseteq V\times V be the set of node pairs that are the endpoints of paths in Π\Pi. Since all paths in Π\Pi are unique paths between their endpoints in GG, it follows that

diameter(GH)max(s,t)PdistGH(s,t)=max(s,t)PhopdistGH(s,t).\texttt{diameter}(G\cup H)\geq\max_{(s,t)\in P}\text{dist}_{G\cup H}(s,t)=\max_{(s,t)\in P}\texttt{hopdist}_{G\cup H}(s,t).

Then when p[1,n/(αlogn)]p\in[1,n/(\alpha\log n)], we can achieve the bounds in Theorem 8 using the same potential function argument as in Section 4.1.

In order to extend our shortcut set lower bound to p[n/(αlogn),n5/4]p\in[n/(\alpha\log n),n^{5/4}], we will appeal to a more general property of the extremal functions of exact hopsets and shortcut sets, which we defer to Lemma 13 in Appendix B. ∎

5.1 Constructing the strongly convex vector set W(q)W(q)

In our construction of the graph GG, we will implicitly use the following lemma from [3, 8] that establishes the existence of a dense set of vectors that each extend the farthest in their own direction.888This is a slightly stronger property than convexity, and hence is sometimes referred to as “strong convexity” in the area.

Lemma 8 (Theorem 1 of [3]; Lemma 7 of [8]).

For sufficiently large r0r\in\mathbb{Z}_{\geq 0}, there exists a strongly convex set of integer vectors W(r)2W(r)\subseteq\mathbb{Z}^{2} of size |W(r)|=Θ(r2/3)|W(r)|=\Theta(r^{2/3}), such that

  • for all wW(r)\vec{w}\in W(r), wr\|\vec{w}\|\leq r,

  • every wW(r)\vec{w}\in W(r) lies in the first quadrant, i.e., both coordinates are positive, and

  • for all distinct w1,w2W(r)\vec{w}_{1},\vec{w}_{2}\in W(r), w1,w2<w1,w1\langle\vec{w}_{1},\vec{w}_{2}\rangle<\langle\vec{w}_{1},\vec{w}_{1}\rangle.

In our construction of GG, we will use a vector set WW from this lemma to help generate edge and direction vectors. We will make use of the following technical property of the vectors in WW:

Proposition 5.

Let W={w1,,wq}W=\{\vec{w}_{1},\dots,\vec{w}_{q}\} be a set of vectors as described in Lemma 8, with its vectors ordered counterclockwise. For all wi,wj,wkW\vec{w}_{i},\vec{w}_{j},\vec{w}_{k}\in W with i<j<ki<j<k, the following inequalities hold:

wi,wk<wj,wk<wk,wk and wi,wk<wi,wj<wi,wi.\left\langle\vec{w}_{i},\vec{w}_{k}\right\rangle<\left\langle\vec{w}_{j},\vec{w}_{k}\right\rangle<\left\langle\vec{w}_{k},\vec{w}_{k}\right\rangle\text{\qquad and \qquad}\left\langle\vec{w}_{i},\vec{w}_{k}\right\rangle<\left\langle\vec{w}_{i},\vec{w}_{j}\right\rangle<\left\langle\vec{w}_{i},\vec{w}_{i}\right\rangle.
Proof.

We will only prove here that wi,wk<wj,wk<wk,wk\langle\vec{w}_{i},\vec{w}_{k}\rangle<\langle\vec{w}_{j},\vec{w}_{k}\rangle<\langle\vec{w}_{k},\vec{w}_{k}\rangle; the other set of inequalities follows from an identical argument. By Lemma 8 we already have wj,wk<wk,wk\langle\vec{w}_{j},\vec{w}_{k}\rangle<\langle\vec{w}_{k},\vec{w}_{k}\rangle, so it remains only to show that wi,wk<wj,wk\langle\vec{w}_{i},\vec{w}_{k}\rangle<\langle\vec{w}_{j},\vec{w}_{k}\rangle.

Let ψ1\psi_{1} be the inner angle between wi\vec{w}_{i} and wj\vec{w}_{j} and let ψ2\psi_{2} be the inner angle between wj\vec{w}_{j} and wk\vec{w}_{k}; thus the inner angle between wi\vec{w}_{i} and wk\vec{w}_{k} is ψ1+ψ2\psi_{1}+\psi_{2}. We first establish a useful inequality:

wi,wj\displaystyle\langle\vec{w}_{i},\vec{w}_{j}\rangle <wj,wj\displaystyle<\langle\vec{w}_{j},\vec{w}_{j}\rangle by Lemma 8
wiwjcosψ1\displaystyle\|\vec{w}_{i}\|\|\vec{w}_{j}\|\cos\psi_{1} <wj2\displaystyle<\|\vec{w}_{j}\|^{2} cosine formula
wiwkcosψ1cosψ2\displaystyle\|\vec{w}_{i}\|\|\vec{w}_{k}\|\cos\psi_{1}\cos\psi_{2} <wjwkcosψ2.\displaystyle<\|\vec{w}_{j}\|\|\vec{w}_{k}\|\cos\psi_{2}. follows algebraically from previous line

We will next use the trigonometric identity

cosψ1cosψ2\displaystyle\cos\psi_{1}\cos\psi_{2} =sinψ1sinψ2+cos(ψ1+ψ2)\displaystyle=\sin\psi_{1}\sin\psi_{2}+\cos(\psi_{1}+\psi_{2})
>cos(ψ1+ψ2)\displaystyle>\cos(\psi_{1}+\psi_{2}) since ψ1,ψ2(0,π/2)\psi_{1},\psi_{2}\in(0,\pi/2) by Lemma 8.

We are now ready to show:

wi,wk\displaystyle\langle\vec{w}_{i},\vec{w}_{k}\rangle =wiwkcos(ψ1+ψ2)\displaystyle=\|\vec{w}_{i}\|\|\vec{w}_{k}\|\cos(\psi_{1}+\psi_{2}) cosine formula
<wiwkcosψ1cosψ2\displaystyle<\|\vec{w}_{i}\|\|\vec{w}_{k}\|\cos\psi_{1}\cos\psi_{2} second inequality
<wjwkcosψ2\displaystyle<\|\vec{w}_{j}\|\|\vec{w}_{k}\|\cos\psi_{2} first inequality
<wj,wk.\displaystyle<\langle\vec{w}_{j},\vec{w}_{k}\rangle. cosine formula ∎

5.2 Constructing GG

We next construct the graph that will be used for Lemma 7. Let nn be a sufficiently large positive integer, and let p=n/(α0logn)p=n/(\alpha_{0}\log n) for a sufficiently large constant α0>0\alpha_{0}>0 to be chosen later (we will extend our construction to other choices of p[1,n5/4]p\in[1,n^{5/4}] later). For simplicity of presentation, we will frequently ignore issues related to non-integrality of expressions that arise in our construction; these issues affect our bounds only by lower-order terms. All edges (u,v)(u,v) in GG will be directed from uu to vv.

Vertex Set VV.

  • Let rr be a positive integer construction parameter to be specified later. Our graph GG will have :=n1/3r2/3\ell:=\frac{n^{1/3}}{r^{2/3}} layers L1,,LL_{1},\dots,L_{\ell}, and each layer will have n/=n2/3r2/3n/\ell=n^{2/3}r^{2/3} nodes.

  • We will label each node in layer LiL_{i}, i[1,]i\in[1,\ell], with a distinct triple in {i}×[1,n1/3r1/3]×[1,n1/3r1/3]\{i\}\times\left[1,n^{1/3}r^{1/3}\right]\times\left[1,n^{1/3}r^{1/3}\right]. We will interpret the node in LiL_{i} labeled (i,j,k)(i,j,k) as an integer point (i,j,k)3(i,j,k)\in\mathbb{Z}^{3}.

  • These nn nodes arranged in \ell layers will compose our node set

    V=[1,]×[1,n1/3r1/3]×[1,n1/3r1/3]3V=[1,\ell]\times[1,n^{1/3}r^{1/3}]\times[1,n^{1/3}r^{1/3}]\subseteq\mathbb{Z}^{3}

    of graph GG. In a slight abuse of notation, we will treat vVv\in V as either a node in GG or a point in 3\mathbb{Z}^{3}, depending on the context.

Edge Set EE.

  • All the edges in EE will be between consecutive layers Li,Li+1L_{i},L_{i+1} of GG. We will let EiE_{i} denote the set of edges in GG between layers LiL_{i} and Li+1L_{i+1}.

  • Just as our nodes in VV correspond to points in 3\mathbb{R}^{3}, we can interpret the edges EE in GG as vectors in 3\mathbb{R}^{3}. In particular, for every edge e=(v1,v2)Ee=(v_{1},v_{2})\in E, we identify ee with the corresponding vector ue:=v2v1\vec{u}_{e}:=v_{2}-v_{1}. Note that since all edges in EE are between adjacent layers LiL_{i} and Li+1L_{i+1}, the first coordinate of ue\vec{u}_{e} is 11 for all eEe\in E. We will use ueiu_{e}^{i} to denote the iith coordinate of ue\vec{u}_{e} for i{2,3}i\in\{2,3\} (i.e. for all eEe\in E, we write ue=(1,ue2,ue3)\vec{u}_{e}=(1,u_{e}^{2},u_{e}^{3})).

  • We begin our construction of EE by defining the set of vectors W:=W(r3)2W:=W(r^{3})\subseteq\mathbb{Z}^{2}, where W(r3)W(r^{3}) is the strongly convex set of vectors defined in Lemma 8. Let W={w1,,wq}W=\{\vec{w}_{1},\dots,\vec{w}_{q}\}, where vectors wi\vec{w}_{i} are ordered counterclockwise and q:=|W|=Θ(r2)q:=|W|=\Theta(r^{2}). We may assume wlog that qr2q\leq r^{2} (e.g. by removing vectors from WW until this is true).

  • Using WW, we define our set of edge vectors C2C\subseteq\mathbb{Z}^{2} as:

    C:={wi+1wii[1,q1]}.C:=\left\{\vec{w}_{i+1}-\vec{w}_{i}\mid i\in[1,q-1]\right\}.

    Let ci:=wi+1wi\vec{c}_{i}:=\vec{w}_{i+1}-\vec{w}_{i} denote the iith vector of CC.

  • For each layer LiL_{i}, i[1,1]i\in[1,\ell-1], sample a random integer from [1,q1][1,q-1] and call it λi\lambda_{i}. We define the set Ci2C_{i}\subseteq\mathbb{Z}^{2} as

    Ci:={(0,0),cλi},C_{i}:=\{(0,0),\vec{c}_{\lambda_{i}}\},

    where cλi\vec{c}_{\lambda_{i}} is the λi\lambda_{i}th vector of CC. Note that CiC_{i} contains exactly two vectors, the zero vector and a randomly chosen vector cλi\vec{c}_{\lambda_{i}} from CC. Intuitively: for each layer LiL_{i}, we are sampling two adjacent vectors wλi\vec{w}_{\lambda_{i}} and wλi+1\vec{w}_{\lambda_{i}+1} from CC and adding wλi-\vec{w}_{\lambda_{i}} to each of them to obtain CiC_{i}. The purpose of adding the normalizing vector wλi-\vec{w}_{\lambda_{i}} to wλi\vec{w}_{\lambda_{i}} and wλi+1\vec{w}_{\lambda_{i}+1} is to reduce the magnitude of the vectors in CiC_{i}, as we will formalize in Proposition 6.

  • The vectors in set CiC_{i} will define the edges in EiE_{i}. Specifically, for all (i,v1,v2)Li(i,v_{1},v_{2})\in L_{i} and for all (c1,c2)Ci(c_{1},c_{2})\in C_{i} such that

    (i+1,v1+c1,v2+c2)Li+1,(i+1,v_{1}+c_{1},v_{2}+c_{2})\in L_{i+1},

    we add the edge ((i,v1,v2),(i+1,v1+c1,v2+c2))((i,v_{1},v_{2}),(i+1,v_{1}+c_{1},v_{2}+c_{2})) to EiE_{i}.

This completes the construction of our graph G=(V,E)G=(V,E). We now verify that the vectors in CiC_{i} have small magnitude in expectation.

Proposition 6.

For all i[1,1]i\in[1,\ell-1], 𝔼[cλi]24r3q=Θ(r)\mathbb{E}[\|\vec{c}_{\lambda_{i}}\|]\leq\frac{2^{4}r^{3}}{q}=\Theta(r).

Proof.

Note that the vectors in CC correspond to sides of a convex polygon whose vertices are the vectors in WW. Since this polygon is contained in a ball of radius r3r^{3} in >02\mathbb{R}^{2}_{>0} by Lemma 8, it follows that cCc2πr3\sum_{\vec{c}\in C}\|\vec{c}\|\leq 2\pi r^{3}. Note that |C|=q1q/2|C|=q-1\geq q/2 for sufficiently large qq. Then

𝔼[cλi]=cC1|C|c2πr3|C|24r3q.\mathbb{E}[\|\vec{c}_{\lambda_{i}}\|]=\sum_{\vec{c}\in C}\frac{1}{|C|}\cdot\|\vec{c}\|\leq\frac{2\pi r^{3}}{|C|}\leq\frac{2^{4}r^{3}}{q}.\qed

The vectors wiW\vec{w}_{i}\in W have magnitude roughly r3r^{3}, whereas the vectors in CiC_{i} have expected magnitude at most Θ(r)\Theta(r) by Proposition 6. Since each edge in our critical path corresponds to a vector in CiC_{i}, ensuring that the vectors in CiC_{i} have small magnitude (at least in expectation) will be essential for guaranteeing that the paths πΠ\pi\in\Pi are long.

Let us comment here on a discrepancy between this construction and the technical overview. In the technical overview, we stated that we would use the convex vectors WW to generate the edge vectors CC. Instead, we are using the difference between adjacent convex vectors WW to generate CC. This is an optimization: our plan is to sample one edge vector from CC, and use it together with the zero vector (0,0)(0,0) as the two available vectors between pairs of adjacent layers. This is equivalent to sampling two adjacent edge vectors from WW, as advertised in the technical overview, and then applying an appropriate translation of the next layer in space. Our strategy lets us use vectors of length Θ(r)\Theta(r), instead of Θ(r3)\Theta(r^{3}), and these shorter vectors ultimately lead to a stronger lower bound.

5.3 Direction Vectors, Critical Pairs, and Critical Paths

Our next step is to generate a set of critical pairs PV×VP\subseteq V\times V, as well as a set of critical paths Π\Pi. Specifically, there will be one critical path πs,tΠ\pi_{s,t}\in\Pi going between each critical pair (s,t)P(s,t)\in P, and we will show that πs,t\pi_{s,t} is the unique sts\leadsto t path in GG.

Direction Vectors DD.

We choose our set of direction vectors DD to be D:=WD:=W, where WW is our strongly convex set of q=Θ(r2)q=\Theta(r^{2}) vectors. We will let D={d1,,dq}D=\{\vec{d}_{1},\dots,\vec{d}_{q}\} be our list of direction vectors, and we will let the iith vector di\vec{d}_{i} of DD correspond to the iith vector wi\vec{w}_{i} of WW, i.e. di:=wi\vec{d}_{i}:=\vec{w}_{i}, for i[1,q]i\in[1,q]. We will simply use the name di\vec{d}_{i} when we wish to emphasize the role of these vectors as direction vectors.

Note that since D=WD=W, Proposition 5 also holds for DD. That is, if i,j,k[1,q1]i,j,k\in[1,q-1] and i<j<ki<j<k, then

di,dk<dj,dk<dk,dk and di,dk<di,dj<di,di.\langle\vec{d}_{i},\vec{d}_{k}\rangle<\langle\vec{d}_{j},\vec{d}_{k}\rangle<\langle\vec{d}_{k},\vec{d}_{k}\rangle\qquad\text{ and }\qquad\langle\vec{d}_{i},\vec{d}_{k}\rangle<\langle\vec{d}_{i},\vec{d}_{j}\rangle<\langle\vec{d}_{i},\vec{d}_{i}\rangle.

Critical Pairs PP and Critical Paths Π\Pi.

  • We first define a set SL1S\subseteq L_{1}, containing a subset of the nodes in the first layer L1L_{1} of GG:

    S:={(1,j,k)|(j,k)[13n1/3r1/3,23n1/3r1/3]×[13n1/3r1/3,23n1/3r1/3]}.S:=\left\{(1,j,k)\bigm{|}(j,k)\in\left[\frac{1}{3}n^{1/3}r^{1/3},\hskip 5.69054pt\frac{2}{3}n^{1/3}r^{1/3}\right]\times\left[\frac{1}{3}n^{1/3}r^{1/3},\hskip 5.69054pt\frac{2}{3}n^{1/3}r^{1/3}\right]\right\}.

    Informally, SS is a middle square patch of the nodes in L1L_{1}. The key property of SS is that all nodes in SS are of distance at least 13n1/3r2/3\frac{1}{3}n^{1/3}r^{2/3} from the sides of the square grid corresponding to layer L1L_{1}.

    We will choose our set of demand pairs PP so that PS×VP\subseteq S\times V. For every node sSs\in S and direction vector dD\vec{d}\in D, we will choose a critical pair (s,t)S×V(s,t)\in S\times V and a corresponding critical path πs,t\pi_{s,t} to add to PP and Π\Pi.

  • Let v1Sv_{1}\in S, and let d=(1,d)D\vec{d}=(1,d)\in D. The associated path π\pi has start node v1v_{1}. We iteratively grow π\pi, layer-by-layer, as follows. Suppose that currently π=(v1,,vi)\pi=(v_{1},\dots,v_{i}) with each viLiv_{i}\in L_{i}. To determine the next node vi+1Li+1v_{i+1}\in L_{i+1}, let EiviEiE_{i}^{v_{i}}\subseteq E_{i} be the edges in EiE_{i} incident to viv_{i}, and let ui=(ui1,ui2)Ci\vec{u}_{i}=(u_{i}^{1},u_{i}^{2})\in C_{i} be

    ui:=argmaxcCic,d.\vec{u}_{i}:=\text{argmax}_{\vec{c}\in C_{i}}\langle\vec{c},\vec{d}\rangle.

    If (1,ui1,ui2)=uei(1,u_{i}^{1},u_{i}^{2})=\vec{u}_{e_{i}} for some eiEivie_{i}\in E_{i}^{v_{i}}, then by definition, eie_{i} is an edge whose first node is viv_{i}; we define vi+1Li+1v_{i+1}\in L_{i+1} to be the other node in eie_{i}, and we append vi+1v_{i+1} to π\pi. Otherwise, if there is no such edge eie_{i} in EiviE_{i}^{v_{i}}, then we terminate our construction of path π\pi (i.e. viv_{i} will be the final node in π\pi).

    This completes our construction of PP and Π\Pi. We will show that the paths generated in this way have distinct endpoints (with high probability), and therefore |P|=|S||D|n2/3r2/3q25|P|=|S||D|\geq\frac{n^{2/3}r^{2/3}q}{2^{5}}, where q=Θ(r2)q=\Theta(r^{2}).

An important feature for correctness of our construction is that, when we iteratively generate paths, if we reach a point viv_{i} such that vi+CiLi+1v_{i}+C_{i}\not\subseteq L_{i+1} (i.e. vi+cLi+1v_{i}+\vec{c}\not\in L_{i+1} for some cCi\vec{c}\in C_{i}), then we end our path at viv_{i}. As a consequence, our critical paths in Π\Pi may not travel through all \ell layers of GG. However, with nonzero probability, paths in Π\Pi travel through a constant fraction of layers, as we prove in the following proposition.

Proposition 7.

Let ^:=q27r2=Θ()\hat{\ell}:=\ell\cdot\frac{q}{2^{7}r^{2}}=\Theta(\ell). With probability at least 1/21/2, for all πΠ\pi\in\Pi, |π|^|\pi|\geq\hat{\ell}.

Proof.

Each critical path starts at a node ss in SS Each edge of the path corresponds to a vector (1,c1,c2)3(1,c_{1},c_{2})\in\mathbb{Z}^{3} such that (c1,c2)Ci(c_{1},c_{2})\in C_{i} for some i[1,1]i\in[1,\ell-1]. Our path ends when we reach the boundary of our vertex set V=[1,]×[1,n1/3r1/3]×[1,n1/3r1/3]3V=[1,\ell]\times[1,n^{1/3}r^{1/3}]\times[1,n^{1/3}r^{1/3}]\subseteq\mathbb{Z}^{3}. We must show that before any such path travels through ^\hat{\ell} nodes before reaching the boundary. Note that ^=q27r2\hat{\ell}=\ell\cdot\frac{q}{2^{7}r^{2}}\leq\ell, since qr2q\leq r^{2}.

Let xx be the random variable defined as x:=i=1^cλix:=\sum_{i=1}^{\hat{\ell}}\vec{c}_{\lambda_{i}}. Then by Proposition 6 and the linearity of expectation,

𝔼[x]^24r3q=r23=123n1/3r1/3,\mathbb{E}[x]\leq\hat{\ell}\cdot\frac{2^{4}r^{3}}{q}=\ell\cdot\frac{r}{2^{3}}=\frac{1}{2^{3}}n^{1/3}r^{1/3},

where the final equality follows from the fact that =n1/3r2/3\ell=n^{1/3}r^{-2/3}. Then by Markov’s inequality,

Pr[x14n1/3r1/3]1/2.\Pr\left[x\leq\frac{1}{4}n^{1/3}r^{1/3}\right]\geq 1/2.

Now we claim that if x14n1/3r1/3x\leq\frac{1}{4}n^{1/3}r^{1/3}, then for all πΠ\pi\in\Pi, |π|^|\pi|\geq\hat{\ell}. Let πs,tΠ\pi_{s,t}\in\Pi be a critical path for critical pair (s,t)P(s,t)\in P. Let s=(s1,s2,s3)3s=(s_{1},s_{2},s_{3})\in\mathbb{Z}^{3} and let t=(t1,t2,t3)3t=(t_{1},t_{2},t_{3})\in\mathbb{Z}^{3}. By our construction of critical paths Π\Pi, either t1=t_{1}=\ell or t+Ct1Vt+C_{t_{1}}\not\in V. In the first case, |π|=^|\pi|=\ell\geq\hat{\ell}, since path π\pi traveled through all \ell layers LiL_{i}. In the second case, we must have that (t2,t3)(s2,s3)13n1/3r1/3\|(t_{2},t_{3})-(s_{2},s_{3})\|\geq\frac{1}{3}n^{1/3}r^{1/3} by our choice of set SS. But since x14n1/3r1/3x\leq\frac{1}{4}n^{1/3}r^{1/3}, we conclude that |π|^|\pi|\geq\hat{\ell}. The claim follows. ∎

We have shown that with nonzero probability, all our paths in Π\Pi travel through the first ^\hat{\ell} layers of GG. However, we cannot guarantee that paths in Π\Pi travel to layers LiL_{i} with i>^i>\hat{\ell}. Because of this, we choose to remove all layers LiL_{i}, i>^i>\hat{\ell}, from GG. We replace all critical paths πΠ\pi\in\Pi with the truncated subpath of π\pi containing only the first ^\hat{\ell} nodes of π\pi, and we update our critical pairs PV×VP\subseteq V\times V to be the set of all pairs of endpoints of the updated paths in Π\Pi.

5.4 Critical paths are unique paths

We now verify that graph GG and paths Π\Pi have the unique path property as stated in Lemma 7. This will follow straightforwardly from the properties of our set of direction vectors DD, particularly Proposition 5.

Lemma 9 (Unique paths).

For every (s,t)P(s,t)\in P, path πs,t\pi_{s,t} is a unique sts\leadsto t-path in GG.

Proof.

Fix a direction vector dj\vec{d}_{j} in DD. We claim that for all i[1,1]i\in[1,\ell-1], there is a unique vector cCi\vec{c}\in C_{i} such that maximizes c,dj\langle\vec{c},\vec{d}_{j}\rangle. Recall that Ci2C_{i}\subseteq\mathbb{Z}^{2} contains exactly two vectors: the zero vector (0,0)(0,0) and the vector cλi=wλi+1wλi\vec{c}_{\lambda_{i}}=\vec{w}_{\lambda_{i}+1}-\vec{w}_{\lambda_{i}}.

Now assume that jλi+1j\geq\lambda_{i}+1 and observe the following sequence of equivalent inequalities:

wλi,wj\displaystyle\langle\vec{w}_{\lambda_{i}},\vec{w}_{j}\rangle <wλi+1,wj\displaystyle<\langle\vec{w}_{\lambda_{i}+1},\vec{w}_{j}\rangle   by Lemma 8 and Proposition 5
0,wj\displaystyle\langle\vec{0},\vec{w}_{j}\rangle <wλi+1wλi,wj\displaystyle<\langle\vec{w}_{\lambda_{i}+1}-\vec{w}_{\lambda_{i}},\vec{w}_{j}\rangle
0,dj\displaystyle\langle\vec{0},\vec{d}_{j}\rangle <cλi+1,dj.\displaystyle<\langle\vec{c}_{\lambda_{i}+1},\vec{d}_{j}\rangle.

When j<λi+1j<\lambda_{i}+1, an identical argument shows that 0,dj>cλi+1,dj\langle\vec{0},\vec{d}_{j}\rangle>\langle\vec{c}_{\lambda_{i}+1},\vec{d}_{j}\rangle. In either case, there is a unique vector cCi\vec{c}\in C_{i} maximizing c,dj\langle\vec{c},\vec{d}_{j}\rangle for all i[1,1]i\in[1,\ell-1].

Now fix a critical pair (s,t)P(s,t)\in P that has djD\vec{d}_{j}\in D as its associated direction vector and πs,t=(v1,,vk)\pi_{s,t}=(v_{1},\dots,v_{k}) as its critical path (where s=v1s=v_{1} and t=vkt=v_{k}). Let f:32f:\mathbb{R}^{3}\mapsto\mathbb{R}^{2} be the function that projects each point in 3\mathbb{R}^{3} onto the subspace corresponding to the last two coordinates of 3\mathbb{R}^{3}, i.e. f(x,y,z)=(y,z)f(x,y,z)=(y,z) for all (x,y,z)3(x,y,z)\in\mathbb{R}^{3}.

Let π\pi be an arbitrary sts\leadsto t-path, and note that |π|=|πs,t||\pi|=|\pi_{s,t}| since GG is a layered directed graph. Let π=(v1,,vk)\pi=(v_{1}^{\prime},\dots,v_{k}^{\prime}), where s=v1s=v_{1}^{\prime} and t=vkt=v_{k}^{\prime}. By our construction of path πs,t\pi_{s,t}, we must have that for all i[1,k1]i\in[1,k-1],

f(vi+1vi),djf(vi+1vi),dj.\langle f(v^{\prime}_{i+1}-v^{\prime}_{i}),\vec{d}_{j}\rangle\leq\langle f(v_{i+1}-v_{i}),\vec{d}_{j}\rangle.

Now suppose for the sake of contradiction that ππs,t\pi\neq\pi_{s,t}. Then vi+1vivi+1viv^{\prime}_{i+1}-v^{\prime}_{i}\neq v_{i+1}-v_{i} for some i[1,k1]i\in[1,k-1]. Then f(vi+1vi),dj<f(vi+1vi),dj\langle f(v^{\prime}_{i+1}-v^{\prime}_{i}),\vec{d}_{j}\hskip 2.84526pt\rangle<\langle f(v_{i+1}-v_{i}),\vec{d}_{j}\hskip 2.84526pt\rangle by the above discussion. But then since π\pi and πs,t\pi_{s,t} are both sts\leadsto t-paths,

f(ts),dj=i=1k1f(vi+1vi),dj<i=1k1f(vi+1vi),dj=f(ts),dj.\displaystyle\langle f(t-s),\vec{d}_{j}\rangle=\sum_{i=1}^{k-1}\langle f(v^{\prime}_{i+1}-v^{\prime}_{i}),\vec{d}_{j}\rangle<\sum_{i=1}^{k-1}\langle f(v_{i+1}-v_{i}),\vec{d}_{j}\rangle=\langle f(t-s),\vec{d}_{j}\rangle.

This is a contradiction, so we conclude that π=πs,t\pi=\pi_{s,t}. Then path πs,t\pi_{s,t} is a unique sts\leadsto t-path in GG. ∎

5.5 Critical Paths Intersection Properties

Before finishing our proof of Lemma 7, we will need to establish several properties of the critical paths in Π\Pi.

Proposition 8.

Let π1,π2Π\pi_{1},\pi_{2}\in\Pi be two critical paths with the same corresponding direction vector dD\vec{d}\in D. Then π1π2=\pi_{1}\cap\pi_{2}=\emptyset.

Proof.

Let k=min{|π1|,|π2|}k=\min\{|\pi_{1}|,|\pi_{2}|\}. Let vijLiv_{i}^{j}\in L_{i} denote the iith node of πj\pi_{j}, where j{1,2}j\in\{1,2\} and i[1,k]i\in[1,k]. Note that since π1\pi_{1} and π2\pi_{2} share the same direction vector d\vec{d}, edges (vi1,vi+11)(v_{i}^{1},v_{i+1}^{1}) and (vi2,vi+12)(v_{i}^{2},v_{i+1}^{2}) have the same corresponding vector uiCi\vec{u}_{i}\in C_{i} for all i[1,k1]i\in[1,k-1] by our construction of π1\pi_{1} and π2\pi_{2}. Also, for each node vL1v\in L_{1}, vv belongs to at most one path πΠ\pi\in\Pi with direction vector d\vec{d}, so v11v12v_{1}^{1}\neq v_{1}^{2}. Then for all i[1,k]i\in[1,k],

vi1=v11+i=1i1uiv12+i=1i1ui=vi2.v_{i}^{1}=v_{1}^{1}+\sum_{i=1}^{i-1}\vec{u}_{i}\neq v_{1}^{2}+\sum_{i=1}^{i-1}\vec{u}_{i}=v_{i}^{2}.\qed

Let π1,π2Π\pi_{1},\pi_{2}\in\Pi be two critical paths, and let vVv\in V be a node in GG. We say that paths π1\pi_{1} and π2\pi_{2} split at vv if vπ1π2v\in\pi_{1}\cap\pi_{2} and the node following vv in π1\pi_{1} is distinct from the node following vv in π2\pi_{2}, and we simply say that π1\pi_{1} and π2\pi_{2} split if there exists some vVv\in V such that π1\pi_{1} and π2\pi_{2} split at vv. Note that since π1,π2Π\pi_{1},\pi_{2}\in\Pi are unique paths in GG, paths π1\pi_{1} and π2\pi_{2} can split at most once.

Lemma 10.

Fix a node vLiv\in L_{i}, where i[1,1]i\in[1,\ell-1], and let π1,π2Π\pi_{1},\pi_{2}\in\Pi be critical paths with direction vectors dj\vec{d}_{j} and dkD\vec{d}_{k}\in D, j,k[1,q]j,k\in[1,q], such that vπ1v\in\pi_{1} and vπ2v\in\pi_{2}. Then paths π1\pi_{1} and π2\pi_{2} split at vv with probability at least |jk|q\frac{|j-k|}{q}.

Proof.

Fix a node vLiv\in L_{i}, where i[1,1]i\in[1,\ell-1], and let π1,π2Π\pi_{1},\pi_{2}\in\Pi be critical paths with direction vectors dj\vec{d}_{j} and dkD\vec{d}_{k}\in D, j,k[1,q]j,k\in[1,q], such that vπ1v\in\pi_{1} and vπ2v\in\pi_{2}. By Proposition 8, jkj\neq k, and assume wlog that j<kj<k. Let FF be the event that the random variable λi\lambda_{i} was sampled so that

jλi<k.j\leq\lambda_{i}<k.

Our proof strategy is to show that FF implies that π1,π2\pi_{1},\pi_{2} split at vv, and then to show that FF occurs with the claimed probability.

FF implies that π1,π2\pi_{1},\pi_{2} split at vv.

Assume that FF occurs. Then jλi<λi+1kj\leq\lambda_{i}<\lambda_{i}+1\leq k. Now observe the following sequence of equivalent inequalities:

wj,wλi\displaystyle\langle\vec{w}_{j},\vec{w}_{\lambda_{i}}\rangle >wj,wλi+1\displaystyle>\langle\vec{w}_{j},\vec{w}_{\lambda_{i}+1}\rangle by Lemma 8 and Proposition 5
wj,0\displaystyle\langle\vec{w}_{j},\vec{0}\rangle >wj,wλi+1wλi\displaystyle>\langle\vec{w}_{j},\vec{w}_{\lambda_{i}+1}-\vec{w}_{\lambda_{i}}\rangle
dj,0\displaystyle\langle\vec{d}_{j},\vec{0}\rangle >dj,cλi.\displaystyle>\langle\vec{d}_{j},\vec{c}_{\lambda_{i}}\rangle.

Since Ci={0,cλi}C_{i}=\{\vec{0},\vec{c}_{\lambda_{i}}\}, by our construction of the critical paths in Π\Pi, the above inequality dj,0>dj,cλi\langle\vec{d}_{j},\vec{0}\rangle>\langle\vec{d}_{j},\vec{c}_{\lambda_{i}}\rangle implies that path π1\pi_{1} takes an edge in EiE_{i} corresponding to vector 0\vec{0}. An identical argument will show that dk,0<dk,cλi\langle\vec{d}_{k},\vec{0}\rangle<\langle\vec{d}_{k},\vec{c}_{\lambda_{i}}\rangle, so path π2\pi_{2} takes an edge in EiE_{i} corresponding to vector cλi\vec{c}_{\lambda_{i}}. Since paths π1\pi_{1} and π2\pi_{2} take different edges in EiE_{i}, they must split at vv.

FF happens with good probability.

Random variable λi\lambda_{i} is sampled uniformly from [1,q1][1,q-1]. Then the event FF occurs with probability |jk|q1|jk|q\frac{|j-k|}{q-1}\geq\frac{|j-k|}{q}. ∎

We will use Lemma 11 to prove the following two lemmas, which capture key properties of our graph GG.

Lemma 11.

Let π1,π2Π\pi_{1},\pi_{2}\in\Pi be critical paths with associated direction vectors dj,dkD\vec{d}_{j},\vec{d}_{k}\in D. Then |π1π2|8qlogn|jk||\pi_{1}\cap\pi_{2}|\leq\frac{8q\log n}{|j-k|} with probability at least 1n81-n^{-8}.

Proof.

If π1π2=\pi_{1}\cap\pi_{2}=\emptyset, then the claim is immediate, so assume there is a node vπ1π2v\in\pi_{1}\cap\pi_{2}. Suppose vLiv\in L_{i}. By Lemma 10, π1\pi_{1} and π2\pi_{2} split at vv with probability at least |jk|q\frac{|j-k|}{q}. Moreover, conditioning on vπ1π2v\in\pi_{1}\cap\pi_{2}, the event that π1\pi_{1} and π2\pi_{2} split at vv given that vπ1v\in\pi_{1} and vπ2v\in\pi_{2} depends only on our choice of λi\lambda_{i} and is independent of λj\lambda_{j} for jij\neq i.

Since π1\pi_{1} and π2\pi_{2} are unique paths in GG, it follows that π1π2\pi_{1}\cap\pi_{2} is a contiguous subpath of π1\pi_{1} and π2\pi_{2}. The number of nodes in the intersection |π1π2||\pi_{1}\cap\pi_{2}| is 11 more than the number of consecutive nodes at which π1,π2\pi_{1},\pi_{2} intersect but do not split. So by the above discussion, we have

Pr[|π1π2|>8qlogn|jk|]\displaystyle\Pr\left[|\pi_{1}\cap\pi_{2}|>\frac{8q\log n}{|j-k|}\right] (1|jk|q)8qlogn|jk|\displaystyle\leq\left(1-\frac{|j-k|}{q}\right)^{\frac{8q\log n}{|j-k|}}
e|jk|q8qlogn|jk|\displaystyle\leq e^{-\frac{|j-k|}{q}\cdot\frac{8q\log n}{|j-k|}}
e8logn\displaystyle\leq e^{-8\log n}
=n8.\displaystyle=n^{-8}.

Since |Π|=pn2|\Pi|=p\leq n^{2}, we can argue by a union bound that Proposition 7 holds and Lemma 4 holds for all π1,π2Π\pi_{1},\pi_{2}\in\Pi simultaneously with probability at least 11/2n4>01-1/2-n^{-4}>0. From now on, we will assume that this property holds for our constructed graph GG.

Once we specify our construction parameters \ell and qq, the following lemma will immediately imply the third property of GG as stated in Lemma 7.

Lemma 12.

Let x,yVx,y\in V be nodes in GG such that the unweighted distance from xx to yy in GG is zz, where 0<z<0<z<\infty. Let ΠΠ\Pi^{\prime}\subseteq\Pi be the following set of critical paths:

Π:={πΠx,yπ}.\Pi^{\prime}:=\{\pi\in\Pi\mid x,y\in\pi\}.

Then |Π|max{16qlognz,1}|\Pi^{\prime}|\leq\max\left\{\frac{16q\log n}{z},1\right\}.

Proof.

Let Π={π1,,πk}\Pi^{\prime}=\{\pi_{1},\dots,\pi_{k}\} and let dσiD\vec{d}_{\sigma_{i}}\in D be the direction vector associated with πi\pi_{i} for i[1,k]i\in[1,k]. By Proposition 4, σiσj\sigma_{i}\neq\sigma_{j} for iji\neq j. Let a=mini[1,k]σia=\min_{i\in[1,k]}\sigma_{i} and let b=maxi[1,k]σib=\max_{i\in[1,k]}\sigma_{i}. Then bak1b-a\geq k-1, so by Lemma 11,

|πaπb|8qlogn|ba|8qlognk1.|\pi_{a}\cap\pi_{b}|\leq\frac{8q\log n}{|b-a|}\leq\frac{8q\log n}{k-1}.

Additionally, since x,yπaπbx,y\in\pi_{a}\cap\pi_{b} and πa\pi_{a} and πb\pi_{b} are unique paths, it follows that πa[x,y]=πb[x,y]\pi_{a}[x,y]=\pi_{b}[x,y]. Moreover, since the unweighted distance from xx to yy in GG is zz, it follows that |πaπb||πa[x,y]|=|πb[x,y]|=z+1|\pi_{a}\cap\pi_{b}|\geq|\pi_{a}[x,y]|=|\pi_{b}[x,y]|=z+1. Then taken together we have

z|πaπb|8qlognk1.z\leq|\pi_{a}\cap\pi_{b}|\leq\frac{8q\log n}{k-1}.

Rearranging, we get that

k8qlognz+1max{16qlognz,1}.k\leq\frac{8q\log n}{z}+1\leq\max\left\{\frac{16q\log n}{z},1\right\}.

(We obtain the final inequality from the following observation: if 8qlognz1\frac{8q\log n}{z}\geq 1, then this implies k16qlognzk\leq\frac{16q\log n}{z}. Otherwise, if 8qlognz<1\frac{8q\log n}{z}<1, then this implies that k1k\leq 1 since kk\in\mathbb{Z}.) ∎

5.6 Finishing the proof of Lemma 7

Let

r=n1/826log3/8n.r=\frac{n^{1/8}}{2^{6}\log^{3/8}n}.

Now recall that q=Θ(r2)q=\Theta(r^{2}), and let α1>0\alpha_{1}>0 be a constant such that α1r2q2α1r2\alpha_{1}r^{2}\leq q\leq 2\alpha_{1}r^{2}. Then

=n1/3r2/3n1/4log1/4n,^=q27r2α127,andq2α1r2^16logn.\ell=\frac{n^{1/3}}{r^{2/3}}\geq n^{1/4}\log^{1/4}n,\qquad\hat{\ell}=\frac{q}{2^{7}r^{2}}\cdot\ell\geq\frac{\alpha_{1}}{2^{7}}\cdot\ell,\qquad\text{and}\qquad q\leq 2\alpha_{1}r^{2}\leq\frac{\hat{\ell}}{16\log n}.

We now quickly verify that graph GG and associated critical paths Π\Pi satisfy the properties of Lemma 7:

  • By construction, GG has ^=Θ(n1/4log1/4n)\hat{\ell}=\Theta\left(n^{1/4}\log^{1/4}n\right) layers, and each path in Π\Pi travels from the first layer to the ^\hat{\ell}th layer.

  • Each path πΠ\pi\in\Pi is a unique path between its endpoints in GG by Lemma 9.

  • Since 16qlogn16q\log n\leq\ell and hopdistG(u,v)\texttt{hopdist}_{G}(u,v)\leq\ell, Lemma 12 immediately implies that for all u,vVu,v\in V, there are at most

    max{16qlognhopdistG(u,v),1}hopdistG(u,v)\max\left\{\frac{16q\log n}{\texttt{hopdist}_{G}(u,v)},1\right\}\leq\frac{\ell}{\texttt{hopdist}_{G}(u,v)}

    paths in Π\Pi that contain both uu and vv.

  • For all critical pairs (s,t)P(s,t)\in P, the unweighted distance from ss to tt in GG is 116qlogn\ell-1\gg 16q\log n. Then by Lemma 12, each of the |Π||\Pi| paths constructed in Section 5.3 have distinct endpoints and thus are distinct. Then

    |Π|=|S||D|n2/3r2/3q25α1n2/3r8/325α1n221logn.|\Pi|=|S||D|\geq\frac{n^{2/3}r^{2/3}q}{2^{5}}\geq\frac{\alpha_{1}n^{2/3}r^{8/3}}{2^{5}}\geq\frac{\alpha_{1}n}{2^{21}\log n}.

    Recall that we initially let p=n/(α0logn)p=n/(\alpha_{0}\log n), for some unspecified constant α0>0\alpha_{0}>0. Choose α0\alpha_{0} so that p|Π|p\leq|\Pi|.

We have shown that our directed graph GG satisfies the properties of Lemma 7 when p=n/(α0logn)p=n/(\alpha_{0}\log n). All that remains is to extend our construction to the regime p[1,n/(α0logn)]p\in[1,n/(\alpha_{0}\log n)].

Extending the construction to p[1,n/(α0logn)]p\in[1,n/(\alpha_{0}\log n)].

We can extend our construction to the regime of p[1,n/(α0logn)]p\in[1,n/(\alpha_{0}\log n)] using the same modification to GG that we performed on our exact hopset construction and that was previously used in the prior work of [29]. We will sketch the modification here. The proof of correctness follows from an argument identical to the proof of Lemma 6 in Appendix A.

We use G(n,p)G(n,p) denote an instance of our originally constructed graph GG with input parameters nn and p=n/(α0logn)p=n/(\alpha_{0}\log n). Let nn be a sufficiently large integer and let p[1,n]p\in[1,n]. Let G1:=G(p1,p1/(α0logp1))G_{1}:=G(p_{1},p_{1}/(\alpha_{0}\log p_{1})) where p=Θ(p1/logp1)p=\Theta(p_{1}/\log p_{1}) and p1p_{1} divides nn. Now for each node vv in G1G_{1}, replace vv with a directed path πv=(u1v,,ukv)\pi_{v}=(u_{1}^{v},\dots,u_{k}^{v}) with k=n/p1k=n/p_{1} nodes. For all vVv\in V, assign weight 0 to all edges in πv\pi_{v}. For each edge (v1,v2)(v_{1},v_{2}) originally in G1G_{1}, add edge (ukv1,u1v2)(u_{k}^{v_{1}},u_{1}^{v_{2}}) to the graph. Let G2G_{2} be the resulting graph, and let Π2\Pi_{2} be the updated set of critical paths. This completes the modification.

References

  • [1] Amir Abboud, Greg Bodwin, and Seth Pettie. A hierarchy of lower bounds for sublinear additive spanners. In Proceedings of the 28th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 568–576. Society for Industrial and Applied Mathematics, 2017.
  • [2] Alexandr Andoni, Clifford Stein, and Peilin Zhong. Parallel approximate undirected shortest paths via low hop emulators. In Proceedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing, pages 322–335, 2020.
  • [3] Imre Bárány and David G Larman. The convex hull of the integer points in a large ball. Mathematische Annalen, 312(1):167–181, 1998.
  • [4] Uri Ben-Levy and Merav Parter. New (α\alpha, β\beta) spanners and hopsets. In Proceedings of the Fourteenth Annual ACM-SIAM Symposium on Discrete Algorithms, pages 1695–1714. SIAM, 2020.
  • [5] Piotr Berman, Sofya Raskhodnikova, and Ge Ruan. Finding sparser directed spanners. In IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science (FSTTCS 2010). Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik, 2010.
  • [6] Aaron Bernstein, Maximilian Probst Gutenberg, and Christian Wulff-Nilsen. Near-optimal decremental sssp in dense weighted digraphs. In 2020 IEEE 61st Annual Symposium on Foundations of Computer Science (FOCS), pages 1112–1122. IEEE, 2020.
  • [7] Aaron Bernstein and Nicole Wein. Closing the gap between directed hopsets and shortcut sets. In Proceedings of the 2023 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 163–182. SIAM, 2023.
  • [8] Greg Bodwin and Gary Hoppenworth. New additive spanner lower bounds by an unlayered obstacle product. 2022.
  • [9] Greg Bodwin, Gary Hoppenworth, and Ohad Trabelsi. Bridge girth: A unifying notion in network design. arXiv preprint arXiv:2212.11944, 2022.
  • [10] Nairen Cao, Jeremy T Fineman, and Katina Russell. Efficient construction of directed hopsets and parallel approximate shortest paths. In Proceedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing, pages 336–349, 2020.
  • [11] Edith Cohen. Polylog-time and near-linear work approximation scheme for undirected shortest paths. Journal of the ACM (JACM), 47(1):132–166, 2000.
  • [12] Don Coppersmith and Michael Elkin. Sparse sourcewise and pairwise distance preservers. SIAM Journal on Discrete Mathematics, 20(2):463–501, 2006.
  • [13] Michael Elkin, Yuval Gitlitz, and Ofer Neiman. Almost shortest paths with near-additive error in weighted graphs. arXiv preprint arXiv:1907.11422, 2019.
  • [14] Michael Elkin and Ofer Neiman. Linear-size hopsets with small hopbound, and constant-hopbound hopsets in rnc. In The 31st ACM Symposium on Parallelism in Algorithms and Architectures, pages 333–341, 2019.
  • [15] Jeremy T Fineman. Nearly work-efficient parallel algorithm for digraph reachability. SIAM Journal on Computing, 49(5):STOC18–500, 2019.
  • [16] Sebastian Forster and Danupon Nanongkai. A faster distributed single-source shortest paths algorithm. In 2018 IEEE 59th Annual Symposium on Foundations of Computer Science (FOCS), pages 686–697. IEEE, 2018.
  • [17] Maximilian Probst Gutenberg and Christian Wulff-Nilsen. Decremental sssp in weighted digraphs: Faster and against an adaptive adversary. In Proceedings of the Fourteenth Annual ACM-SIAM Symposium on Discrete Algorithms, pages 2542–2561. SIAM, 2020.
  • [18] Monika Henzinger, Sebastian Krinninger, and Danupon Nanongkai. Decremental single-source shortest paths on undirected graphs in near-linear total update time. In Foundations of Computer Science (FOCS), 2014 IEEE 55th Annual Symposium on, pages 146–155. IEEE, 2014.
  • [19] Monika Henzinger, Sebastian Krinninger, and Danupon Nanongkai. Sublinear-time decremental algorithms for single-source reachability and shortest paths on directed graphs. In Proceedings of the forty-sixth annual ACM symposium on Theory of computing, pages 674–683, 2014.
  • [20] Monika Henzinger, Sebastian Krinninger, and Danupon Nanongkai. Improved algorithms for decremental single-source reachability on directed graphs. In Automata, Languages, and Programming: 42nd International Colloquium, ICALP 2015, Kyoto, Japan, July 6-10, 2015, Proceedings, Part I 42, pages 725–736. Springer, 2015.
  • [21] William Hesse. Directed graphs requiring large numbers of shortcuts. In Proceedings of the fourteenth annual ACM-SIAM symposium on Discrete algorithms, pages 665–669. Society for Industrial and Applied Mathematics, 2003.
  • [22] Shang-En Huang and Seth Pettie. Lower Bounds on Sparse Spanners, Emulators, and Diameter-reducing shortcuts. In David Eppstein, editor, 16th Scandinavian Symposium and Workshops on Algorithm Theory (SWAT 2018), volume 101 of Leibniz International Proceedings in Informatics (LIPIcs), pages 26:1–26:12, Dagstuhl, Germany, 2018. Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik.
  • [23] Shang-En Huang and Seth Pettie. Thorup–zwick emulators are universally optimal hopsets. Information Processing Letters, 142:9–13, 2019.
  • [24] Arun Jambulapati, Yang P Liu, and Aaron Sidford. Parallel reachability in almost linear work and square root depth. In 2019 IEEE 60th Annual Symposium on Foundations of Computer Science (FOCS), pages 1664–1686. IEEE, 2019.
  • [25] Adam Karczmarz and Piotr Sankowski. A deterministic parallel apsp algorithm and its applications. In Proceedings of the 2021 ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 255–272. SIAM, 2021.
  • [26] Philip N Klein and Sairam Subramanian. A randomized parallel algorithm for single-source shortest paths. Journal of Algorithms, 25(2):205–220, 1997.
  • [27] Shimon Kogan and Merav Parter. Beating Matrix Multiplication for n1/3n^{1/3}-Directed Shortcuts. In Mikołaj Bojańczyk, Emanuela Merelli, and David P. Woodruff, editors, 49th International Colloquium on Automata, Languages, and Programming (ICALP 2022), volume 229 of Leibniz International Proceedings in Informatics (LIPIcs), pages 82:1–82:20, Dagstuhl, Germany, 2022. Schloss Dagstuhl – Leibniz-Zentrum für Informatik.
  • [28] Shimon Kogan and Merav Parter. Having hope in hops: New spanners, preservers and lower bounds for hopsets. In 2022 IEEE 63rd Annual Symposium on Foundations of Computer Science (FOCS), pages 766–777. IEEE, 2022.
  • [29] Shimon Kogan and Merav Parter. New diameter-reducing shortcuts and directed hopsets: Breaking the barrier. In Proceedings of the 2022 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 1326–1341. SIAM, 2022.
  • [30] Kevin Lu, Virginia Vassilevska Williams, Nicole Wein, and Zixuan Xu. Better lower bounds for shortcut sets and additive spanners via an improved alternation product. In Proceedings of the 2022 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 3311–3331. SIAM, 2022.
  • [31] Ofer Neiman and Idan Shabat. A unified framework for hopsets. In 30th Annual European Symposium on Algorithms (ESA 2022). Schloss Dagstuhl-Leibniz-Zentrum für Informatik, 2022.
  • [32] Hanmao Shi and Thomas H Spencer. Time–work tradeoffs of the single-source shortest paths problem. Journal of algorithms, 30(1):19–32, 1999.
  • [33] Mikkel Thorup. On shortcutting digraphs. In International Workshop on Graph-Theoretic Concepts in Computer Science, pages 205–211. Springer, 1992.
  • [34] Jeffrey D Ullman and Mihalis Yannakakis. High-probability parallel transitive-closure algorithms. SIAM Journal on Computing, 20(1):100–125, 1991.

Appendix A Proof of Lemma 6

Let nn be a sufficiently large integer, and let p[1,n]p\in[1,n] such that pp divides nn. Let p1:=230pp_{1}:=2^{30}p. Recall that to obtain G2G_{2}, we first construct graph G1:=G(p1,p)G_{1}:=G(p_{1},p), where G(p1,p)G(p_{1},p) denotes our initial construction of the graph G1G_{1} in Lemma 1 on p1p_{1} nodes with an associated set Π1\Pi_{1} of |Π1|=p|\Pi_{1}|=p paths. We let V1V_{1} denote the nodes and E1E_{1} denote the edges of G1G_{1}. Let P1P_{1} be the set of critical pairs associated with G1G_{1}, and let Π1\Pi_{1} be the corresponding canonical paths. Let \ell and qq be the construction parameters used to construct G(p1,p1)G(p_{1},p_{1}). Then

26p|P1|210p, =p1210p1/2log1/2p1, and q=p1220p1/2log3/2p1.2^{6}p\leq|P_{1}|\leq 2^{10}p,\text{ \qquad}\ell=\frac{p_{1}}{2^{10}p^{1/2}\log^{1/2}p_{1}},\text{ \qquad and \qquad}q=\frac{p_{1}}{2^{20}p^{1/2}\log^{3/2}p_{1}}.

Note that for all πΠ1\pi\in\Pi_{1}, we have that |π|=|\pi|=\ell. We then modified G1G_{1} by replacing each node vV1v\in V_{1} with a path πv=(u1v,,ukv)\pi_{v}=(u_{1}^{v},\dots,u_{k}^{v}) with k=n/p1k=n/p_{1} nodes. If an edge (v1,v2)E1(v_{1},v_{2})\in E_{1} was originally in G1G_{1}, we replaced it with an edge (ukv1,u1v2)(u_{k}^{v_{1}},u_{1}^{v_{2}}). This gave us our final nn-node graph G2=(V2,E2)G_{2}=(V_{2},E_{2}).

Let P2:={(u1s,ukt)(s,t)P1}P_{2}:=\{(u_{1}^{s},u_{k}^{t})\mid(s,t)\in P_{1}\}, and for all (u1s,ukt)P2(u_{1}^{s},u_{k}^{t})\in P_{2}, let πu1s,ukt\pi_{u_{1}^{s},u_{k}^{t}} be the path obtained by taking πs,tΠ1\pi_{s,t}\in\Pi_{1} and replacing each node vπs,tv\in\pi_{s,t} with the path πv\pi_{v}. Let Π2:={πs,t(s,t)P2}\Pi_{2}:=\{\pi_{s,t}\mid(s,t)\in P_{2}\}. Then it is clear from our construction of G2G_{2} that for all (s,t)P2(s,t)\in P_{2}, the path πs,tΠ2\pi_{s,t}\in\Pi_{2} is a unique shortest sts\leadsto t-path in G2G_{2}. Additionally, for all (s,t)P2(s,t)\in P_{2}, the number of nodes in path πs,tΠ2\pi_{s,t}\in\Pi_{2} is at least |πs,t|2|\pi_{s,t}|\geq\ell_{2}, where

2:=np1=n210p1/2log1/2p1=Θ(np1/2log1/2p).\ell_{2}:=\ell\cdot\frac{n}{p_{1}}=\frac{n}{2^{10}p^{1/2}\log^{1/2}p_{1}}=\Theta\left(\frac{n}{p^{1/2}\log^{1/2}p}\right).

We now quickly verify that graph G2G_{2} and associated critical paths Π2\Pi_{2} satisfy the properties of Lemma 1:

  • By construction, G2G_{2} has 2=Θ(np1/2log1/2n)\ell_{2}=\Theta\left(\frac{n}{p^{1/2}\log^{1/2}n}\right) layers, and each path in Π\Pi travels from the first layer to the last layer.

  • Each path πΠ\pi\in\Pi is a unique shortest path between its endpoints in G2G_{2}. This follows from Lemma 2 and the observation that our path replacement step cannot increase the number of paths between pairs of nodes in G2G_{2}.

  • What remains is to show that for any two nodes u,vV2u,v\in V_{2}, there are at most 2hopdistG2(u,v)\frac{\ell_{2}}{\texttt{hopdist}_{G_{2}}(u,v)} paths in Π2\Pi_{2} that contain both uu and vv. We prove this in two cases:

    • Case 1: u,vπwu,v\in\pi_{w} for some wV1w\in V_{1}. Note that hopdistG(u,v)n/p1\texttt{hopdist}_{G}(u,v)\leq n/p_{1}. By Proposition 4, the number of paths πΠ1\pi\in\Pi_{1} such that wπw\in\pi is at most the number of direction vectors

      |D|\displaystyle|D| p1q42p1/24log1/2p1p1/24log1/2p1n/p1hopdistG2(u,v)\displaystyle\leq\frac{p_{1}q}{4\ell^{2}}\leq\frac{p^{1/2}}{4\log^{1/2}p_{1}}\leq\frac{p^{1/2}}{4\log^{1/2}p_{1}}\cdot\frac{n/p_{1}}{\texttt{hopdist}_{G_{2}}(u,v)}
      n230p1/2log1/2p1hopdistG2(u,v)2hopdistG2(u,v).\displaystyle\leq\frac{n}{2^{30}p^{1/2}\log^{1/2}p}\cdot\frac{1}{\texttt{hopdist}_{G_{2}}(u,v)}\leq\frac{\ell_{2}}{\texttt{hopdist}_{G_{2}}(u,v)}.

      Then by our construction of G2G_{2}, the number of paths πΠ2\pi\in\Pi_{2} such that u,vπu,v\in\pi is at most |D|2hopdistG2(u,v)|D|\leq\frac{\ell_{2}}{\texttt{hopdist}_{G_{2}}(u,v)}.

    • Case 2: uπw1u\in\pi_{w_{1}} and vπw2v\in\pi_{w_{2}} for distinct w1,w2V1w_{1},w_{2}\in V_{1}. By Lemma 5, the number of paths πΠ1\pi\in\Pi_{1} such that w1,w2πw_{1},w_{2}\in\pi is at most

      max{16qlognhopdistG1(w1,w2),1}\displaystyle\max\left\{\frac{16q\log n}{\texttt{hopdist}_{G_{1}}(w_{1},w_{2})},1\right\} max{hopdistG1(w1,w2),1}max{2hopdistG2(u,v),1}\displaystyle\leq\max\left\{\frac{\ell}{\texttt{hopdist}_{G_{1}}(w_{1},w_{2})},1\right\}\leq\max\left\{\frac{\ell_{2}}{\texttt{hopdist}_{G_{2}}(u,v)},1\right\}
      2hopdistG2(u,v).\displaystyle\leq\frac{\ell_{2}}{\texttt{hopdist}_{G_{2}}(u,v)}.

      (The final inequality follows from the fact that hopdistG2(u,v)2\texttt{hopdist}_{G_{2}}(u,v)\leq\ell_{2}.)

Appendix B Extending our shortcut set lower bound

To extend our shortcut set lower bound so that it holds in the regime of p>n/(αlogn)p>n/(\alpha\log n), we will prove a more general statement about the behavior of the extremal function of shortcut sets. We write S(n,p)\texttt{S}(n,p) for the smallest integer DD such that every nn-node graph GG has a shortcut set HH of size |H|p|H|\leq p such that GHG\cup H has diameter at most DD.

Lemma 13.

For all positive integers nn and x[1,n]x\in[1,n],

S(n,p/x)xS(n,p)16logn.\texttt{S}\left(n,\hskip 2.84526ptp/x\right)\leq x\cdot\texttt{S}(n,\hskip 2.84526ptp)\cdot 16\log n.

Lemma 13 essentially states that if we decrease the number of shortcuts allowed in our shortcut set, the extremal function controlling the worst-case size of shortcut sets won’t increase by too much. We will use this lemma in the opposite direction to argue that our lower bound of S(n,n/(αlogn))=Ω(n1/4log1/4n)\texttt{S}(n,n/(\alpha\log n))=\Omega\left(\frac{n^{1/4}}{\log^{1/4}n}\right) (where α>0\alpha>0 is a sufficiently large constant) that we obtained from our shortcut set construction in Lemma 7 implies lower bounds for shortcut sets with greater than n/(αlogn)n/(\alpha\log n) shortcuts.

Let p[nαlogn,n5/4]p\in\left[\frac{n}{\alpha\log n},n^{5/4}\right], and let x=pn/(αlogn)x=\frac{p}{n/(\alpha\log n)}. Then by applying our lower bound from Lemma 7 to Lemma 13 we find that

Ω(n1/4log1/4n)S(n,nαlogn)=S(n,px)xS(n,p)16logn.\Omega\left(\frac{n^{1/4}}{\log^{1/4}n}\right)\leq\texttt{S}\left(n,\frac{n}{\alpha\log n}\right)=\texttt{S}\left(n,\frac{p}{x}\right)\leq x\cdot\texttt{S}(n,p)\cdot 16\log n.

Rearranging, we find that

S(n,p)=Ω(n5/4plog9/4n),\texttt{S}(n,p)=\Omega\left(\frac{n^{5/4}}{p\log^{9/4}n}\right),

as claimed in Theorem 8.

We now prove Lemma 13, which will follow from a simple path subsampling argument.

Proof of Lemma 13.

Let nn be a positive integer, p[1,n2]p\in[1,n^{2}], and x[1,n]x\in[1,n]. Let G=(V,E)G=(V,E) be a graph on nn nodes. We subsample nodes of GG to construct a smaller graph G1=(V1,E1)G_{1}=(V_{1},E_{1}) as follows.

  • Independently sample each node VV into set V1V_{1} with probability 12x\frac{1}{2x}. Then 𝔼[|V1|]=n2x\mathbb{E}[|V_{1}|]=\frac{n}{2x}.

  • For all pairs of nodes u,vVu,v\in V such that distG(u,v)8xlogn\text{dist}_{G}(u,v)\leq 8x\log n, add directed edge (u,v)(u,v) to E1E_{1}. This completes the construction of G1=(V1,E1)G_{1}=(V_{1},E_{1}).

By Markov’s inequality, |V1|n/x|V_{1}|\leq n/x with probability at least 12\frac{1}{2}. Assume for now that this does indeed hold, and we have |V1|n/x|V_{1}|\leq n/x. Then using the fact that S(c1n,c1p)S(c2n,c2p)\texttt{S}(c_{1}n,c_{1}p)\leq\texttt{S}(c_{2}n,c_{2}p) if c1c2c_{1}\leq c_{2}, we find that there exists a shortcut set H1H_{1} of size |H1|p/x|H_{1}|\leq p/x such that the diameter of G1H1G_{1}\cup H_{1} is at most

diameter(G1H1)S(|V1|,|H1|)S(n/x,p/x)S(n,p).\texttt{diameter}(G_{1}\cup H_{1})\leq\texttt{S}(|V_{1}|,|H_{1}|)\leq\texttt{S}(n/x,p/x)\leq\texttt{S}(n,p).

Now we claim that this implies that diameter(GH1)xS(n,p)16logn\texttt{diameter}(G\cup H_{1})\leq x\cdot\texttt{S}(n,p)\cdot 16\log n. For every pair of nodes u,vVu,v\in V such that vv is reachable from uu in GG, fix a shortest uvu\leadsto v-path πu,v\pi_{u,v} in GG. Then the following statement holds with high probability: for all pairs of nodes u,vVu,v\in V such that distG(u,v)4xlogn\text{dist}_{G}(u,v)\geq 4x\log n, path πu,v\pi_{u,v} contains a node that was sampled into V2V_{2}, i.e. πu,vV2\pi_{u,v}\cap V_{2}\neq\emptyset. From now on, we will assume this property holds for our sampled set V2V_{2}.

Consider a pair of nodes u,vVu,v\in V such that 4xlogndistG(u,v)4x\log n\leq\text{dist}_{G}(u,v) and vv is reachable from uu in GG, and let π\pi be an uvu\leadsto v-path in GG. Let uu^{\prime} be be the node in πV1\pi\cap V_{1} closest to uu, and let vv^{\prime} be the node in πV1\pi\cap V_{1} closest to vv. By our construction of G1G_{1} and the above discussion, there must be a uvu^{\prime}\leadsto v^{\prime}-path in G1G_{1}. Moreover, distG1H1(u,v)diameter(G1H1)S(n,p)\text{dist}_{G_{1}\cup H_{1}}(u^{\prime},v^{\prime})\leq\texttt{diameter}(G_{1}\cup H_{1})\leq\texttt{S}(n,p). Then since distG(x,y)8xlogn\text{dist}_{G}(x,y)\leq 8x\log n for all edges (x,y)E1(x,y)\in E_{1} in G1G_{1}, it follows that

distGH1(u,v)distG1H1(u,v)8xlognS(n,p)8xlogn.\text{dist}_{G\cup H_{1}}(u^{\prime},v^{\prime})\leq\text{dist}_{G_{1}\cup H_{1}}(u^{\prime},v^{\prime})\cdot 8x\log n\leq\texttt{S}(n,p)\cdot 8x\log n.

Then putting it all together,

distGH1(u,v)\displaystyle\text{dist}_{G\cup H_{1}}(u,v) distGH1(u,u)+distGH1(u,v)+distGH1(v,v)\displaystyle\leq\text{dist}_{G\cup H_{1}}(u,u^{\prime})+\text{dist}_{G\cup H_{1}}(u^{\prime},v^{\prime})+\text{dist}_{G\cup H_{1}}(v^{\prime},v)
4xlogn+distGH1(u,v)+4xlogn\displaystyle\leq 4x\log n+\text{dist}_{G\cup H_{1}}(u^{\prime},v^{\prime})+4x\log n
distG1H1(u,v)8xlogn+8xlogn\displaystyle\leq\text{dist}_{G_{1}\cup H_{1}}(u^{\prime},v^{\prime})\cdot 8x\log n+8x\log n
S(n,p)8xlogn+8xlogn\displaystyle\leq\texttt{S}(n,p)\cdot 8x\log n+8x\log n
xS(n,p)16logn.\displaystyle\leq x\cdot\texttt{S}(n,p)\cdot 16\log n.

Finally, to conclude the analysis, we made two assumptions: (1) that |V1|n/x|V_{1}|\leq n/x, which occurs with probability 1/21/2, and (2) that for all u,vu,v that are sufficiently far apart, we sampled a node on a uvu\leadsto v path, which happens with high probability, i.e., 11/poly(n)1-1/\text{poly}(n). By an intersection bound, there is positive probability that both events happen at the same time. So a graph G1G_{1} exists as described, completing the proof. ∎