This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

LP-Based Approximation Algorithms for Traveling Salesman Path Problems

Hyung-Chan An anhc@cs.cornell.edu. Dept. of Computer Science, Cornell University, Ithaca, NY 14853. Research supported in part by NSF under grants no. CCF-1017688 and CCF-0729102, and the Korea Foundation for Advanced Studies. Part of this research was conducted while the author was a visiting student at CSAIL, MIT.    David B. Shmoys shmoys@cs.cornell.edu. School of ORIE and Dept. of Computer Science, Cornell University, Ithaca, NY 14853. Research supported in part by NSF under grants no. CCF-0832782 and CCF-1017688. Part of this research was conducted while the author was a visiting professor at Sloan School of Management, MIT.
Abstract

We present a (53ϵ)(\frac{5}{3}-\epsilon)-approximation algorithm for some constant ϵ>0\epsilon>0 for the traveling salesman path problem under the unit-weight graphical metric, and prove an upper bound on the integrality gap of the path-variant Held-Karp relaxation both under this metric and the general metric. Given a complete graph with the metric cost and two designated endpoints in the graph, the traveling salesman path problem is to find a minimum Hamiltonian path between these two endpoints. The best previously known performance guarantee for this problem was 5/35/3 and was due to Hoogeveen. We give the first constant upper bound on the integrality gap of the path-variant Held-Karp relaxation, showing it to be at most 5/35/3 by providing a new analysis of Hoogeveen’s algorithm. This analysis exhibits a well-characterized critical case, and we show that the recent result of Oveis Gharan, Saberi and Singh on the traveling salesman circuit problem under the unit-weight graphical metric can be modified for the path case to complement Hoogeveen’s algorithm in the critical case, providing an improved performance guarantee of (53ϵ)(\frac{5}{3}-\epsilon). This also proves the matching integrality gap upper bound of (53ϵ)(\frac{5}{3}-\epsilon).

1 Introduction

The traveling salesman problem (TSP) has been widely studied in a variety of settings [10, 5, 13, 7, 17, 14, 12, 4, 2, 1], but we shall focus on the approximation results for the traveling salesman path problem with two prespecified endpoints. In this variant, two vertices in the graph are specified as a part of the input, and we need to find a Hamiltonian path (in contrast to Hamiltonian circuit found in other variants) between these two vertices. We will assume a symmetric metric cost: costs are defined over the complete graph; the costs between two vertices in opposite directions do not differ; and the costs satisfy the triangle inequality. The triangle inequality can be assumed without loss of generality by allowing vertices to be visited more than once; furthermore, it is directly satisfied by many cost functions of interest. This assumption is necessary, in that without the triangle inequality the problem cannot be approximated within any polynomial approximation factor unless P=NP\mathrm{P}=\mathrm{NP}.

For the traveling salesman circuit problem, a 3/23/2-approximation algorithm due to Christofides [5] attains the best performance guarantee known; the matching upper bound on the integrality gap of the Held-Karp relaxation [10], a standard LP relaxation of the problem, is obtained by an analysis of this algorithm [19, 18]. In contrast, for the traveling salesman path problem with two prespecified endpoints, the best approximation ratio known is only 5/35/3, as achieved by Hoogeveen’s algorithm [11]; no upper bound on the integrality gap of the (accordingly modified) Held-Karp relaxation is known. Recently, Oveis Gharan, Saberi and Singh [16] studied a special case of the traveling salesman circuit problem where the cost function is a shortest-path metric defined by an unweighted undirected graph, and gave a (32ϵ0)(\frac{3}{2}-\epsilon_{0})-approximation algorithm; its analysis constructively proves that the Held-Karp relaxation under this special case has an integrality gap of at most 32ϵ0\frac{3}{2}-\epsilon_{0}. In this paper, we first present an LP-based new analysis of Hoogeveen’s algorithm that establishes an upper bound of 5/35/3 on the integrality gap of the path-variant Held-Karp relaxation under the general metric. We then modify Oveis Gharan, Saberi and Singh’s algorithm in order to apply it to the path problem; our new analysis of Hoogeveen’s algorithm reveals that the combination of these two algorithms yields a (53ϵ)(\frac{5}{3}-\epsilon)-approximation algorithm for some constant ϵ>0\epsilon>0. This analysis would also constructively prove the matching integrality gap upper bound. We note that recent independent work due to Mömke and Svensson [15] yields a better performance guarantee for the same special case of both the circuit- and path-variants, but shows the matching integrality gap upper bound only for the circuit-variant Held-Karp relaxation; it instead shows for the path-variant case that, for any ϵ>0\epsilon^{\prime}>0, there exists τ(ϵ)\tau(\epsilon^{\prime}) that depends on ϵ\epsilon^{\prime} such that every graph with more than τ(ϵ)\tau(\epsilon^{\prime}) vertices has an integral optimal solution of value no more than 32+ϵ3-\sqrt{2}+\epsilon^{\prime} times the Held-Karp optimum.

Proposed by Held and Karp [10] originally for the symmetric traveling salesman circuit problem, the Held-Karp relaxation is a linear program relaxation of the traveling salesman polytope. It consists of the degree-2 constraints, and the edge connectivity constraints stating that the graph needs to be 2-edge-connected; the asymmetric Held-Karp relaxation is similarly formulated. These relaxations have proven to be useful in designing approximation algorithms in many settings (see, e.g., [2, 1, 16, 15]). In the LP-based design of an approximation algorithm, one important measure of the strength of a particular LP relaxation is its integrality gap, i.e., the worst-case ratio between the integral and fractional optimal values. There is a significant gap between currently known lower and upper bounds on the integrality gap of the Held-Karp relaxation. For the asymmetric circuit case, the best lower bound known, due to Charikar, Goemans and Karloff [3], is 22, whereas the upper bound, constructively proven by the best known approximation algorithm of Asadpour, Goemans, Mądry, Oveis Gharan and Saberi [2], is O(logn/loglogn)O(\log n/\log\log n). For the symmetric circuit case, the best lower bound known is 4/34/3, achieved by the family of graphs depicted in Figure 1(a) under the shortest-path metric [8]; yet, the best upper bound known, also constructively proven [19, 18] based on Christofides’ algorithm [5], is 3/23/2. Christofides’ algorithm does not directly compute the LP optimum, but the analyses due to Wolsey [19] and Shmoys & Williamson [18] establish that the output solution value can be bounded using the LP optimum.

Refer to caption
Figure 1: Examples establishing the integrality gap lower bounds for the circuit- and path-variant Held-Karp relaxations

For the path problem, Hoogeveen [11] presents Christofides-like approximation algorithms for three cases, depending on the number of prespecified endpoints. When both endpoints are prespecified, Hoogeveen [11] gives a 5/35/3-approximation algorithm. However, the analysis compares the output solution value to the optimal (integral) solution, and therefore it is unclear whether the algorithm yields an integrality gap bound for the Held-Karp relaxation analogously formulated for the path problem. We observe that the family of graphs in Figure 1(b) demonstrates the integrality gap lower bound of 3/23/2. Note that this lower bound is strictly greater than the known upper bound on the integrality gap of the circuit-variant Held-Karp relaxation under the shortest-path metric defined by an unweighted undirected graph; this suggests that the lack of a performance guarantee known matching the 3/23/2 for all other TSP variants has a true structural cause.

We present in this paper a new analysis of Hoogeveen’s algorithm, bounding the output solution value in terms of the LP optimum. In particular, we produce two different bounds that both depend on the cost of the direct edge between the two endpoints but with opposite signs; taking the minimum of these two bounds will establish the integrality gap upper bound of 5/35/3.

These two bounds coincide when the edge cost is exactly 1/31/3 times the LP optimum; this in fact is the critical case that determines the integrality gap bound proven, and the analysis provides a better ratio on all other cases. Thus, the question whether this critical case can be improved, possibly by using a different algorithm only in the critical case or formulating a third bound that does not coincide with the other two at the critical point, could naturally follow. We will illustrate the former approach in a special case where the cost function is a shortest-path metric defined by an unweighted undirected graph: we will show how Oveis Gharan, Saberi and Singh’s algorithm can be modified for the path case and prove that its performance guarantee near the critical point is slightly better than Hoogeveen’s algorithm. This analysis would also serve as a constructive integrality gap proof for the special case.

Section 2 of this paper introduces some definitions and notation; Section 3 presents the new analysis yielding a constructive integrality gap proof for the general metric. Section 4 gives the improved approximation algorithm for the special case of the shortest-path metric defined by an unweighted undirected graph, proving the matching integrality gap upper bound.

2 Preliminaries

In this section, we introduce some definitions and notation to be used throughout this paper.

Let G=(V,E)G=(V,E) be the input complete graph with cost function c:E+c:E\to\mathbb{R}_{+}. Let ss and tt be the two prespecificed endpoints; we call the other vertices internal points. For UVU\subset V, δ(U)\delta(U) denotes the set of cut edges: i.e., δ(U)={{u,v}E:|{u,v}U|=1}\delta(U)=\left\{\{u,v\}\in E:|\{u,v\}\cap U|=1\right\}. Let G(U)G(U) denote the subgraph induced by UVU\subset V.

For FEF\subset E and xEx\in\mathbb{R}^{E}, x(F)x(F) is a shorthand for fFxf\sum_{f\in F}x_{f}; c(x)c(x) is for eExec(e)\sum_{e\in E}x_{e}c(e).

Definition 1 (Held and Karp [10]).

The circuit-variant Held-Karp relaxation is the following:

minimizec(x)subject tox(δ(S))2,SV,S;x(δ({v}))=2,vV;x0.\begin{array}[]{lll}\textrm{minimize}&c(x)&\\ \textrm{subject to}&x(\delta(S))\geq 2,&\forall S\subsetneq V,S\neq\emptyset;\\ &x(\delta(\{v\}))=2,&\forall v\in V;\\ &x\geq 0.&\end{array} (1)
Definition 2.

The path-variant Held-Karp relaxation is analogously defined as follows:

minimizec(x)subject tox(δ(S))1,SV,|{s,t}S|=1;x(δ(S))2,SV,|{s,t}S|1,S;x(δ({s}))=x(δ({t}))=1;x(δ({v}))=2,vV{s,t};x0.\begin{array}[]{lll}\textrm{minimize}&c(x)&\\ \textrm{subject to}&x(\delta(S))\geq 1,&\forall S\subsetneq V,|\{s,t\}\cap S|=1;\\ &x(\delta(S))\geq 2,&\forall S\subsetneq V,|\{s,t\}\cap S|\neq 1,S\neq\emptyset;\\ &x(\delta(\{s\}))=x(\delta(\{t\}))=1;&\\ &x(\delta(\{v\}))=2,&\forall v\in V\setminus\{s,t\};\\ &x\geq 0.&\end{array} (2)

Both linear programs can be solved in polynomial time via the ellipsoid method using a min-cut algorithm to solve the separation problem [9]. The following observation gives an equivalent formulation of (2).

Observation 1.

Following is an equivalent formulation of (2):

minimizec(x)subject tox(E(S))|S|1,SV,{s,t}S,S;x(E(S))|S|2,SV,{s,t}S;x(δ({s}))=x(δ({t}))=1;x(δ({v}))=2,vV{s,t};x0.\begin{array}[]{lll}\textrm{minimize}&c(x)&\\ \textrm{subject to}&x(E(S))\leq|S|-1,&\forall S\subsetneq V,\{s,t\}\not\subseteq S,S\neq\emptyset;\\ &x(E(S))\leq|S|-2,&\forall S\subsetneq V,\{s,t\}\subseteq S;\\ &x(\delta(\{s\}))=x(\delta(\{t\}))=1;&\\ &x(\delta(\{v\}))=2,&\forall v\in V\setminus\{s,t\};\\ &x\geq 0.&\end{array} (3)
Definition 3.

For TVT\subset V and JEJ\subset E, JJ is a TT-join if the set of odd-degree vertices in G=(V,J)G^{\prime}=(V,J) is TT.

The following theorem gives a nice polyhedral representation of TT-joins.

Theorem 1 (Edmonds and Johnson [6]).

Let PT(G)P_{T}(G) be the convex hull of the incidence vectors of the TT-joins on G=(V,E)G=(V,E). PT(G)++EP_{T}(G)+\mathbb{R}_{+}^{E} can be exactly characterized by

{y(δ+(S))1,SV,|ST| odd;y+E.\begin{cases}y(\delta^{+}(S))\geq 1,&\forall S\subset V,|S\cap T|\textnormal{ odd};\\ y\in\mathbb{R}_{+}^{E}.&\end{cases} (4)

Following are from Oveis Gharan, Saberi and Singh [16]. In these, the parameters ϵ1,ϵ2,γ,δ\epsilon_{1},\epsilon_{2},\gamma,\delta and ρ\rho can be chosen as follows: ϵ1=1.8751012\epsilon_{1}=1.875\cdot 10^{-12}, ϵ2=5102\epsilon_{2}=5\cdot 10^{-2}, γ=107\gamma=10^{-7}, δ=6.251016\delta=6.25\cdot 10^{-16}, ρ=1.51024\rho=1.5\cdot 10^{-24}, and nn denotes |V||V|.

Definition 4 (Nearly integral edges).

An edge ee is nearly integral with respect to xEx\in\mathbb{R}^{E} if xe1γx_{e}\geq 1-\gamma.

Definition 5.

For some constant ν15\nu\leq\frac{1}{5} and k2k\geq 2, a maximum entropy distribution over spanning trees with approximate marginal xEx\in\mathbb{R}^{E} is a probability distribution μ\mu defined by λE\lambda\in\mathbb{R}^{E} such that μ(T)eTλe\mu(T)\propto\prod_{e\in T}\lambda_{e} for every spanning tree TT and the marginal probability of every edge ee is no greater than (1+νnk)xe(1+\frac{\nu}{n^{k}})x_{e}.

Definition 6 (Good edges).

A cut is (1+δ)(1+\delta)-near-minimum if its weight is at most (1+δ)(1+\delta) times the minimum cut weight. An edge ee is even with respect to FEF\subset E if every (1+δ)(1+\delta)-near-minimum cut containing ee has even number of edges intersecting with FF.

For a circuit-variant Held-Karp feasible solution xcircuitx^{*}_{\mathrm{circuit}}, consider xcircuitx^{*}_{\mathrm{circuit}} as the edge weight and let FF be a spanning tree sampled from a maximum entropy distribution with approximate marginal (11n)xcircuit(1-\frac{1}{n})x^{*}_{\mathrm{circuit}}. We say an edge ee is good with respect to xcircuitx^{*}_{\mathrm{circuit}} if the probability that ee is even with respect to FF is at least ρ\rho.

Theorem 2 (Structure Theorem).

Let xcircuitx^{*}_{\mathrm{circuit}} be a feasible solution to the circuit-variant Held-Karp relaxation, and let μ\mu be a maximum entropy distribution over spanning trees with approximate marginal (11n)xcircuit(1-\frac{1}{n})x^{*}_{\mathrm{circuit}}. There exist small constants ϵ1,ϵ2>0\epsilon_{1},\epsilon_{2}>0 such that at least one of the following is true:

  • 1.

    there exists a set EEE^{*}\subset E such that x(E)ϵ1nx(E^{*})\geq\epsilon_{1}n and every edge in EE^{*} is good with respect to xcircuitx^{*}_{\mathrm{circuit}};

  • 2.

    there exist at least (1ϵ2)n(1-\epsilon_{2})n edges that are nearly integral with respect to xcircuitx^{*}_{\mathrm{circuit}}.

Lemma 1.

Suppose that Case 1 of Theorem 2 holds and FF is sampled from μ\mu. Let TT be the set of odd-degree vertices in FF, then a minimum TT-join JJ satisfies

E[c(J)]c(xcircuit)(12ϵ1δρ4(1+δ)).\mathrm{E}[c(J)]\leq c(x^{*}_{\mathrm{circuit}})(\frac{1}{2}-\frac{\epsilon_{1}\delta\rho}{4(1+\delta)}).

3 An LP-based new analysis of Hoogeveen’s 5/35/3-approximation algorithm

In this section, we present a new analysis of Hoogeveen’s algorithm for the traveling salesman path problem with two prespecified endpoints. This analysis compares the output solution value to the LP optimum of the path-variant Held-Karp relaxation, although the LP optimum is never computed by the algorithm.

Hoogeveen’s algorithm, when both endpoints are prespecified, is shown in Algorithm 1.

Algorithm 1 Hoogeveen’s algorithm [11]
0  Complete graph G=(V,E)G=(V,E) with cost function c:E+c:E\to\mathbb{R}_{+}; endpoints s,tVs,t\in V.
0  Hamiltonian path between ss and tt
1  Find a minimum spanning tree HH of GG.
2  Let TVT\subset V be the set of the vertices with the ‘wrong’ parity of degree in HH:i.e., TT is the set of odd-degree internal points and even-degree endpoints in HH.
3  Find a minimum perfect matching MM on G(T)G(T).
4  Shortcut an Eulerian path of the multigraph HMH\cup M to obtain a Hamiltonian path; output it.

Let xEx^{*}\in\mathbb{R}^{E} be the LP optimum of the path-variant Held-Karp relaxation, and 𝖠𝖫𝖦\mathsf{ALG} be the output of the Algorithm.

Lemma 2.

c(H)c(x)c(H)\leq c(x^{*}).

Proof.

As can be seen from (3), the path-variant Held-Karp polytope is contained in the spanning tree polytope. The lemma follows from this observation, since HH is a minimum spanning tree. ∎

Lemmas 3 and 4 give two different bounds on the cost of MM.

Lemma 3.

c(M)12{c(x)+c(s,t)}c(M)\leq\frac{1}{2}\left\{c(x^{*})+c(s,t)\right\}.

Proof.

Let xcircuit:=x+𝐞(s,t)x^{*}_{\mathrm{circuit}}:=x^{*}+\mathbf{e}_{(s,t)}: i.e., xcircuitx^{*}_{\mathrm{circuit}} is obtained by ‘adding’ the edge (s,t)(s,t) to xx^{*}. Then xcircuitx^{*}_{\mathrm{circuit}} is a feasible solution to the circuit-variant Held-Karp relaxation (see (1) and (2)). Let HKcircuit\mathrm{HK_{circuit}} be the optimal value of the circuit-variant Held-Karp relaxation and we have

c(M)\displaystyle c(M) \displaystyle\leq 12HKcircuit\displaystyle\frac{1}{2}\mathrm{HK_{circuit}}
\displaystyle\leq 12c(xcircuit)\displaystyle\frac{1}{2}c(x^{*}_{\mathrm{circuit}})
=\displaystyle= 12{c(x)+c(s,t)},\displaystyle\frac{1}{2}\left\{c(x^{*})+c(s,t)\right\},

where the first inequality follows from [19, 18]. ∎

Lemma 4.

c(M)c(x)c(s,t)c(M)\leq c(x^{*})-c(s,t).

Proof.

Let PstHP_{st}^{H} be the path between ss and tt on HH. Consider an edge set M:=HPstHM^{\prime}:=H\setminus P_{st}^{H}. Note that MM^{\prime} is a TT-join: vVv\in V has even degree in PstHP_{st}^{H} if and only if vv is internal; thus, vv has even degree in the multigraph HM=(HH)PstHH\cup M^{\prime}=(H\cup H)\setminus P_{st}^{H} if and only if vv is an internal point, and this shows that vv has odd degree in MM^{\prime} if and only if vTv\in T.

We have

c(M)\displaystyle c(M) \displaystyle\leq c(M)\displaystyle c(M^{\prime})
=\displaystyle= c(H)c(PstH)\displaystyle c(H)-c(P_{st}^{H})
\displaystyle\leq c(x)c(s,t).\displaystyle c(x^{*})-c(s,t).

The last inequality follows from Lemma 2 and the triangle inequality. ∎

Theorem 3.

c(𝖠𝖫𝖦)53c(x)c(\mathsf{ALG})\leq\frac{5}{3}c(x^{*}); therefore, Algorithm 1 is a 5/35/3-approximation algorithm, and the integrality gap of the path-variant Held-Karp relaxation is at most 5/35/3.

Proof.

We have

c(𝖠𝖫𝖦)\displaystyle c(\mathsf{ALG}) \displaystyle\leq c(H)+c(M)\displaystyle c(H)+c(M) (5)
\displaystyle\leq c(x)+min[12{c(x)+c(s,t)},c(x)c(s,t)]\displaystyle c(x^{*})+\min\left[\frac{1}{2}\left\{c(x^{*})+c(s,t)\right\},c(x^{*})-c(s,t)\right]
=\displaystyle= 53c(x)+min[12{13c(x)+c(s,t)},13c(x)c(s,t)]\displaystyle\frac{5}{3}c(x^{*})+\min\left[\frac{1}{2}\left\{-\frac{1}{3}c(x^{*})+c(s,t)\right\},\frac{1}{3}c(x^{*})-c(s,t)\right]
\displaystyle\leq 53c(x),\displaystyle\frac{5}{3}c(x^{*}),

where the second inequality follows from Lemmas 2, 3 and 4. ∎

Open Question.

We can observe that the equality of (5) is achieved when c(s,t)=13c(x)c(s,t)=\frac{1}{3}c(x^{*}), and this is the critical case of this analysis that determines the integrality gap bound proven. Hence, if we can improve the performance guarantee near this critical case, such an improvement would lead to a better guarantee, and possibly a tighter integrality gap bound. For example, even if an algorithm does not outperform Hoogeveen’s algorithm in general, if it yields a better ratio near the critical case, i.e., when c(s,t)c(s,t) is close to 13c(x)\frac{1}{3}c(x^{*}), this algorithm can be combined with Hoogeveen’s to provide a better overall performance guarantee. Another possibility could be to devise a third upper bound on c(M)c(M): if such a bound is better than the other two near the critical case, this would lead to a tighter integrality gap bound. It remains an open question whether the analysis for this critical case can be so improved.

4 Unit-weight graphical metrics

We illustrate in this section how the critical case discussed in Section 3 can be improved by introducing another algorithm that bears a better performance guarantee near the critical case. In particular, we show how the present results combine with the recent work due to Oveis Gharan, Saberi and Singh [16] on a special case of the traveling salesman circuit problem, to yield an improved constructive integrality gap proof of the path-variant Held-Karp relaxation under a similar special case.

Oveis Gharan et al. consider the case where the metric is a shortest path metric defined by an unweighted undirected graph, and give a (32ϵ0)\left(\frac{3}{2}-\epsilon_{0}\right)-approximation algorithm for some constant ϵ0>0\epsilon_{0}>0. We will consider the traveling salesman path problem with two specified endpoints under the same class of metric, and show how to modify Oveis Gharan, Saberi and Singh’s algorithm for the path case and that, when c(s,t)c(s,t) is close to 13c(x)\frac{1}{3}c(x^{*}), this modified algorithm carries a performance guarantee that is slightly better than 5/35/3.

Algorithm 2 Algorithm for the shortest-path metric defined by an unweighted undirected graph
0  Complete graph G=(V,E)G=(V,E) with cost function c:E+c:E\to\mathbb{R}_{+}; endpoints s,tVs,t\in V.
0  Hamiltonian path between ss and tt
1 xx^{*}\leftarrowoptimal solution to the path-variant Held-Karp relaxation
2 if c(s,t)=(13+α)c(x)c(s,t)=(\frac{1}{3}+\alpha)c(x^{*}) for α[σl,σu]\alpha\in[-\sigma_{l},\sigma_{u}] then
3   if at least (1ϵ2)(n1)(1-\epsilon^{\prime}_{2})(n-1) edges are nearly integral with respect to xx^{*} then {Case A1}
4    Find a minimum spanning subgraph FF^{\prime} containing all the nearly integral edges
5    Find a minimum spanning tree FF of FF^{\prime}
6    Let TT be the set of odd-degree internal points and even-degree endpoints in FF
7    Compute a minimum TT-join JJ; LFJL\leftarrow F\cup J
8   else {Case A2}
9    xcircuit:=x+𝐞(s,t)x^{*}_{\mathrm{circuit}}:=x^{*}+\mathbf{\mathbf{e}}_{(s,t)}
10    Sample spanning tree FF from max-entropy distrib. with approx. marginal (11n)xcircuit(1-\frac{1}{n})x^{*}_{\mathrm{circuit}}
11    Let TT be the set of odd-degree vertices in FF
12    Compute a minimum TT-join JJ; L0FJL_{0}\leftarrow F\cup J
13    if (s,t)L0(s,t)\in L_{0} then LL0{(s,t)}L\leftarrow L_{0}\setminus\{(s,t)\} else LL0{(s,t)}L\leftarrow L_{0}\cup\{(s,t)\} end if
14   end if
15 else {Case B, Hoogeveen’s algorithm}
16   Find a minimum spanning tree HH of GG.
17   Let TT be the set of odd-degree internal points and even-degree endpoints in HH
18   Compute a minimum TT-join JJ; LFJL\leftarrow F\cup J
19 end if
20  Shortcut an Eulerian path of the multigraph LL to obtain a Hamiltonian path; output it.

Algorithm 2 gives the entire algorithm for the traveling salesman path problem under the special case. It first computes the LP optimum xx^{*}. If c(s,t)c(s,t) is close to 13c(x)\frac{1}{3}c(x^{*}), we run a modified version of Oveis Gharan, Saberi and Singh’s algorithm (Cases A1 and A2); otherwise, we invoke Hoogeveen’s algorithm (Case B). Let 𝖠𝖫𝖦\mathsf{ALG} be the output of the algorithm. Parameters σl,σu\sigma_{l},\sigma_{u} and ϵ2\epsilon^{\prime}_{2} are to be chosen later.

First we show that we can have a Structure Theorem analogous to Theorem 2 by adjusting ϵ2\epsilon_{2} and replacing nn with (n1)(n-1) in Case 2. The following corollary states that either there are good edges of significant weight with respect to xcircuitx^{*}_{\mathrm{circuit}} or there are many nearly integral edges with respect to xx^{*}.

Corollary 1.

Let xx^{*} be a feasible solution to the path-variant Held-Karp relaxation and xcircuit:=x+𝐞(s,t)x^{*}_{\mathrm{circuit}}:=x^{*}+\mathbf{\mathbf{e}}_{(s,t)}. Let μ\mu be a maximum entropy distribution over spanning trees with approximate marginal (11n)xcircuit(1-\frac{1}{n})x^{*}_{\mathrm{circuit}}. There exist small constants ϵ1,ϵ2>0\epsilon_{1},\epsilon^{\prime}_{2}>0 such that at least one of the following is true:

  • 1.

    there exists a set EEE^{*}\subset E such that x(E)ϵ1nx(E^{*})\geq\epsilon_{1}n and every edge in EE^{*} is good with respect to xcircuitx^{*}_{\mathrm{circuit}};

  • 2.

    there exist at least (1ϵ2)(n1)(1-\epsilon^{\prime}_{2})(n-1) edges that are nearly integral with respect to xx^{*}.

Proof.

By Theorem 2, at least one of the two cases of Theorem 2 holds. Case 1 of Theorem 2 and Case 1 of this corollary are identical, so consider when Case 2 of Theorem 2 holds.

Recall that ϵ2\epsilon_{2} was chosen as 51025\cdot 10^{-2}; we choose ϵ2=6102\epsilon^{\prime}_{2}=6\cdot 10^{-2}.

Suppose n19n\leq 19. xcircuitx^{*}_{\mathrm{circuit}} has at least (1ϵ2)n(1-\epsilon_{2})n nearly integral edges; thus, xx^{*} has at least (1ϵ2)n1=n1(1ϵ2)(n1)\lceil(1-\epsilon_{2})n\rceil-1=n-1\geq(1-\epsilon^{\prime}_{2})(n-1) nearly integral edges.

Suppose n20n\geq 20. xx^{*} has at least

(1ϵ2)n1\displaystyle(1-\epsilon_{2})n-1 =\displaystyle= (1ϵ2)(n1)ϵ2\displaystyle(1-\epsilon_{2})(n-1)-\epsilon_{2}
\displaystyle\geq (12019ϵ2)(n1)\displaystyle(1-\frac{20}{19}\epsilon_{2})(n-1)
\displaystyle\geq (1ϵ2)(n1)\displaystyle(1-\epsilon^{\prime}_{2})(n-1)

nearly integral edges. ∎

Lemma 5.

In Case A1, c(𝖠𝖫𝖦)(53CA1)c(x)c(\mathsf{ALG})\leq(\frac{5}{3}-C_{A1})c(x^{*}) for some cA1>0c_{A1}>0.

Proof.

The following proof is adapted from [16] and modified for the path case.

Let SS^{\prime} be the set of nearly integral edges. Since the metric is defined by an unweighted connected graph, c(F)=c(S)+|FS|c(x(S))1γ+|FS|c(F^{\prime})=c(S^{\prime})+|F^{\prime}\setminus S^{\prime}|\leq\frac{c(x(S^{\prime}))}{1-\gamma}+|F^{\prime}\setminus S^{\prime}|. From γ<13\gamma<\frac{1}{3}, we know that SS^{\prime} is a union of disjoint cycles and paths and the lengths of cycles are at least 1γ\frac{1}{\gamma}. Thus, |FS|(n1)(1ϵ2)(1γ)|F\cap S^{\prime}|\geq(n-1)(1-\epsilon^{\prime}_{2})(1-\gamma) and |FS|(n1)(ϵ2+γ)c(x)(ϵ2+γ)|F\setminus S^{\prime}|\leq(n-1)(\epsilon^{\prime}_{2}+\gamma)\leq c(x^{*})(\epsilon^{\prime}_{2}+\gamma). Let S=SFS=S^{\prime}\cap F.

We construct a feasible solution yy to (4) as follows. Note that Theorem 1 implies c(J)c(y)c(J)\leq c(y).

ye={1if eFSxeif eEFxe2(1γ)if eSy_{e}=\begin{cases}1&\textrm{if }e\in F\setminus S\\ x_{e}^{*}&\textrm{if }e\in E\setminus F\\ \frac{x_{e}^{*}}{2(1-\gamma)}&\textrm{if }e\in S\end{cases}

Let (U,U¯)(U,\bar{U}) be any cut that has an odd number of vertices in TT on one side. If there exists an edge e(FS)δ(U)e\in(F\setminus S)\cap\delta(U), then y(δ(U))ye=1y(\delta(U))\geq y_{e}=1. So suppose from now on that δ(U)FS\delta(U)\cap F\subset S. Then δ(U)S=δ(U)F\delta(U)\cap S=\delta(U)\cap F.

If ss and tt lie on the same side of the cut, UU contains odd number of odd-degree vertices, and thus |δ(U)F||\delta(U)\cap F| is odd. We have x(δ(U))2x^{*}(\delta(U))\geq 2 from the Held-Karp formulation and thus

{y(δ(U))x(δ(U)F)1if |δ(U)F|=1y(δ(U))y(δ(U)S)31γ2(1γ)>1if |δ(U)S|3.\begin{cases}y(\delta(U))\geq x^{*}(\delta(U)\setminus F)\geq 1&\textrm{if }|\delta(U)\cap F|=1\\ y(\delta(U))\geq y(\delta(U)\cap S)\geq 3\frac{1-\gamma}{2(1-\gamma)}>1&\textrm{if }|\delta(U)\cap S|\geq 3.\end{cases}

If (U,U¯)(U,\bar{U}) separates ss and tt, then UU contains even number of odd-degree vertices, and thus |δ(U)F||\delta(U)\cap F| is even. We have (δ(U)F)(\delta(U)\cap F)\neq\emptyset since FF is connected and

y(δ(U))y(δ(U)S)21γ2(1γ)=1.y(\delta(U))\geq y(\delta(U)\cap S)\geq 2\frac{1-\gamma}{2(1-\gamma)}=1.

Thus yy is a feasible solution to LPTjoin\mathrm{LP_{T-join}}. Now,

c(𝖠𝖫𝖦)\displaystyle c(\mathsf{ALG}) \displaystyle\leq c(F)+c(y)\displaystyle c(F)+c(y)
\displaystyle\leq c(x(S))1γ+c(FS)+c(FS)+c(x(EF))+c(x(S))2(1γ)\displaystyle\frac{c(x^{*}(S))}{1-\gamma}+c(F\setminus S)+c(F\setminus S)+c(x^{*}(E\setminus F))+\frac{c(x^{*}(S))}{2(1-\gamma)}
\displaystyle\leq 3c(x(S))2(1γ)+2c(x)(ϵ2+γ)+c(x(ES))\displaystyle\frac{3c(x^{*}(S))}{2(1-\gamma)}+2c(x^{*})(\epsilon^{\prime}_{2}+\gamma)+c(x^{*}(E\setminus S))
\displaystyle\leq c(x)(32(1γ)+2ϵ2+2γ)\displaystyle c(x^{*})(\frac{3}{2(1-\gamma)}+2\epsilon^{\prime}_{2}+2\gamma)
\displaystyle\leq c(x)(53CA1)\displaystyle c(x^{*})(\frac{5}{3}-C_{A1})

for some CA1>0C_{A1}>0. For example, we can choose cA1=4102c_{A1}=4\cdot 10^{-2}. ∎

Lemma 6.

In Case A2, E[c(𝖠𝖫𝖦)](53CA2)c(x)\mathrm{E}[c(\mathsf{ALG})]\leq(\frac{5}{3}-C_{A2})c(x^{*}) for some CA2>0C_{A2}>0.

Proof.

First we have

E[c(F)]\displaystyle\mathrm{E}[c(F)] \displaystyle\leq c((1+νnk)(11n)xcircuit)\displaystyle c\left((1+\frac{\nu}{n^{k}})(1-\frac{1}{n})x^{*}_{\mathrm{circuit}}\right)
\displaystyle\leq (1+15n2)(11n)(43+α)c(x)\displaystyle(1+\frac{1}{5n^{2}})(1-\frac{1}{n})(\frac{4}{3}+\alpha)c(x^{*})
\displaystyle\leq (145n)(43+α)c(x).\displaystyle(1-\frac{4}{5n})(\frac{4}{3}+\alpha)c(x^{*}).

From Lemma 1,

E[c(J)](43+α)c(x)(12ϵ1δρ4(1+δ)).\mathrm{E}[c(J)]\leq(\frac{4}{3}+\alpha)c(x^{*})(\frac{1}{2}-\frac{\epsilon_{1}\delta\rho}{4(1+\delta)}).

We have

Pr[(s,t)L0]\displaystyle\Pr[(s,t)\in L_{0}] \displaystyle\geq Pr[(s,t)F]\displaystyle\Pr[(s,t)\in F]
=\displaystyle= n1E[|F(s,t)|]\displaystyle n-1-\mathrm{E}[|F\setminus(s,t)|]
\displaystyle\geq n1(n2+1n)(1+νnk)\displaystyle n-1-(n-2+\frac{1}{n})(1+\frac{\nu}{n^{k}})
\displaystyle\geq n1(n2+1n)(1+15n2)\displaystyle n-1-(n-2+\frac{1}{n})(1+\frac{1}{5n^{2}})
\displaystyle\geq 175n\displaystyle 1-\frac{7}{5n}

and hence

E[c(𝖠𝖫𝖦)]\displaystyle\mathrm{E}[c(\mathsf{ALG})] \displaystyle\leq E[c(F)]+E[c(J)](175n)c(s,t)+75nc(s,t)\displaystyle\mathrm{E}[c(F)]+\mathrm{E}[c(J)]-(1-\frac{7}{5n})c(s,t)+\frac{7}{5n}c(s,t)
\displaystyle\leq c(x){(145n)(43+α)+(43+α)(12ϵ1δρ4(1+δ))(175n)(13+α)+75n(13+α)}\displaystyle c(x^{*})\left\{(1-\frac{4}{5n})(\frac{4}{3}+\alpha)+(\frac{4}{3}+\alpha)(\frac{1}{2}-\frac{\epsilon_{1}\delta\rho}{4(1+\delta)})-(1-\frac{7}{5n})(\frac{1}{3}+\alpha)+\frac{7}{5n}(\frac{1}{3}+\alpha)\right\}
=\displaystyle= c(x){(53ϵ1δρ3(1+δ))+α(12ϵ1δρ4(1+δ))1n(2152α)}\displaystyle c(x^{*})\left\{(\frac{5}{3}-\frac{\epsilon_{1}\delta\rho}{3(1+\delta)})+\alpha(\frac{1}{2}-\frac{\epsilon_{1}\delta\rho}{4(1+\delta)})-\frac{1}{n}(\frac{2}{15}-2\alpha)\right\}
\displaystyle\leq c(x)(53CA2)\displaystyle c(x^{*})(\frac{5}{3}-C_{A2})

for some CA2>0C_{A2}>0 by choosing sufficiently small σl,σu>0\sigma_{l},\sigma_{u}>0. For example, we can choose σl=7.81052\sigma_{l}=7.8\cdot 10^{-52}, σu=3.91052\sigma_{u}=3.9\cdot 10^{-52} and CA2=3.91052C_{A2}=3.9\cdot 10^{-52}. ∎

Lemma 7.

In Case B, c(𝖠𝖫𝖦)(53CB)c(x)c(\mathsf{ALG})\leq(\frac{5}{3}-C_{B})c(x^{*}) for some CB>0C_{B}>0.

Proof.

Suppose that c(s,t)<(13σl)c(x)c(s,t)<(\frac{1}{3}-\sigma_{l})c(x^{*}). From Lemmas 2 and 3, it follows that

c(𝖠𝖫𝖦)\displaystyle c(\mathsf{ALG}) \displaystyle\leq c(F)+c(J)\displaystyle c(F)+c(J)
<\displaystyle< c(x)+12{c(x)+(13σl)c(x)}\displaystyle c(x^{*})+\frac{1}{2}\left\{c(x^{*})+(\frac{1}{3}-\sigma_{l})c(x^{*})\right\}
=\displaystyle= (53σl2)c(x).\displaystyle\left(\frac{5}{3}-\frac{\sigma_{l}}{2}\right)c(x^{*}).

Suppose c(s,t)>(13+σu)c(x)c(s,t)>(\frac{1}{3}+\sigma_{u})c(x^{*}). From Lemmas 2 and 4,

c(𝖠𝖫𝖦)\displaystyle c(\mathsf{ALG}) \displaystyle\leq c(F)+c(J)\displaystyle c(F)+c(J)
<\displaystyle< c(x)+{c(x)(13+σu)c(x)}\displaystyle c(x^{*})+\left\{c(x^{*})-(\frac{1}{3}+\sigma_{u})c(x^{*})\right\}
=\displaystyle= (53σu)c(x).\displaystyle\left(\frac{5}{3}-\sigma_{u}\right)c(x^{*}).

Now choose CB:=min(σl2,σu)C_{B}:=\min(\frac{\sigma_{l}}{2},\sigma_{u}). ∎

Lemmas 5, 6 and 7 yield the following theorem.

Theorem 4.

For some ϵ>0\epsilon>0, Algorithm 2 is a (53ϵ)(\frac{5}{3}-\epsilon)-approximation algorithm for the traveling salesman path problem with two prespecified endpoints under the shortest-path metric defined by an unweighted undirected graph.

Proof.

In Cases A1 and B, the multigraph LL is the union of a spanning tree and a TT-join where TT is the set of the vertices with the wrong parity of degree. Thus, LL has an Eulerian path between the two endpoints.

In Case A2, L0L_{0} is Eulerian and hence 2-edge-connected; LL0{(s,t)}L\supset L_{0}\setminus\{(s,t)\} is therefore connected and LL has an Eulerian path between the two endpoints.

By choosing ϵ=min{CA1,CA2,CB}\epsilon=\min\{C_{A1},C_{A2},C_{B}\}, ϵ=3.91052\epsilon=3.9\cdot 10^{-52} for example, we have E[c(𝖠𝖫𝖦)](53ϵ)c(x)\mathrm{E}[c(\mathsf{ALG})]\leq(\frac{5}{3}-\epsilon)c(x^{*}) from Lemmas 5, 6 and 7. Thus, Algorithm 2 is a (53ϵ)(\frac{5}{3}-\epsilon)-approximation algorithm. ∎

Note that this proof also establishes the matching integrality gap upper bound.

Acknowledgments.

The authors thank Bobby Kleinberg for very helpful discussions.

References

  • [1] H.-C. An, R. D. Kleinberg, and D. B. Shmoys. Approximation algorithms for the bottleneck asymmetric traveling salesman problem. In APPROX-RANDOM, pages 1–11, 2010.
  • [2] A. Asadpour, M. X. Goemans, A. Mądry, S. Oveis Gharan, and A. Saberi. An O(logn/loglogn)O(\log n/\log\log n)-approximation algorithm for the asymmetric traveling salesman problem. In SODA ’10: Proceedings of the 21st Annual ACM-SIAM Symposium on Discrete Algorithms, pages 379–389, 2010.
  • [3] M. Charikar, M. X. Goemans, and H. Karloff. On the integrality ratio for asymmetric TSP. In Proceedings of the 45th Annual IEEE Symposium on Foundations of Computer Science, pages 101–107. IEEE Computer Society, 2004.
  • [4] C. Chekuri and M. Pál. An O(logn)O(\log n) approximation ratio for the asymmetric traveling salesman path problem. Theory of Computing, 3(1):197–209, 2007.
  • [5] N. Christofides. Worst-case analysis of a new heuristic for the travelling salesman problem. Technical Report 388, Graduate School of Industrial Administration, CMU, 1976.
  • [6] J. Edmonds and E. Johnson. Matching: A well-solved class of integer linear programs. In M. Junger, G. Reinelt, and G. Rinaldi, editors, Combinatorial Optimization — Eureka, You Shrink!, volume 2570 of Lecture Notes in Computer Science, pages 27–30. Springer Berlin / Heidelberg, 2003.
  • [7] A. M. Frieze, G. Galbiati, and F. Maffioli. On the worst-case performance of some algorithms for the asymmetric traveling salesman problem. Networks, 12:23–39, 1982.
  • [8] M. X. Goemans. Worst-case comparison of valid inequalities for the TSP. Mathematical Programming, 69:335–349, 1995.
  • [9] M. Grötschel, L. Lovász, and A. Schrijver. The ellipsoid method and its consequences in combinatorial optimization. Combinatorica, 1(2):169–197, 1981.
  • [10] M. Held and R. M. Karp. The traveling-salesman problem and minimum spanning trees. Operations Research, 18(6):1138–1162, 1970.
  • [11] J. A. Hoogeveen. Analysis of Christofides’ heuristic: some paths are more difficult than cycles. Operations Research Letters, 10(5):291 – 295, 1991.
  • [12] J. Kleinberg and D. P. Williamson. Unpublished manuscript, 1998. Also see http://legacy.orie.cornell.edu/˜dpw/cornell.ps, pages 124–126.
  • [13] H. T. Lau. Finding EPS-Graphs. Monatshefte für Mathematik, 92(1):37–40, Mar 1981.
  • [14] E. L. Lawler, J. K. Lenstra, A. H. G. Rinnooy Kan, and D. B. Shmoys, editors. The Traveling Salesman Problem: A Guided Tour of Combinatorial Optimization. Wiley, 1985.
  • [15] T. Mömke and O. Svensson. Approximating graphic TSP by matchings. CoRR, abs/1104.3090, 2011. http://arxiv.org/abs/1104.3090.
  • [16] S. Oveis Gharan, A. Saberi, and M. Singh. A randomized rounding approach to the traveling salesman problem. http://www.stanford.edu/˜shayan/Publications_files/tsp.pdf.
  • [17] R. G. Parker and R. L. Rardin. Guaranteed performance heuristics for the bottleneck traveling salesman problem. Operations Research Letters, 2(6):269 – 272, 1984.
  • [18] D. B. Shmoys and D. P. Williamson. Analyzing the Held-Karp TSP bound: a monotonicity property with application. Information Processing Letters, 35(6):281–285, 1990.
  • [19] L. A. Wolsey. Heuristic analysis, linear programming and branch and bound. In Combinatorial Optimization II, volume 13 of Mathematical Programming Studies, pages 121–134. Springer Berlin Heidelberg, 1980.