This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Fixed-Parameter Tractability of the (1+1) Evolutionary Algorithm on Random Planted Vertex Covers

Jack Kearney Algorithmic Evolution Lab
Department of Computer Science
University of Minnesota Duluth
 
Frank Neumann Optimisation and Logistics Group
School of Computer Science
University of Adelaide
Andrew M. Sutton Algorithmic Evolution Lab
Department of Computer Science
University of Minnesota Duluth
 
Abstract

We present the first parameterized analysis of a standard (1+1) Evolutionary Algorithm on a distribution of vertex cover problems. We show that if the planted cover is at most logarithmic, restarting the (1+1) EA every O(nlogn)O(n\log n) steps will find a cover at least as small as the planted cover in polynomial time for sufficiently dense random graphs p>0.71p>0.71. For superlogarithmic planted covers, we prove that the (1+1) EA finds a solution in fixed-parameter tractable time in expectation.

We complement these theoretical investigations with a number of computational experiments that highlight the interplay between planted cover size, graph density and runtime.

1 Introduction

Combinatorial problems with planted solutions have been an important subject of study on a wide range of settings. In this scenario, a fixed solution is hidden within a large random structure such as a graph. The canonical example of this is the planted clique problem where a fixed complete subgraph of size kk is placed within a large Erdős-Rényi random graph on nkn\gg k vertices. The task is to either recover the hidden solution [AKS98] or one of size at least kk [Jer92]. These problems have important applications in cryptography [JP00] for example. In the context of randomized search heuristics, Storch [Sto07] investigated the planted clique problem for random local search (RLS) and the (1+1) EA. More recently, Doerr et al. [DNS17] considered randomly generated propositional satisfiability problems with planted assignments and proved that the (1+1) EA requires at most O(nlogn)O(n\log n) time to solve this problem provided that the constraint density is high enough.

Planted vertex covers have recently been studied in the context of systematically incomplete data [BK18] in networks. In this view, true node interactions can only be observed among some core set CC, whereas a potentially much larger set of fringe nodes lies outside this sphere of observability. This may occur, for example, in social networks and communication data sets [RUK19] where a company only knows about links within the company and between an employee and the outside world, but not about links between external entities. This translates to a planted vertex cover problem on a graph G=(V,E)G=(V,E). An adversary knows of a subset CVC\subseteq V which is a vertex cover, and the task is to identify a set as close to CC as possible.

In the 𝒢(n,k,p)\mathcal{G}{\left({n},{k},{p}\right)} model, a graph G=(V,E)G=(V,E) is constructed on a set VV of nn vertices by taking a size-kk subset CVC\subseteq V to be the core. An edge appears in GG with probability pp unless it connects two vertices in VCV\setminus C, in which it occurs with probability zero. Therefore, GG is guaranteed to have a kk-vertex cover. Note that a graph can be constructed from this model by drawing a standard Erdős-Rényi graph and subsequently deleting all edges that connect fringe vertices.

This model is a special case of the stochastic block model of random graphs from network theory [HLL83] in which the vertex set is partitioned into rr disjoint communities and edge probabilities are specified by a symmetric r×rr\times r matrix PP where a vertex in community ii is connected to a vertex in community jj with probability PijP_{ij}. The stochastic block model allows for the generation of graphs from which the community subgraphs might be recovered partially or in full from the graph data [AS15]. This models the detection of community structure in networks, which is a fundamental problem in computer science. The 𝒢(n,k,p)\mathcal{G}{\left({n},{k},{p}\right)} model we study in this work is a stochastic block model with r=2r=2 and probability matrix

P=[ppp0].P=\left[\begin{array}[]{cc}p&p\\ p&0\\ \end{array}\right].

In this paper, we are interested in the performance of simple randomized search heuristics on planted vertex cover problems in the context of parameterized complexity. We prove that, for sufficiently “dense” graphs (i.e., large enough pp), the (1+1) EA is with high probability a fixed-parameter tractable heuristic for the kk-vertex cover problem where kk is the size of the planted solution. More precisely, if kk is at most logarithmic, we prove there is a threshold on pp such that above this threshold the (1+1) EA is very likely to find a kk-cover in almost linear time. For larger values of kk, we show that the (1+1) EA runs in O(f(k,p)nlogn))O(f(k,p)n\log n)) time where ff is a function of kk and pp (but not nn).

The first parameterized result on vertex cover is due to Kratsch and Neumann [KN12] who demonstrated that Global SEMO using instance-specific mutation operators has expected optimization time O(OPTn4+n2OPT2+OPT)O(OPT\cdot n^{4}+n\cdot 2^{OPT^{2}+OPT}) on any graph GG where OPTOPT is the size of the optimal vertex cover of GG. This result can be tightened to O(n2logn+OPTn2+4OPTn)O(n^{2}\log n+OPT\cdot n^{2}+4^{OPT}n) by incorporating the cost of an optimal fractional vertex cover provided by an LP solver into the fitness function. A recent study by Baguley et al. [BFN+23] extended these multi-objective approaches to the W-separator problem. Using a special focused jump-and-repair mechanism, Branson and Sutton [BS21] showed that evolutionary algorithms can solve the vertex cover problem in expected time O(2OPTn2logn)O(2^{OPT}n^{2}\log n) by probabilistically simulating an iterative compression routine.

The above results hold for all graphs GG with vertex cover size OPTOPT. In this paper, we sacrifice the generality of the problem slightly in order to investigate a more general algorithm, i.e., the (1+1) EA. To our knowledge, we present here the first parameterized complexity result on vertex cover problems for a standard evolutionary algorithm that does not rely on any special mutation operators.

Our results. For random planted graph models with nn vertices, edge density pp and planted cover size kk, we show that if klnnk\leq\ln n, then if p>1lnδ2p>\sqrt{\frac{1-\ln\delta}{2}} for any constant δ(1/e,1)\delta\in(1/e,1), a restart framework for the (1+1) EA finds a kk-cover in nc+1lognn^{c+1}\log n, where cc is a constant. If k>lnnk>\ln n, then we show for any 0<p<10<p<1, the expected time of the (1+1) EA is O(k4k(1+1p)nlogn)O{\left(k^{4k\left(1+\frac{1}{p}\right)}n\log n\right)}, i.e., the (1+1) EA runs in FPT time parameterized by kk and pp.

We also provide the results of computational experiments that investigate regimes that our theorem does not cover, for example when both pp and kk are small. These results elucidate the relationship between kk and pp and the runtime of the (1+1) EA, and hint at new interesting directions for future theoretical study.

2 Preliminaries

Given a graph G=(V,E)G=(V,E) on nn vertices, we encode subsets of VV as elements of {0,1}n\{0,1\}^{n} in the usual way. For x{0,1}nx\in\{0,1\}^{n}, denote as |x||x| as the number of bits set to 11 in xx (i.e., the cardinality of the set to which it corresponds). The fitness function typically employed by evolutionary algorithms on the minimum vertex cover problem first penalizes infeasible sets (sets that do not cover all edges in EE), then penalizes larger feasible covers:

f(x)=|x|+n|{(u,v)E:x[u]=x[v]=0}|.f(x)=|x|+n\cdot\Big{\lvert}\Big{\{}(u,v)\in E\colon x[u]=x[v]=0\Big{\}}\Big{\rvert}. (1)

This fitness function is quite natural for searching for a minimal cover, and was originally designed by Khuri and Bäck [KB94]. It has been studied extensively both empirically and theoretically [KB94, OHY09, FHH+10].

We point out that this is a so-called vertex-based representation for which there are currently no bounds on the approximation ratio for the (1+1) EA. It is possible to obtain a guaranteed 2-approximation with the (1+1) EA by using edge-based representations instead [JOZ13]. This is rather notable, as minimum vertex cover is likely hard to approximate below a (2ϵ)(2-\epsilon) factor [KR08].

Input: A fitness function f:{0,1}nf\colon\{0,1\}^{n}\to\mathbb{R}
1
2Choose xx uniformly at random from {0,1}n\{0,1\}^{n};
3 while termination criteria not met do
4  Create yy by flipping each bit of xx with probability 1/n1/n;
5 if f(y)f(x)f(y)\leq f(x) then xyx\leftarrow y;
6 
7return xx;
Algorithm 1 (1+1) EA

Many of our theoretical results make use of multiplicative drift with tail bounds, which we state in the following theorem for reference.

Theorem 1 (Multiplicative Drift [DG10, KK19]).

Let (Xt)t(X_{t})_{t\in\mathbb{N}} be a stochastic process over \mathbb{R}, xmin>0x_{\min}>0 and let Tmin{t:Xt<xmin}T\coloneqq\min\{t:X_{t}<x_{\min}\}. Suppose that X0xminX_{0}\geq x_{\min} and, for all tTt\leq T, it holds that Xt0X_{t}\geq 0, and there exists some δ>0\delta>0 such that, for all t<Tt<T, E[XtXt+1X0,,Xt]δXt\operatorname{E}[X_{t}-X_{t+1}\mid X_{0},\ldots,X_{t}]\geq\delta X_{t}, then,

  1. 1.

    E[TX0]ln(X0/xmin)+1δ\operatorname{E}[T\mid X_{0}]\leq\frac{\ln(X_{0}/x_{\min})+1}{\delta}, and

  2. 2.

    Pr(Tln(X0/xmin)+rδ)er\Pr\left(T\geq\frac{\ln(X_{0}/x_{\min})+r}{\delta}\right)\leq e^{-r}

The fitness function in Equation (1) ensures that Algorithm 1 quickly finds a feasible cover, which is captured in Theorem 2, which was proved asymptotically in [FHH+10, Theorem 1]. We restate this result here with a simple upper bound with leading constants using drift.

Theorem 2.

The expected time until the (1+1) EA finds a feasible cover for any graph on nn vertices is at most 12(enlnn+en)\frac{1}{2}(en\ln n+en).

Proof.

Let (Xt)t(X_{t})_{t\in\mathbb{N}} be the stochastic process that counts the number of edges uncovered by the candidate solution in iteration tt of the (1+1) EA. For any vertex uu, denote as dt(u)d_{t}(u) the count of uncovered edges incident to uu in iteration tt. Since any vertex uu is flipped with probability (11/n)n1(1/n)(en)1(1-1/n)^{n-1}(1/n)\geq(en)^{-1}, and an increase in uncovered edges is never accepted, we may bound the drift of (Xt)(X_{t}) as

E[XtXt+1Xt]udt(u)en=2Xten\displaystyle\operatorname{E}[X_{t}-X_{t+1}\mid X_{t}]\geq\sum_{u}\frac{d_{t}(u)}{en}=\frac{2X_{t}}{en}

since each of the XtX_{t} uncovered edges is counted twice in the sum over dtd_{t}. The claim follows by Theorem 1. ∎

Definition 1.

Let n,kn,k\in\mathbb{N} and p(0,1)p\in(0,1). The 𝒢(n,k,p)\mathcal{G}{\left({n},{k},{p}\right)} model of random planted graphs is a distribution of random graphs on nn vertices defined by construction as follows.

Let VV be a set of nn (labeled) vertices. Choose a kk-subset CVC\subset V uniformly at random, and for each u,vVu,v\in V, if {u,v}C\{u,v\}\cap C\neq\emptyset, add edge uvuv to EE with probability pp.

In the resulting graph G=(V,E)G=(V,E), we refer to CC as the core, and each vCv\in C as a core vertex. We refer to vertices in VCV\setminus C as fringe vertices.

3 Small kk

In this section we consider 𝒢(n,k,p)\mathcal{G}{\left({n},{k},{p}\right)} where klnnk\leq\ln n. Our results rely heavily on the following property of planted vertex cover graphs, which we call δ\delta-heaviness.

Definition 2.

Let G=(V,E)G=(V,E) be a graph drawn from the 𝒢(n,k,p)\mathcal{G}{\left({n},{k},{p}\right)} model. For a constant 0<δ<10<\delta<1, we say GG is δ\delta-heavy if for every subset SVCS\subset V\setminus C where |S|=δ|VC||S|=\delta|V\setminus C|, every core vertex in CC is adjacent to at least lnn\ln n vertices in SS.

Lemma 1.

Let G=(V,E)G=(V,E) be a graph drawn from the 𝒢(n,k,p)\mathcal{G}{\left({n},{k},{p}\right)} model. Let δ,p(0,1)\delta,p\in(0,1) be constants. If p>1lnδ2p>\sqrt{\frac{1-\ln\delta}{2}}, then GG is δ\delta-heavy with probability 1eΩ(n)1-e^{-\Omega(n)}.

Proof.

Fix an arbitrary vCv\in C and an arbitrary δ(nk)\delta(n-k)-sized subset SVCS\subset V\setminus C. We first bound the probability that vv is adjacent to no more than lnn\ln n vertices in SS. Let XX be the random variable that counts the edges between vv and vertices in SS. Each edge from vv to a vertex in SS appears independently with probability pp, so XX is the sum of |S||S| independent Bernoulli random variables, each with success probability pp so E[X]=p|S|\operatorname{E}[X]=p|S|. By Hoeffding’s inequality [Hoe63], for any t>0t>0, Pr(XE[X]t)<e2t2/|S|\Pr(X\leq\operatorname{E}[X]-t)<e^{-2t^{2}/|S|}, thus the probability that vv is adjacent to at most lnn\ln n vertices in SS can be estimated by

Pr(Xlnn)\displaystyle\Pr(X\leq\ln n) =Pr(XE[X](E[X]lnn))\displaystyle=\Pr(X\leq\operatorname{E}[X]-(\operatorname{E}[X]-\ln n))
<e2(p|S|lnn)2/|S|\displaystyle<e^{-2(p|S|-\ln n)^{2}/|S|}
=exp(2(p2|S|+ln2n|S|2plnn))\displaystyle=\exp\left(-2\left(p^{2}|S|+\frac{\ln^{2}n}{|S|}-2p\ln n\right)\right)
exp(2δp2(nk)+4plnn).\displaystyle\leq\exp\left(-2\delta p^{2}(n-k)+4p\ln n\right).

We have assumed klnnk\leq\ln n, so this probability is at most

exp(2δp2(nlnn)+4plnn)<exp(2δp2n+6plnn).\exp\left(-2\delta p^{2}(n-\ln n)+4p\ln n\right)<\exp\left(-2\delta p^{2}n+6p\ln n\right).

Note that we have used here the fact that δ<1\delta<1 and p2<pp^{2}<p. Taking a union bound over all kk vertices vCv\in C, the probability that any core vertex is adjacent to fewer than lnn\ln n vertices in SS is at most

exp(2δp2n+6plnn+lnk).\exp\left(-2\delta p^{2}n+6p\ln n+\ln k\right).

A final union bound over all subsets SS of size δ|VC|=δ(nk)\delta|V\setminus C|=\delta(n-k) shows the probability that GG is not δ\delta-heavy is at most

(nδn)\displaystyle\binom{n}{\delta n} exp(2δp2n+6plnn+lnk)\displaystyle\exp\left(-2\delta p^{2}n+6p\ln n+\ln k\right)
eδnnδn(δn)δnexp(2δp2n+6plnn+lnk)\displaystyle\leq\frac{e^{\delta n}n^{\delta n}}{(\delta n)^{\delta n}}\exp\left(-2\delta p^{2}n+6p\ln n+\ln k\right)
=exp(2δp2n+6plnn+lnk+δnln(e/δ))\displaystyle=\exp\left(-2\delta p^{2}n+6p\ln n+\ln k+\delta n\ln(e/\delta)\right)
exp(δn(2p2ln(e/δ))+(6p+1)lnn).\displaystyle\leq\exp\left(-\delta n(2p^{2}-\ln(e/\delta))+(6p+1)\ln n\right).

Since p>1lnδ2p>\sqrt{\frac{1-\ln\delta}{2}}, and pp and δ\delta are taken to be positive constants, we have 2p2ln(e/δ)=Ω(1)2p^{2}-\ln(e/\delta)=\Omega(1), and the probability that GG is not δ\delta-heavy is eΩ(n)e^{-\Omega(n)}, which completes the proof. ∎

Theorem 3.

Consider the 𝒢(n,k,p)\mathcal{G}{\left({n},{k},{p}\right)} model with klnnk\leq\ln n and p>1lnδ2p>\sqrt{\frac{1-\ln\delta}{2}} for some constant δ(1/e,1)\delta\in(1/e,1). Then for all but an exponentially-fast vanishing fraction of all graphs GG sampled from 𝒢(n,k,p)\mathcal{G}{\left({n},{k},{p}\right)}, if TT is the runtime for the (1+1) EA to find a kk-cover on GG, we have

Pr(T2enlnn+en(1δ))=Ω(n(e(1δ)ln(2e)+ln2)).\Pr\left(T\leq 2en\ln n+\lfloor en(1-\delta)\rfloor\right)=\Omega(n^{-(e(1-\delta)\ln(2e)+\ln 2)}).
Proof.

Since pp is sufficiently large, by Lemma 1, all but an eΩ(n)e^{-\Omega(n)}-fraction of graphs drawn from 𝒢(n,k,p)\mathcal{G}{\left({n},{k},{p}\right)} are δ\delta-heavy. Thus, we assume for the remainder of the proof that GG is δ\delta-heavy.

Let \mathcal{E} be the event that after exactly en(1δ)\lfloor en(1-\delta)\rfloor iterations of the (1+1) EA, the following conditions hold:

  1. 1.

    The core vertices CC belong to the current solution of the (1+1) EA,

  2. 2.

    There are at least δn\delta n fringe vertices that are not part of the current solution of the (1+1) EA.

This is a rather fortunate event for the (1+1) EA, because such a candidate solution is already a feasible vertex cover (as all vertices in CC are present), so after this point no infeasible covers would be accepted. Moreover, since GG is δ\delta-heavy, every core vertex is adjacent to at least lnn\ln n uncovered edges (by condition (2) above). Thus in order to remove a core vertex vv from the cover, a single mutation operation would need to change at least lnn\ln n neighbors of vv to remain feasible. In contrast, it is always possible to remove any fringe vertex from the current cover. Thus if there are ii fringe vertices in the current solution, the probability to improve the fitness is at least i/(en)i/(en). Furthermore, the probability of flipping at least lnn\ln n vertices in a single mutation is nω(1)n^{-\omega(1)}.

Let {Xt}t\{X_{t}\}_{t\in\mathbb{N}} denote the stochastic process that tracks the number of fringe vertices in the cover at time tt. The drift of {Xt}\{X_{t}\} conditioned on \mathcal{E} and starting at iteration en(1δ)\lfloor en(1-\delta)\rfloor is at least Xt/ennω(1)=Ω(Xt/n)X_{t}/en-n^{-\omega(1)}=\Omega(X_{t}/n). By Theorem 1,

Pr(T<2enlnn+en(1δ))=1o(1)\Pr\left(T<2en\ln n+\lfloor en(1-\delta)\rfloor\mid\mathcal{E}\right)=1-o(1)

It remains to bound the probability of \mathcal{E}. Let 1\mathcal{E}_{1} be the event that the initial solution to the (1+1) EA contains every vertex in CC and let 2\mathcal{E}_{2} be the event that the core vertices in CC are not mutated during the first en(1δ)\lfloor en(1-\delta)\rfloor iterations of the (1+1) EA. Conditioning on 12\mathcal{E}_{1}\cap\mathcal{E}_{2}, the (1+1) EA already starts with a feasible solution and does not remove any core vertices during the first en(1δ)\lfloor en(1-\delta)\rfloor steps.

Let T1T_{1} be the random variable that measures the number of iterations until the first time the number of fringe vertices in the cover drops below a δ\delta-fraction. Again applying tail bounds on multiplicative drift, and noting that 1+ln(11δ)1δ1+\ln\left(\frac{1}{1-\delta}\right)\geq 1-\delta for constant 0<δ<10<\delta<1, under the condition 12\mathcal{E}_{1}\cap\mathcal{E}_{2}, the (1+1) EA has reduced the number of fringe vertices in the cover from at most nkn-k to at most δ(nk)\delta(n-k) with probability at least 11/e1-1/e. Applying the law of total probability we have

Pr()\displaystyle\Pr(\mathcal{E}) Pr(12)Pr(12)\displaystyle\geq\Pr(\mathcal{E}\mid\mathcal{E}_{1}\cap\mathcal{E}_{2})\Pr(\mathcal{E}_{1}\cap\mathcal{E}_{2})
=Pr(12)Pr(21)Pr(1)\displaystyle=\Pr(\mathcal{E}\mid\mathcal{E}_{1}\cap\mathcal{E}_{2})\Pr(\mathcal{E}_{2}\mid\mathcal{E}_{1})\Pr(\mathcal{E}_{1})
(11e)[(11n)k]en(1δ)(1/2)k\displaystyle\geq\left(1-\frac{1}{e}\right)\cdot\left[\left(1-\frac{1}{n}\right)^{k}\right]^{\lfloor en(1-\delta)\rfloor}(1/2)^{k}
(11/e)(2e)ek(1δ)2k\displaystyle\geq\left(1-1/e\right)\cdot(2e)^{-ek(1-\delta)}\cdot 2^{-k}
(11/e)n(e(1δ)ln(2e)+ln2),\displaystyle\geq\left(1-1/e\right)\cdot n^{-(e(1-\delta)\ln(2e)+\ln 2)},

where we have used klnnk\leq\ln n in the final inequality. ∎

Input: A fitness function f:{0,1}nf\colon\{0,1\}^{n}\to\mathbb{R} and a run length \ell
1 t0t\leftarrow 0;
2 while termination criteria not met do
3 if t=0t=0 then
4     Choose xx uniformly at random from {0,1}n\{0,1\}^{n};
5    
6  Create yy by flipping each bit of xx with probability 1/n1/n;
7 if f(y)f(x)f(y)\leq f(x) then xyx\leftarrow y;
8 t(t+1)modt\leftarrow(t+1)\bmod\ell;
9 
10return xx;
Algorithm 2 (1+1) EA with cold restarts

Theorem 3 provides a lower bound on the probability that a run of length at least 2enlnn+en(1δ)2en\ln n+\lfloor en(1-\delta)\rfloor finds a kk-cover of a random graph with sufficient density. This bound vanishes with nn, but slowly enough that a simple cold-restart strategy (periodically starting over from a randomly generated cover) is guaranteed to be efficient. This is captured by the following corollary.

Corollary 1 (to Theorem 3).

Consider the 𝒢(n,k,p)\mathcal{G}{\left({n},{k},{p}\right)} model with klnnk\leq\ln n and 0.71p10.71\leq p\leq 1. Running the (1+1) EA with cold restarts (Algorithm 2) with =3enlnn\ell=3en\ln n finds a kk-cover on all but an exponentially-fast vanishing fraction of graphs in O(nc+1logn)O(n^{c+1}\log n) fitness evaluations where 0.73<ce(1+ln2)1<3.610.73<c\leq e(1+\ln 2)-1<3.61 is a constant depending on pp.

Proof.

Let δ=e12p2\delta=e^{1-2p^{2}}. Since p>0.71p>0.71, we have δ(1/e,1)\delta\in(1/e,1). Thus the conditions for Theorem 3 are satisfied, and the success probability for an independent run of length 3enlnn3en\ln n of the (1+1) EA is Ω(n(e(1δ)ln(2e)+ln2)\Omega(n^{-(e(1-\delta)\ln(2e)+\ln 2)}. Under this condition, the number of independent runs until a success is geometrically distributed with expectation ne(1δ)ln(2e)+ln2=ne(1e12p2)(1+ln2)+ln2n^{e(1-\delta)\ln(2e)+\ln 2}=n^{e(1-e^{1-2p^{2}})(1+\ln 2)+\ln 2}, and cc can be chosen appropriately. ∎

4 Large kk

We now consider 𝒢(n,k,p)\mathcal{G}{\left({n},{k},{p}\right)} in which k>lnnk>\ln n. We will make use of the following probabilistic bound on the size of independent sets in the core.

Lemma 2.

Suppose GG is drawn from the 𝒢(n,k,p)\mathcal{G}{\left({n},{k},{p}\right)} model with k=ω(1)k=\omega(1). Then with probability 1o(1)1-o(1), the largest independent set in CC has size at most (1+2/p)lnk+1(1+2/p)\ln k+1.

Proof.

Set (1+2/p)lnk+1\ell\coloneqq\lceil(1+2/p)\ln k+1\rceil. There are (k)\binom{k}{\ell} size-\ell vertex sets in CC. We label these sets from 11 to (k)\binom{k}{\ell} and consider a sequence X1,,X(k)X_{1},\ldots,X_{\binom{k}{\ell}} of indicator random variables over 𝒢(n,k,p)\mathcal{G}{\left({n},{k},{p}\right)} where

Xi={1if the i-th size- subset of C is an independent set in G,0otherwise.X_{i}=\begin{cases}1&\text{if the $i$-th size-$\ell$ subset of $C$ is an independent set in $G$,}\\ 0&\text{otherwise.}\end{cases}

Consider the sum X=X1++X(k)X=X_{1}+\cdots+X_{\binom{k}{\ell}} and note that X=0X=0 if and only if there are no independent sets of size \ell or larger in GG. By Markov’s inequality,

Pr(X1)\displaystyle\Pr(X\geq 1) E[X]=(k)(1p)(2)k((1p)(1)/2)\displaystyle\leq\operatorname{E}[X]=\binom{k}{\ell}(1-p)^{\binom{\ell}{2}}\leq k^{\ell}\left((1-p)^{(\ell-1)/2}\right)^{\ell}
(exp(lnkp(1)/2)),since 1pep,\displaystyle\leq\left(\exp\left(\ln k-p(\ell-1)/2\right)\right)^{\ell},\,\text{since $1-p\leq e^{-p}$,}
=exp([(1+p2)lnk+p2]lnk)\displaystyle=\exp\left(-\left[\left(1+\frac{p}{2}\right)\ln k+\frac{p}{2}\right]\ln k\right)
eln2k,\displaystyle\leq e^{-\ln^{2}k},

since p0p\geq 0. ∎

Theorem 4.

Consider a graph GG drawn from the 𝒢(n,k,p)\mathcal{G}{\left({n},{k},{p}\right)} model with k>lnnk>\ln n. Then with probability 1o(1)1-o(1) (taken over the model), the expected runtime of the (1+1) EA to find a cover of size at most kk on GG is O(k4k(1+1p)nlogn)O{\left(k^{4k\left(1+\frac{1}{p}\right)}n\log n\right)}.

Proof.

By Theorem 2, the (1+1) EA takes at most 12(enlnn+en)\frac{1}{2}(en\ln n+en) steps in expectation to find a feasible solution, after which the (1+1) EA never accepts an infeasible solution.

Consider the potential function ϕ(x)=max{0,f(x)k}\phi(x)=\max\{0,f(x)-k\} and note that when ϕ(x)=0\phi(x)=0, xx is a feasible cover of size at most kk. Moreover, ϕ\phi cannot increase during the run of the (1+1) EA.

By Lemma 2, the largest independent set in the core of GG contains at most (1+2p)lnk+1(1+\frac{2}{p})\ln k+1 vertices with probability 1o(1)1-o(1), and we condition on this event for the remainder of the proof. Consider the stochastic process (Xt)t(X_{t})_{t\in\mathbb{N}}, which corresponds to the potential in the tt-th iteration.

We seek to bound the drift of (Xt)(X_{t}) after finding a feasible solution. Assume that the (1+1) EA has already found a feasible solution, and let CC be the core vertices of GG. Let xx be the current solution. We make the following case distinction on xx.

Case 1:

C{i:x[i]=0}=C\cap\{i:x[i]=0\}=\emptyset. In this case, all of the vertices in CC are in the cover described by xx. Thus, any fringe vertex can be removed from the current cover and the resulting set is still a cover. A particular vertex is removed from the cover with probability (1/n)(11/n)n1(1/n)(1-1/n)^{n-1} and there are f(x)kf(x)-k fringe vertices, so the drift in this case is

E[XtXt+1Xt]f(x)kn(11n)n1Xten.\operatorname{E}[X_{t}-X_{t+1}\mid X_{t}]\geq\frac{f(x)-k}{n}\left(1-\frac{1}{n}\right)^{n-1}\geq\frac{X_{t}}{en}.
Case 2:

C{i:x[i]=0}C\cap\{i:x[i]=0\}\neq\emptyset. In this case, some of the core vertices are not in the cover described by xx. Let ZC{x[i]=0}Z\coloneqq C\cap\{x[i]=0\} be the set of core vertices that are not in the current cover. Note that since xx is feasible ZZ must be an independent set in CC (otherwise there would be an uncovered edge in CC).

Let ZZ^{\prime} be an arbitrary set of exactly |Z||Z| fringe vertices that belong to the current solution xx, i.e., Z{i:x[i]=1}(VC)Z^{\prime}\subseteq\{i:x[i]=1\}\cap(V\setminus C) with |Z|=|Z||Z^{\prime}|=|Z|. Such a ZZ^{\prime} must exist, otherwise we would have f(x)<kf(x)<k. Let \mathcal{E} denote the event that mutation changes all of the zero-bits corresponding to ZZ into one-bits, and all of the of one-bits corresponding to ZZ^{\prime} to zero. Since each bit is mutated independently, we may invoke the principle of deferred decisions [MU05] and assume that the choices are first made for the bits in ZZ and ZZ^{\prime} to produce a partially mutated offspring. Hence, we assume that \mathcal{E} has occurred, and consider the random choices on the remaining bits corresponding to V(ZZ)V\setminus(Z\cap Z^{\prime}). There are f(x)(k|Z|)=f(x)k+|Z|f(x)-(k-|Z|)=f(x)-k+|Z| fringe vertices in xx, and after removing |Z|=|Z||Z^{\prime}|=|Z| fringe vertices, there are still f(x)k=Xtf(x)-k=X_{t} fringe vertices that have not yet been considered for mutation, so we may assume that we are in Case 1, now with exactly f(x)k=Xtf(x)-k=X_{t} fringe vertices remaining in the cover. Since XtXt+10X_{t}-X_{t+1}\geq 0, by the law of total expectation, we can bound the drift from below as follows.

E[XtXt+1Xt]\displaystyle\operatorname{E}[X_{t}-X_{t+1}\mid X_{t}] E[XtXt+1Xt]Pr()\displaystyle\geq\operatorname{E}[X_{t}-X_{t+1}\mid X_{t}\cap\mathcal{E}]\Pr(\mathcal{E})
n2|Z|Xten,\displaystyle\geq n^{-2|Z|}\frac{X_{t}}{en},

since Pr()=n(|Z|+|Z|)=n2|Z|\Pr(\mathcal{E})=n^{-(|Z|+|Z^{\prime}|)}=n^{-2|Z|}.

In either case, the drift is at least n2|Z|Xtenn^{-2|Z|}\frac{X_{t}}{en}, but we have assumed via Lemma 2 that |Z|(1+2p)lnk+1<2(1+1/p)lnk|Z|\leq(1+\frac{2}{p})\ln k+1<2(1+1/p)\ln k for sufficiently large nn (and hence kk, as klnnk\geq\ln n). Therefore, by the multiplicative drift theorem, the expected time until a kk-cover is found is at most

O(n4(1+1/p)lnknlogn)\displaystyle O(n^{4(1+1/p)\ln k}n\log n) =O(k4(1+1/p)lnnnlogn)\displaystyle=O(k^{4(1+1/p)\ln n}n\log n)
=O(k4k(1+1p)nlogn),\displaystyle=O\left(k^{4k\left(1+\frac{1}{p}\right)}n\log n\right),

since lnn<k\ln n<k. ∎

5 Computational Experiments

To fill in the gaps left open by the previous sections, we report here on a number of experiments that investigate the relationship between the parameters of the planted vertex cover problem. For each experiment, we sample from the 𝒢(n,k,p)\mathcal{G}{\left({n},{k},{p}\right)} model by constructing a random graph on nn vertices choosing each edge with probability pp as long as at least one incident vertex is in the set {1,,k}\{1,\ldots,k\}. After this, we run the standard (1+1) EA (Algorithm 1) until f(x)kf(x)\leq k. For each setting of nn, kk, pp, we run the algorithm for 100 trials (but sample a new graph from 𝒢(n,k,p)\mathcal{G}{\left({n},{k},{p}\right)} each time.

To better understand how the runtime depends on nn on dense graphs in which kk is a small function of nn, we plot the average runtime, varying n=100,,1000n=100,\ldots,1000 and fixing p=0.5p=0.5. This is plotted in Figure 1(a), where we observe a stable runtime varying almost linearly with nn. In Figure 1(b), we show the same data for runs where pp is also varied with nn, i.e., p=1/np=1/n. This corresponds to much sparser graphs, and we see that the runtime has much higher variability, especially for slower growing kk.

Refer to caption
(a) Dense regime (p=0.5p=0.5).
Refer to caption
(b) Sparse regime (p=1/np=1/n).
Figure 1: Runtime dependence on nn for k=lnnk=\ln n and k=nk=\sqrt{n}. Error bars denote standard deviation.

This scaling behavior is not so surprising, as we expect that random planted graphs are particularly easy for the (1+1) EA. Similar to the case of random planted satisfiability [DNS17], the relatively uniform structure of the problem is likely to provide a good fitness signal for hill-climbing type algorithms.

Random distributions of problems often undergo a so-called phase transition as various system parameters are varied. Very often, problems sampled near a critical density tend to be (empirically) harder to solve by different algorithms. For example, empirical evidence suggests critically-constrained planted propositional satisfiability formulas are difficult for the (1+1) EA when they are sampled near a critical density [DNS17]. To study the performance of the (1+1) EA on 𝒢(n,k,p)\mathcal{G}{\left({n},{k},{p}\right)} as a function of graph density, we plot the dependence of the average runtime on pp in Figures 2(a) and 2(b), holding nn fixed and averaging over all values of kk. We also see in this case a dependence on graph density in which the (1+1) EA performs worse in a band of not-too-sparse but not-too-dense graphs.

Refer to caption
(a) n=1000n=1000
Refer to caption
(b) n=200n=200
Figure 2: Runtime dependence on pp for fixed nn varying k=10,,100k=10,\ldots,100. Error bars denote standard deviation.

The dependence of runtime on kk, however, is more uniform as we can see in Figure 3. Here we have aggregated over all pp values, which likely explains the large variance, especially in the larger n=1000n=1000 problems.

Refer to caption
Figure 3: Runtime dependence on kk (pp aggregated). Error bars denote standard deviation.

A more detailed picture is provided by Figures 4(a) and 4(b), where we display two-dimensional color plots showing the runtime dependence on both kk and pp simultaneously. On these plots one can see how the density and the cover size influences the efficiency of the (1+1) EA. We conjecture that there is a critical value (or range) of pp at which the (1+1) EA struggles to find a kk-cover.

Refer to caption
(a) n=200n=200
Refer to caption
(b) n=1000n=1000
Figure 4: Runtime dependence on both kk and pp for fixed nn.

The (1+1) EA completes execution as soon as it finds a kk-cover. However, this is not necessarily guaranteed to be the kk-cover that was planted in the graph. Indeed, for smaller densities, we would expect many other kk-covers in the graph. To investigate this, in Figure 5(a) we plot the proportion of runs in which the planted kk-core was recovered (as opposed to some different kk-cover) as a function of pp. The dependence of this characteristic as a function of kk is plotted in Figure 5(b), and Figures 5(c) and 5(d) display this in a color plot for both kk and pp simultaneously.

Refer to caption
(a) kk-core recovered as a a function of pp.
Refer to caption
(b) kk-core recovered as a function of kk.
Refer to caption
(c) kk-core recovered as a function of kk and pp (n=200n=200).
Refer to caption
(d) kk-core recovered as a function of kk and pp (n=1000n=1000).
Figure 5: Proportion of runs in which the planted kk-core was recovered.

When the graph is relatively sparse, we would also expect the (1+1) EA to “overshoot” kk by finding an even smaller cover before finding a kk-cover. To understand better how this depends on kk and pp, we plot the average difference between kk and the best fitness found as a function of pp on sparse (p=1/np=1/n) instances where nn is varied in Figure 6(a), and on fixed-nn instances in Figures 6(b) and 6(c).

Refer to caption
(a) n=100,,1000n=100,\ldots,1000
Refer to caption
(b) n=1000n=1000
Refer to caption
(c) n=200n=200
Figure 6: Average difference between kk and best fitness found as a function of pp.

6 Conclusion

In this paper we have presented a parameterized analysis the (1+1) EA on problems drawn from the 𝒢(n,p,k)\mathcal{G}{\left({n},{p},{k}\right)} random planted vertex cover model. We showed that for dense graphs (p>0.71)(p>0.71) and small kk, there is sufficient signal in enough of the space so that the (1+1) EA has a relatively good chance of finding a kk-cover in a polynomial-length run. When kk is large, we showed that a feasible cover cannot leave too much of the planted core uncovered, and therefore the (1+1) EA does not require a large effort to make progress. In the end, this translates to a fixed-parameter tractable runtime for the (1+1) EA with high probability over 𝒢(n,p,k)\mathcal{G}{\left({n},{p},{k}\right)}.

To fill in the picture, we also reported a number of computational experiments that measure the runtime on graphs drawn from 𝒢(n,p,k)\mathcal{G}{\left({n},{p},{k}\right)}. These experiments point to a critical value for pp at which the (1+1) EA requires more time to find any kk-cover, which suggest an interesting direction for future theoretical work to understand this phenomenon better.

Acknowledgements

This work was supported by the National Science Foundation under grant 2144080 and by the Australian Research Council under grant FT200100536.

References

  • [AKS98] Noga Alon, Michael Krivelevich, and Benny Sudakov. Finding a large hidden clique in a random graph. Random Structures & Algorithms, 13(3–4):457–466, 1998.
  • [AS15] Emmanuel Abbe and Colin Sandon. Community detection in general stochastic block models: Fundamental limits and efficient algorithms for recovery. In 2015 IEEE 56th Annual Symposium on Foundations of Computer Science (FOCS), pages 670–688, 2015.
  • [BFN+23] Samuel Baguley, Tobias Friedrich, Aneta Neumann, Frank Neumann, Marcus Pappik, and Ziena Zeif. Fixed parameter multi-objective evolutionary algorithms for the w-separator problem. In Genetic and Evolutionary Computation Conference (GECCO ’23), 2023.
  • [BK18] Austin R. Benson and Jon M. Kleinberg. Found graph data and planted vertex covers. In Samy Bengio, Hanna M. Wallach, Hugo Larochelle, Kristen Grauman, Nicolò Cesa-Bianchi, and Roman Garnett, editors, Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, NeurIPS 2018, December 3-8, 2018, Montréal, Canada, pages 1363–1374, 2018.
  • [BS21] Luke Branson and Andrew M. Sutton. Focused jump-and-repair constraint handling for fixed-parameter tractable graph problems. In Proceedings of the 16th ACM/SIGEVO Conference on Foundations of Genetic Algorithms. Association for Computing Machinery, New York, NY, USA, 2021.
  • [DG10] Benjamin Doerr and Leslie Ann Goldberg. Drift analysis with tail bounds. In Proceedings of the Eleventh International Conference on Parallel Problem Solving from Nature (PPSN XI), volume 6238 of Lecture Notes in Computer Science, pages 174–183. Springer, 2010.
  • [DNS17] Benjamin Doerr, Frank Neumann, and Andrew M. Sutton. Time complexity analysis of evolutionary algorithms on random satisfiable k-CNF formulas. Algorithmica, 78(2):561–586, June 2017.
  • [FHH+10] Tobias Friedrich, Jun He, Nils Hebbinghaus, Frank Neumann, and Carsten Witt. Approximating covering problems by randomized search heuristics using multi-objective models. Evolutionary Computation, 18(4):617–633, June 2010.
  • [HLL83] Paul W. Holland, Kathryn Blackmond Laskey, and Samuel Leinhardt. Stochastic blockmodels: First steps. Social Networks, 5(2):109–137, 1983.
  • [Hoe63] Wassily Hoeffding. Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58(301):13–30, 1963.
  • [Jer92] Mark Jerrum. Large cliques elude the metropolis process. Random Structures & Algorithms, 3(4):347–359, 1992.
  • [JOZ13] Thomas Jansen, Pietro S. Oliveto, and Christine Zarges. Approximating vertex cover using edge-based representations. In Frank Neumann and Kenneth A. De Jong, editors, Proceedings of the Twelfth Workshop on Foundations of Genetic Algorithms (FOGA XII), Adelaide, SA, Australia, January 16-20, 2013, pages 87–96. ACM, 2013.
  • [JP00] Ari Juels and Marcus Peinado. Hiding cliques for cryptographic security. Designs, Codes and Cryptograrphy, 20(3):269–280, 2000.
  • [KB94] Sami Khuri and Thomas Bäck. An evolutionary heuristic for the minimum vertex cover problem. In J. Kunze and H. Stoyan, editors, Workshops of the Eighteenth Annual German Conference on Artificial Intelligence (KI-94), Saarbrücken, Germany, pages 86–90, 1994.
  • [KK19] Timo Kötzing and Martin S. Krejca. First-hitting times under drift. Theoretical Computer Science, 796:51–69, 2019.
  • [KN12] Stefan Kratsch and Frank Neumann. Fixed-Parameter Evolutionary Algorithms and the Vertex Cover Problem. Algorithmica, 65(4):754–771, May 2012.
  • [KR08] Subhash Khot and Oded Regev. Vertex cover might be hard to approximate to within (2ϵ)(2-\epsilon). Journal of Computer and System Sciences, 74(3):335–349, 2008. Computational Complexity 2003.
  • [MU05] Michael Mitzenmacher and Eli Upfal. Probability and Computing : Randomized Algorithms and Probabilistic Analysis. Cambridge University Press, 2005.
  • [OHY09] Pietro S. Oliveto, Jun He, and Xin Yao. Analysis of the (1+1) EA for finding approximate solutions to vertex cover problems. IEEE Transactions on Evolutionary Computation, 13(5):1006–1029, 2009.
  • [RUK19] Daniel M. Romero, Brian Uzzi, and Jon M. Kleinberg. Social networks under stress: Specialized team roles and their communication structure. ACM Trans. Web, 13(1):6:1–6:24, 2019.
  • [Sto07] Tobias Storch. Finding large cliques in sparse semi-random graphs by simple randomized search heuristics. Theoretical Computer Science, 386:114–131, 2007.