¹¹institutetext: Department of Applied Mathematics, Faculty of Mathematics and Physics,
Charles University in Prague, Czech Republic
¹¹email: kolman@kam.mff.cuni.cz, koutecky@kam.mff.cuni.cz

Extended Formulation for CSP that is Compact
for Instances of Bounded Treewidth^†^†thanks: This research was partially supported by the project 14-10003S of GA ČR.

Petr Kolman Martin Koutecký

Abstract

In this paper we provide an extended formulation for the class of constraint satisfaction problems and prove that its size is polynomial for instances whose constraint graph has bounded treewidth. This implies new upper bounds on extension complexity of several important NP-hard problems on graphs of bounded treewidth.

1 Introduction

Many important combinatorial optimization problems belong to the class of constraint satisfaction problems (CSP). Naturally, a lot of effort has been given to design efficient approximation algorithms for CSP, to prove complexity lower bounds for CSP, and to identify tractable instances of CSP (e.g., from the point of view of parameterized complexity). It has been shown that CSP is solvable in polynomial time for instances whose constraint graph has bounded treewidth [7].

In recent years, a lot of attention has been given to study extension complexity of problems [5]: what is the minimum number of inequalities representing a polytope whose (suitably chosen) linear projection coincides with the convex hull $H$ of all integral solutions of $Q$ ? Such a polytope is called the extended formulation of $H$ . Note that membership of a problem in the class P of polynomially solvable problems does not necessarily imply the existence of an extended formulation of polynomial size [16]. In this work, we present an extended formulation for CSP and show that its size is polynomial for instances of CSP whose constraint graph has bounded treewidth.

1.1 Notation and Terminology

An instance $Q=(V,\mathcal{D},\mathcal{H},\mathcal{C})$ of CSP consists of

•

a set of variables $z_{v}$ , one for each $v\in V$ ; without loss of generality we assume that $V=\{1,\ldots,n\}$ ,
•

a set $\mathcal{D}$ of finite domains $D_{v}\subseteq\mathbb{R}$ (also denoted $D(v)$ ), one for each $v\in V$ ,
•

a set of hard constraints $\mathcal{H}\subseteq\{C_{U}\ |\ U\subseteq V\}$ where each hard constraint $C_{U}\in\mathcal{H}$ with $U=\{i_{1},i_{2},\dots,i_{k}\}$ and $i_{1}<i_{2}<\cdots<i_{k}$ , is a $|U|$ -ary relation $C_{U}\subseteq D_{i_{1}}\times D_{i_{2}}\times\cdots\times D_{i_{k}}$ ,
•

a set of soft constraints $\mathcal{C}\subseteq\{C_{U}\ |\ U\subseteq V\}$ where each soft constraint $C_{U}\in\mathcal{C}$ with $U=\{i_{1},i_{2},\dots,i_{k}\}$ and $i_{1}<i_{2}<\cdots<i_{k}$ , is a $|U|$ -ary relation $C_{U}\subseteq D_{i_{1}}\times D_{i_{2}}\times\cdots\times D_{i_{k}}$ .

The constraint graph of $Q$ is defined as $G=(V,E)$ where $E=\{\{u,v\}\ |\ \exists C_{U}\in\mathcal{C}\cup\mathcal{H}\textrm{ s.t. }\{u,v\}\subseteq U\}$ . We say that a CSP instance $Q$ has bounded treewidth if the constraint graph of $Q$ has bounded treewidth. In binary CSP, every hard and soft relation is a unary or binary relation, and in boolean CSP, the domain of every variable is $\{0,1\}$ . We use $D$ to denote the maximal size of all domains, that is, $D=\max_{u\in V}|D_{u}|$ .

For a vector $z=(z_{1},z_{2},\ldots,z_{n})$ and $U=\{i_{1},i_{2},\dots,i_{k}\}\subseteq V$ with $i_{1}<i_{2}<\cdots<i_{k}$ , we define the projection of $z$ on $U$ as $z|_{U}=(z_{i_{1}},z_{i_{2}},\ldots,z_{i_{k}})$ . A vector $z\in\mathbb{R}^{n}$ satisfies the constraint $C_{U}\in\mathcal{C}\cup\mathcal{H}$ if and only if $z|_{U}\in C_{U}$ . We say that a vector $z^{\star}=(z^{\star}_{1},\ldots,z^{\star}_{n})$ is a feasible assignment for $Q$ if $z^{\star}\in D_{1}\times D_{2}\times\ldots\times D_{n}$ and $z^{\star}$ satisfies every hard constraint $C\in\mathcal{H}$ . For a given feasible assignment $z^{\star}$ we define an extended feasible assignment ex $(z^{\star})=(z^{\star},h^{\star})\in\mathbb{R}^{n+|\mathcal{C}|}$ as follows: the coordinates of $h^{\star}$ are indexed by the soft constraints from $\mathcal{C}$ (to be more precise: by the subsets $U$ of $V$ used as lower indices of the soft constraints) and for each $C_{U}\in\mathcal{C}$ , we have $h^{\star}_{U}=1$ if and only if $z^{\star}|_{U}\in C_{U}$ , and $h^{\star}_{U}=0$ otherwise. We denote by $\mathcal{F}(Q)$ the set of all feasible assignments for $Q$ , by $\mathcal{F}^{ex}(Q)=\{$ ex $(z^{\star})\ |\ z^{\star}\in\mathcal{F}(Q)\}$ the set of all extended feasible assignments for $Q$ . For every instance $Q$ we define two polytopes: $CSP(Q)$ is the convex hull of $\mathcal{F}^{ex}(Q)$ and $CSP^{\prime}(Q)$ the convex hull of $\mathcal{F}(Q)$ . We also define three trivial linear projections:

•

$\textrm{proj}_{V}(z,h)=z$ , $\textrm{proj}_{E}(z,h)=h$ , $\textrm{proj}_{id}(z,h)=(z,h)$

where $z\in\mathbb{R}^{n}$ and $h\in\mathbb{R}^{|\mathcal{C}|}$ , and observe that $\textrm{proj}_{V}(CSP(Q))=CSP^{\prime}(Q)$ .

In the decision version of CSP, the set $\mathcal{C}$ of soft constraints is empty and the task is to decide whether there exists a feasible assignment. In the maximization (minimization, resp.) version of the problem, the task is to find a feasible assignment that maximizes (minimizes, resp.) the number of satisfied (unsatisfied, resp.) soft constraints. Note that there is no difference between maximization and minimization versions of the problem with respect to optimal solutions but the two versions differ significantly from an approximation perspective.

In the weighted version of CSP we are also given a weight function $w:\mathcal{C}\rightarrow\mathbb{R}$ that specifies for each soft constraint $C\in\mathcal{C}$ its weight $w(C)$ . The goal is to find a feasible assignment that maximizes (minimizes, resp.) the total weight of satisfied (unsatisfied, resp.) constraints. The unweighted version of CSP is equivalent to the weighted version with $w(C)=1$ for all $C\in\mathcal{C}$ .

Even more generally, the relations in the soft constraints can be replaced by bounded real valued payoff functions: a soft constraint $C_{U}\in\cal C$ with $U=\{i_{1},i_{2},\dots,i_{k}\}$ is not a $|U|$ -ary relation but a function $w:D_{i_{1}}\times D_{i_{2}}\times\ldots\times D_{i_{k}}\rightarrow\mathbb{R}$ and the payoff of the soft constraint $C_{U}$ for a feasible assignment $z^{\star}$ is $w(z^{\star}|_{U})$ ; the objective is to maximize (minimize, resp.) the total payoff. For the sake of simplicity of the presentation we do not consider the problem in this generality although the techniques used in this paper apply in the general setting as well.

For notions related to the treewidth of a graph, we stick to the standard terminology as given in the book by Kloks [10]).

1.2 Related Work

CSP for graphs of bounded treewidth.

As CSP captures many NP-hard problems, it is a natural problem to identify tractable special cases of CSP. Freuder [7] showed that CSP instances with treewidth bounded by $\tau$ can be solved in time $O(D^{\tau}n)$ . Later, Grohe et al. [8] proved that, assuming $FPT\not=W[1]$ , this is essentially the only nontrivial class of graphs for which CSP is solvable in polynomial time (cf. Marx [12]).

Describing the polytope of CSP solutions by the means of linear programming, for instances of bounded treewidth, is not a new idea. In 2007, Sellmann et al. published a paper [18] in which they described a linear program that was supposed to define the convex hull of all feasible solutions of a binary CSP when the constraint graph is a tree. They also provided a procedure to convert a given CSP instance with bounded treewidth into one whose constraint graph is a tree, at the cost of blowing up the number of variables and constraints by a function of the treewidth. Unfortunately, there was a substantial bug in their proof and one of the main theorems in the paper does not even hold [17].

The paper [18] also implicitely includes this folklore result: if the constraint graph has treewidth at most $\tau$ , then CSP can be solved by $\tau$ levels of the Sherali-Adams hierarchy. The resulting formulation is of size $\mathcal{O}(n^{\tau})$ while our approach yields size $\mathcal{O}(D^{\tau}n)$ .

CSP for general graphs.

Chan et al. [4] study the extent to which linear programming relaxation can be used in dealing with approximating CSP. They show that polynomial-sized LPs are exactly as powerful as LPs obtained from a constant number of rounds of the Sherali-Adams hierarchy. They also prove integrality gaps for polynomial-sized LPs for some CSP.

Raghavendra [13] shows that under the Unique Games Conjecture, a certain simple SDP relaxation achieves the best approximation ratio for every CSP. In a follow up paper, Raghavendra and Steurer [14] describe an efficient rounding scheme that achieves the integrality gap of the simple SDP relaxation, and, in another paper [15], they show unconditionally that the integrality gap of this SDP relaxation cannot be reduced by Sherali-Adams hierarchies.

Other related results.

Buchanan and Butenko [3] provide an extended formulation for the independent set problem, a special case of CSP, that has size $O(2^{\tau}n)$ where $\tau$ denotes the treewidth of the given graph. Our results can be viewed as a generalization of this result: the size of our formulation, when applied to the independent set problem, is also $O(2^{\tau}n)$ .

In a recent work, Bienstock and Munoz [2] define a class of so called general binary optimization problems which are essentially weighted boolean CSP problems, and for instances of treewidth $\tau$ provide an LP formulation of size $O(2^{\tau}n)$ . Again, this is a special case of our result in this paper. It is worth mentioning at this point that every CSP instance can be transformed into a boolean CSP instance; however, the standard transformation results in a substantial increase (in some cases even $\Omega(D)$ ) of the treewidth of the constraint graph.

1.3 New Results

Our main result is summarized as the following theorem.

Theorem 1.1

For every instance $Q=(V,\mathcal{D},\mathcal{H},\mathcal{C})$ of CSP, there exists an extended formulation $P(Q)$ of $CSP(Q)$ and $CSP^{\prime}(Q)$ of size $\mathcal{O}(D^{\tau}n)$ where $\tau$ is the treewidth of $Q$ ; moreover, the corresponding LP can be constructed in time $\mathcal{O}(D^{\tau}n)$ .

As a corollary we obtain upper bounds on the extension complexity for several NP-hard problems on the class of graphs with bounded treewidth; as far as we know, these results have not been known.

2 CSP Polytope

2.1 Integer Linear Programming Formulation

We start by introducing the terms and notation that we use throughout this section. We assume that $Q=(V,\mathcal{D},\mathcal{H},\mathcal{C})$ is a given instance of CSP. For every subset $W\subseteq V$ we define the set of all configurations of $W$ as

\mathcal{K}(W)=\{(\alpha_{1},\dots,\alpha_{n})\ |\ \forall C_{U}\in\mathcal{H}\ (U\subseteq W\rightarrow\alpha|_{U}\in C_{U})\mbox{ and }\forall i\not\in U\ \alpha_{i}=\lambda\}\

where $\lambda$ is a symbol not appearing in any of the domains $D_{u}$ , $u\in V$ . For a configuration $K\in\mathcal{K}(U)$ and $v\in V$ , we use the notation $K(v)$ to refer to the $v$ -th element of $K$ . Also, for a configuration $K\in\mathcal{K}(U)$ , $v\in V\setminus U$ and $\alpha\in D_{v}$ , we use the notation $K[v\leftarrow\alpha]$ to denote the configuration $K^{\prime}\in\mathcal{K}(U\cup\{v\})$ such that $K^{\prime}(v)=\alpha$ and $K^{\prime}(u)=K(u)$ for every $u\neq v$ .

For an $n$ -dimensional vector $K=(\alpha_{1},\dots,\alpha_{n})$ and a subset of variables $U\subseteq V$ we denote by $K\hskip-2.84526pt\upharpoonright_{U}$ the restriction of $K$ to $U$ that is defined as an $n$ -dimensional vector with $K\hskip-2.84526pt\upharpoonright_{U}(i)=K(i)$ for $i\in U$ and $K\hskip-2.84526pt\upharpoonright_{U}(i)=\lambda$ for $i\not\in U$ (i.e., we set to $\lambda$ all coordinates of $K$ outside of $U$ ). We denote by $\Lambda$ the configuration $(\lambda,\dots,\lambda)\in\mathcal{K}(\emptyset)$ ; note that for $\alpha\in D_{v}$ , $\Lambda[v\leftarrow\alpha]$ is the configuration from $\mathcal{K}(\{v\})$ with exactly one non- $\lambda$ element, namely the $v$ -th element, equaling $\alpha$ .

In our linear program, for every index $v\in V$ and every $i\in D_{v}$ , we introduce a binary variable $y_{v}^{i}$ . The task of the variable $y_{v}^{i}$ is to encode the value of the CSP-variable $z_{v}$ : the variable $y_{v}^{i}$ is set to one if and only if $z_{v}=i$ . Since in every solution each variable assumes a unique value, we enforce the constraint $\sum_{i\in D(v)}y_{v}^{i}=1$ for each $v\in V$ .

For every configuration $K\in\bigcup_{U:C_{U}\in\mathcal{C}\cup\mathcal{H}}\mathcal{K}(U)$ we introduce a binary variable $g(K)$ . The intended meaning of the variable $g(K)$ , for $K\in\mathcal{K}(U)$ and $U\subseteq V$ , is to provide information about the values of the CSP-variables $z_{u}$ for $u\in U$ in the following way: $g(K)=1$ if and only if for every $u\in U$ , $z_{u}=K(u)$ . To ensure consistency between the $y$ and $g$ variables, for every $C_{U}\in\mathcal{C}\cup\mathcal{H}$ and for every $v\in U$ , we enforce the constraint $\sum_{K\in\mathcal{K}(U):K(v)=i}g(K)=y_{v}^{i}$ . Note that for binary CSP, the $g$ variables capture the values of CSP-variables $z$ for pairs of elements from $V$ that correspond to edges of the constraint graph.

Relaxing the integrality constraints we obtain the following initial LP relaxation of the CSP problem $Q=(V,\mathcal{D},\mathcal{H},\mathcal{C})$ :

$\displaystyle\sum_{i\in D(v)}y_{v}^{i}$	$\displaystyle=1$	$\displaystyle\forall v\in V$	(1)
$\displaystyle\sum_{K\in\mathcal{K}(U):K(v)=i}g(K)$	$\displaystyle=y_{v}^{i}$	$\displaystyle\forall C_{U}\in\mathcal{C}\cup\mathcal{H}\ \forall v\in U\ \forall i\in D(v)$	(2)
$\displaystyle 0\leq\bm{y},\bm{g}$	$\displaystyle\leq 1$		(3)

Note that there is a one to one correspondence between the (extended) feasible assignments of $Q$ and integral solutions of (1) - (3); from now on we denote by $\textrm{proj}_{1}$ the linear projection of the convex hull of integral solutions of (1) - (3) to $CSP(Q)$ . Also observe that the total weight of CSP-constraints satisfied by an integral vector $(\bm{y},\bm{g})$ satisfying (1) - (3) is¹¹1In the case of general payoff functions, the total weight is given by $\sum_{C_{U}\in\mathcal{C}}\sum_{K\in\mathcal{K}(U):K|_{U}\in C_{U}}w(K|_{U})g(K)$

\displaystyle\sum_{C_{U}\in\mathcal{C}}w(C_{U})\sum_{K\in\mathcal{K}(U):K|_{U}\in C_{U}}g(K)\ .

Unfortunately, even for CSP problems whose constraint graph is series-parallel, the polytope given by the LP (1) - (3) is not integral (consider, e.g., the instance of CSP corresponding to the independent set problem on $K_{3}$ ). The weakness of the formulation is that no global consistency among the $\bm{y}$ variables is guaranteed. To strengthen the relaxation, we introduce new variables and constraints derived from a tree decomposition of the constraint graph of $Q$ .

2.2 Extended Formulation

Here we describe, for every CSP instance $Q=(V,\mathcal{D},\mathcal{H},\mathcal{C})$ , a polytope $P(Q)$ , and in the next subsection we prove that $P(Q)$ is an extended formulation of $CSP(Q)$ and $CSP^{\prime}(Q)$ . The set of variables in the given LP description of $P(Q)$ is substantially different from the set of variables used in the LP (1) - (3), and the set of new constraints is completely different from the the set of constraints in the LP (1) - (3). Whereas in the previous subsection, there is (roughly) a variable $g(K)$ for every feasible assignment of every subset of CSP variables corresponding to a soft or hard constraint, here we have a variable for every feasible assignment of every subset of CSP variables corresponding to a bag in a given tree decomposition of the constraint graph. Nevertheless, as we show after defining $P(Q)$ , there exists a simple linear projection of $P(Q)$ to the convex hull of all integral points in the polytope given by the LP (1) - (3).

Let $T=(V_{T},E_{T})$ be a fixed nice tree decomposition [10] of the constraint graph of $Q$ and for every node $a\in V_{T}$ , let $B(a)\subseteq V$ denote the corresponding bag. Let $\mathcal{B}=\{B(a)\ |\ a\in V_{T}\}$ denote the set of all bags of $T$ . Let $\mathcal{K}_{\mathcal{B}}=\bigcup_{B\in\mathcal{B}}\mathcal{K}(B)$ be the set of all configurations of all bags in $T$ . We use $V_{I}\subseteq V_{T}$ to denote the subset of all introduce nodes in $T$ and $V_{F}\subseteq V_{T}$ to denote the subset of all forget nodes in $T$ .

For every configuration $K\in\mathcal{K}_{\mathcal{B}}$ we introduce a binary variable $f(K)$ . As in the previous subsection, the intended meaning of the variable $K\in\mathcal{K}(B)$ , for $B\in\mathcal{B}$ , is to provide information about the values of the CSP-variables $z_{u}$ for $u\in B$ in the following way: $f(K)=1$ if and only if for every $u\in B$ , $z_{u}=K(u)$ . To ensure consistency among variables indexed by the configurations of the same bag, namely to ensure that for every $B\in\mathcal{B}$ there exists exactly one configuration $K\in\mathcal{K}(B)$ with $f(K)=1$ , we introduce for every $B\in\mathcal{B}$ the LP constraint $\sum_{K\in\mathcal{K}(B)}f(K)=1$ .

For every introduce node $c\in V_{T}$ with a child $b\in V_{T}$ and for every configuration $K\in\mathcal{K}(B(b))$ we have the constraint $\sum_{K^{\prime}\in\mathcal{K}(B(c)):K^{\prime}\ \hskip-2.84526pt\upharpoonright_{B(b)}=K}f(K^{\prime})=f(K)$ , and symmetrically, for every forget node $c\in V_{T}$ with a child $b\in V_{T}$ and for every configuration $K\in\mathcal{K}(B(c))$ we have the constraint $\sum_{K^{\prime}\in\mathcal{K}(B(b)):K^{\prime}\ \hskip-2.84526pt\upharpoonright_{B(c)}=K}f(K^{\prime})=f(K)$ .

Relaxing the integrality constraints and putting all these additional constraints together, we obtain:

$\displaystyle\sum_{K\in\mathcal{K}(B)}f(K)$	$\displaystyle=1$	$\displaystyle\forall B\in\mathcal{B}$	(4)
$\displaystyle\sum_{K^{\prime}\in\mathcal{K}(B(c)):K^{\prime}\ \hskip-2.84526pt\upharpoonright_{B(b)}=K}f(K^{\prime})$	$\displaystyle=f(K)$	$\displaystyle\forall c\in V_{I},\forall K\in\mathcal{K}(B(b))\mbox{ where $b$ is }$	(5)
the only child of $c$
$\displaystyle\sum_{K^{\prime}\in\mathcal{K}(B(b)):K^{\prime}\ \hskip-2.84526pt\upharpoonright_{B(c)}=K}f(K^{\prime})$	$\displaystyle=f(K)$	$\displaystyle\forall c\in V_{F},\forall K\in\mathcal{K}(B(c))\mbox{ where $b$ is }$	(6)
the only child of $c$
$\displaystyle 0\leq\bm{f}$	$\displaystyle\leq 1$		(7)

For the given binary CSP instance $Q$ , we denote the polytope associated with the LP (4) - (7), as $P(Q)$ .

Consider now a vector $\bm{f}\in P(Q)$ and the following set of linear equations:

	$\displaystyle y_{v}^{i}$	$\displaystyle=\sum_{{K\in\mathcal{K}(B):K(v)=i}}f(K)$	$\displaystyle\forall B\in\mathcal{B},\forall v\in B,\forall i\in D_{v}$		(8)
	$\displaystyle g(K)$	$\displaystyle=\sum_{K^{\prime}\in\mathcal{K}(B):K^{\prime}\ \hskip-2.84526pt\upharpoonright_{U}=K}f(K^{\prime})$	$\displaystyle\forall B\in\mathcal{B},\forall C_{U}\in\mathcal{C}\cup\mathcal{H}\mbox{ s.t. }U\subseteq B,\forall K\in\mathcal{B}(U)$		(9)

It is just a technical exercise to check that for a given $\bm{f}\in P(Q)$ , there always exists a unique solution $(\bm{y},\bm{g})$ of this LP and that the unique $(\bm{y},\bm{g})$ is a linear projection of $\bm{f}$ . Moreover, such a vector $(\bm{y},\bm{g})$ also satisfies the LP constraints (1) - (3). The point is that there exists a linear projection of $P(Q)$ into the polytope defined by the LP (1) - (3); moreover, an integral point from $P(Q)$ is mapped on an integral point. From now on we denote this projection $\textrm{proj}_{2}$ .

2.3 Proof of Theorem 1.1

As in the previous subsections, we assume that $Q=(V,\mathcal{D},\mathcal{H},\mathcal{C})$ is a given instance of CSP, $G=(V,E)$ is the constraint graph of $Q$ and $T=(V_{T},E_{T})$ a fixed nice tree decomposition of $G$ . We start by introducing several notions that will help us dealing with tree decompositions and our linear program.

For a node $a\in V_{T}$ , let $T(a)=(V_{a},E_{a})$ be the subtree of $T$ rooted in $a$ ; the configurations relevant to $T(a)$ are those in the set $\mathcal{R}(a)=\bigcup_{b\in V_{a}}\mathcal{K}(B(b))$ , and the variables relevant to $T(a)$ are those $f(K)$ for which $K\in\mathcal{R}(a)$ . For succinctness of notation, we denote the projection $\bm{f}|_{\mathcal{R}(a)}$ of the vector $\bm{f}$ on the set of variables relevant to $T(a)$ also by $\bm{f}|_{a}$ . The constraints relevant to $T(a)$ are those containing only the variables relevant to $T(a)$ . We say that a vector $I\in\{0,1\}^{\mathcal{R}(a)}$ agrees with the configuration $K\in\mathcal{R}(a)$ if $I(K)=1$ .

Let $\bm{f}$ be a fixed solution of the LP (4) - (7) that corresponds to a vertex of the polytope $P(Q)$ . Our main tool is the following lemma.

Lemma 1

For every node $b\in V_{T}$ , there exist a positive integer $M$ and binary vectors $I_{1},I_{2},\dots,I_{M}\in\{0,1\}^{\mathcal{R}(b)}$ , some possibly identical, such that

$\spadesuit$

every $I_{i}$ satisfies the constraints relevant to $T(b)$ ,
$\clubsuit$

$\bm{f}|_{b}=\frac{1}{M}\sum_{i=1}^{M}I_{i}$ .

Proof

By induction. We start in the leaves of $T$ and proceed in a bottom-up fashion.

Base case.

Assume that $b\in V_{T}$ is a leaf of the nice decomposition tree $T$ . By definition of a nice tree decomposition, the bag $B(b)$ consists of a single vertex, say a vertex $v\in V$ . The only variables relevant to $T(b)$ are $f(K)$ for all $K\in\mathcal{K}(B(b))=\bigcup_{j\in D(v)}\Lambda[v\leftarrow j]$ , and the only relevant constraints are those of the type (4) and (7).

Let $M^{\prime}\in\mathbb{N}$ be such that an $M^{\prime}$ -multiple of every relevant variable is integral; as $\bm{f}$ is a solution corresponding to a vertex of the polytope $P(Q)$ , all the variables are rational which guarantees that such an $M^{\prime}$ exists. For every $j\in D_{v}$ we define an integral vector $I_{j}$ such that $I_{j}(\Lambda[v\leftarrow j])=1$ and $I_{j}(\Lambda[v\leftarrow i])=0$ for every $i\neq j$ .

The vector $I_{j}$ will appear with multiplicity $M^{\prime}\cdot y_{v}^{j}$ among the integral solutions $I_{1},\ldots,I_{M^{\prime}}$ for $G^{\prime}$ . Then, obviously, both properties $\spadesuit$ and $\clubsuit$ are satisfied.

Inductive step.

Consider an internal node $c\in V_{T}$ of the nice decomposition tree $T$ . We distinguish three cases: $c$ is a join node, $c$ is an introduce node and $c$ is a forget node.

Join node. Assume that the two children of the join node $c$ are $a$ and $b$ . Recall that $B(a)=B(b)=B(c)$ . By the inductive assumption, there exist integers $M$ and $M^{\prime}$ and integral vectors $I_{1},\ldots,I_{M}\in\{0,1\}^{\mathcal{R}(a)}$ , each of them satisfying the relevant constraints for $T(a)$ and such that $\bm{f}|_{a}=\frac{1}{M}\sum_{i=1}^{M}I_{i}$ , and integral vectors $J_{1},\ldots,J_{M^{\prime}}\in\{0,1\}^{\mathcal{R}(b)}$ , each of them satisfying the relevant constraints for $T(b)$ and such that $\bm{f}|_{b}=\frac{1}{M^{\prime}}\sum_{i=1}^{M^{\prime}}J_{i}$ .

Two vectors $I_{i}$ and $J_{j}$ that agree with a given configuration $K\in\mathcal{K}(B(c))$ can be easily merged into an integral vector $L\in\{0,1\}^{\mathcal{R}(c)}$ that satisfies $L|_{a}=I_{i}$ and $L|_{b}=J_{j}$ ; as the set of all constraints relevant to $T(c)$ is the union of the constraints relevant to $T(a)$ and the constraints relevant to $T(b)$ , the vector $L$ satisfies also all the constraints relevant to $T(c)$ .

For simplicity we assume, without loss of generality, that $M=M^{\prime}$ . Then, by the property $\clubsuit$ and since $B(a)=B(b)=B(c)$ , for every configuration $K\in\mathcal{K}(B(c))$ , the number of vectors $I_{i}$ that agree with $K$ is equal to the number of vectors $J_{j}$ that agree with $K$ , namely $M\cdot f(K)$ . Thus, it is possible to match the vectors $I_{i}$ and $J_{j}$ one to one in such a way that both vectors in each pair agree with the same configuration; let $L_{1},L_{2},\ldots,L_{M}$ denote the result of their merging as described above. Then the vectors $L_{i}$ satisfy the property $\spadesuit$ as explained in the previous paragraph, and by construction they also satisfy the property $\clubsuit$ .

Introduce node. Assume that the only child of the introduce node $c$ is a node $b$ and $B(c)=B(b)\cup\{v\}$ . By the inductive assumption, there exists integer $M$ and integral vectors $I_{1},\ldots,I_{M}\in\{0,1\}^{\mathcal{R}(b)}$ , each of them satisfying the relevant constraints for $T(b)$ and such that $\bm{f}|_{b}=\frac{1}{M}\sum_{i=1}^{M}I_{i}$ . Without loss of generality we assume that for every variable relevant to $T(c)$ , its $M$ -multiple is integral. We partition the vectors $I_{1},\ldots,I_{M}$ into several groups indexed by the configurations from $\mathcal{K}(B(b))$ : the group $Z_{K}$ , for $K\in\mathcal{K}(B(b))$ , consists exactly of those vectors $I_{i}$ that agree with $K$ .

Consider a fixed configuration $K\in\mathcal{K}(B(b))$ and the corresponding group $Z_{K}$ . Note that the size of this group is $M\cdot f(K)$ . We further partition the group $Z_{K}$ into at most $|D_{v}|$ subgroups $Z_{K^{\prime}}$ , where $K^{\prime}=K[v\leftarrow j]$ , for every $j\in D_{v}$ satisfying $K[v\leftarrow j]\in\mathcal{K}(B(c))$ , in such a way that $Z_{K^{\prime}}$ contains exactly $M\cdot f(K^{\prime})$ vectors (it does not matter which ones); the LP constraint (5) makes this possible. Then, for every $j\in D_{v}$ , we create from every vector $I\in Z_{K[v\leftarrow j]}$ a new integral vector $J_{I}$ in the following way:

•

for every $\bar{K}\in\mathcal{R}(b)$ , $J_{I}(\bar{K})=I(\bar{K})$ ; this guarantees $J_{I}|_{b}=I$ ,
•

$J_{I}(K[v\leftarrow j])=1$ ,
•

for every $i\in D_{v}$ , $i\not=j$ , $J_{I}(K[v\leftarrow i])=0$ .

Obviously, the new vectors $J_{I}$ satisfy all constraints relevant to $T(b)$ , and it is easy to check that they satisfy all constraints relevant to $T(c)$ as well, given the definitions above. Moreover, the definitions above imply that the vectors $J_{I}$ satisfy the property $\clubsuit$ .

Forget node. Assume that the only child of the forget node $c$ is a node $b$ , $B(c)=B(b)\setminus\{v\}$ . This case is symmetric to the previous one in that instead of splitting the groups $Z_{K}$ into smaller groups $Z_{K^{\prime}}$ , we merge them into bigger $Z_{K^{\prime}}$ .

By the inductive assumption, there exists an integer $M$ and integral vectors $I_{1},\ldots,I_{M}\in\{0,1\}^{\mathcal{R}(b)}$ , each of them satisfying the relevant constraints for $T(b)$ and such that $\bm{f}|_{b}=\frac{1}{M}\sum_{i=1}^{M}I_{i}$ . Without loss of generality we assume that for every variable relevant to $T(c)$ , its $M$ -multiple is integral. We partition the vectors $I_{1},\ldots,I_{M}$ into several groups indexed by the configurations from $\mathcal{K}(B(b))$ : the group $Z_{K}$ , for $K\in\mathcal{K}(B(b))$ , consists exactly of those vectors $I_{i}$ that agree with $K$ . Note that the size of $Z_{K}$ is $M\cdot f(K)$ .

For every $K^{\prime}\in\mathcal{K}(B(c))$ we create a bigger group group $Z_{K^{\prime}}$ by merging $|D_{v}|$ of the groups $Z_{K}$ , namely those satisfying $K|_{B(c)}=K^{\prime}$ . By the LP constraint (6), the new group $Z_{K^{\prime}}$ contains exactly $M\cdot f(K^{\prime})$ vectors. For every $K^{\prime}\in\mathcal{K}(B(c))$ , we create from every vector $I\in Z_{K^{\prime}}$ a new integral vector $J_{I}$ in the following way:

•

for every $\bar{K}\in\mathcal{R}(b)$ , $J_{I}(\bar{K})=I(\bar{K})$ .

If $\mathcal{K}(B(c))\subseteq\mathcal{R}(b)$ , there is nothing more to do. Otherwise we further define

•

$J_{I}(K^{\prime})=1$ , and for every $\hat{K}\in\mathcal{K}(B(c))$ , $\hat{K}\not=K^{\prime}$ , $J_{I}(\hat{K})=0$ .

We have to check that the vectors $J_{I}$ satisfy all constraints relevant to $T(c)$ . The only possibly new constraints are those using variables $f(K^{\prime})$ for $K^{\prime}\in\mathcal{K}(B(c))$ and it is easily seen that they are satisfied, given the definitions above. Also, the definitions above imply that the vectors $J_{K^{\prime}}$ satisfy the property $\clubsuit$ . ∎

By applying Lemma 1 to the whole tree $T$ , that is, to the subtree rooted in the root of $T$ , we immediately obtain that $\bm{f}$ is an integral vector, and, thus, also the corresponding vertex of $P(Q)$ is integral. As this holds for every vertex of $P(Q)$ , we conclude that $P(Q)$ is an integral polytope.

Considering the notes at the ends of the previous two subsections, we also conclude that $CSP(Q)=\textrm{proj}_{1}(\textrm{proj}_{2}(P(Q))$ and $CSP^{\prime}(Q)=\textrm{proj}_{V}(CSP(Q))$ .

To complete the proof of Theorem 1.1, we observe that the number of variables and constraints in the LP (4) - (7) is $\mathcal{O}(D^{\tau}n)$ . ∎

3 Applications

The purpose of this section is to make explicit the extension complexity upper bounds given in Theorem 1.1 for several well known graph problems. We find it interesting that the attained extension complexity upper bounds meet the best possible (assuming Strong ETH) time complexity lower bounds, given by Lokshtanov et al. [11]; the only exception is the Multiway Cut problem. To state our results, we use for each problem the following template:

Problem name

Projection

Extension complexity

Time complexity

Instance: …

Solution: …

CSP formulation: $V$ , $\mathcal{D}$ , $\mathcal{H}$ , $\mathcal{C}$ . CSP version: Decision / Max / Min

where Projection is the name of the linear projection that yields the natural polytope of the problem $Q$ from the $CSP(Q)$ polytope (or from the $P(Q)$ polytope, in case of the OCT problem). We use the notation $[n]=\{1,\dots,n\}$ .

Coloring / Chromatic Number [1]

$\textrm{proj}_{V}$

$\mathcal{O}($ $q^{\tau}n$ $)$

$\Theta($ $q^{\tau}n$ $)$

Instance: Graph $G=(V,E)$ , set of colors $[q]$

Solution: A coloring of $G$ with $q$ colors with no monochromatic edges.

CSP formulation: $V=[n]$ , $D_{v}=[q]$ for every $v\in V$ , $H_{uv}=\{(i,j)\ |\ i\in D_{u},j\in D_{v},i\neq j\}$ for every $uv\in E$ , $\mathcal{C}=\emptyset$ . Decision

Comment: Note that Chromatic Number $\chi(G)$ of $G$ is always upper bounded by $\tau+1$ since graphs of bounded treewidth are $\tau$ -degenerate and thus $(\tau+1)$ -colorable. Thus, if the goal is to determine $\chi(G)$ , it suffice to find the smallest $q$ such that $CSP(Q)$ is non-empty.

List- $H$ -Coloring / List Homomorphism [6]

$\textrm{proj}_{V}$

$\mathcal{O}($ $L^{\tau}n$ $)$

$\Theta($ $L^{\tau}n$ $)$

Instance: Graph $G=(V,E)$ , graph $H=(V_{H},E_{H})$ possibly containing loops, and for every vertex $v\in V$ a set $L(v)\subseteq V_{H}$ . (We denote $L=\max_{v\in V}|L(v)|$ )

Solution: A mapping $f:V\rightarrow V_{H}$ such that $\forall uv\in E$ it holds that $f(u)f(v)\in E_{H}$ and $f(v)\in L(v)$ for every $v\in V$ .

CSP formulation: $V=[n]$ , $D_{v}=L(v)$ for every $v\in V$ , $H_{uv}=\{(i,j)\ |\ i\in D_{u},j\in D_{v},ij\in E_{H}\}$ for every $uv\in E$ , $\mathcal{C}=\emptyset$ . Decision

Comment: Note that the problems List Coloring, Precoloring Extension and $H$ -Coloring (or Graph Homomorphism) are all special cases of this problem. The lower bound given by Lokshtanov et al. [11] applies to all of them since Coloring is a special case of each of them.

Unique Games [9]

$\textrm{proj}_{id}$

$\mathcal{O}(t^{\tau}n)$

—

Instance: Graph $G=(V,E)$ , an integer $t\in\mathbb{N}$ , a permutation $\pi_{e}$ of order $t$ for every edge $e\in E$ .

Solution: A mapping $\ell:V\rightarrow[t]$ such that the number of edges $uv\in E$ with $\pi_{uv}(\ell(u))=\ell(v)$ is maximized.

CSP formulation: $V=[n]$ , $D_{v}=[t]$ for every $v\in V$ , $\mathcal{H}=\emptyset$ , $C_{uv}=\{(i,\pi_{uv}(i))\ |\ i\in D_{u}\}$ for every edge $uv\in E$ . Max

Comment: The decision variant of this problem is not interesting as it is trivially solvable in polynomial time.

Multiway Cut [1]

$\textrm{proj}_{E}$

$\mathcal{O}(t^{\tau}n)$

Instance: Graph $G=(V,E)$ , an integer $t\in\mathbb{N}$ and $t$ vertices $s_{1},\dots,s_{t}\in V$

Solution: A partition of $V$ into sets $V_{1},\dots,V_{t}$ such that for every $i$ we have $s_{i}\in V_{i}$ and the total number of edges between $V_{i}$ and $V_{j}$ for $i\neq j$ is minimized.

CSP formulation: $V=[n]$ , $D_{v}=[t]$ for every $v\in V$ , $\mathcal{H}=\emptyset$ , $C_{uv}=\{(i,i)\ |\ i\in[n]\}$ for every edge $uv\in E$ . Min

Comment: Setting $z_{v}=i$ models vertex $v$ belonging to the set $V_{i}$ . Not satisfying the constraint $C_{uv}$ means that the edge $uv$ belongs to the multiway cut.

Max Cut [1]

$\textrm{proj}_{E}$

$\mathcal{O}($ $2^{\tau}n$ $)$

$\Theta($ $2^{\tau}n$ $)$

Instance: Graph $G=(V,E)$

Solution: A partition of vertices into two sets $V_{1},V_{2}$ such that the number of edges between $V_{1}$ and $V_{2}$ is maximized.

CSP formulation: $V=[n]$ , $D_{v}=\{0,1\}$ for every $v\in V$ , $\mathcal{H}=\emptyset$ , $C_{uv}=\{(1,0),(0,1)\}$ for every edge $uv\in E$ . Max

Comment: The values $0,1$ model the vertex belonging to the set $V_{1}$ or $V_{2}$ . If we replace maximization by minimization, the problem becomes Edge Bipartization (aka Edge OCT) problem which is a parametric dual of Max Cut.

Vertex Cover [1]

$\textrm{proj}_{V}$

$\mathcal{O}($ $2^{\tau}n$ $)$

$\Theta($ $2^{\tau}n$ $)$

Instance: Graph $G=(V,E)$

Solution: A set of vertices $C\subseteq V$ of minimal size such that every edge contains a vertex $v\in C$ as at least one of its endpoints.

CSP formulation: $V=[n]$ , $D_{v}=\{0,1\}$ for every $v\in V$ , $H_{uv}=\{(0,0),(0,1),(1,0)\}$ for every edge $uv\in E$ , $C_{v}=\{1\}$ . Min

Comment: The values $0,1$ model the vertex belonging to $C$ or $V\setminus C$ . If we replace maximization by minimization, the problem becomes Independent Set problem which is a parametric dual of Vertex Cover.

Odd Cycle Transversal [11]

$\textrm{proj}_{OCT}$ $\circ$ $\textrm{proj}_{2}$

$\mathcal{O}(3^{\tau}n)$

$\Theta(3^{\tau}n)$

Instance: Graph $G=(V,E)$

Solution: A subset of vertices $W\subseteq V$ of minimal size such that $G[V\setminus W]$ is a bipartite graph.

CSP formulation: $V=[n]$ , $D_{v}=\{0,1,2\}$ for every $v\in V$ , $H_{uv}=\{0,1,2\}^{2}\setminus\{(0,0),(1,1)\}$ for every edge $uv\in E$ , $C_{v}=\{0,1\}$ for every $v\in V$ . Min

Comment: The values $0,1,2$ model the vertex belonging to either the first or the second partite of a bipartite graph, or the deletion set $W$ . Satisfying the constraint $C_{v}$ corresponds to not putting $v$ in the deletion set $W$ . Also known as Vertex Bipartization. The projection $\textrm{proj}_{OCT}:P(Q)\rightarrow\{0,1\}^{V}$ is defined as follows: $\textrm{proj}_{OCT}(y_{1}^{0},y_{1}^{1},y_{1}^{2},y_{2}^{0},y_{2}^{1},y_{2}^{2},\ldots,y_{n}^{0},y_{n}^{1},y_{n}^{2},\bm{g})=(y_{1}^{2},y_{2}^{2},\ldots,y_{n}^{2})$ .

4 Open problems

A natural research direction is to examine more closely the extension complexity for CSP and the specific graph problems on graphs with bounded treewidth, in particular, what are the best possible upper bounds?

Acknowledgments.

The authors thank Hans Raj Tiwary and Jiří Sgall for stimulating discussions.

References

[1] G. Ausiello, P. Creczenzi, G. Gambosi, V. Kann, A. Marchetti-Spaccamela, and M. Protasi. Complexity and Approximation; Combinatorial Optimization Problems and Their Approximability Properties. Springer, 1999.
[2] D. Bienstock and G. Munoz. LP approximations to mixed-integer polynomial optimization problems. ArXiv e-prints, Jan. 2015.
[3] A. Buchanan and S. Butenko. Tight extended formulations for independent set, 2014. Available on Optimization Online.
[4] S. O. Chan, J. R. Lee, P. Raghavendra, and D. Steurer. Approximate constraint satisfaction requires large LP relaxations. In Proc. of the 54th Annual IEEE Symposium on Foundations of Computer Science, (FOCS), pages 350–359, 2013.
[5] M. Conforti, G. Cornuéjols, and G. Zambelli. Extended formulations in combinatorial optimization. Annals OR, 204(1):97–143, 2013.
[6] T. Feder and P. Hell. List homomorphisms to reflexive graphs. J. Comb. Theory, Ser. B, 72(2):236–250, 1998.
[7] E. C. Freuder. Complexity of $K$ -tree structured constraint satisfaction problems. In Proc. of the 8th National Conference on Artificial Intelligence, pages 4–9, 1990.
[8] M. Grohe, T. Schwentick, and L. Segoufin. When is the evaluation of conjunctive queries tractable? In Proc. of the 33rd Annual ACM Symposium on Theory of Computing (STOC), pages 657–666, 2001.
[9] S. Khot. On the power of unique 2-Prover 1-Round games. In Proc. of the 34th Annual ACM Symposium on Theory of Computing (STOC), pages 767–775, 2002.
[10] T. Kloks. Treewidth: Computations and Approximations, volume 842 of Lecture Notes in Computer Science. Springer, 1994.
[11] D. Lokshtanov, D. Marx, and S. Saurabh. Known algorithms on graphs on bounded treewidth are probably optimal. In Proc. of the 22nd Annual ACM-SIAM Symposium on Discrete Algorithms, (SODA), pages 777–789, 2011.
[12] D. Marx. Can you beat treewidth? Theory of Computing, 6(1):85–112, 2010.
[13] P. Raghavendra. Optimal algorithms and inapproximability results for every CSP? In Proc. of the 40th Annual ACM Symposium on Theory of Computing (STOC), pages 245–254, 2008.
[14] P. Raghavendra and D. Steurer. How to round any CSP. In Proc. of the 50th Annual IEEE Symposium on Foundations of Computer Science, (FOCS), pages 586–594, 2009.
[15] P. Raghavendra and D. Steurer. Integrality gaps for strong SDP relaxations of unique games. In Proc. of the 50th Annual IEEE Symposium on Foundations of Computer Science, (FOCS), pages 575–585, 2009.
[16] T. Rothvoß. The matching polytope has exponential extension complexity. In Proc. of the 46th ACM Symposium on Theory of Computing, (STOC), pages 263–272, 2014.
[17] M. Sellmann. The polytope of tree-structured binary constraint satisfaction problems. In Proc. of Integration of AI and OR Techniques in Constraint Programming for Combinatorial Optimization Problems (CPAIOR), volume 5015 of Lecture Notes in Computer Science, pages 367–371. Springer, 2008.
[18] M. Sellmann, L. Mercier, and D. H. Leventhal. The linear programming polytope of binary constraint problems with bounded tree-width. In Proc. of Integration of AI and OR Techniques in Constraint Programming for Combinatorial Optimization Problems (CPAIOR), volume 4510 of Lecture Notes in Computer Science, pages 275–287. Springer, 2007.

Extended Formulation for CSP that is Compact for Instances of Bounded Treewidth††thanks: This research was partially supported by the project 14-10003S of GA ČR.

Abstract

1 Introduction

1.1 Notation and Terminology

1.2 Related Work

CSP for graphs of bounded treewidth.

CSP for general graphs.

Other related results.

1.3 New Results

Theorem 1.1

2 CSP Polytope

2.1 Integer Linear Programming Formulation

2.2 Extended Formulation

2.3 Proof of Theorem 1.1

Lemma 1

Proof

Base case.

Inductive step.

3 Applications

4 Open problems

Acknowledgments.

References

Extended Formulation for CSP that is Compact
for Instances of Bounded Treewidth^†^†thanks: This research was partially supported by the project 14-10003S of GA ČR.