¹¹institutetext: CNRS, UMR 8188, Centre de Recherche en Informatique de Lens (CRIL), Lens, F-62300, France
Univ. Artois, UMR 8188, Lens, F-62300, France

Characterizing Tseitin-formulas with short regular resolution refutations^†^†thanks: This work has been partly supported by the PING/ACK project of the French National Agency for Research (ANR-18-CE40-0011).

Alexis de Colnet Stefan Mengel

Abstract

Tseitin-formulas are systems of parity constraints whose structure is described by a graph. These formulas have been studied extensively in proof complexity as hard instances in many proof systems. In this paper, we prove that a class of unsatisfiable Tseitin-formulas of bounded degree has regular resolution refutations of polynomial length if and only if the treewidth of all underlying graphs $G$ for that class is in $O(\log|V(G)|)$ . To do so, we show that any regular resolution refutation of an unsatisfiable Tseitin-formula with graph $G$ of bounded degree has length $2^{\Omega(tw(G))}/|V(G)|$ , thus essentially matching the known $2^{O(tw(G))}\textup{poly}(|V(G)|)$ upper bound up. Our proof first connects the length of regular resolution refutations of unsatisfiable Tseitin-formulas to the size of representations of satisfiable Tseitin-formulas in decomposable negation normal form (DNNF). Then we prove that for every graph $G$ of bounded degree, every DNNF-representation of every satisfiable Tseitin-formula with graph $G$ must have size $2^{\Omega(tw(G))}$ which yields our lower bound for regular resolution.

Keywords:

proof complexity, regular resolution, DNNF, treewidth

1 Introduction

Resolution is one of the most studied propositional proof systems in proof complexity due to its naturality and it connections to practical SAT solving [21, 9]. A refutation of a CNF-formula in this system (a resolution refutation) relies uniquely on clausal resolution: in a refutation, clauses are iteratively derived by resolutions on clauses from the formula or previously inferred clauses, until reaching the empty clause indicating unsatisfiability. In this paper, we consider regular resolution which is the restriction of resolution to proofs in which, intuitively, variables which have been resolved away from a clause cannot be reintroduced later on by additional resolution steps. This fragment of resolution is known to generally require exponentially longer refutations than general resolution [16, 1, 25, 27] but is still interesting since it corresponds to DPLL-style algorithms [12, 13]. Consequently, there is quite some work on regular resolution, see e.g. [3, 24, 5, 4] for a very small sample.

Tseitin-formulas are encodings of certain systems of linear equations whose structure is given by a graph [23]. They have been studied extensively in proof complexity essentially since the creation of the field because they are hard instances in many settings, see e.g. [24, 6, 18, 19, 4]. It is known that different properties of the underlying graph characterize different parameters of their resolution refutations [14, 2, 18]. Extending this line of work, we here show that treewidth determines the length of regular resolution refutations of Tseitin-formulas: classes of Tseitin-formulas of bounded degree have polynomial length regular resolution refutations if and only if the treewidth of the underlying graphs is bounded logarithmically in their size. The upper bound for this result was already known from [2] where it is shown that, for every graph $G$ , unsatisfiable Tseitin-formulas with the underlying graph $G$ have regular resolution refutations of length at most $2^{O(tw(G))}|V(G)|^{c}$ where $c$ is a constant. We provide a matching lower bound:

Theorem 1.1

Let $T(G,c)$ be an unsatisfiable Tseitin-formula where $G$ is a connected graph with maximum degree at most $\Delta$ . The length of the smallest regular resolution refutation of $T(G,c)$ is at least $2^{\Omega(tw(G)/\Delta)}|V(G)|^{-1}$ .

There were already known lower bounds for the length of resolution refutations of Tseitin-formulas based on treewidth before. For general resolution, a $2^{\Omega(tw(G)^{2})/|V(G)|}$ lower bound can be inferred width the classical width-length relation of [6] and width bounds of [14]. This gives a tight $2^{\Omega(tw(G))}$ bound when the treewidth of $G$ is linear in its number of vertices. For smaller treewidth, better bounds of $2^{\Omega(tw(G))/\log|V(G)|}$ that almost match the upper bound where shown in [19] for regular resolution refutations. Building on [19], we eliminate the division by $\log|V(G)|$ in the exponent and thus give a tight $2^{\Theta(tw(G))}$ dependence.

As in [19], our proof strategy follows two steps. First, we show that the problem of bounding the length of regular resolution refutations of an unsatisfiable Tseitin-formula can be reduced to lower bounding the size of certain representations of a satisfiable Tseitin-formula. Itsykson et al. in [19] used a similar reduction of lower bounds for regular resolution refutations to bounds on read-once branching programs (1-BP) for satisfiable Tseitin-formulas, using the classical connection between regular resolution and the search problem which, given an unsatisfiable CNF-formula and a truth assignment, returns a clause of the formula it falsifies [20]. Itsykson et al. showed that there is a transformation of a 1-BP solving the search problem for an unsatisfiable Tseitin-formula into a 1-BP of pseudopolynomial size computing a satisfiable Tseitin-formula with the same underlying graph. This yields lower bounds for regular resolution from lower bounds for 1-BP computing satisfiable Tseitin-formulas which [19] also shows. Our crucial insight here is that when more succinct representations are used to present the satisfiable formula, the transformation from the unsatisfiable instance can be changed to have only a polynomial instead of pseudopolynomial size increase. Concretely, the representations we use are so-called decomposable negation normal forms (DNNF) which are very prominent in the field of knowledge compilation [10] and generalize 1-BP. We show that every refutation of an unsatisfiable Tseitin-formula can be transformed into a DNNF-representation of a satisfiable Tseitin-formula with the same underlying graph with only polynomial overhead.

In a second step, we then show for every satisfiable Tseitin-formula with an underlying graph $G$ a lower bound of $2^{\Omega(tw(G))}$ on the size of DNNF computing the formula. To this end, we adapt techniques developed in [8] to a parameterized setting. [8] uses rectangle covers of a function, a common tool from communication complexity, to lower bound the size of any DNNF computing the function. Our refinement takes the form of a two-player game in which the first player tries to cover the models of a function with few rectangles while the second player hinders this construction by adversarially choosing the variable partitions respected by the rectangles from a certain set of partitions. We show that this game gives lower bounds for DNNF, and consequently the aim is to show that the adversarial player can always force $2^{\Omega(tw(G))}$ rectangles in the game when playing on a Tseitin-formula with graph $G$ . This is done by proving that any rectangle for a carefully chosen variable partition splits parity constraints of the formula in a way that bounds by a function of $tw(G)$ the number of models that can be covered. We show that, depending on the treewidth of $G$ , the adversarial player can choose a partition to limit the number of models of every rectangle constructed in the game to the point that at least $2^{\Omega(tw(G))}$ of them will be needed to cover all models of the Tseitin-formula. As a consequence, we get the desired lower bound of $2^{\Omega(tw(G))}|V(G)|^{-1}$ for regular resolution refutations of Tseitin-formulas.

2 Preliminaries

Notions on Graphs.

We assume the reader is familiar with the fundamentals of graph theory. For a graph $G$ , we denote by $V(G)$ its vertices and by $E(G)$ its edges. For $v\in V(G)$ , $E(v)$ denotes the edges incident to $v$ and $N(v)$ its neighbors ( $v$ is not in $N(v)$ ). For a subset $V^{\prime}$ of $V(G)$ we denote by $G[V^{\prime}]$ the sub-graph of $G$ induced by $V^{\prime}$ .

A binary tree whose leaves are in bijection with the edges of $G$ is called a branch decomposition¹¹1We remark that often branch decompositions are defined as unrooted trees. However, it is easy to see that our definition is equivalent, so we use it here since it is more convenient in our setting.. Each edge $e$ of a branch decomposition $T$ induces a partition of $E(G)$ into two parts as the edge sets that appear in the two connected components of $T$ after deletion of $e$ . The number of vertices of $G$ that are incident to edges in both parts of this partition is the order of $e$ , denoted by $order(e,T)$ . The branchwidth of $G$ , denoted by $bw(G)$ , is defined as $bw(G)=\min_{T}\max_{e\in E(T)}order(e,T)$ , where $\min_{T}$ is over all branch decompositions of $G$ .

While it is convenient to work with branchwidth in our proofs, we state our main result with the more well-known treewidth $tw(G)$ of a graph $G$ . This is justified by the following well-known connection between the two measures.

Lemma 1

[17, Lemma 12] If $bw(G)\geq 2$ , then $bw(G)-1\leq tw(G)\leq\frac{3}{2}bw(G)$ .

A separator $S$ in a connected graph $G$ is defined to be a vertex set such that $G\setminus S$ is non-empty and not connected. A graph $G$ is called $3$ -connected if and only if it has at least $4$ vertices and, for every $S\subseteq V(G)$ , $|S|\leq 2$ , the graph $G\setminus S$ is connected.

Variables, assignments, v-trees.

Boolean variables can have value 0 ( $false$ ) or 1 ( $true$ ). The notation $\ell_{x}$ refers to a literal for a variable $x$ , that is, $x$ or its negation $\overline{x}$ . Given a set $X$ of Boolean variables, $lit(X)$ denotes its set of literals. A truth assignment to $X$ is a mapping $a:X\rightarrow\{0,1\}$ . If $a_{X}$ and $a_{Y}$ are assignments to disjoint sets of variables $X$ and $Y$ , then $a_{X}\cup a_{Y}$ denotes the combined assignment to $X\cup Y$ . The set of assignments to $X$ is denoted by $\{0,1\}^{X}$ . Let $f$ be a Boolean function, we denote by $var(f)$ its variables and by $sat(f)$ its set of models, i.e., assignments to $var(f)$ on which $f$ evaluates to $1$ . A v-tree of $X$ is a binary tree $T$ whose leaves are labeled bijectively with the variables in $X$ . A v-tree $T$ of $X$ induces a set of partitions $(X_{1},X_{2})$ of $X$ as follows: choose a vertex $v$ of $T$ , setting $X_{1}$ to contain exactly the variables in $T$ that appear below $v$ and $X_{2}:=X\setminus X_{1}$ .

Tseitin-Formulas.

Tseitin formulas are systems of parity constraints whose structure is determined by a graph. Let $G=(V,E)$ be a graph and let $c:V\rightarrow\{0,1\}$ be a labeling of its vertices called a charge function. The Tseitin-formula $T(G,c)$ has for each edge $e\in E$ a Boolean variable $x_{e}$ and for each vertex $v\in V$ a constraint $\chi_{v}:\sum_{e\in E(v)}x_{e}=c(v)\mod 2$ . The Tseitin-formula $T(G,c)$ is then defined as $T(G,c):=\bigwedge_{v\in V}\chi_{v}$ , i.e., the conjunction of the parity constraints for all $v\in V$ . By $\overline{\chi_{v}}$ we denote the negation of $\chi_{v}$ , i.e., the parity constraint on $(x_{e})_{e\in E(v)}$ with charge $1-c(v)$ .

Proposition 1

[24, Lemma 4.1] The Tseitin-formula $T(G,c)$ is satisfiable if and only if for every connected component $U$ of $G$ we have $\sum_{v\in U}c(v)=0\mod 2$ .

Proposition 2

[15, Lemma 2] Let $G$ be a graph with $K$ connected components. If the Tseitin-formula $T(G,c)$ is satisfiable, then it has $2^{|E(G)|-|V(G)|+K}$ models.

When conditioning the formula $T(G,c)$ on a literal $\ell_{e}\in\{x_{e},\overline{x_{e}}\}$ for $e=ab$ in $E(G)$ , the resulting function is another Tseitin formula $T(G,c)|\ell_{e}=T(G^{\prime},c^{\prime})$ where $G^{\prime}$ is the graph $G$ without the edge $e$ (so $G^{\prime}=G-e$ ) and $c^{\prime}$ depends on $\ell_{e}$ . If $\ell_{e}=\overline{x_{e}}$ then $c^{\prime}$ equals $c$ . If $\ell_{e}=x_{e}$ then $c^{\prime}=c+1_{a}+1_{b}\mod 2$ , where $1_{v}$ denotes the charge function that assigns $1$ to $v$ and $0$ to all other variables.

Since we consider Tseitin-formulas in the setting of proof systems for CNF-formulas, we will assume in the following that they are encoded as CNF-formulas. In this encoding, every individual parity constraint $\chi_{v}$ is expressed as a CNF-formula $F_{v}$ and $T(G,c):=\bigwedge_{v\in V}F_{v}$ . Since it takes $2^{|E(v)|-1}$ clauses to write the parity constraint $\chi_{v}$ , each clause containing $E(v)$ literals, we make the standard assumption that $E(v)$ is bounded, i.e., there is a constant upper bound $\Delta$ on the degree of all vertices in $G$ .

DNNF.

A circuit over $X$ in negation normal form (NNF) is a directed acyclic graph whose leaves are labeled with literals in $lit(X)$ or 0/1-constants, and whose internal nodes are labeled by $\lor$ -gates or $\land$ -gates. We use the usual semantics for the function computed by (gates of) Boolean circuits. Every NNF can be turned into an equivalent NNF whose nodes have at most two successors in polynomial time. So we assume that NNF in this paper have only binary gates and thus define the size $|D|$ as the number of gates, which is then at most half the number of wires. Given a gate $g$ , we denote by $var(g)$ the variables for the literals appearing under $g$ . When $g$ is a literal input $\ell_{x}$ , we have $var(g)=\{x\}$ , and when it is a 0/1-input, we define $var(g)=\emptyset$ . A gate with two children $g_{l}$ and $g_{r}$ is called decomposable when $var(g_{l})\cap var(g_{r})=\emptyset$ , and it is called complete (or smooth) when $var(g_{l})=var(g_{r})$ . An NNF whose $\land$ -gates are all decomposable is called a decomposable NNF (DNNF). We call a DNNF complete when all its $\lor$ -gates are complete. Every DNNF can be made complete in polynomial time. For every Boolean function $f$ on finitely many variables, there exists a DNNF computing $f$ .

When representing Tseitin-formulas by DNNF, we will use the following:

Lemma 2

Let $G$ be a graph and let $c$ and $c^{\prime}$ be two charge functions such that $T(G,c)$ and $T(G,c^{\prime})$ are satisfiable Tseitin-formulas. Then $T(G,c)$ can be computed by a DNNF of size $s$ if and only if this is true for $T(G,c^{\prime})$ .

Proof (sketch)

$T(G,c)$ can be transformed into $T(G,c^{\prime})$ by substituting some variables by their negations, see [19, Proposition 26]. So every DNNF for $T(G,c)$ can be transformed into one for $T(G,c^{\prime})$ by making the same substitutions. ∎

Proof trees of a DNNF $D$ are tree-like sub-circuits of $D$ constructed iteratively as follows: we start from the root gate and add it to the proof tree. Whenever an $\land$ -gate is met, both its child gates are added to the proof tree. Whenever a $\lor$ -gate is met, exactly one child is is added to the proof tree. Each proof tree of $D$ computes a conjunction of literals. By distributivity, the disjunction of the conjunctions computed by all proof trees of $D$ computes the same function as $D$ . When $D$ is complete, every variable appears exactly once per proof tree, so every proof tree of a complete DNNF encodes a single model.

Branching programs.

A branching program (BP) $B$ is a directed acyclic graph with a single source, sinks that uniquely correspond to the values of a finite set $Y$ , and whose inner nodes, called decision nodes are each labeled by a Boolean variable $x\in X$ and have exactly two output wires called the 0- and 1-wire pointing to two nodes respectively called its 0- and the 1-child. The variable $x$ appears on a path in $B$ if there is a decision node $v$ labeled by $x$ on that path. A truth assignment $a$ to $X$ induces a path in $B$ which starts at the source and, when encountering a decision node for a variable $x$ , follows the 0-wire (resp. the 1-wire) if $a(x)=0$ (resp. $a(x)=1$ ). The BP $B$ is defined to compute the value $y\in Y$ on an assignment $a$ if and only if the path of $a$ leads to the sink labeled with $y$ . We denote this value $y$ as $B(a)$ . Let $f:X\rightarrow Y$ be a function where $X$ is a finite set of Boolean variables and $Y$ any finite set. Then we say that $B$ computes $f$ if for every assignment $a\in\{0,1\}^{X}$ we have $B(a)=f(a)$ . We say that a node $v$ in $B$ computes a function $g$ if the BP we get from $B$ by deleting all nodes that are not reachable from $v$ computes $g$ .

Let $R\subseteq\{0,1\}^{X}\times Y$ be a relation where $Y$ is again finite. Then we say that a BP $B$ computes $R$ if for every assignment $a$ we have that $(a,B(a))\in R$ . Let $T(G,c)$ be an unsatisfiable Tseitin-formula for a graph $G=(V,E)$ . Then we define the two following relations: $\textup{Search}_{T(G,c)}$ consists of the pairs $(a,C)$ such that $a$ is an assignment to $T(G,c)$ that does not satisfy the clause $C$ of $T(G,c)$ . The relation $\textup{SearchVertex}(G,c)$ consists of the pairs $(a,v)$ such that $a$ does not satisfy the parity constraint $\chi_{v}$ of a vertex $v\in V$ . Note that $\textup{Search}_{T(G,c)}$ and $\textup{SearchVertex}(G,c)$ both give a reason why an assignment $a$ does not satisfy $T(G,c)$ but the latter is more coarse: $\textup{SearchVertex}(G,c)$ only gives a constraint that is violated while $\textup{Search}_{T(G,c)}$ gives an exact clause that is not satisfied.

Regular Resolution.

We only introduce some minimal notions of proof complexity here; for more details and references the reader is referred to the recent survey [9]. Let $C_{1}=x\lor D_{1}$ and $C_{2}=\overline{x}\lor D_{2}$ be two clauses such that $D_{1},D_{2}$ contain neither $x$ nor $\overline{x}$ . Then the clause $D_{1}\lor D_{2}$ is inferred by resolution of $C_{1}$ and $C_{2}$ on $x$ . A resolution refutation of length $s$ of a CNF-formula $F$ is defined to be a sequence $C_{1},\ldots,C_{s}$ such that $C_{s}$ is the empty clause and for every $i\in[s]$ we have that $C_{i}$ is a clause of $F$ or it is inferred by resolution of two clauses $C_{j},C_{\ell}$ such that $j,\ell<i$ . It is well-known that $F$ has a resolution refutation if and only if $F$ is unsatisfiable.

To every resolution refutation $C_{1},\ldots,C_{s}$ we assign a directed acyclic graph $G$ as follows: the vertices of $G$ are the clauses $\{C_{i}\mid i\in[s]\}$ . Moreover, there is an edge $C_{j}C_{i}$ in $G$ if and only if $C_{i}$ is inferred by resolution of $C_{j}$ and some other clause $C_{\ell}$ on a variable $x$ in the refutation. We also label the edge $C_{j}C_{i}$ with the variable $x$ . Note that there might be two pairs of clauses $C_{j},C_{\ell}$ and $C_{j^{\prime}},C_{\ell^{\prime}}$ such that resolution on both pairs leads to the same clause $C_{i}$ . If this is the case, we simply choose one of them to make sure that all vertices in $G$ have indegree at most $2$ . A resolution refutation is called regular if on every directed path in $G$ every variable $x$ appears at most once as a label of an edge. It is known that there is a resolution refutation of $F$ if and only if a regular resolution refutation of $F$ exists [13], but the latter are in general longer [1, 25].

In this paper, we will not directly deal with regular resolution proofs thanks to the following well-known result.

Theorem 2.1

[20] For every unsatisfiable CNF-formula $F$ , the length of the shortest regular resolution refutation of $F$ is the size of the smallest $1$ -BP computing $\textup{Search}_{F}$ .

Since in our setting, from an unsatisfied clause we can directly inferred an unsatisfied parity constraint, we can use the following simple consequence.

Corollary 1

For every unsatisfiable Tseitin-formula $T(G,c)$ , the length of the shortest regular resolution refutation of $T(G,c)$ is at least the size of the smallest $1$ -BP computing $\textup{SearchVertex}(G,c)$ .

3 Reduction From Unsatisfiable to Satisfiable Formulas

To show our main result, we give a reduction from unsatisfiable to satisfiable Tseitin-formulas as in [19]. There it was shown that, given a $1$ -BP $B$ computing $\textup{SearchVertex}(G,c)$ for an unsatisfiable Tseitin-formula $T(G,c)$ , one can construct a $1$ -BP $B^{\prime}$ computing the function of a satisfiable Tseitin-formula $T(G,c^{*})$ such that $|B^{\prime}|$ is quasipolynomial in $|B|$ . Then good lower bounds on the size of $B^{\prime}$ yield lower bounds for regular refutation by Corollary 1. To give tighter results, we give a version of the reduction from unsatisfiable to satisfiable Tseitin-formulas where the target representation for $T(G,c^{*})$ is not $1$ -BP but the more succinct DNNF. This lets us decrease the size of the representation from pseudopolynomial to polynomial which, with tight lower bounds in the later parts of the paper, will yield Theorem 1.1.

Theorem 3.1

Let $T(G,c)$ be an unsatisfiable Tseitin-formula where $G$ is connected and let $S$ be the length of its smallest resolution refutation. Then there exists for every satisfiable Tseitin-formula $T(G,c^{*})$ a DNNF of size $O(S\times|V(G)|)$ computing it.

In the proof of Theorem 3.1, we heavily rely on results from [19] in particular the notion of well-structuredness that we present in Section 3.1. In Section 3.2 we will then prove Theorem 3.1.

3.1 Well-structured branching programs for $\textup{SearchVertex}(G,c)$

In a well-structured 1-BP computing $\textup{SearchVertex}(G,c)$ , every decision node $u_{k}$ for a variable $x_{e}$ will compute $\textup{SearchVertex}(G_{k},c_{k})$ where $G_{k}$ is a connected sub-graph of $G$ containing the edge $e:=ab$ , and $c_{k}$ is a charge function such that $T(G_{k},c_{k})$ is unsatisfiable. Since $u_{k}$ deals with $T(G_{k},c_{k})$ , its 0- and 1-successors $u_{k_{0}}$ and $u_{k_{1}}$ will work on $T(G_{k},c_{k})|\ell_{e}$ for $\ell_{e}=\overline{x_{e}}$ and $\ell_{e}=x_{e}$ , respectively. $T(G_{k},c_{k})|\ell_{e}$ is a Tseitin-formula whose underlying graph is $G_{k}-e$ and whose charge function is $c_{k}$ or $c_{k}+1_{a}+1_{b}\mod 2$ depending on $\ell_{e}$ . For convenience, we introduce the notation $\gamma_{k}(x_{e})=c_{k}+1_{a}+1_{b}\mod 2$ and $\gamma_{k}(\overline{x_{e}})=c_{k}$ . Since $G_{k}$ is connected, $G_{k}-e$ has at most two connected components. Let $G^{a}_{k}$ and $G^{b}_{k}$ denote the components of $G_{k}-e$ containing $a$ and $b$ , respectively. Note that $G^{a}_{k}=G^{b}_{k}$ when $e$ is not a bridge of $G_{k}$ . Let $\gamma^{a}_{k}(\ell_{e})$ and $\gamma^{b}_{k}(\ell_{e})$ denote the restriction of $\gamma_{k}(\ell_{e})$ to the vertices of $G^{a}_{k}$ and $G^{b}_{k}$ , respectively. While the graph for $T(G_{k},c_{k})|\ell_{e}$ has at most two connected components, exactly one of them holds an odd total charge, so only the Tseitin-formula corresponding to that component is unsatisfiable. Well-structuredness states that $u_{k_{0}}$ and $u_{k_{1}}$ each deal with that unique connected component.

Figure 1: The graphs of Example 1. On the left the graph

G_{k}

, in the middle the result after assigning

0

x_{e}

, on the right after assigning

1

x_{e}

Example 1

Consider the graph $G_{k}$ shown on the left in Figure 1. Black nodes have charge $0$ and white nodes have charge $1$ . The corresponding Tseitin-formula $T(G_{k},c_{k})$ is unsatisfiable because there is an odd number of white nodes. Let $e:=ab$ . Then $T(G_{k},c_{k})|\overline{x_{e}}$ is the Tseitin-formula for the graph $G_{k}-e$ with charges as shown in the middle of Figure 1. Note that $T(G_{k},c_{k})|\overline{x_{e}}$ is unsatisfiable because of the charges in the triangle component $G_{k}^{b}$ . The repartition of charges for $T(G_{k},c_{k})|x_{e}$ illustrated on the right of Figure 1 shows that $T(G_{k},c_{k})|x_{e}$ is unsatisfiable because of the charges in the rombus component $G_{k}^{a}$ . Well-structuredness will ensure that, if $u_{k}$ computes $\textup{SearchVertex}(G_{k},c_{k})$ and decides $x_{e}$ , then $u_{k_{0}}$ computes $\textup{SearchVertex}(G_{k}^{b},\gamma^{b}_{k}(\overline{x_{e}}))$ and $u_{k_{1}}$ computes $\textup{SearchVertex}(G_{k}^{a},\gamma^{a}_{k}(x_{e}))$ .

Definition 1

Let $T(G,c)$ be an unsatisfiable Tseitin-formula where $G$ is a connected graph. A branching program $B$ computing $\textup{SearchVertex}(G,c)$ is well-structured when, for all nodes $u_{k}$ of $B$ , there exists a connected subgraph $G_{k}$ of $G$ and a charge function $c_{k}$ such that $T(G_{k},c_{k})$ is unsatisfiable, $u_{k}$ computes $\textup{SearchVertex}(G_{k},c_{k})$ and

1.

if $u_{k}$ is the source, then $G_{k}=G$ and $c_{k}=c$ ,
2.

if $u_{k}$ is a sink corresponding to $v\in V(G)$ , then $G_{k}=(\{v\},\emptyset)$ and $c_{k}=1_{v}$ ,
3.

if $u_{k}$ is a decision node for $x_{ab}$ with 0- and 1- successors $u_{k_{0}}$ and $u_{k_{1}}$ , set $\ell_{0}=\overline{x_{ab}}$ and $\ell_{1}=x_{ab}$ , then for all $i\in\{0,1\}$ , $(G_{k_{i}},c_{k_{i}})=(G^{a}_{k},\gamma^{a}_{k}(\ell_{i}))$ if $T(G^{a}_{k},\gamma^{a}_{k}(\ell_{i}))$ is unsatisfiable, otherwise $(G_{k_{i}},c_{k_{i}})=(G^{b}_{k},\gamma^{b}_{k}(\ell_{i}))$ .

We remark that our definition is a slight simplification of that given by Itsykson et al. [19]. It can easily be seen that ours is implied by theirs (see Definition 11 and Proposition 16 in [19]).

Lemma 3

[19, Lemma 17] Let $T(G,c)$ be an unsatisfiable Tseitin-formula where $G$ is connected and let $B$ be a 1-BP of minimal size²²2[19, Lemma 17] is for locally minimal 1-BP, which encompass minimal size 1-BP. computing the relation $\textup{SearchVertex}(G,c)$ . Then $B$ is well-structured.

3.2 Constructing DNNF from Well-structured branching programs

Similarly to Theorem 14 in [19], we give a reduction from a well-structured 1-BP for $\textup{SearchVertex}(G,c)$ to a DNNF computing a satisfiable formula $T(G,c^{*})$ .

Lemma 4

Let $G$ be a connected graph. Let $T(G,c^{*})$ and $T(G,c)$ be Tseitin-formulas where $T(G,c^{*})$ is satisfiable and $T(G,c)$ unsatisfiable. For every well-structured 1-BP $B$ computing $\textup{SearchVertex}(G,c)$ there exists a DNNF of size $O(|B|\times|V(G)|)$ computing $T(G,c^{*})$ .

Proof

Let $S=|B|$ and denote by $u_{1},\dots,u_{S}$ the nodes of $B$ such that if $u_{j}$ is a successor of $u_{i}$ , then $j<i$ (thus $u_{S}$ is the source of $B$ ). For every $i\in[S]$ , the node $u_{i}$ computes $\textup{SearchVertex}(G_{i},c_{i})$ . We will show how to iteratively construct DNNF $D_{1},\dots,D_{S}$ such that, $D_{1}\subseteq D_{2}\subseteq\dots\subseteq D_{S}$ and, for every $i\in[S]$ ,

for all $v\in V(G_{i})$ , there is a gate $g_{v}$ in $D_{i}$ computing $T(G_{i},c_{i}+1_{v})$ . $(\ast)$

Observe that, since $T(G_{i},c_{i})$ is unsatisfiable, $T(G_{i},c_{i}+1_{v})$ is satisfiable for any $v\in V(G_{i})$ . We show by induction on $i$ how to construct $D_{i}$ by extending $D_{i-1}$ while respecting $(\ast)$ .

For the base case, $u_{1}$ is a sink of $B$ , so it computes $\textup{SearchVertex}(G_{v},1_{v})$ where $G_{v}:=(\{v\},\emptyset)$ for a vertex $v\in V(G)$ . Thus we define $D_{1}$ as a single constant-1-node which indeed computes $T(G_{v},1_{v}+1_{v})=T(G_{v},0)$ . So $D_{1}$ is a DNNF respecting $(\ast)$ .

Now for the inductive case, suppose we have the DNNF $D_{k-1}$ satisfying $(\ast)$ . Consider the node $u_{k}$ of $B$ . If $u_{k}$ is a sink of $B$ , then we argue as for $D_{1}$ but since we already have the constant-1-node in $D_{k-1}$ we define $D_{k}:=D_{k-1}$ .

Now assume that $u_{k}$ is a decision node for the variable $x_{e}$ with 0- and 1-successors $u_{k_{0}}$ and $u_{k_{1}}$ . Recall that $u_{k}$ computes $\textup{SearchVertex}(G_{k},c_{k})$ and let $e=ab$ . There are two cases. If $e$ is not a bridge in $G_{k}$ then $G^{a}_{k}=G^{b}_{k}=G_{k}-e$ and, by well-structuredness,

•

$u_{k_{0}}$ computes $\textup{SearchVertex}(G_{k}-e,c_{k})$
•

$u_{k_{1}}$ computes $\textup{SearchVertex}(G_{k}-e,c_{k}+1_{a}+1_{b})$

For every $v\in V(G_{k})$ , since $k_{0},k_{1}<k$ , by induction there is a gate $g^{0}_{v}$ in $D_{k_{0}}$ computing $T(G_{k}-e,c_{k}+1_{v})$ and a gate $g^{1}_{v}$ in $D_{k_{1}}$ computing $T(G_{k}-e,c_{k}+1_{a}+1_{b}+1_{v})$ . So for every $v\in V(G_{k})$ we add to $D_{k-1}$ an $\lor$ -gate $g_{v}$ whose left input is $\overline{x_{e}}\land g^{0}_{v}$ and whose right input is $x_{e}\land g^{1}_{v}$ . By construction, $g_{v}$ computes $T(G_{k},c_{k}+1_{v})$ and the new $\land$ -gates are decomposable since $e$ is not an edge of $G_{k}-e$ and therefore $x_{e}$ and $\overline{x_{e}}$ do not appear in $D_{k_{0}}$ and $D_{k_{1}}$ .

Now if $e=ab$ is a bridge in $G_{k}$ , by well-structuredness, there exist $i\in\{0,1\}$ and $\ell_{e}\in\{\overline{x_{e}},x_{e}\}$ such that

•

$u_{k_{i}}$ computes $\textup{SearchVertex}(G^{a}_{k},\gamma^{a}_{k}(\ell_{e}))$
•

$u_{k_{1-i}}$ computes $\textup{SearchVertex}(G^{b}_{k},\gamma^{b}_{k}(\overline{\ell_{e}}))$

We construct a gate $g_{v}$ computing $T(G_{k},c_{k}+1_{v})$ for each $v\in V(G_{k})$ . Assume, without loss of generality, that $v\in V(G^{a}_{k})$ , then

•

$T(G_{k},c_{k}+1_{v})|\overline{\ell_{e}}\equiv T(G^{a}_{k},\gamma^{a}_{k}(\overline{\ell_{e}})+1_{v})\land T(G^{b}_{k},\gamma^{b}_{k}(\overline{\ell_{e}}))\equiv 0$
(because of the second conjunct which is known to be unsatisfiable), and
•

$T(G_{k},c_{k}+1_{v})|\ell_{e}\equiv T(G^{a}_{k},\gamma^{a}_{k}(\ell_{e})+1_{v})\land T(G^{b}_{k},\gamma^{b}_{k}(\ell_{e}))$

For the second item, since $k_{0},k_{1}<k$ , by induction there is a gate $g^{i}_{v}$ in $D_{k_{i}}$ computing $T(G^{a}_{k},\gamma^{a}_{k}(\ell_{e})+1_{v})$ and there is a gate $g^{1-i}_{b}$ in $D_{k_{1-i}}$ computing $T(G^{b}_{k},\gamma^{b}_{k}(\overline{\ell_{e}})+1_{b})$ . But $\gamma_{k}(\ell_{e})=\gamma_{k}(\overline{\ell_{e}})+1_{a}+1_{b}\mod 2$ , so $\gamma^{b}_{k}(\ell_{e})=\gamma^{b}_{k}(\overline{\ell_{e}})+1_{b}\mod 2$ , therefore $g^{i-1}_{b}$ computes the formula $T(G^{b}_{k},\gamma^{b}_{k}(\ell_{e}))$ . So we add an $\land$ -gate $g_{v}$ whose left input is $\ell_{e}$ and whose right input is $s^{i}_{v}\land s^{1-i}_{b}$ and add it to $D_{k-1}$ . Note that $\land$ -gates are decomposable since $G^{a}_{k}$ and $G^{b}_{k}$ share no edge and therefore $D_{k_{0}}$ and $D_{k_{1}}$ are on disjoint sets of variables.

Let $D_{k}$ be the circuit after all $g_{v}$ have been added to $D_{k-1}$ . It is a DNNF satisfying both $D_{k-1}\subseteq D_{k}$ and $(\ast)$ .

It only remains to bound $|D_{S}|$ . To this end, observe that when constructing $D_{k}$ from $D_{k-1}$ we add at most $3\times|V_{k}|$ gates, so $|D_{S}|$ is at most $3(|V_{1}|+\dots+|V_{S}|)=O(S\times|V(G)|)$ . Finally, take any root of $D_{S}$ and delete all gates not reached from it, the resulting circuit is a DNNF $D$ computing a satisfiable Tseitin formula $T(G,c^{\prime})$ . We get a DNNF computing $T(G,c^{*})$ using Lemma 2.∎

Combining Corollary 1, Lemma 3 and Lemma 4 yields Theorem 3.1.

4 Adversarial Rectangle Bounds

In this section, we introduce the game we will use to show DNNF lower bounds for Tseitin formulas. It is based on combinatorial rectangles, a basic object of study from communication complexity.

Definition 2

A (combinatorial) rectangle for a variable partition $(X_{1},X_{2})$ of a variables set $X$ is defined to be a set of assignments of the form $R=A\times B$ where $A\subseteq\{0,1\}^{X_{1}}$ and $B\subseteq\{0,1\}^{X_{2}}$ . The rectangle is called balanced when $\frac{|X|}{3}\leq|X_{1}|,|X_{2}|\leq\frac{2|X|}{3}$ .

A rectangle on variables $X$ may be seen as a function whose satisfying assignments are exactly the $a\cup b$ for $a\in A$ and $b\in B$ , so we sometimes interpret rectangles as Boolean functions whenever it is convenient.

Definition 3

Let $f$ be a Boolean function. A balanced rectangle cover of $f$ is a collection $\mathcal{R}=\{R_{1},\dots,R_{K}\}$ of balanced rectangles on $var(f)$ , possibly for different partitions of $var(f)$ , such that $f$ is equivalent to $\bigvee_{i=1}^{K}R_{i}$ . The minimum number of rectangles in a balanced cover of $f$ is denoted by $R(f)$ .

Theorem 4.1

[8] Let $D$ be a DNNF computing a function $f$ , then $R(f)\leq|D|$ .

When trying to show parameterized lower bounds with Theorem 4.1, one often runs into the problem that it is somewhat inflexible: the partitions of the rectangles in covers have to be balanced, but in parameterized applications this is often undesirable. Instead, to show good lower bounds, one wants to be able to partition in places that allow to cut in complicated subparts of the problem. This is e.g. the underlying technique in [22]. To make this part of the lower bound proofs more explicit and the technique more reusable, we here introduce a refinement of Theorem 4.1.

We define the adversarial multi-partition rectangle cover game for a function $f$ on variables $X$ and a set $S\subseteq sat(f)$ to be played as follows: two players, the cover player Charlotte and her adversary Adam, construct in several rounds a set $\mathcal{R}$ of combinatorial rectangles that cover the set $S$ respecting $f$ (that is, rectangles in $\mathcal{R}$ contain only models of $f$ ). The game starts with $\mathcal{R}$ as the empty set. Charlotte starts a round by choosing an input $a\in S$ and a v-tree $T$ of $X$ . Now Adam chooses a partition $(X_{1},X_{2})$ of $X$ induced by $T$ . Charlotte ends the round by adding to $\mathcal{R}$ a combinatorial rectangle for this partition and respecting $f$ that covers $a$ . The game is over when $S$ is covered by $\mathcal{R}$ . The adversarial multi-partition rectangle complexity of $f$ and $S$ , denoted by $aR(f,S)$ is the minimum number of rounds in which Charlotte can finish the game, whatever the choices of Adam are. The following theorem gives the core technique for showing lower bounds later on.

Theorem 4.2

Let $D$ be a complete DNNF computing a function $f$ and let $S\subseteq sat(f)$ . Then $aR(f,S)\leq|D|$ .

Proof

Let $X=var(D)$ . We iteratively delete vertices from $D$ and construct rectangles. The approach is as follows: Charlotte chooses an assignment $a\in S$ not yet in any rectangle she constructed before and a proof tree $T$ accepting $a$ in $D$ . By completeness of $D$ , all variables of $X$ appear exactly once in $T$ . Charlotte constructs a v-tree of $X$ from $T$ by deleting negations on the leaves, contracting away nodes with a single child and forgetting the labels of all operation gates. Now Adam chooses a partition induced by $T$ given by a subtree of $T$ with root $v$ . Note that $v$ is a gate of $C$ . Let $sat(D,v)\subseteq sat(f)$ be the assignments to $X$ accepted by a proof tree of $C$ passing through $v$ , and observe that $sat(D,v)$ is a combinatorial rectangle $A\times B$ with $A\subseteq\{0,1\}^{var(v)}$ and $B\subseteq\{0,1\}^{X\setminus var(v)}$ . Charlotte chooses the rectangle $sat(D,v)$ , deletes it from $S$ and the game continues.

Note that the vertex $v$ in the above construction is different for every iteration of the game: by construction, Charlotte never chooses an assignment $a$ that is in any set $sat(D,v)$ for a vertex $v$ that has appeared before. Thus, no such $v$ can appear in the proof tree of the chosen $a$ . Consequently, a new vertex $v$ is chosen for each assignment $a$ that Charlotte chooses and thus the game will never last more than $|D|$ rounds. ∎

5 Splitting Parity Constraints

In this section, we will see that rectangles split parity constraints in a certain sense and show how this is reflected in in the underlying graph of Tseitin-formulas. This will be crucial in proving the DNNF lower bound in the next section with the adversarial multi-partition rectangle cover game.

5.1 Rectangles Induce Sub-Constraints for Tseitin-Formulas

Let $R$ be a rectangle for the partition $(E_{1},E_{2})$ of $E(G)$ such that $R\subseteq sat(T(G,c))$ . Assume that there is a vertex $v$ of $G$ incident to edges in $E_{1}$ and to edges in $E_{2}$ , i.e., $E(v)=E_{1}(v)\cup E_{2}(v)$ where neither $E_{1}(v)$ not $E_{2}(v)$ is empty. We will show that $R$ does not only respect $\chi_{v}$ , but it also respects a sub-constraint of $\chi_{v}$ .

Definition 4

Let $\chi_{v}$ be a parity constraint on $(x_{e})_{e\in E(v)}$ . A sub-constraint of $\chi_{v}$ is a parity constraint $\chi^{\prime}_{v}$ on a non-empty proper subset of the variables of $\chi_{v}$ .

Lemma 5

Let $T(G,c)$ be a satisfiable Tseitin-formula and let $R$ be a rectangle for the partition $(E_{1},E_{2})$ of $E(G)$ such that $R\subseteq sat(T(G,c))$ . If $v\in V(G)$ is incident to edges in $E_{1}$ and to edges in $E_{2}$ , then there exists a sub-constraint $\chi^{\prime}_{v}$ of $\chi_{v}$ such that $R\subseteq sat(T(G,c)\land\chi^{\prime}_{v})$ .

Proof

Let $a_{1}\cup a_{2}\in R$ where $a_{1}$ is an assignment to $E_{1}$ and $a_{2}$ an assignment to $E_{2}$ . Let $a_{1}(v)$ and $a_{2}(v)$ denote the restriction of $a_{1}$ and $a_{2}$ to $E_{1}(v)$ and $E_{2}(v)$ , respectively. We claim that for all $a^{\prime}_{1}\cup a^{\prime}_{2}\in R$ , we have that $a^{\prime}_{1}(v)$ and $a_{1}(v)$ have the same parity, that is, $a_{1}(v)$ assigns an odd number of variables of $E_{1}(v)$ to 1 if and only if it is also the case for $a^{\prime}_{1}(v)$ . Indeed if $a_{1}(v)$ and $a^{\prime}_{1}(v)$ have different parities, then so do $a_{1}(v)\cup a_{2}(v)$ and $a^{\prime}_{1}(v)\cup a_{2}(v)$ . So either $a_{1}\cup a_{2}$ or $a^{\prime}_{1}\cup a_{2}$ falsifies $\chi_{v}$ , but both assignments are in $R$ , so $a_{1}(v)$ and $a^{\prime}_{1}(v)$ cannot have different parities as this contradicts $R\subseteq sat(T(G,c))$ . Let $c_{1}$ be the parity of $a_{1}(v)$ , then we have that assignments in $R$ must satisfy $\chi^{\prime}_{v}:\sum_{e\in E_{1}(v)}x_{e}=c_{1}\mod 2$ , so $R\subseteq sat(T(G,c)\land\chi^{\prime}_{v})$ . ∎

Renaming $\chi^{\prime}_{v}$ as $\chi^{1}_{v}$ and adopting notations from the proof, one sees that $\chi^{1}_{v}\land\chi_{v}\equiv\chi^{1}_{v}\land\chi^{2}_{v}$ where $\chi^{2}_{v}:\sum_{e\in E_{2}(v)}x_{e}=c(v)+c_{1}\mod 2$ . So $R$ respects the formula $(T(G,c)-\chi_{v})\land\chi^{1}_{v}\land\chi^{2}_{v}$ where $(T(G,c)-\chi_{v})$ is the formula obtained by removing all clauses of $\chi_{v}$ from $T(G,c)$ . In this sense, the rectangle is splitting the constraint $\chi_{v}$ into two subconstraints in disjoint variables. Since $\chi_{v}\equiv(\chi^{1}_{v}\land\chi^{2}_{v})\lor(\overline{\chi}^{1}_{v}\land\overline{\chi}^{2}_{v})$ it is plausible that potentially many models of $\chi_{v}$ are not in $R$ . We show that this is true in the next section.

5.2 Vertex Splitting and Sub-constraints for Tseitin-Formulas

Let $v\in V(G)$ and let $(N_{1},N_{2})$ be a proper partition of $N(v)$ , that is, neither $N_{1}$ nor $N_{2}$ is empty. The graph $G^{\prime}$ we get by splitting $v$ along $(N_{1},N_{2})$ is defined as the graph we get by deleting $v$ , adding two vertices $v^{1}$ and $v^{2}$ , and connecting $v^{1}$ to all vertices in $N_{1}$ and $v^{2}$ to all vertices in $N_{2}$ . We now show that splitting a vertex $v$ in a graph $G$ has the same effect as adding a sub-constraint of $\chi_{v}$ .

Lemma 6

Let $T(G,c)$ be a Tseitin-formula. Let $v\in V(G)$ and let $(N_{1},N_{2})$ be a proper partition of $N(v)$ . Let $c_{1}$ and $c_{2}$ be such that $c_{1}+c_{2}=c(v)\mod 2$ and let $\chi^{i}_{v}:\sum_{u\in N_{i}}x_{uv}=c_{i}\mod 2$ for $i\in\{1,2\}$ be sub-constraints of $\chi_{v}$ . Call $G^{\prime}$ the result of splitting $v$ along $(N_{1},N_{2})$ and set

\displaystyle c^{\prime}(u):=\begin{cases}c(u),&\text{ if }u\in V(G)\setminus\{v\}\\ c_{i},&\text{ if }u=v^{i},i\in\{1,2\}\end{cases}

There is a bijection $\rho:var(T(G,c))\rightarrow var(T(G^{\prime},c^{\prime}))$ acting as a renaming of the variables such that $T(G^{\prime},c^{\prime})\equiv(T(G,c)\land\chi^{1}_{v})\circ\rho$ .

Proof

Denote by $T(G,c)-\chi_{v}$ the formula equivalent to the conjunction of all $\chi_{u}$ for $u\in V(G)\setminus\{v\}$ . Then $T(G,c)\land\chi^{1}_{v}\equiv(T(G,c)-\chi_{v})\land\chi^{1}_{v}\land\chi^{2}_{v}$ . The constraints $\chi_{u}$ for $u\in V(G)\setminus\{v\}$ appear in both $T(G^{\prime},c^{\prime})$ and in $T(G,c)-\chi_{v}$ and the sub-constraints $\chi^{1}_{v}$ and $\chi^{2}_{v}$ are exactly the constraints for $v^{1}$ and $v^{2}$ in $T(G^{\prime},c^{\prime})$ modulo the variable renaming $\rho$ defined by $\rho(x_{uv})=x_{uv^{1}}$ when $u\in N_{1}$ , $\rho(x_{uv})=x_{uv^{2}}$ when $u\in N_{2}$ , and $\rho(x_{e})=x_{e}$ when $v$ is not incident to $e$ . ∎

Intuitely, Lemma 6 says that splitting a vertex in $G$ and adding sub-constraint are essentially the same operation. This allows us to compute the number of models of a Tseitin-formula to which a sub-constraint was added.

Lemma 7

Let $T(G,c)$ be a satisfiable Tseitin-formula where $G$ is connected. Define $T(G^{\prime},c^{\prime})$ as in Lemma 6. If $G^{\prime}$ is connected then $T(G^{\prime},c^{\prime})$ has $2^{|E(G)|-|V(G)|}$ models.

Proof

By Proposition 1, $T(G^{\prime},c^{\prime})$ is satisfiable since $T(G,c)$ is satisfiable and $\sum_{u\in V(G^{\prime})}c^{\prime}(u)=\sum_{u\in V(G)}c(u)=0\mod 2$ . Using Proposition 2 yields that $T(G^{\prime},c^{\prime})$ has $2^{|E(G^{\prime})|-|V(G^{\prime})|+1}=2^{|E(G)|-|V(G)|}$ models. ∎

Lemma 8

Let $T(G,c)$ be a satisfiable Tseitin-formula where $G$ is connected. Let $\{v_{1},\ldots,v_{k}\}$ be an independent set in $G$ . For all $i\in[k]$ let $(N_{1}^{i},N_{2}^{i})$ be a proper partition of $N(v_{i})$ and let $\chi^{\prime}_{v_{i}}:\sum_{u\in N^{i}_{1}}x_{uv_{i}}=c_{i}\mod 2$ . If the graph obtained by splitting all $v_{i}$ along $(N_{1}^{i},N_{2}^{i})$ is connected, then the formula $T(G,c)\land\chi^{\prime}_{v_{1}}\land\dots\land\chi^{\prime}_{v_{k}}$ has $2^{|E(G)|-|V(G)|-k+1}$ models.

Proof

An easy induction based on Lemma 6 and Lemma 7. The induction works since, $\{v_{1},\ldots,v_{k}\}$ being an independant set, the edges to modify by splitting $v_{i}$ are still in the graph where $v_{1},\dots,v_{i-1}$ have been split. ∎

5.3 Vertex Splitting in 3-Connected Graphs

When we want to apply the results of the last sections to bound the size of rectangles, we require that the graph $G$ remains connected after splitting vertices. This is obviously not true for all choices of vertex splits, but here we will see that if $G$ is sufficiently connected, then we can always chose a large subset of any set of potential splits such that, after applying the split for this subset, $G$ remains connected.

Lemma 9

Let $G$ be a $3$ -connected graph of and let $\{v_{1},\ldots,v_{k}\}$ be an independent set in $G$ . For every $i\in[k]$ let $(N_{1}^{i},N_{2}^{i})$ be a proper partition of $N(v_{i})$ . Then there is a subset $S$ of $\{v_{1},\ldots,v_{k}\}$ of size at least $k/3$ such that the graph resulting from splitting all $v_{i}\in S$ along the corresponding $(N_{1}^{i},N_{2}^{i})$ is connected.

Proof

Let $C_{1},\ldots,C_{r}$ be the connected components of the graph $G_{1}$ that we get by splitting all $v_{i}$ . If $G_{1}$ is connected, then we can set $S=\{v_{1},\ldots,v_{k}\}$ and we are done. So assume that $r>1$ in the following. Now add for every $i\in[k]$ the edge $(v^{1}_{i},v_{i}^{2})$ . Call this edge set $L$ (for links) and the resulting graph $G_{2}$ . Note that $G_{2}$ is connected and for every edge set $E^{\prime}\subseteq L$ we have that $G_{2}\setminus E^{\prime}$ is connected if and only if $G$ is connected after splitting the vertices corresponding to the edges in $E^{\prime}$ . Denote by $L_{in}$ the edges in $L$ whose end points both lie in some component $C_{j}$ and let $L_{out}:=L\setminus L_{in}$ .

We claim that for every $C_{j}$ , at least three edges in $L_{out}$ are incident to a vertex in $C_{j}$ . Since $G_{2}$ is connected but the set $C_{j}$ is a connected component of $G_{2}\setminus L=G_{1}$ , there must be at least one edge in $L$ incident to a vertex in $C_{j}$ . That vertex is by construction one of $v_{1},\ldots,v_{k}$ , say it is $v_{i}$ . Since $N_{1}^{i}\neq\emptyset$ and $N_{2}^{i}\neq\emptyset$ , we have that $v_{i}$ has a neighbor $w$ in $C_{j}$ and, $w\not\in\{v_{1},\ldots,v_{k}\}$ since it is an independent set. Now let $L^{j}_{out}$ be the edges in $L_{out}$ that have an end point in $C_{j}$ . Note that if we delete the vertices $S^{j}\subseteq\{v_{1},\ldots,v_{k}\}$ for which the edges in $L^{j}_{out}$ were introduced in the construction of $G_{2}$ , then a subset of $C_{j}$ becomes disconnected from the rest of the graph (which is non-empty because there is at least one component different from $C_{j}$ in $G_{2}$ which also contains a vertex not in $\{v_{1},\ldots,v_{k}\}$ by the same reasoning as before). But then, because $G$ is $3$ -connected, there must be at least three edges in $L^{j}_{out}$ . Let $k^{\prime}:=|L_{out}|$ , then by the handshaking lemma,

\displaystyle r\leq\frac{2}{3}k^{\prime}.

Now contract all components $C_{i}$ in $G_{2}$ and call the resulting graph $G_{3}$ . Note that $G_{3}$ is connected and that $E(G_{3})=L_{out}$ . Moreover, whenever $G_{3}\setminus E^{*}$ is connected for some $E^{*}\subseteq L_{out}$ , then $G$ is connected after splitting the corresponding vertices. Choose any spanning tree $T$ of $G_{3}$ . Then $|E(T)|=r-1$ and deleting $E^{*}:=L_{out}\setminus E(T)$ leaves $G_{3}$ connected. Thus the graph $G^{*}$ we get from $G$ after splitting the vertices corresponding to $E^{*}$ is connected. We have

\displaystyle|E^{*}|=|L_{out}|-|E(T)|=k^{\prime}-(r-1)>\frac{k^{\prime}}{3}.

Now observe that in $G$ we can safely split all $k-k^{\prime}$ vertices $v_{i}$ that correspond to edges $v_{i}^{1}v_{i}^{2}$ such that $v_{i}^{1}$ and $v_{i}^{2}$ lie in the same component of $G_{1}$ without disconnecting the graph. Thus, overall we can split a set of size

\displaystyle k-k^{\prime}+|E^{*}|>k-k^{\prime}+\frac{k^{\prime}}{3}\geq\frac{k}{3}

in $G$ such that the resulting graph remains connected. ∎

6 DNNF Lower Bounds for Tseitin-Formulas

In this section, we use the results of the previous sections to show our lower bounds for DNNF computing Tseitin-formulas. To this end, we first show that we can restrict ourselves to the case of $3$ -connected graphs.

6.1 Reduction from Connected to 3-Connected Graphs

In [7], Bodlaender and Koster study how separators can be used in the context of treewidth. They call a separator $S$ safe for treewidth if there exists a connected component of $G\setminus S$ whose vertex set $V^{\prime}$ is such that $tw(G[S\cup V^{\prime}]+clique(S))=tw(G)$ , where $G[S\cup V^{\prime}]+clique(S)$ is the graph induced on $S\cup V^{\prime}$ with additional edges that pairwise connect all vertices in $S$ .

Lemma 10

[7, Corollary 15] Every separator of size 1 is safe for treewdith. When $G$ has no separator of size 1, every separator of size 2 is safe for treewidth.

Remember that a topological minor $H$ of a $G$ is a graph that can be constructed from $G$ by iteratively applying the following operations:

$-$

edge deletion,
$-$

deletion of isolated vertices, or
$-$

subdivision elimination: if $\deg(v)=2$ delete $v$ and connect its two neighbors.

Lemma 11

Let $H$ be a topological minor of $G$ . If the satisfiable Tseitin-formula $T(G,0)$ has a DNNF of size $s$ , then so does $T(H,0)$ .

Proof

Edge deletion corresponds to conditioning the variable by $0$ so it cannot increase the size of a DNNF. Deletion of an isolated vertex does not change the Tseitin-formula. Finally, let $e_{1},e_{2}$ be the edges incident to a vertex of degree $2$ . Since we assume that all charges $c(v)$ are $0$ , in every satisfying assignment, $x_{e_{1}}$ and $x_{e_{2}}$ take the same value. Thus we can simply forget the variable of $x_{e_{2}}$ which does not increase the size of a DNNF [11]. ∎

Lemma 12

Let $G$ be a graph with treewidth at least $3$ . Then $G$ has a $3$ -connected topological minor $H$ with $tw(H)=tw(G)$ .

Proof

First we construct a topological minor of $G$ with no separator of size $1$ that preserves treewidth. Let $S=\{v\}$ be a separator of size 1 of $G$ , then $G\setminus S$ has a connected component $V^{\prime}$ such that $G[S\cup V^{\prime}]+clique(S)=G[S\cup V^{\prime}]$ has treewidth $tw(G)$ . Let $G^{\prime}=G[S\cup V^{\prime}]$ , then $tw(G^{\prime})=tw(G)$ . Observe that $G^{\prime}$ is a topological minor (remove all edges not in $G[S\cup V^{\prime}]$ thus isolating all vertices not in $S\cup V^{\prime}$ , which are then deleted) where $S$ is no longer a separator. Repeat the construction until $G^{\prime}$ has no separator of size 1.

Now assume $S=\{u,v\}$ is a separator of $G^{\prime}$ . If $V^{\prime}$ are the vertices of a connected component of $G^{\prime}\setminus S$ , then there is a path from $u$ to $v$ in $G[S\cup V^{\prime}]$ since otherwise either $\{u\}$ or $\{v\}$ is a separator of size $1$ of $G^{\prime}$ . Lemma 10 ensures that there is a connected component $H^{\prime}$ in $G^{\prime}\setminus S$ such that $H:=(V(H^{\prime})\cup S,E(H^{\prime})\cup\{uv\})$ has treewidth $tw(H)=tw(G^{\prime})=tw(G)$ . Let us prove that $H$ is topological minor of $G^{\prime}$ . Consider a connected component of $G^{\prime}\setminus S$ distinct from $H^{\prime}$ with vertices $V^{\prime}$ and let $P$ be a path connecting $u$ to $v$ in $G[S\cup V^{\prime}]$ . Delete all edges of $G[S\cup V^{\prime}]$ not in $P$ , then delete all isolated vertices in $V^{\prime}$ so that only $P$ remains, finally use subdivision elimination to reduce $P$ to a single edge $uv$ . Repeat the procedure for all connected components of $G^{\prime}\setminus S$ distinct from $H^{\prime}$ , the resulting topological minor is $G[V(H^{\prime})\cup S]$ with the (additional) edge $uv$ , so $H$ .

Repeat the construction until there are no separators of size $1$ or size $2$ left. Note that this process eventually terminates since the number of vertices decreases after every separator elimination. The resulting graph $H$ is a topological minor of $G$ of treewidth $tw(G)$ without separators of size $1$ or $2$ . Since $tw(H)=tw(G)\geq 3$ , we have that $H$ has at least $4$ vertices, so $H$ is $3$ -connected. ∎

6.2 Proof of the DNNF Lower Bound and of the Main Result

Lemma 13

Let $T(G,c)$ be a satisfiable Tseitin-formula where $G$ is a connected graph with maximum degree at most $\Delta$ . Any complete DNNF computing $T(G,c)$ has size at least $2^{\Omega(tw(G)/\Delta)}$ .

Proof

By Lemma 2 we can set $c=0$ . By Lemmas 11 and 12 we can assume that $G$ is 3-connected. We show that the adversarial multi-partition rectangle complexity is lower-bounded by $2^{k}$ for $k:=\frac{2tw(G)}{9\Delta}$ . To this end, we will show that the rectangles that Charlotte can construct after Adam’s answer are never bigger than $2^{|E(G)|-|V(G)|-k+1}$ . Since $T(G,c)$ has $2^{|E(G)|-|V(G)|+1}$ models, the claim then follows.

So let Charlotte choose an assignment $a$ and a v-tree $T$ . Note that since the variables of $T(G,0)$ are the edges of $G$ , the v-tree $T$ is also a branch decomposition of $G$ . Now by the definition of branchwidth, Adam can choose a cut of $T$ inducing a partition $(E_{1},E_{2})$ of $E(G)$ for which there exists a set $V^{\prime}\in V(G)$ of at least $bw(G)\geq\frac{2}{3}tw(G)$ vertices incident to edges in $E_{1}$ and to edges in $E_{2}$ .

$G$ has maximum degree $\Delta$ so there is an independent set $V^{\prime\prime}\subset V^{\prime}$ of size at least $\frac{|V^{\prime}|}{\Delta}$ . Since $G$ is $3$ -connected, by Lemma 9 there is a subset $V^{*}\subseteq V^{\prime\prime}$ of size at least $\frac{|V^{\prime\prime}|}{3}\geq\frac{2tw(G)}{9\Delta}=k$ such that $G$ remains connected after splitting of the nodes in $V^{*}$ along the partition of their neighbors induced by the edges partition $(E_{1},E_{2})$ . Using Lemma 5, we find that any rectangle $R$ for the partition $(E_{1},E_{2})$ respects a sub-constraint $\chi^{\prime}_{v}$ for each $v\in V^{*}$ . So $R$ respects $T(G,0)\land\bigwedge_{v\in V^{*}}\chi^{\prime}_{v}$ . Finally, Lemma 8 shows that $|R|\leq 2^{|E(G)|-|V(G)|-k+1}$ , as required. ∎

Theorem 1.1 is now a direct consequence of Theorem 3.1, Lemma 13 and Lemma 2

7 Conclusion

We have shown that the unsatisfiable Tseitin-formulas with polynomial length of regular resolution refutations are completely determined by the treewidth of the underlying graphs. We did this by giving a connection between lower bounds for regular resolution refutations and size bounds of DNNF representations of Tseitin-formulas. Moreover, we introduced a new two-player game that allowed us to show DNNF lower bounds.

Let us discuss some questions that we think are worth exploring in the future. First, it would be interesting to see if a $2^{\Omega(tw(G))}$ lower bound for the refutation of Tseitin-formulas can also be shown for general resolution. In that case the length of resolution refutations would essentially be the same as that regular resolution refutations for Tseitin formulas. Note that this is somewhat plausible since other measures like space and width are known to be the same for the two proof systems for these formulas [14].

Another question is the relation between knowledge compilation and proof complexity. As far as we are aware, our Theorem 3.1 is the first result that connects bounds on DNNF to such in proof complexity. It would be interesting to see if this connection can be strenghtened to other classes of instances, other proof systems, representations from knowledge compilation and measures on proofs and representations, respectively.

References

[1] Alekhnovich, M., Johannsen, J., Pitassi, T., Urquhart, A.: An exponential separation between regular and general resolution. Theory Comput. 3(1), 81–102 (2007). https://doi.org/10.4086/toc.2007.v003a005, https://doi.org/10.4086/toc.2007.v003a005
[2] Alekhnovich, M., Razborov, A.A.: Satisfiability, branch-width and tseitin tautologies. Comput. Complex. 20(4), 649–678 (2011). https://doi.org/10.1007/s00037-011-0033-1, https://doi.org/10.1007/s00037-011-0033-1
[3] Atserias, A., Bonacina, I., de Rezende, S.F., Lauria, M., Nordström, J., Razborov, A.A.: Clique is hard on average for regular resolution. In: Diakonikolas, I., Kempe, D., Henzinger, M. (eds.) Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2018, Los Angeles, CA, USA, June 25-29, 2018. pp. 866–877. ACM (2018). https://doi.org/10.1145/3188745.3188856, https://doi.org/10.1145/3188745.3188856
[4] Beame, P., Beck, C., Impagliazzo, R.: Time-space tradeoffs in resolution: superpolynomial lower bounds for superlinear space. In: Karloff, H.J., Pitassi, T. (eds.) Proceedings of the 44th Symposium on Theory of Computing Conference, STOC 2012, New York, NY, USA, May 19 - 22, 2012. pp. 213–232. ACM (2012). https://doi.org/10.1145/2213977.2213999, https://doi.org/10.1145/2213977.2213999
[5] Beck, C., Impagliazzo, R.: Strong ETH holds for regular resolution. In: Boneh, D., Roughgarden, T., Feigenbaum, J. (eds.) Symposium on Theory of Computing Conference, STOC’13, Palo Alto, CA, USA, June 1-4, 2013. pp. 487–494. ACM (2013). https://doi.org/10.1145/2488608.2488669, https://doi.org/10.1145/2488608.2488669
[6] Ben-Sasson, E.: Hard examples for the bounded depth frege proof system. Comput. Complex. 11(3-4), 109–136 (2002). https://doi.org/10.1007/s00037-002-0172-5, https://doi.org/10.1007/s00037-002-0172-5
[7] Bodlaender, H.L., Koster, A.M.C.A.: Safe separators for treewidth. Discret. Math. 306(3), 337–350 (2006). https://doi.org/10.1016/j.disc.2005.12.017, https://doi.org/10.1016/j.disc.2005.12.017
[8] Bova, S., Capelli, F., Mengel, S., Slivovsky, F.: Knowledge compilation meets communication complexity. In: Kambhampati, S. (ed.) Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, IJCAI 2016, New York, NY, USA, 9-15 July 2016. pp. 1008–1014. IJCAI/AAAI Press (2016), http://www.ijcai.org/Abstract/16/147
[9] Buss, S., Nordström, J.: Proof complexity and sat solving. Chapter to appear in the 2nd edition of Handbook of Satisfiability, Draft version available at https://www. math. ucsd. edu/ sbuss/ResearchWeb/ProofComplexitySAT (2019)
[10] Darwiche, A.: Decomposable negation normal form. J. ACM 48(4), 608–647 (2001). https://doi.org/10.1145/502090.502091, https://doi.org/10.1145/502090.502091
[11] Darwiche, A., Marquis, P.: A knowledge compilation map. J. Artif. Intell. Res. 17, 229–264 (2002). https://doi.org/10.1613/jair.989, https://doi.org/10.1613/jair.989
[12] Davis, M., Logemann, G., Loveland, D.W.: A machine program for theorem-proving. Commun. ACM 5(7), 394–397 (1962). https://doi.org/10.1145/368273.368557, https://doi.org/10.1145/368273.368557
[13] Davis, M., Putnam, H.: A computing procedure for quantification theory. J. ACM 7(3), 201–215 (1960). https://doi.org/10.1145/321033.321034, http://doi.acm.org/10.1145/321033.321034
[14] Galesi, N., Talebanfard, N., Torán, J.: Cops-robber games and the resolution of tseitin formulas. ACM Trans. Comput. Theory 12(2), 9:1–9:22 (2020). https://doi.org/10.1145/3378667, https://doi.org/10.1145/3378667
[15] Glinskih, L., Itsykson, D.: Satisfiable tseitin formulas are hard for nondeterministic read-once branching programs. In: Larsen, K.G., Bodlaender, H.L., Raskin, J. (eds.) 42nd International Symposium on Mathematical Foundations of Computer Science, MFCS 2017, August 21-25, 2017 - Aalborg, Denmark. LIPIcs, vol. 83, pp. 26:1–26:12. Schloss Dagstuhl - Leibniz-Zentrum für Informatik (2017). https://doi.org/10.4230/LIPIcs.MFCS.2017.26, https://doi.org/10.4230/LIPIcs.MFCS.2017.26
[16] Goerdt, A.: Regular resolution versus unrestricted resolution. SIAM J. Comput. 22(4), 661–683 (1993). https://doi.org/10.1137/0222044, https://doi.org/10.1137/0222044
[17] Harvey, D.J., Wood, D.R.: Parameters tied to treewidth. J. Graph Theory 84(4), 364–385 (2017). https://doi.org/10.1002/jgt.22030, https://doi.org/10.1002/jgt.22030
[18] Itsykson, D., Oparin, V.: Graph expansion, tseitin formulas and resolution proofs for CSP. In: Bulatov, A.A., Shur, A.M. (eds.) Computer Science - Theory and Applications - 8th International Computer Science Symposium in Russia, CSR 2013, Ekaterinburg, Russia, June 25-29, 2013. Proceedings. Lecture Notes in Computer Science, vol. 7913, pp. 162–173. Springer (2013). https://doi.org/10.1007/978-3-642-38536-0_14, https://doi.org/10.1007/978-3-642-38536-0_14
[19] Itsykson, D., Riazanov, A., Sagunov, D., Smirnov, P.: Almost tight lower bounds on regular resolution refutations of tseitin formulas for all constant-degree graphs. Electron. Colloquium Comput. Complex. 26, 178 (2019), https://eccc.weizmann.ac.il/report/2019/178
[20] Lovász, L., Naor, M., Newman, I., Wigderson, A.: Search problems in the decision tree model. SIAM J. Discret. Math. 8(1), 119–132 (1995). https://doi.org/10.1137/S0895480192233867, https://doi.org/10.1137/S0895480192233867
[21] Nordström, J.: On the interplay between proof complexity and SAT solving. ACM SIGLOG News 2(3), 19–44 (2015), https://dl.acm.org/citation.cfm?id=2815497
[22] Razgon, I.: On the read-once property of branching programs and cnfs of bounded treewidth. Algorithmica 75(2), 277–294 (2016). https://doi.org/10.1007/s00453-015-0059-x, https://doi.org/10.1007/s00453-015-0059-x
[23] Tseitin, G.: On the complexity of derivation in propositional calculus. Studies in Constructive Mathematics and Mathematical Logic Part 2, 115–125 (1968)
[24] Urquhart, A.: Hard examples for resolution. J. ACM 34(1), 209–219 (1987). https://doi.org/10.1145/7531.8928, https://doi.org/10.1145/7531.8928
[25] Urquhart, A.: A near-optimal separation of regular and general resolution. SIAM J. Comput. 40(1), 107–121 (2011). https://doi.org/10.1137/090772897, https://doi.org/10.1137/090772897
[26] Vatshelle, M.: New Width Parameters of Graphs. Ph.D. thesis, Department of Informatics, University of Bergen (2012)
[27] Vinyals, M., Elffers, J., Johannsen, J., Nordström, J.: Simplified and improved separations between regular and general resolution by lifting. In: Pulina, L., Seidl, M. (eds.) Theory and Applications of Satisfiability Testing - SAT 2020 - 23rd International Conference, Alghero, Italy, July 3-10, 2020, Proceedings. Lecture Notes in Computer Science, vol. 12178, pp. 182–200. Springer (2020). https://doi.org/10.1007/978-3-030-51825-7_14, https://doi.org/10.1007/978-3-030-51825-7_14

Characterizing Tseitin-formulas with short regular resolution refutations††thanks: This work has been partly supported by the PING/ACK project of the French National Agency for Research (ANR-18-CE40-0011).

Abstract

Keywords:

1 Introduction

Theorem 1.1

2 Preliminaries

Notions on Graphs.

Lemma 1

Variables, assignments, v-trees.

Tseitin-Formulas.

Proposition 1

Proposition 2

DNNF.

Lemma 2

Proof (sketch)

Branching programs.

Regular Resolution.

Theorem 2.1

Corollary 1

3 Reduction From Unsatisfiable to Satisfiable Formulas

Theorem 3.1

3.1 Well-structured branching programs for SearchVertex​(G,c)\textup{SearchVertex}(G,c)

Example 1

Definition 1

Lemma 3

3.2 Constructing DNNF from Well-structured branching programs

Lemma 4

Proof

4 Adversarial Rectangle Bounds

Definition 2

Definition 3

Theorem 4.1

Theorem 4.2

Proof

5 Splitting Parity Constraints

5.1 Rectangles Induce Sub-Constraints for Tseitin-Formulas

Definition 4

Lemma 5

Proof

5.2 Vertex Splitting and Sub-constraints for Tseitin-Formulas

Lemma 6

Proof

Lemma 7

Proof

Lemma 8

Proof

5.3 Vertex Splitting in 3-Connected Graphs

Lemma 9

Proof

6 DNNF Lower Bounds for Tseitin-Formulas

6.1 Reduction from Connected to 3-Connected Graphs

Lemma 10

Lemma 11

Proof

Lemma 12

Proof

6.2 Proof of the DNNF Lower Bound and of the Main Result

Lemma 13

Proof

7 Conclusion

References

Characterizing Tseitin-formulas with short regular resolution refutations^†^†thanks: This work has been partly supported by the PING/ACK project of the French National Agency for Research (ANR-18-CE40-0011).

3.1 Well-structured branching programs for $\textup{SearchVertex}(G,c)$