Forbidden Induced Subgraphs and the Łoś-Tarski Theorem

Yijia Chen
School of Computer Science
Fudan University
yijiachen@fudan.edu.cn
Jörg Flum
Mathematisches Institut
Universität Freiburg
joerg.flum@math.uni-freiburg.de

Abstract

Let $\mathscr{C}$ be a class of finite and infinite graphs that is closed under induced subgraphs. The well-known Łoś-Tarski Theorem from classical model theory implies that $\mathscr{C}$ is definable in first-order logic (FO) by a sentence $\varphi$ if and only if $\mathscr{C}$ has a finite set of forbidden induced finite subgraphs. It provides a powerful tool to show nontrivial characterizations of graphs of small vertex cover, of bounded tree-depth, of bounded shrub-depth, etc. in terms of forbidden induced finite subgraphs. Furthermore, by the Completeness Theorem, we can compute from $\varphi$ the corresponding forbidden induced subgraphs. We show that this machinery fails on finite graphs.

–

There is a class $\mathscr{C}$ of finite graphs which is definable in FO and closed under induced subgraphs but has no finite set of forbidden induced subgraphs.
–

Even if we only consider classes $\mathscr{C}$ of finite graphs which can be characterized by a finite set of forbidden induced subgraphs, such a characterization cannot be computed from an FO-sentence $\varphi$ , which defines $\mathscr{C}$ , and the size of the characterization cannot be bounded by $f(|\varphi|)$ for any computable function $f$ .

Besides their importance in graph theory, the above results also significantly strengthen similar known results for arbitrary structures.

1 Introduction

Many classes of graphs can be defined by a finite set of forbidden induced finite subgraphs. One of the simplest examples is the class of graphs of bounded degree. Let $d\geq 1$ and $\mathscr{F}_{d}$ consist of all graphs with vertex set $\{1,\ldots,d+2\}$ and maximum degree exactly $d+1$ . Then a graph $G$ has degree at most $d$ if and only if no graph in $\mathscr{F}_{d}$ is isomorphic to an induced subgraph of $G$ . Less trivial examples include graphs of small vertex cover (attributed to Lovász [9]), of bounded tree-depth [5], and of bounded shrub-depth [13]. As a matter of fact, understanding forbidden induced subgraphs for those graph classes is an important question in structural graph theory [7, 21, 12, 11]. However, a straightforward adaptation of a result in [10] shows that it is in general impossible to compute the forbidden induced subgraphs from a description of classes of graphs by Turing machines.

It is folklore [1, 17] that characterization by finitely many forbidden induced finite subgraphs is equivalent to definability by a universal sentence of first-order logic (FO). But only very recently, it was realized [2] that such a characterization can be further understood by the Łoś-Tarski theorem. Łoś [15] and Tarski [19] proved the first so-called preservation theorem of classical model theory. In its simplest form it says that the class $\textsc{Graph}(\varphi)$ of finite and infinite graphs that are models of a sentence $\varphi$ of first-order logic is closed under induced subgraphs (or, that $\varphi$ is preserved under induced subgraphs) if and only if there is a universal FO-sentence $\mu$ with $\textsc{Graph}(\varphi)=\textsc{Graph}(\mu)$ . Recall that a universal sentence $\mu$ is a sentence of the form $\forall x_{1}\ldots\forall x_{k}\,\mu_{0}$ , where $\mu_{0}$ is quantifier-free.

Such a universal sentence $\mu=\forall x_{1}\ldots\forall x_{k}\,\mu_{0}$ expresses that certain patterns of induced subgraphs with at most $k$ vertices are forbidden. In fact, let $\mathscr{F}$ be a finite set of finite graphs and denote by $\textsc{Forb}(\mathscr{F})$ the class of (finite and infinite) graphs that do not contain an induced subgraph isomorphic to a graph in $\mathscr{F}$ . Then for a universal sentence $\mu$ as above we have

\textsc{Graph}(\mu)=\textsc{Forb}\big{(}\mathscr{F}_{k}(\mu)\big{)}.

(1)

Here for any FO-sentence $\varphi$ and $k\geq 1$ by $\mathscr{F}_{k}(\varphi)$ we denote the class of graphs that are models of $\neg\varphi$ and whose universe is $\{1,\ldots,\ell\}$ for some $\ell$ with $1\leq\ell\leq k$ . Clearly, $\mathscr{F}_{k}(\varphi)$ is finite.

We say that a class $\mathscr{C}$ of finite and infinite graphs is definable by a finite set of forbidden induced subgraphs if there is a finite set $\mathscr{F}$ of finite graphs such that $\mathscr{C}=\textsc{Forb}(\mathscr{F})$ . Hence the graph-theoretic version of the Łoś-Tarski Theorem can be restated in the form:

(I)	Let $\mathscr{C}$ be a class of finite and infinite graphs. The following are equivalent:
	(i) $\mathscr{C}$ is closed under induced subgraphs and FO-axiomatizable.
	(ii) $\mathscr{C}$ is axiomatizable by a universal sentence.
	(iii) $\mathscr{C}$ is definable by a finite set of forbidden induced subgraphs.

This version of the Łoś-Tarski Theorem is already contained, at least implicitly, in the article [20] of Vaught published in 1954. In addition, it is easy to see that the equivalence between (ii) and (iii) holds for any class of finite graphs too.

Note that we have repeatedly mentioned that in the Łoś-Tarski Theorem graphs are allowed to be infinite. This is not merely a technicality. In [2], to obtain the forbidden induced subgraph characterization of graphs of bounded shrub-depth using the Łoś-Tarski Theorem, one simple but vital step is to extend the notion of shrub-depth to infinite graphs. Indeed, Tait [18] exhibited a class $\mathscr{C}$ of finite structures (which might be understood as colored directed graphs) which is closed under induced substructures and FO-axiomatizable. Yet, $\mathscr{C}$ is not definable by any universal sentence, thus cannot be characterized by a finite set of forbidden induced substructures. As the first result of this paper, we strengthen Tait’s result to graphs.

Theorem 1.1.

There is a class $\mathscr{C}$ of finite graphs with the following properties.

(i)

$\mathscr{C}$ is closed under induced subgraphs and FO-axiomatizable,
(ii)

$\mathscr{C}$ is not definable by a finite set of forbidden induced subgraphs.

Even though we are interested in structural and algorithmic results for classes of finite graphs, we see that in order to apply the Łoś-Tarski Theorem for such purposes we have to consider classes of finite and infinite graphs. So in this paper “graph” means finite or infinite graph. As in the preceding result we mention it explicitly if we only consider finite graphs.

Complementing Theorem 1.1 we show that it is even undecidable whether a given FO-definable class of finite graphs which is closed under induced subgraphs can be characterized by a finite set of forbidden induced subgraphs. More precisely:

Theorem 1.2.

There is no algorithm that for any FO-sentence $\varphi$ such that

\textsc{Graph}_{\textup{fin}}(\varphi):=\big{\{}G\;\big{|}\;\text{$G$ is a finite graph and a model of $\varphi$}\big{\}}

is closed under induced subgraphs decides whether $\varphi$ is equivalent to a universal sentence on finite graphs.

As mentioned at the beginning, for a class of finite graphs definable by a finite set of forbidden induced subgraphs, it is preferable to have an explicit construction of those graphs. This however turns out to be difficult for many natural classes of graphs. For example, the forbidden induced subgraphs are only known for tree-depth at most $3$ [7]. Let us consider the $k$ -vertex cover problem for a constant $k\geq 1$ . It asks whether a given graph has a vertex cover (i.e., a set of vertices that contains at least one endpoint of every edge) of size at most $k$ . The class of all yes-instances of this problem, finite and infinite, is closed under induced subgraphs and FO-axiomatizable by the FO-sentence

\varphi^{k}_{\textup{VC}}:=\varphi_{\textsc{Graph}}\wedge\exists x_{1}\ldots\exists x_{k}\forall y\forall z\Big{(}Eyz\to\bigvee_{1\leq\ell\leq k}(x_{\ell}=y\vee x_{\ell}=z)\Big{)},

where $\varphi_{\textsc{Graph}}$ axiomatizes the class of graphs. Hence, by (I) the class of yes-instances can be defined by a finite set of forbidden induced subgraphs. As the reader will notice it is by no means trivial to find a universal sentence equivalent to $\varphi^{k}_{\textup{VC}}$ . But on the other hand, by the Completeness Theorem, we can search for such a universal sentence by enumerating all possible universal sentences $\mu$ and all possible proofs for $\vdash\varphi^{k}_{\textup{VC}}\leftrightarrow\mu$ , and then extract the corresponding forbidden induced subgraphs from $\mu$ as in (1).

To explain the hardness of constructing forbidden induced subgraphs, we prove two negative results.

Theorem 1.3.

There is no algorithm that for any FO-sentence $\varphi$ which is equivalent to a universal sentence $\mu$ on finite graphs computes such a $\mu$ .

Or equivalently, there is no algorithm that for any FO-sentence $\varphi$ such that

\textsc{Graph}_{\textup{fin}}(\varphi)=\textsc{Forb}_{\textup{fin}}(\mathscr{F})

for a finite set $\mathscr{F}$ of graphs computes such an $\mathscr{F}$ . Here,

\textsc{Forb}_{\textup{fin}}(\mathscr{F}):=\big{\{}G\;\big{|}\;\text{$G$ is a finite graph without induced subgraph isomorphic to a graph in $\mathscr{F}$}\big{\}}.

Theorem 1.4.

Let $f:\mathbb{N}\to\mathbb{N}$ be a computable function. Then there is a class $\mathscr{C}$ of finite graphs and an FO-sentence $\varphi$ such that

(i)

$\mathscr{C}=\textsc{Graph}_{\textup{fin}}(\varphi)$ .
(ii)

$\mathscr{C}=\textsc{Graph}_{\textup{fin}}(\mu)$ for some universal sentence $\mu$ , in particular $\mathscr{C}$ is closed under induced subgraphs.
(iii)

For every universal sentence $\mu$ with $\mathscr{C}=\textsc{Graph}_{\textup{fin}}(\mu)$ we have $|\mu|\geq f(|\varphi|)$ .

Theorem 1.3 significantly strengthens the aforementioned result of [10]. Even if a class $\mathscr{C}$ of finite graphs definable by a finite set of forbidden induced subgraphs is given by an FO-sentence $\varphi$ with $\mathscr{C}=\textsc{Graph}_{\textup{fin}}(\varphi)$ , instead of a much more powerful Turing machine, we still cannot compute an appropriate finite set of forbidden induced subgraphs for $\mathscr{C}$ from $\varphi$ . On top of it, Theorem 1.4 implies that the size of forbidden subgraphs for $\mathscr{C}$ cannot be bounded by any computable function in terms of the size of $\varphi$ .

There is an important precursor for Theorem 1.4,

Theorem 1.5 (Gurevich’s Theorem [14]).

Let $f:\mathbb{N}\to\mathbb{N}$ be computable. Then there is an FO-sentence $\varphi$ such that the class $\textsc{Mod}(\varphi)$ of models of $\varphi$ is closed under induced substructures but for every universal sentence $\mu$ with $\textsc{Mod}_{\textup{fin}}(\mu)=\textsc{Mod}_{\textup{fin}}(\varphi)$ we have $|\mu|\geq f(|\varphi|)$ .

Hence, Theorem 1.4 can be viewed as the graph-theoretic version of Theorem 1.5.

Besides its importance in graph theory, Theorem 1.4 is also relevant in the context of algorithmic model theory. For algorithmic applications, the Łoś-Tarski theorem provides a normal form (i.e., a universal sentence) for any FO-sentence preserved under induced substructures. In [3], it is shown that on labelled trees there is no elementary bound on the length of the equivalent universal sentence in terms of the original one. We should point out that Theorem 1.4 is not comparable to Theorem 6.1 in [3], since our lower bound is uncomputable (and thus, much higher than non-elementary) while the classes of graphs we construct in the proof are dense (thus very far from trees).

Our technical contributions.

For every vocabulary it is well known that the class of structures of this vocabulary is FO-interpretable in graphs (see for example [8]). So one might expect that Theorem 1.1 and Theorem 1.4 can be derived easily from Tait’s Theorem and Gurevich’s Theorem using the standard FO-interpretations. However, an easy analysis shows that those interpretations result in classes of graphs that are not closed under induced subgraphs. So we introduce the notion of strongly existential interpretation which translates any class of structures preserved under induced substructures to a class of graphs closed under induced subgraphs. A lot of care is needed to construct strongly existential interpretations.

Related research.

Let us briefly mention some further results related to the Łoś-Tarski Theorem. Essentially one could divide them into three categories: (a) The positive results showing that for certain classes $\mathscr{C}$ of finite structures the analogue of the Łoś-Tarski Theorem holds if we restrict to structures in $\mathscr{C}$ . For example, this is the case if $\mathscr{C}$ is the class of all finite structures of tree-width $k$ or less for some $k\in\mathbb{N}$ [1] or if $\mathscr{C}$ is the class of all finite structures whose hypergraph satisfies certain properties [6]. (b) Both just mentioned papers contain also negative results, i.e, classes for which the analogue of the Łoś-Tarski Theorem fails: For example, in [1] this is shown for the class of finite planar graphs. (c) The third category contains generalizations of the Łoś-Tarski Theorem. For example, in [17] the authors, for $k\geq 1$ consider sentences of the form $\exists x_{1}\ldots\exists x_{k}\mu$ , where $\mu$ is universal. Then the role of the closure under induced substructures is taken over by a semantic “core property PS( $k$ )” which for $k=0$ coincides with closure under induced substructures. Finally, we mention that in [4] the authors strengthen Tait’s result by showing that for every $n\geq 1$ there are first-order definable classes of finite structures closed under substructures which are not definable with $n$ quantifier alternations.

Organization of this paper.

In Section 2 we fix some notations and recall or derive some results about universal sentences we need in this paper. For the reader’s convenience, in Section 3 we include a proof of Tait’s result. Moreover, we prove a technical result, i.e., Proposition 3.11, which is an important tool in Gurevich’s Theorem. We introduce the concept of strongly existential interpretation in Section 4 and show that the results of the preceding section remain true under such interpretations. We present an appropriate strongly existential interpretation for graphs (in Section 5). Hence, we get the results of Section 3 for graphs. In Section 6 we first derive Gurevich’s Theorem and apply our interpretations to get the results for graphs. Finally, in Section 7, we prove that various problems related to our results are undecidable.

2 Preliminaries

We denote by $\mathbb{N}$ the set of natural numbers greater or equal to 0. For $n\in\mathbb{N}$ let $[n]:=\{1,2,\ldots,n\}$ .

First-order logic FO.

A vocabulary $\tau$ is a finite set of relation symbols. Each relation symbol has an arity. A structure $\mathcal{A}$ of vocabulary $\tau$ , or $\tau$ -structure, consists of a (finite or infinite) nonempty set $A$ , called the universe of $\mathcal{A}$ and of an interpretation $R^{\mathcal{A}}\subseteq A^{r}$ of each $r$ -ary relation symbol $R\in\tau$ . If $\mathcal{A}$ and $\mathcal{B}$ are $\tau$ -structures, then $\mathcal{A}$ is a substructure of $\mathcal{B}$ , denoted by $\mathcal{A}\subseteq\mathcal{B}$ , if $A\subseteq B$ and $R^{\mathcal{A}}\subseteq R^{\mathcal{B}}$ , and $\mathcal{A}$ is an induced substructure of $\mathcal{B}$ , denoted by $\mathcal{A}\subseteq_{\textup{ind}}\mathcal{B}$ , if $A\subseteq B$ and $R^{\mathcal{A}}=R^{\mathcal{B}}\cap A^{r}$ , where $r$ is the arity of $R$ . If, in addition, $A\subsetneq B$ , then $\mathcal{A}$ is an proper induced substructure of $\mathcal{B}$ . By $\textsc{Str}[\tau]$ ( $\textsc{Str}_{\textup{fin}}[\tau]$ ) we denote the class of all (of all finite) $\tau$ -structures.

Formulas $\varphi$ of first-order logic FO of vocabulary $\tau$ are built up from atomic formulas $x_{1}=x_{2}$ and $Rx_{1}\ldots x_{r}$ (where $R\in\tau$ is of arity $r$ and $x_{1},x_{2},\ldots,x_{r}$ are variables) using the boolean connectives $\neg$ , $\wedge$ , and $\vee$ and the universal $\forall$ and existential $\exists$ quantifiers. A relation symbol $R$ is positive (negative) in $\varphi$ if all atomic subformulas $R\ldots$ in $\varphi$ appear in the scope of an even (odd) number of negation symbols. By the notation $\varphi(\bar{x})$ with $\bar{x}=x_{1},\ldots,x_{e}$ we indicate that the variables free in $\varphi$ are among $x_{1},\ldots,x_{e}$ . If then $\mathcal{A}$ is a $\tau$ -structure and $a_{1},\ldots,a_{e}\in A$ , then $\mathcal{A}\models\varphi(a_{1},\ldots,a_{e})$ means that $\varphi(\bar{x})$ holds in $\mathcal{A}$ if $x_{i}$ is interpreted by $a_{i}$ for $i\in[k]$ .

A sentence is a formula without free variables. For a sentence $\varphi$ we denote by $\textsc{Mod}(\varphi)$ the class of models of $\varphi$ and $\textsc{Mod}_{\textup{fin}}(\varphi)$ is its subclass consisting of the finite models of $\varphi$ . Sentences $\varphi$ and $\psi$ are equivalent if $\textsc{Mod}(\varphi)=\textsc{Mod}(\psi)$ and finitely equivalent if $\textsc{Mod}_{\textup{fin}}(\varphi)=\textsc{Mod}_{\textup{fin}}(\psi)$ .

Graphs.

Let $\tau_{E}:=\{E\}$ with binary $E$ . For all $\tau_{E}$ -structures we use the notation $G=(V(G),E(G))$ common in graph theory. Here $V(G)$ , the universe of $G$ , is the set of vertices, and $E(G)$ , the interpretation of the relation symbol $E$ , is the set of edges. The $\tau_{E}$ -structure $G=(V(G),E(G))$ is a directed graph if $E(G)$ does not contain self-loops, i.e., $(v,v)\notin E(G)$ for any $v\in V(G)$ . If moreover $(u,v)\in E(G)$ implies $(v,u)\in E(G)$ for any pair $(u,v)$ , then $G$ is an (undirected) graph. The graph $H=(V(H),E(H))$ is an induced subgraph of $G$ if

\displaystyle V(H)\subseteq V(G)

and

\displaystyle E(H)=E(G)\cap\big{(}V(H)\times V(H)\big{)}.

We denote by Graph and $\textsc{Graph}_{\textup{fin}}$ the class of all graphs and the class of finite graphs, respectively. Furthermore, for an $\textup{FO}[\tau_{E}]$ -sentence $\varphi$ by $\textsc{Graph}(\varphi)$ (and $\textsc{Graph}_{\textup{fin}}(\varphi)$ ) we denote the class of graphs (and the class of finite graphs) that are models of $\varphi$ .

Universal sentences and forbidden induced substructures.

An FO-formula is universal if it is built up from atomic and negated atomic formulas by means of the connectives $\wedge$ and $\vee$ and the universal quantifier $\forall$ . Often we say that a formula, say, containing the connective $\to$ is universal if by replacing $\varphi\to\psi$ by $\neg\varphi\vee\psi$ (and “simple manipulations”) we get an equivalent universal sentence. Every universal sentence $\mu$ is equivalent to a sentence of the form $\forall x_{1}\ldots\forall x_{k}\,\mu_{0}$ for some $k\in\mathbb{N}$ and some quantifier-free $\mu_{0}$ and moreover the length $|\mu|$ of $\mu$ is at most $|\varphi|$ . If in the definition of universal formula we replace the universal quantifier by the existential one we get the definition of an existential formula.

One easily verifies that the class of models of a universal sentence is closed under induced substructures. As already mentioned in the Introduction for classes of graphs, Łoś [15] and Tarski [19] proved:

Theorem 2.1 (Łoś-Tarski Theorem).

Let $\tau$ be a vocabulary and $\varphi$ an $\textup{FO}[\tau]$ -sentence. Then $\textsc{Mod}(\varphi)$ is closed under induced substructures if and only if $\varphi$ is equivalent to a universal sentence.

We fix a vocabulary $\tau$ . Let $\mathscr{F}$ be a finite set of finite $\tau$ -structures and denote by $\textsc{Forb}(\mathscr{F})$ (and $\textsc{Forb}_{\textup{fin}}(\mathscr{F})$ ) the class of structures (of finite structures) that do not contain an induced substructure isomorphic to a structure in $\mathscr{F}$ . Clearly for finite sets $F$ and $F^{\prime}$ of finite $\tau$ -structures we have

\mathscr{F}\subseteq\mathscr{F}^{\prime}

, then

\textsc{Forb}(\mathscr{F}^{\prime})\subseteq\textsc{Forb}(\mathscr{F})

(2)

We say that a class $\mathscr{C}$ of $\tau$ -structures (of finite $\tau$ -structures) is definable by a finite set of forbidden induced substructures if there is a finite set $\mathscr{F}$ of finite structures such that $\mathscr{C}=\textsc{Forb}(\mathscr{F})$ ( $\mathscr{C}=\textsc{Forb}_{\textup{fin}}(\mathscr{F})$ ).

Recall that $\tau_{E}=\{E\}$ with binary $E$ .

\displaystyle\varphi_{\textup{DG}}:=\forall x\neg Exx

and

\displaystyle\varphi_{\textsc{Graph}}:=\forall x\neg Exx\wedge\forall x\forall y(Exy\to Eyx)

(3)

axiomatize the classes of directed graphs and of graphs, respectively. Let the $\tau_{E}$ -structures $H_{0}=(V(H_{0}),E(H_{0}))$ and $H_{1}=(V(H_{1}),E({H_{1}}))$ be given by

\displaystyle V(H_{0}):=\{1\},\ E(H_{0}):=\big{\{}(1,1)\big{\}}

and

\displaystyle V(H_{1}):=\{1,2\},\ E(H_{0}):=\big{\{}(1,2)\big{\}}.

Then $\textsc{Forb}\big{(}\{H_{0}\}\big{)}$ and $\textsc{Forb}\big{(}\{H_{0},H_{1}\}\big{)}$ are the class of directed graphs and the class of graphs, respectively, i.e., $\textsc{Mod}(\varphi_{\textup{DG}})=\textsc{Forb}\big{(}\{H_{0}\}\big{)}$ and $\textsc{Mod}(\varphi_{\textsc{Graph}})=\textsc{Forb}\big{(}\{H_{0},H_{1}\}\big{)}$ .

The following result generalizes this simple fact and establishes the equivalence between axiomatizability by a universal sentence and definability by a finite set of forbidden induced substructures. For an arbitrary vocabulary $\tau$ , an $\textup{FO}[\tau]$ -sentence $\varphi$ , and $k\geq 1$ let

\mathscr{F}_{k}(\varphi):=\big{\{}\mathcal{A}\in\textsc{Str}[\tau]\;\big{|}\;\mathcal{A}\models\neg\varphi\text{\ and $A=[\ell]$ for some $\ell\in[k]$}\big{\}}.

(4)

Thus, $\mathscr{F}_{k}(\varphi)$ is, up to isomorphism, the class of structures with at most $k$ elements which fail to be a model of $\varphi$ . Note that $\mathscr{F}_{1}(\varphi_{\textup{DG}})=\{H_{0}\}$ and $\mathscr{F}_{1}(\varphi_{\textsc{Graph}})=\{H_{0},H_{1}\}$ . Clearly, for a $\tau$ -sentence we have:

	if $\textsc{Mod}(\varphi)$ is closed under indu	$\displaystyle\text{ced substructures},$
		$\displaystyle\text{then $\textsc{Mod}(\varphi)\subseteq\textsc{Forb}(\mathscr{F}_{k}(\varphi))$ for all $k\geq 1$}.$		(5)

Proposition 2.2.

For a class $\mathscr{C}$ of $\tau$ -structures and $k\geq 1$ the statements (i) and (ii) are equivalent.

(i)

$\mathscr{C}=\textsc{Mod}(\mu)$ for some universal sentence $\mu:=\forall x_{1}\ldots\forall x_{k}\,\mu_{0}$ with quantifier-free $\mu_{0}$ .
(ii)

$\mathscr{C}=\textsc{Forb}(\mathscr{F})$ for some finite set $\mathscr{F}$ of structures, all of at most $k$ elements.

If (i) holds for $\mu$ , then $\mathscr{C}=\textsc{Forb}(\mathscr{F}_{k}(\mu))$ .

Proof : (i) $\Rightarrow$ (ii) Let $\mathscr{C}=\textsc{Mod}(\mu)$ for $\mu$ as in (i). Then $\textsc{Mod}(\mu)$ is closed under induced substructures and hence, $\mathscr{C}\subseteq\textsc{Forb}\big{(}\mathscr{F}_{k}(\mu)\big{)}$ by (5).

Now assume that $\mathcal{A}\notin\mathscr{C}$ . Then $\mathcal{A}\models\neg\mu$ and hence there are $a_{1},\ldots,a_{k}\in A$ with $\mathcal{A}\models\neg\mu_{0}(a_{1},\ldots,a_{k})$ . For $\mathcal{B}:=[a_{1},\ldots,a_{k}]^{\mathcal{A}}$ , the substructure of $\mathcal{A}$ induced by $a_{1},\ldots,a_{k}$ , we have $\mathcal{B}\models\neg\mu_{0}(a_{1},\ldots,a_{k})$ (as $\mu_{0}$ is quantifier-free) and thus, $\mathcal{B}\models\neg\mu$ . Therefore, $\mathcal{B}$ is isomorphic to a structure in $\mathscr{F}_{k}(\mu)$ and therefore, $\mathcal{A}\notin\textsc{Forb}\big{(}\mathscr{F}_{k}(\mu)\big{)}$ .

(ii) $\Rightarrow$ (i) Let the $\tau$ -structure $\mathcal{A}$ have at most $k$ elements and let $a_{1},\ldots,a_{k}$ be an enumeration of the elements of $A$ (possibly with repetitions). Let $\delta(\mathcal{A};a_{1},\ldots,a_{k})$ be the conjunction of all literals (i.e., atomic or negated atomic formulas) $\lambda(x_{1},\ldots,x_{k})$ such that $\mathcal{A}\models\lambda(a_{1},\ldots,a_{k})$ . Then for every $\tau$ -structure $\mathcal{B}$ and $b_{1},\ldots,b_{k}\in B$ we have

	$\displaystyle\mathcal{B}\models\delta(\mathcal{A};a_{1},\ldots,a_{k})(b_{1},\ldots,b_{k})\iff$	the clauses $\pi(a_{i})=b_{i}$ for $i\in[k]$
		define an isomorphism from $\mathcal{A}$ onto $[b_{1},\ldots,b_{k}]^{\mathcal{B}}$ .		(6)

Now assume (ii), i.e., $\mathscr{C}=\textsc{Forb}(\mathscr{F})$ for some finite set $\mathscr{F}$ of structures, all of at most $k$ elements. If $\mathscr{F}$ is empty, then $\mathscr{C}=\textsc{Mod}(\forall x\,x=x)$ . Otherwise for every $\mathcal{A}\in\mathscr{F}$ we fix an enumeration $a^{\mathcal{A}}_{1},\ldots,a^{\mathcal{A}}_{k}$ of the elements of $A$ . We set

\mu:=\forall x_{1}\ldots\forall x_{k}\bigwedge_{\mathcal{A}\in\mathscr{F}}\neg\delta(\mathcal{A};a^{\mathcal{A}}_{1},\ldots,a^{\mathcal{A}}_{k}).

Then $\textsc{Forb}(\mathscr{F})=\textsc{Mod}(\mu)$ . In fact, assume first that $\mathcal{B}\notin\textsc{Mod}(\mu)$ . Then there are $b_{1},\ldots,b_{k}\in B$ and an $\mathcal{A}\in\mathscr{F}$ such that $\mathcal{B}\models\delta(\mathcal{A};a^{\mathcal{A}}_{1},\ldots,a^{\mathcal{A}}_{k})(b_{1},\ldots,b_{k})$ . By (6), then $\mathcal{A}$ is isomorphic to the induced substructure $[b_{1},\ldots,b_{k}]^{\mathcal{B}}$ of $\mathcal{B}$ ; hence, $\mathcal{B}\notin\textsc{Forb}(\mathscr{F})$ .

Now assume $\mathcal{B}\notin\textsc{Forb}(\mathscr{F})$ . Then there is an $\mathcal{A}\in\mathscr{F}$ and elements $b_{1},\ldots,b_{k}\in B$ such that the clauses $\pi(a^{\mathcal{A}}_{i})=b_{i}$ for $i\in[k]$ define an isomorphism from $\mathcal{A}$ onto $[b_{1},\ldots,b_{k}]^{\mathcal{B}}$ . By (6), then $\mathcal{B}\models\delta(\mathcal{A};a^{\mathcal{A}}_{1},\ldots,a^{\mathcal{A}}_{k})(b_{1},\ldots,b_{k})$ . Therefore, $\mathcal{B}\models\neg\mu$ , i.e., $\mathcal{B}\notin\textsc{Mod}(\mu)$ . $\Box$

Corollary 2.3.

Let $\varphi$ be a $\tau$ -sentence and $k\geq 1$ . Then

	$\displaystyle\textsc{Mod}(\varphi)=\textsc{Forb}\big{(}\mathscr{F}_{k}(\varphi)\big{)}$	$\displaystyle\iff$	$\varphi$ is equivalent to a universal sentence
			of the form $\forall x_{1}\ldots\forall x_{k}\,\mu_{0}$ with quantifier-free $\mu_{0}$ .

By (2) and (5) we get:

Corollary 2.4.

If $\textsc{Mod}(\mu)=\textsc{Forb}\big{(}\mathscr{F}_{k}(\mu)\big{)}$ for some universal $\mu$ and some $k\in\mathbb{N}$ , then $\textsc{Mod}(\mu)=\textsc{Forb}\big{(}\mathscr{F}_{\ell}(\mu)\big{)}$ for all $\ell\geq k$ .

Corollary 2.5.

It is decidable whether two universal sentences are equivalent.

Proof : Let $\mu$ and $\mu^{\prime}$ be universal sentences. W.l.o.g. we may assume that $\mu=\forall x_{1}\ldots\forall x_{k}\,\mu_{0}$ and $\mu^{\prime}=\forall x_{1}\ldots\forall x_{\ell}\,\mu^{\prime}_{0}$ with $k\leq\ell$ . By Corollary 2.3 and Corollary 2.4, we have

\displaystyle\textsc{Mod}(\mu)=\textsc{Forb}\big{(}\mathscr{F}_{\ell}(\mu)\big{)}

and

\displaystyle\textsc{Mod}(\mu^{\prime})=\textsc{Forb}\big{(}\mathscr{F}_{\ell}(\mu^{\prime})\big{)}.

Thus $\mu$ and $\mu^{\prime}$ are equivalent if and only if $\mathscr{F}_{\ell}(\mu)=\mathscr{F}_{\ell}(\mu^{\prime})$ . The right hand side of this equivalence is clearly decidable. $\Box$

The last equivalence of this corollary shows:

Corollary 2.6.

For universal sentences $\mu$ and $\mu^{\prime}$ we have

\text{$\mu$ and $\mu^{\prime}$ are equivalent}\iff\text{$\mu$ and $\mu^{\prime}$ are finitely equivalent.}

The following consequence of Corollary 2.2 will be used in the next section.

Corollary 2.7.

Let $m,k\in\mathbb{N}$ with $m>k$ and let $\psi_{0}$ and $\psi_{1}$ be $\textup{FO}[\tau]$ -sentences. Assume that $\mathcal{A}$ is a finite model of $\psi_{0}\wedge\psi_{1}$ with at least $m$ elements and all its proper induced substructures with at most $k$ elements are models of $\psi_{0}\wedge\neg\psi_{1}$ . Then $\psi_{0}\wedge\neg\psi_{1}$ is not finitely equivalent to a universal sentence of the form $\mu:=\forall x_{1}\ldots\forall x_{k}\,\mu_{0}$ with quantifier-free $\mu_{0}$ .

Proof : For a contradiction assume $\textsc{Mod}_{\textup{fin}}(\psi_{0}\wedge\neg\psi_{1})=\textsc{Mod}_{\textup{fin}}(\mu)$ for $\mu$ as above. As $\textsc{Mod}(\mu)=\textsc{Forb}\big{(}\mathscr{F}_{k}(\mu)\big{)}$ by Proposition 2.2, we get (applying the finitely equivalence of $\psi_{0}\wedge\neg\psi_{1}$ and $\mu$ to obtain the last equality)

\textsc{Mod}_{\textup{fin}}(\psi_{0}\wedge\neg\psi_{1})=\textsc{Mod}_{\textup{fin}}(\mu)=\textsc{Forb}_{\textup{fin}}\big{(}\mathscr{F}_{k}(\mu)\big{)}=\textsc{Forb}_{\textup{fin}}\big{(}\mathscr{F}_{k}(\psi_{0}\wedge\neg\psi_{1})\big{)}.

However, by the assumptions the structure $\mathcal{A}$ is contained in $\textsc{Mod}_{\textup{fin}}(\psi_{0}\wedge\neg\psi_{1})$ but not in the class $\textsc{Forb}_{\textup{fin}}(\mathscr{F}_{k}(\psi_{0}\wedge\neg\psi_{1}))$ . $\Box$

Remark 2.8.

Let $\mathscr{C}$ be a class of $\tau$ -structures closed under induced substructures. For an $\textup{FO}[\tau]$ -sentence $\varphi$ we set $\textsc{Mod}_{\mathscr{C}}(\varphi):=\{\mathcal{A}\in\mathscr{C}\mid\mathcal{A}\models\varphi\}$ . We say that the Łoś-Tarski Theorem holds for $\mathscr{C}$ if for every $\textup{FO}[\tau]$ -sentence $\varphi$ such that the class $\textsc{Mod}_{\mathscr{C}}(\varphi)$ is closed under induced substructures there is a universal sentence $\mu$ such that

\textsc{Mod}_{\mathscr{C}}(\varphi)=\textsc{Mod}_{\mathscr{C}}(\mu).

The following holds:

Let $\mathscr{C}$ and $\mathscr{C}^{\prime}$ be classes of $\tau$ -structures closed under induced substructures with $\mathscr{C}^{\prime}\subseteq\mathscr{C}$ . Furthermore assume that there is a universal sentence $\mu_{0}$ such that $\mathscr{C}^{\prime}=\textsc{Mod}_{\mathscr{C}}(\mu_{0})$ . If the analogue of the Łoś-Tarski Theorem holds for $\mathscr{C}$ , then it holds for $\mathscr{C}^{\prime}$ , too

In fact, for every $\textup{FO}[\tau]$ -sentence $\varphi$ we have $\textsc{Mod}_{\mathscr{C}^{\prime}}(\varphi)=\textsc{Mod}_{\mathscr{C}}(\mu_{0}\wedge\varphi)$ . Hence, if $\textsc{Mod}_{\mathscr{C}^{\prime}}(\varphi)$ is closed under induced substructures, then by assumption there is a universal $\mu$ such that $\textsc{Mod}_{\mathscr{C}}(\mu_{0}\wedge\varphi)=\textsc{Mod}_{\mathscr{C}}(\mu)$ . Therefore, $\textsc{Mod}_{\mathscr{C}^{\prime}}(\varphi)=\textsc{Mod}_{\mathscr{C}}(\mu)=\textsc{Mod}_{\mathscr{C}^{\prime}}(\mu)$ .

3 Basic ideas underlying the classical results

This section contains a proof of Tait’s Theorem telling us that the analogue of the Łoś-Tarski-Theorem fails if we only consider finite structures. Afterwards we refine the argument to derive a generalization, namely Proposition 3.11, which is a key result to get Gurevich’s Theorem.

We consider the vocabulary $\tau_{0}:=\{<,U_{\textup{min}},U_{\textup{max}},S\}$ , where $<$ and $S$ (the successor relation) are binary relation symbols and $U_{\textup{min}}$ and $U_{\textup{max}}$ are unary.

Let $\varphi_{0}$ be the conjunction of the universal sentences

–

$\forall x\neg x<x$ , $\forall x\forall y(x<y\vee x=y\vee y<x)$ , $\forall x\forall y\forall z((x<y\wedge y<z)\to x<z)$ , i.e., “ $<$ is an ordering”
–

$\forall x\forall y\big{(}(U_{\textup{min}}\,x\to(x=y\vee x<y)\big{)}$ i.e., “every element in $U_{\textup{min}}$ is a minimum w.r.t. $<$ ”
–

$\forall x\forall y\big{(}(U_{\textup{max}}\,x\to(x=y\vee y<x)\big{)}$ i.e., “every element in $U_{\textup{max}}$ is a maximum w.r.t. $<$ ”
–

$\forall xy(Sxy\to x<y)$
–

$\forall x\forall y\forall z(x<y<z\to\neg Sxz)$ .

Note that from the axioms it follows that there is at most one element in $U_{\textup{min}}$ , at most one in $U_{\textup{max}}$ , and that $S$ is a subset of the successor relation w.r.t. $<$ . We call $\tau_{0}$ -orderings the models of $\varphi_{0}$ .

For $\tau_{0}$ -structures $\mathcal{A}$ and $\mathcal{B}$ we write $\mathcal{B}\subseteq_{<}\mathcal{A}$ and say that $\mathcal{B}$ is a $<$ -substructure of $\mathcal{A}$ if $\mathcal{A}$ is a substructure of $\mathcal{B}$ with $<^{\mathcal{B}}=<^{\mathcal{A}}\cap\,(B\times B)$ .

We remark that the relation symbols $U_{\textup{min}},\ U_{\textup{max}}$ , and $S$ are negative in $\varphi_{0}$ . Therefore we have:

Lemma 3.1.

Let $\mathcal{B}\subseteq_{<}\mathcal{A}$ . If $\mathcal{A}\models\varphi_{0}$ , then $\mathcal{B}\models\varphi_{0}$ .

Let

\varphi_{1}:=\exists x\,U_{\textup{min}}\,x\wedge\exists xU_{\textup{max}}\,x\wedge\forall x\forall y(x<y\to\exists zSxz).

(7)

We call models of $\varphi_{0}\wedge\varphi_{1}$ complete $\tau_{0}$ -orderings. Clearly, for every $k\geq 1$ there is a unique, up to isomorphism, complete $\tau_{0}$ -ordering with exactly $k$ elements. The next lemma shows that all its proper $<$ -substructures are models of $\varphi_{0}\wedge\neg\varphi_{1}$ .

Lemma 3.2.

Let $\mathcal{A}$ and $\mathcal{B}$ be $\tau_{0}$ -structures. Assume that $\mathcal{A}\models\varphi_{0}$ and $\mathcal{B}$ is a finite $<$ -substructure of $\mathcal{A}$ that is a model of $\varphi_{1}$ . Then $\mathcal{B}=\mathcal{A}$ (in particular, $\mathcal{A}\models\varphi_{1}$ ).

Proof : By the previous lemma we know that $\mathcal{B}\models\varphi_{0}$ . Let $B:=\{b_{1},\ldots,b_{n}\}$ . As $<^{\mathcal{B}}$ is an ordering, we may assume that

b_{1}<^{\mathcal{B}}b_{2}<^{\mathcal{B}}\ldots<^{\mathcal{B}}b_{n-1}<^{\mathcal{B}}b_{n}.

As $\mathcal{B}\models(\varphi_{0}\wedge\varphi_{1})$ , we have $U_{\textup{min}}^{\mathcal{B}}b_{1}$ , $U_{\textup{max}}^{\mathcal{B}}b_{n}$ , and $S^{\mathcal{B}}b_{i}b_{i+1}$ for $i\in[n-1]$ . As $\mathcal{B}\subseteq\mathcal{A}$ , everywhere we can replace the upper index ^B by ^A.

We show $A=B$ : Let $a\in A$ . By $\mathcal{A}\models\varphi_{0}$ , we have $b_{1}\leq^{\mathcal{A}}a\leq^{\mathcal{A}}b_{n}$ . Let $i\in[n]$ be maximal with $b_{i}\leq^{\mathcal{A}}a$ . If $i=n$ , then $b_{n}=a$ . Otherwise $b_{i}\leq^{\mathcal{A}}a<^{\mathcal{A}}b_{i+1}$ . As $S^{\mathcal{A}}b_{i}b_{i+1}$ , we see that $b_{i}=a$ (by the last conjunct of $\varphi_{0}$ ). Now $\mathcal{A}=\mathcal{B}$ follows from $\mathcal{A}\models\varphi_{0}$ . $\Box$

Corollary 3.3.

Every proper $<$ -substructure of a finite model of $\varphi_{0}\wedge\varphi_{1}$ is a model of $\varphi_{0}\wedge\neg\varphi_{1}$ .

The class of finite $\tau_{0}$ -orderings that are not complete is closed under $<$ -substructures but not axiomatizable by a universal sentence:

Theorem 3.4 (Tait’s Theorem).

The class $\textsc{Mod}_{\textup{fin}}(\varphi_{0}\wedge\neg\varphi_{1})$ is closed under $<$ -substructures (and hence, closed under induced substructures) but $\varphi_{0}\wedge\neg\varphi_{1}$ is not finitely equivalent to a universal sentence.

Proof : $\textsc{Mod}_{\textup{fin}}(\varphi_{0}\wedge\neg\varphi_{1})$ is closed under $<$ -substructures: If $\mathcal{A}\models\varphi_{0}\wedge\neg\varphi_{1}$ and $\mathcal{B}$ is a finite $<$ -substructure of $\mathcal{A}$ , then $\mathcal{B}\models\varphi_{0}$ (by Lemma 3.1). If $\mathcal{B}\models\neg\varphi_{1}$ , we are done. If $\mathcal{B}\models\varphi_{1}$ , then $\mathcal{A}\models\varphi_{1}$ by Lemma 3.2, which contradicts our assumption $\mathcal{A}\models\neg\varphi_{1}$ .

Let $k\in\mathbb{N}$ . It is clear that there is a finite model $\mathcal{A}$ of $\varphi_{0}\wedge\varphi_{1}$ with at least $k+1$ elements. By Corollary 3.3 every proper induced substructure of $\mathcal{A}$ is a model of $\varphi_{0}\wedge\neg\varphi_{1}$ . Therefore, by Corollary 2.7, the sentence $\varphi_{0}\wedge\neg\varphi_{1}$ is not finitely equivalent to a universal sentence of the form $\mu:=\forall x_{1}\ldots\forall x_{k}\,\mu_{0}$ with quantifier-free $\mu_{0}$ . As $k$ was arbitrary, we get our claim. $\Box$

Remark 3.5.

A slight generalization of the previous proof shows that $\textsc{Mod}_{\textup{fin}}(\varphi_{0}\wedge\neg\varphi_{1})$ is not even axiomatizable by a $\Pi_{2}$ -sentence, i.e., by a sentence $\chi$ of the form $\forall x_{1}\ldots\forall x_{k}\exists y_{1}\ldots\exists y_{\ell}\,\chi_{0}$ for some $k,\ell\geq 1$ and quantifier-free $\chi_{0}$ . In fact, assume that $\textsc{Mod}_{\textup{fin}}(\varphi_{0}\wedge\neg\varphi_{1})=\textsc{Mod}_{\textup{fin}}(\chi)$ . Again we choose a finite model $\mathcal{A}$ of $\varphi_{0}\wedge\varphi_{1}$ with at least $k+1$ elements. Then $\mathcal{A}\not\models\chi$ . Hence there are $a_{1},\ldots,a_{k}\in A$ with $\mathcal{A}\models\neg\exists y_{1}\ldots\exists y_{\ell}\,\chi_{0}(a_{1},\ldots,a_{k})$ . Then $\mathcal{B}\models\neg\exists y_{1}\ldots\exists y_{\ell}\,\chi_{0}(a_{1},\ldots,a_{k})$ , where $\mathcal{B}:=[a_{1},\ldots,a_{k}]^{\mathcal{A}}$ is the substructure of $\mathcal{A}$ induced by $a_{1},\ldots,a_{k}$ . Hence, $\mathcal{B}\not\models\chi$ and therefore, $\mathcal{B}\not\models\varphi_{0}\wedge\neg\varphi_{1}$ . But this contradicts Corollary 3.3 as $\mathcal{B}$ is a proper induced substructure of $\mathcal{A}$ .

Note that $\varphi_{0}\wedge\neg\varphi_{1}$ is (equivalent to) a $\Sigma_{2}$ -sentence, i.e., equivalent to the negation of a $\Pi_{2}$ -sentence.

We turn to a refinement of the previous statement that will be helpful to get Gurevich’s Theorem.

Definition 3.6.

(a)

Let $\tau$ be obtained from the vocabulary $\tau_{0}$ by adding finitely many relation symbols “in pairs,” the standard $R$ together with its complement $R^{\textup{comp}}$ (intended as the complement of $R$ ). The symbols $R$ and $R^{\textup{comp}}$ have the same arity and for our purposes we can restrict ourselves to unary or binary relation symbols (even though all results can be generalized to arbitrary arities). We briefly say that $\tau$ is obtained from $\tau_{0}$ by adding pairs.
(b)
Let $\tau$ be obtained from $\tau_{0}$ by adding pairs. We say that $\varphi_{0\tau}$ is a $\tau$ -extension of $\varphi_{0}$ (where $\varphi_{0}$ is as above) if it is a universal sentence such that
- (i)
  
  the sentence $\varphi_{0}$ is a conjunct of $\varphi_{0\tau}$ ,
- (ii)
  
  the sentence $\bigwedge_{R\textup{ standard}}\forall\bar{x}(\neg R\bar{x}\vee\neg R^{\textup{comp}}\bar{x})$ is a conjunct of $\varphi_{0\tau}$ ,
- (iii)
  
  besides $<$ all relation symbols are negative in $\varphi_{0\tau}$ (if this is not the case for some new $R$ or $R^{\textup{comp}}$ , the idea is to replace any positive occurrence of $R$ or $R^{\textup{comp}}$ by $\neg R^{\textup{comp}}$ and $\neg R$ , respectively). For instance, we replace a subformula
  
  $\displaystyle x<y\wedge Rxy\ \$ by $\displaystyle\ \ x<y\wedge\neg R^{\textup{comp}}xy.$
(c)

Let $\tau$ be obtained from $\tau_{0}$ by adding pairs. Then we set

$\varphi_{1\tau}:=\varphi_{1}\wedge\bigwedge_{R\textup{ standard}}\forall\bar{x}(R\bar{x}\vee R^{\textup{comp}}\bar{x}),$ (8)

where $\varphi_{1}$ is as above (see (7)).

For a $\tau$ -structure $\mathcal{B}$ with $\mathcal{B}\models\varphi_{0\tau}\wedge\varphi_{1\tau}$ we have

\mathcal{B}\models\bigwedge_{R\textup{ standard}}\Big{(}\forall\bar{x}(\neg R\bar{x}\vee\neg R^{\textup{comp}}\bar{x})\wedge\forall\bar{x}(R\bar{x}\vee R^{\textup{comp}}\bar{x})\Big{)}.

Hence,

\text{if $\mathcal{B}\models\varphi_{0\tau}\wedge\varphi_{1\tau}$, then $(R^{\textup{comp}})^{\mathcal{B}}$ is the complement of $R^{\mathcal{B}}$ for standard $R\in\tau$}.

(9)

Now we derive the analogues of Lemma 3.1–Theorem 3.4 essentially by the same proofs.

Lemma 3.7.

Let $\tau$ be obtained from $\tau_{0}$ by adding pairs and let $\varphi_{0\tau}$ be an extension of $\varphi_{0}$ . If $\mathcal{B}\subseteq_{<}\mathcal{A}$ and $\mathcal{A}\models\varphi_{0\tau}$ , then $\mathcal{B}\models\varphi_{0\tau}$ .

Proof : By Definition 3.6 (b) (iii) all relation symbols distinct from $<$ are negative in $\varphi_{0\tau}$ . $\Box$

Lemma 3.8.

Let $\tau$ be obtained from $\tau_{0}$ by adding pairs and let $\varphi_{0\tau}$ be an extension of $\varphi_{0}$ . Assume that $\mathcal{A}\models\varphi_{0\tau}$ and that the finite $<$ -substructure $\mathcal{B}$ of $\mathcal{A}$ is a model of $\varphi_{1\tau}$ . Then $\mathcal{B}=\mathcal{A}$ (in particular, $\mathcal{A}\models\varphi_{1\tau}$ ).

Proof : Let $\mathcal{A}\upharpoonright\tau_{0}$ (and $\mathcal{B}\upharpoonright\tau_{0}$ ) be the $\tau_{0}$ -structure obtained from $\mathcal{A}$ (from $\mathcal{B}$ ) by removing all relations in $\tau\setminus\tau_{0}$ .

By Lemma 3.2 we know that $\mathcal{B}\upharpoonright\tau_{0}=\mathcal{A}\upharpoonright\tau_{0}$ . Furthermore, $\mathcal{B}\models\varphi_{0\tau}$ by the previous lemma; thus, $\mathcal{B}\models\varphi_{0\tau}\wedge\varphi_{1\tau}$ . Hence, by (9), $(R^{\textup{comp}})^{\mathcal{B}}$ is the complement of $R^{\mathcal{B}}$ for standard $R$ . Clearly, $R^{\mathcal{B}}\subseteq R^{\mathcal{A}}$ and $(R^{\textup{comp}})^{\mathcal{B}}\subseteq(R^{\textup{comp}})^{\mathcal{A}}$ . As $A=B$ and $\mathcal{A}$ is a model of the sentence $\bigwedge_{R\textup{ standard}}\forall\bar{x}(\neg R\bar{x}\vee\neg R^{\textup{comp}}\bar{x})$ , we get $R^{\mathcal{B}}=R^{\mathcal{A}}$ and $(R^{\textup{comp}})^{\mathcal{B}}=(R^{\textup{comp}})^{\mathcal{A}}$ . $\Box$

Corollary 3.9.

Every proper $<$ -substructure of a finite model of $\varphi_{0\tau}\wedge\varphi_{1\tau}$ is a model of $\varphi_{0\tau}\wedge\neg\varphi_{1\tau}$ .

By replacing in the proof of Tait’s Theorem the use of Lemma 3.1, Lemma 3.2, and Corollary 3.3 by Lemma 3.7, Lemma 3.8, and Corollary 3.9 respectively, we get:

Lemma 3.10.

Let $\tau$ be obtained from $\tau_{0}$ by adding pairs and let $\varphi_{0\tau}$ be an extension of $\varphi_{0}$ . The class $\textsc{Mod}_{\textup{fin}}(\varphi_{0\tau}\wedge\neg\varphi_{1\tau})$ is closed under $<$ -substructures (and hence, closed under induced substructures) but $\varphi_{0\tau}\wedge\neg\varphi_{1\tau}$ is not finitely equivalent to a universal sentence.

Perhaps the reader will ask why we do not introduce for $<$ the “complement relation symbol” $<^{\textup{comp}}$ and add the corresponding conjuncts to $\varphi_{0\tau}$ and $\varphi_{1\tau}$ (or, to $\varphi_{0}$ and $\varphi_{1}$ ) in order to get a result of the type of Lemma 3.8 (or already of the type of Lemma 3.2) where we can replace “ $<$ -substructure” by “substructure.” The reader will realize that corresponding proofs of $B=A$ break down.

The next proposition provides a uniform way to construct FO-sentences that are only equivalent to universal sentences of large size, which is the core of the proof of Gurevich’s Theorem.

Proposition 3.11.

Again let $\tau$ be obtained from $\tau_{0}$ by adding pairs and $\varphi_{0\tau}$ be an extension of $\varphi_{0}$ . Let $m\geq 1$ and $\gamma$ be an $\textup{FO}[\tau]$ -sentence such that

\text{$\varphi_{0\tau}\wedge\varphi_{1\tau}\wedge\gamma$ has no infinite model but a finite model with at least $m$ elements}.

(10)

For

\chi:=\varphi_{0\tau}\wedge(\varphi_{1\tau}\to\neg\gamma)

the statements (a) and (b) hold.

(a)

The class $\textsc{Mod}(\chi)$ is closed under $<$ -substructures.
(b)

If $\mu:=\forall x_{1}\ldots\forall x_{k}\,\mu_{0}$ with quantifier-free $\mu_{0}$ is finitely equivalent to $\chi$ , then $k\geq m$ .

Proof : (a) Let $\mathcal{A}\models\chi$ and $\mathcal{B}\subseteq_{<}\mathcal{A}$ . Thus, $\mathcal{B}\models\varphi_{0\tau}$ . If $\mathcal{B}\not\models\varphi_{1\tau}$ , we are done. Assume $\mathcal{B}\models\varphi_{1\tau}$ . In case $B$ is infinite, we conclude by (10) that $\mathcal{B}$ is a model of $\neg\gamma$ and hence of $\chi$ . Otherwise $B$ is finite; then $\mathcal{B}=\mathcal{A}$ (by Lemma 3.8) and thus, $\mathcal{B}\models\chi$ .

(b) According to (10) there is a finite model $\mathcal{A}$ of $\varphi_{0\tau}\wedge\varphi_{1\tau}\wedge\gamma$ , i.e., of $\varphi_{0\tau}\wedge\neg(\varphi_{1\tau}\to\neg\gamma)$ , with at least $m$ elements. By Corollary 3.9 every proper induced substructure of $\mathcal{A}$ is not a model of $\varphi_{1\tau}$ and therefore, it is a model of $\varphi_{0\tau}\wedge(\varphi_{1\tau}\to\neg\gamma)$ . Hence by Corollary 2.7, $\varphi_{0\tau}\wedge(\varphi_{1\tau}\to\neg\gamma)$ is not finitely equivalent to a universal sentence of the form $\mu:=\forall x_{1}\ldots\forall x_{k}\,\mu_{0}$ with $k<m$ and quantifier-free $\mu_{0}$ . $\Box$

Remark 3.12.

We can strengthen the statement (b) of the preceding proposition to:

If the $\Pi_{2}$ -sentence $\forall x_{1}\ldots\forall x_{k}\exists y_{1}\ldots\exists y_{\ell}\;\chi_{0}$ with quantifier-free $\chi_{0}$ is finitely equivalent to $\chi$ , then $k\geq m$ .

The proof is similar to that of the result in Remark 3.5 and is left to the reader.

4 The general machinery: strongly existential interpretations

We show that appropriate interpretations preserve the validity of Tait’s theorem and of the statement of Proposition 3.11. Later on these interpretations will allow us to get versions of the results for graphs.

Let $\tau_{E}:=\{E\}$ with binary $E$ . As already remarked in the Preliminaries for all $\tau_{E}$ -structures we use the notation $G=(V(G),E(G))$ common in graph theory.

Let $\tau$ be obtained from $\tau_{0}$ by adding pairs. Furthermore, let $I$ be an interpretation of width $2$ (we only need this case) of $\tau$ -structures in $\tau_{E}$ -structures. This means that $I$ assigns to every unary relation symbol $T\in\tau$ an $\textup{FO}[\tau_{E}]$ -formula $\varphi_{T}(x_{1},x_{2})$ and to every binary relation symbol $T\in\tau$ an $\textup{FO}[\tau_{E}]$ -formula $\varphi_{T}(x_{1},x_{2},y_{1},y_{2})$ ; moreover, $I$ selects an $\textup{FO}[\tau_{E}]$ -formula $\varphi_{\textup{uni}}(x_{1},x_{2})$ .

Then $I$ assigns to every $\tau_{E}$ -structure $G$ with $G\models\exists\bar{x}\varphi_{\textup{uni}}(\bar{x})$ a $\tau$ -structure $G_{I}$ , which we often denote by $\mathcal{O}_{I}(G)$ , defined by

–

$O_{I}(G):=\big{\{}\bar{a}\in V(G)\times V(G)\;\big{|}\;G\models\varphi_{\textup{uni}}(\bar{a})\big{\}}$
–

$T^{O_{I}(G)}:=\big{\{}\bar{a}\in O_{I}(G)\;\big{|}\;G\models\varphi_{T}(\bar{a})\big{\}}$ for unary $T\in\tau$
–

$T^{O_{I}(G)}:=\big{\{}(\bar{a},\bar{b})\in O_{I}(G)\times O_{I}(G)\;\big{|}\;G\models\varphi_{T}(\bar{a},\bar{b})\big{\}}$ for binary $T\in\tau$ .

As the interpretation $I$ is of width $2$ , we have

|O_{I}(G)|\leq|V(G)|^{2}.

(11)

Recall that for every sentence $\varphi\in\textup{FO}[\tau]$ there is a sentence $\varphi^{I}\in\textup{FO}[\tau_{E}]$ such that for all $\tau_{E}$ -structures $G$ with $G\models\exists\bar{x}\varphi_{\textup{uni}}(\bar{x})$ we have

\left(G_{I}=\right)\;\mathcal{O}_{I}(G)\models\varphi\iff G\models\varphi^{I}.

(12)

For example, for the sentence $\varphi=\forall x\forall y\,Txy$ we have

\varphi^{I}=\forall\bar{x}\Big{(}\varphi_{\textup{uni}}(\bar{x})\to\forall\bar{y}\big{(}\varphi_{\textup{uni}}(\bar{y})\to\varphi_{T}(\bar{x},\bar{y})\big{)}\Big{)}.

Furthermore there is a constant $c_{I}\in\mathbb{N}$ such that for all $\varphi\in\textup{FO}[\tau]$ ,

|\varphi^{I}|\leq c_{I}\cdot|\varphi|.

(13)

Definition 4.1.

Let $\tau$ be obtained from $\tau_{0}$ by adding pairs and let $I$ be an interpretation of $\tau_{0}$ -structures in $\tau_{E}$ as just described. We say that $I$ is strongly existential if all formulas of $I$ are existential and $\varphi_{<}$ is even quantifier-free.

Lemma 4.2.

Let $\tau$ be obtained from $\tau_{0}$ by adding pairs and let $\varphi_{0\tau}$ be an extension of $\varphi_{0}$ . Then for every strongly existential interpretation $I$ the sentence $\varphi^{I}_{0\tau}$ is (equivalent to) a universal sentence.

Proof : The claim holds as all relation symbols distinct from $<$ are negative in $\varphi_{0\tau}$ . For example, for $\varphi:=\forall x\forall y\big{(}U_{\textup{min}}\,x\to(x=y\vee x<y)\big{)}$ , we have

\varphi^{I}=\forall\bar{x}\Big{(}\varphi_{\textup{uni}}(\bar{x})\to\forall\bar{y}\big{(}\varphi_{\textup{uni}}(\bar{y})\to(\varphi_{U_{\textup{min}}}(\bar{x})\to((x_{1}=y_{1}\wedge x_{2}=y_{2})\vee\varphi_{<}(\bar{x},\bar{y})))\big{)}\Big{)}.

The following result shows that strongly existential interpretations preserve induced substructures in such a way that we can translate the results of the preceding section to the actual context.

Lemma 4.3.

Assume that $I$ is strongly existential. Then for all $\tau_{E}$ -structures $G$ and $H$ with $H\subseteq_{\textup{ind}}G$ and $O_{I}(H)\neq\emptyset$ , we have $\mathcal{O}_{I}(H)\subseteq_{<}\mathcal{O}_{I}(G)$ .

Proof : As $\varphi_{\textup{uni}}$ is existential, we have $O_{I}(H)\subseteq O_{I}(G)$ . Let $T\in\tau$ be distinct from $<$ and $\bar{b}\in T^{\mathcal{O}_{I}(H)}$ . Then $H\models\varphi_{T}(\bar{b})$ . As $\varphi_{T}$ is existential, $G\models\varphi_{T}(\bar{b})$ and thus, $\bar{b}\in T^{\mathcal{O}_{I}(G)}$ . Moreover, for $\bar{b},\bar{b}^{\prime}\in O_{I}(H)$ we have

$\displaystyle\bar{b}<^{\mathcal{O}_{I}(H)}\bar{b}^{\prime}$	$\displaystyle\iff$	$\displaystyle H\models\varphi_{<}(\bar{b},\bar{b}^{\prime})$
	$\displaystyle\iff$	$\displaystyle G\models\varphi_{<}(\bar{b},\bar{b}^{\prime})\qquad\text{(as $H\subseteq_{\textup{ind}}G$ and $\varphi_{<}$ is quantifier-free)}$
	$\displaystyle\iff$	$\displaystyle\bar{b}<^{\mathcal{O}_{I}(G)}\bar{b}^{\prime}.$

Putting all together we see that $\mathcal{O}_{I}(H)\subseteq_{<}\mathcal{O}_{I}(G)$ . $\Box$

We obtain from Lemma 3.8 the corresponding result in our framework.

Lemma 4.4.

Assume that $I$ is strongly existential. Let $\varphi_{0\tau}$ be an extension of $\varphi_{0}$ . Let $G$ be a $\tau_{E}$ -structure and $G\models\varphi_{0\tau}^{I}$ . Let $H\subseteq_{\textup{ind}}G$ with finite $O_{I}(H)$ . If $H\models\varphi_{1\tau}^{I}$ , then $\mathcal{O}_{I}(H)=\mathcal{O}_{I}(G)$ and $G\models\varphi^{I}_{1\tau}$ .

Proof : As $H\models\varphi_{1\tau}^{I}$ , in particular $H\models(\exists x\,U_{\textup{min}}\,x)^{I}$ ; thus, $O_{I}(H)\neq\emptyset$ . Therefore, $\mathcal{O}_{I}(H)\subseteq_{<}\mathcal{O}_{I}(G)$ by Lemma 4.3. By assumption and (12), $\mathcal{O}_{I}(G)\models\varphi_{0\tau}$ and $\mathcal{O}_{I}(H)\models\varphi_{1\tau}$ . As $O_{I}(H)$ is finite, Lemma 3.8 implies $\mathcal{O}_{I}(H)=\mathcal{O}_{I}(G)$ , and in particular $\mathcal{O}_{I}(G)\models\varphi_{1\tau}$ . Hence, $G\models\varphi^{I}_{1\tau}$ by (12). $\Box$

We now prove for strongly existential interpretations two results, Proposition 4.5 corresponds to Tait’s Theorem (Theorem 3.4), and Proposition 4.6 corresponds to Proposition 3.11 (relevant to Gurevich’s Theorem). In our application of these results to graphs in the next section the sentence $\psi$ will be $\forall x\neg Exx\wedge\forall x\forall y(Exy\to Eyx)$ , i.e., the sentence $\varphi_{\textsc{Graph}}$ (cf. (3)) axiomatizing the class of graphs.

Proposition 4.5.

Let $\psi$ be a universal $\tau_{E}$ -sentence. Assume that the interpretation $I$ of $\tau_{0}$ -structures in $\tau_{E}$ -structures is strongly existential. Furthermore, assume that for every sufficiently large finite complete $\tau_{0}$ -ordering $\mathcal{A}$ there is a finite $\tau_{E}$ -structure $G$ with $\mathcal{O}_{I}(G)\cong\mathcal{A}$ and $G\models\psi$ . Then there is an $\textup{FO}[\tau_{E}]$ -sentence $\varphi$ such that $\textsc{Mod}_{\textup{fin}}(\psi\wedge\varphi)$ is closed under induced substructures, but $\psi\wedge\varphi$ is not finitely equivalent to a universal sentence.

As $\varphi$ we an take the sentence

\varphi:=\forall\bar{x}\neg\varphi_{\textup{uni}}(\bar{x})\vee\big{(}\varphi^{I}_{0}\wedge\neg\varphi^{I}_{1}\big{)}

(for the definition of $\varphi_{0}$ and $\varphi_{1}$ see page 3 and (7), respectively).

Proof : First we verify that the class $\textsc{Mod}_{\textup{fin}}(\psi\wedge\varphi)$ is closed under induced substructures. Assume $G\models\psi\wedge\varphi$ and $H\subseteq_{\textup{ind}}G$ . Since $\psi$ is universal, we have $H\models\psi$ . If $G\models\forall\bar{x}\neg\varphi_{\textup{uni}}(\bar{x})$ , then $H\models\forall\bar{x}\neg\varphi_{\textup{uni}}(\bar{x})$ . Now assume that $G\models\varphi^{I}_{0}\wedge\neg\varphi^{I}_{1}$ . Then $H\models\varphi^{I}_{0}$ , as $\varphi_{0}^{I}$ is universal by Lemma 4.2. If $H\models\forall\bar{x}\neg\varphi_{\textup{uni}}(\bar{x})$ or $H\models\neg\varphi^{I}_{1}$ , we are done. Otherwise $O_{I}(H)\neq\emptyset$ and $H\models\varphi^{I}_{1}$ . Then $G\models\varphi^{I}_{1}$ (see Lemma 4.4), a contradiction.

Finally we show that for every $k\in\mathbb{N}$ the sentence $\psi\wedge\varphi$ is not finitely equivalent to a sentence of the form $\mu=\forall z_{1}\ldots\forall z_{k}\,\mu_{0}$ with quantifier-free $\mu_{0}$ . Let

\mathcal{A}:=\big{(}A,<^{\mathcal{A}},U_{\textup{min}}^{\mathcal{A}},U_{\textup{max}}^{\mathcal{A}},S^{\mathcal{A}}\big{)}

be a complete $\tau_{0}$ -ordering with at least $k^{2}+1$ elements. In particular, $\mathcal{A}\models\varphi_{0}\wedge\varphi_{1}$ . By assumption we can choose $\mathcal{A}$ in such a way that there is a finite $\tau_{E}$ -structure $G$ such that $\mathcal{O}_{I}(G)\cong\mathcal{A}$ and $G\models\psi$ . Then $\mathcal{O}_{I}(G)\models\varphi_{0}\wedge\varphi_{1}$ , hence, $G\models\varphi_{0}^{I}\wedge\varphi^{I}_{1}$ . Thus $G\models\psi\wedge\neg\varphi$ . As $|O_{I}(G)|=|A|\geq k^{2}+1$ , the graph $G$ must contain more than $k$ elements by (11).

We want to show that every induced substructure of $G$ with at most $k$ elements is a model of $\psi\wedge\varphi$ . Then the result follows from Corollary 2.7. So let $H$ be an induced substructure of $G$ with at most $k$ elements. Clearly, $H\models(\psi\wedge\varphi^{I}_{0})$ . If $H\models\forall\bar{x}\neg\varphi_{\textup{uni}}(\bar{x})$ or $H\models\neg\varphi^{I}_{1}$ , we are done. Otherwise $O_{I}(H)\neq\emptyset$ and $H\models\varphi^{I}_{1}$ . Then, Lemma 4.4 implies $O_{I}(H)=O_{I}(G)$ . Recall $|V(H)|\leq k$ , so $O_{I}(H)$ has at most $k^{2}$ elements by (11), a contradiction as $|O_{I}(G)|\geq k^{2}+1$ . $\Box$

Proposition 4.6.

Assume that $\psi$ is a universal $\tau_{E}$ -sentence. Let $\tau$ be obtained from $\tau_{0}$ by adding pairs and let $\varphi_{0\tau}$ be an extension of $\varphi_{0}$ . Let $I$ be a strongly existential interpretation of $\tau$ -structures in $\tau_{E}$ -structures with the property that for every finite $\tau$ -structure $\mathcal{A}$ , which is a model of $\varphi_{0\tau}\wedge\varphi_{1\tau}$ , there is a finite $\tau_{E}$ -structure $G$ with $\mathcal{O}_{I}(G)\cong\mathcal{A}$ and $G\models\psi$ .

Let $m\geq 1$ and $\gamma$ be an $\textup{FO}[\tau]$ -sentence such that

\text{$\varphi_{0\tau}\wedge\varphi_{1\tau}\wedge\gamma$ has no infinite model but a finite model with at least $m$ elements}.

(13)

For

\rho:=\forall\bar{x}\neg\varphi_{\textup{uni}}(\bar{x})\vee\big{(}\varphi_{0\tau}\wedge(\varphi_{1\tau}\to\neg\gamma)\big{)}^{I}

(14)

the statements (a) and (b) hold.

(a)

The class $\textsc{Mod}(\psi\wedge\rho)$ is closed under induced substructures.
(b)

If $\mu:=\forall x_{1}\ldots\forall x_{k}\,\mu_{0}$ with quantifier-free $\mu_{0}$ is finitely equivalent to $\psi\wedge\rho$ , then $k^{2}\geq m$ .

Proof : (a) Assume that $G\models\psi\wedge\rho$ and $H\subseteq_{\textup{ind}}G$ . Clearly $H\models\psi$ . If $H\models\forall\bar{x}\neg\varphi_{\textup{uni}}(\bar{x})$ , then we are done. Otherwise, the universe of $\mathcal{O}_{I}(H)$ and hence, that of $\mathcal{O}_{I}(G)$ , are not empty. Then $G\models\varphi^{I}_{0\tau}$ and as $H\subseteq_{\textup{ind}}G$ , we have $H\models\varphi^{I}_{0\tau}$ by Lemma 4.2.

If $H\not\models\varphi^{I}_{1\tau}$ , we are done. Otherwise, $H\models\varphi^{I}_{1\tau}$ . If $H_{I}$ is infinite, then $H_{I}\models\neg\gamma$ by (13) and we are again done. If $H_{I}$ is finite, then $\mathcal{O}_{I}(H)=\mathcal{O}_{I}(G)$ by Lemma 4.4. Thus $\mathcal{O}_{I}(G)\models\varphi_{1\tau}$ and hence, $\mathcal{O}_{I}(G)\models\neg\gamma$ as $G\models\rho$ . Therefore, $\mathcal{O}_{I}(H)\models\neg\gamma$ and thus, $H\models\rho$ .

(b) By (13) there is a finite model $\mathcal{A}$ of $\varphi_{0\tau}\wedge\varphi_{1\tau}\wedge\gamma$ with at least $m$ elements. By assumption there is a finite $\tau_{E}$ -structure $G$ with $\mathcal{O}_{I}(G)\cong\mathcal{A}$ and $G\models\psi$ . Clearly, $G\models\neg\forall\bar{x}\neg\varphi_{\textup{uni}}(\bar{x})$ and $G\models(\varphi_{0\tau}\wedge\varphi_{1\tau}\wedge\gamma)^{I}$ . Hence, $G\models\psi\wedge\neg\rho$ . Assume that $k^{2}<m$ . We want to show that every induced substructure of $G$ with at most $k$ elements is a model of $\psi\wedge\rho$ . Then the claim (b) follows from Corollary 2.7.

So let $H$ be an induced substructure of $G$ with at most $k$ elements. Clearly, $H\models(\psi\wedge\varphi^{I}_{0\tau})$ . If $H\models\forall\bar{x}\neg\varphi_{\textup{uni}}(\bar{x})$ or $H\models\neg\varphi^{I}_{1\tau}$ , we are done. Otherwise $O_{I}(H)\neq\emptyset$ and $H\models\varphi^{I}_{1\tau}$ . Then, $O_{I}(H)=O_{I}(G)$ by Lemma 4.4. This leads to a contradiction, as $O_{I}(H)$ has at most $k^{2}$ elements by (12), while $O_{I}(G)$ has $m$ elements and we assumed $k^{2}<m$ . $\Box$

Remark 4.7.

The results corresponding to Remark 3.5 and Remark 3.12 are valid for Proposition 4.5 and Proposition 4.6 too. In particular, the sentence $\psi\wedge\varphi\ \big{(}=\psi\wedge\forall\bar{x}\neg\varphi_{\textup{uni}}(\bar{x})\vee\big{(}\varphi^{I}_{0}\wedge\neg\varphi^{I}_{1}\big{)}\big{)}$ is not equivalent to a $\Pi_{2}$ -sentence. Furthermore $\psi\wedge\varphi$ itself is equivalent to a $\Sigma_{2}$ -sentence. In fact, as all relation symbols besides $<$ are negative in $\varphi_{0}$ , the sentence $\varphi_{0}^{I}$ is universal. Moreover, as $U_{\textup{min}}$ , $U_{\textup{max}}$ , and $S$ are positive in $\varphi_{1}$ , the sentence $\varphi_{1}^{I}$ (as $\varphi_{1}$ ) is equivalent to a $\Pi_{2}$ -sentence. Hence $\psi\wedge\varphi$ is equivalent to a $\Sigma_{2}$ -sentence.

5 Tait’s Theorem for finite graphs

In this section we introduce a strongly existential interpretation, which allows us to get Tait’s Theorem for graphs. The corresponding result for Gurevich’s Theorem will be derived in Section 6.

We first introduce a further concept. Let $G$ be a graph and $a,b\in V(G)$ . For $r,s\geq 3$ a path from vertex $a$ to vertex $b$ of length $r$ with an $s$ -ear is a path between $a$ and $b$ with a cycle of length $s$ ; one vertex of this cycle is adjacent to the vertex adjacent to $b$ on the path. Figure 1 is a path from $a$ to $b$ of length $6$ with a $4$ -ear.

Figure 1: A path of length 6 with a

4

-ear.

Lemma 5.1.

For $r,s\geq 3$ there are quantifier-free formulas $\varphi_{cr}(x,\bar{z})$ and $\varphi_{pe,r,s}(x,y,\bar{z},\bar{w})$ such that for all graphs $G$ we have

(a)

$G\models\varphi_{cr}(a,\bar{u})\iff\bar{u}$ is a cycle of length $r$ containing $a$ .
(b)

$G\models\varphi_{pe,r,s}(a,b,\bar{u},\bar{v})\iff\bar{u}$ is path from $a$ to $b$ of length $r$ with the $s$ -ear $\bar{v}$ .

Proof : (a) We can take as $\varphi_{cr}(x,z_{1},\ldots,z_{r})$ the formula

x=z_{1}\wedge Ez_{r}z_{1}\wedge\bigwedge_{1\leq i<r}Ez_{i}z_{i+1}\wedge\bigwedge_{1\leq i<j\leq r}\neg z_{i}=z_{j}.

(b) We can take as $\varphi_{pe,r,s}(x,y,z_{0},\ldots,z_{r},w_{1},\ldots,w_{s})$ the formula

	$\displaystyle x=z_{0}\wedge y=z_{r}\wedge\bigwedge_{0\leq i<r-1}Ez_{i}z_{i+1}\wedge\bigwedge_{0\leq i<j\leq r}\neg z_{i}$	$\displaystyle=z_{j}\wedge\bigwedge_{0\leq i\leq r,\ j\in[s]}\neg z_{i}=w_{j}$
		$\displaystyle\wedge\varphi_{cs}(w_{1},w_{1},\ldots,w_{r})\wedge Ez_{r-1}w_{1}.$		$\Box$

To understand better how we obtain the desired interpretation we first assign to every complete $\tau_{0}$ -ordering $\mathcal{A}$ , i.e., to every model of $\varphi_{0}\wedge\varphi_{1}$ , a $\tau_{E}$ -structure $G:=G(\mathcal{A})$ which is a graph.

In a first step we extend $\mathcal{A}$ to a $\tau^{*}_{0}$ -structure $\mathcal{A}^{*}$ , where $\tau^{*}_{0}:=\tau_{0}\cup\{B,C,L,F\}$ in the following way. Here $B,C$ are unary and $L,F$ are binary relation symbols.

For every original (or, basic) element $a$ , i.e., for every $a\in A$ , we introduce a new element $a^{\prime}$ , the companion of $a$ . We set

–

$A^{*}:=A\cup\{a^{\prime}\mid a\in A\}$ ,
–

$B^{\mathcal{A}^{*}}:=A$ , $C^{\mathcal{A}^{*}}:=\{a^{\prime}\mid a\in A\}$ ,
–

$L^{\mathcal{A}^{*}}:=\big{\{}(a,a^{\prime})\;\big{|}\;a\in A\big{\}},\qquad F^{\mathcal{A}^{*}}:=\big{\{}(a^{\prime},b),(b,a^{\prime})\;\big{|}\;a,b\in A,\ a<^{\mathcal{A}}b\big{\}}$ .

Note that the relation $F$ is irreflexive and symmetric, i.e., $\big{(}A^{*},F^{\mathcal{A}^{*}}\big{)}$ is already a graph, which is illustrated by Figure 2. Observe that $F$ contains the whole information of the ordering $<^{\mathcal{A}}$ up to isomorphism.

Figure 2: Turning an ordering to the relation

F

We use $\mathcal{A}^{*}$ to define the desired graph $G=G(\mathcal{A})$ . The vertex set $V(G)$ contains the elements of $A^{*}$ , and the edge relation $E(G)$ contains $F^{\mathcal{A}^{*}}$ . Furthermore $G$ contains just all the vertices and edges required by the following items:

–

To $a\in U_{\textup{min}}^{\mathcal{A}}$ we add a cycle of length $5$ consisting of new vertices, i.e., not in $A^{*}$ (besides $a$ ).
–

To $a\in U_{\textup{max}}^{\mathcal{A}}$ we add a cycle of length $7$ consisting of new vertices (besides $a$ ).
–

To $a\in B^{\mathcal{A}^{*}}$ we add a cycle of length $9$ consisting of new vertices (besides $a$ ).
–

To $a\in C^{\mathcal{A}^{*}}$ we add a cycle of length $11$ consisting of new vertices (besides $a$ ).
–

To $(a,b)\in S^{\mathcal{A}}$ we add a path from $a$ to $b$ of length $17$ with a $13$ -ear consisting of new vertices (besides $a$ and $b$ ).
–

To $(a,a^{\prime})\in L^{\mathcal{A}^{*}}$ we add a path from $a$ to $a^{\prime}$ of length $17$ with a $15$ -ear consisting of new vertices (besides $a$ and $a^{\prime}$ ).

Hereby we meant by “add a cycle” or “add a path with an ear” that we only add the edges required by the corresponding formulas in Lemma 5.1.

To ease the discussion, we divide cycles in $G\ (=G(\mathcal{A}))$ into four categories.

[ $F$ -cycle] These are cycles in $\big{(}A^{*},F^{\mathcal{A}^{*}}\big{)}$ , i.e., cycles using only edges of $F^{\mathcal{A}^{*}}$ .

[ $T$ -cycle] For every unary $T\in\big{\{}U_{\textup{min}},U_{\textup{max}},B,C\big{\}}$ , a $T$ -cycle is the cycle introduced for an $a\in T^{\mathcal{A}}$ .

[ear-cycle] These are the cycles constructed as ears on the gadgets for the relations $S^{\mathcal{A}^{*}}$ and $L^{\mathcal{A}^{*}}$ .

[mixed-cycle] All the other cycles are mixed.

For example, we get a mixed cycle if we start with $a_{2}$ , $a^{\prime}_{0}$ , $a_{1}$ in Figure 2 and then add the path introduced for $(a_{1},a_{2})\in S^{\mathcal{A}}$ (ignoring the ear).

A number of observations for these types of cycles are in order.

Lemma 5.2.

(i)

All the $F$ -cycles are of even length.¹¹1Moreover one can show that every chordless $F$ -cycle has length $4$ .
(ii)

Every $U_{\textup{min}}$ -, $U_{\textup{max}}$ -, $B$ -, and $C$ -cycle is of length $5$ , $7$ , $9$ , and $11$ , respectively.
(iii)

Every ear-cycle is of length $13$ or $15$ .
(iv)

Every mixed-cycle neither uses new vertices of any $T$ -cycle for $T\in\big{\{}U_{\textup{min}},U_{\textup{max}},B,C\big{\}}$ nor any vertex of any ear-cycle.
(v)

Every mixed-cycle has length at least $17$ .

Proof : (i) follows easily from the fact that $\big{(}A^{*},F^{\mathcal{A}^{*}}\big{)}$ is a bipartite graph; (ii) and (iii) are trivial.

For (iv) assume that a mixed-cycle uses a new vertex $b$ of a $T$ -cycle $\mathcal{C}$ introduced for some $a\in T^{\mathcal{A}^{*}}$ , where $T\in\big{\{}U_{\textup{min}},U_{\textup{max}},B,C\big{\}}$ . As $\mathcal{C}$ is mixed, it must contain a vertex $c\notin T^{\mathcal{A}^{*}}$ . To reach $b$ from $c$ the mixed cycle must pass through $a$ and hence must contain one of the two segments of $\mathcal{C}$ between $b$ and $a$ . As a consequence, in order for the mixed-cycle to go back from $b$ to $c$ , it must also use the other segment of $\mathcal{C}$ between $a$ and $b$ . This means that it must be the $T$ -cycle $\mathcal{C}$ itself, instead of a mixed one. A similar argument shows that mixed cycles do not contain vertices of any ear-cycle.

To prove (v), let $\mathcal{C}$ be a mixed-cycle. By (iv), $\mathcal{C}$ must contain all vertices of a (at least one) path introduced for a pair $(a,a^{\prime})\in L^{\mathcal{A}*}$ or $(a,b)\in S^{\mathcal{A}^{*}}$ (ignoring the ear). As this path has length $17$ , we get our claim. $\Box$

Conversely, given a $\tau_{E}$ -structure $G$ , which is a graph, we construct a $\tau_{0}$ -structure which we denote by $\mathcal{O}(G)$ , possibly the empty structure. Recall the definitions of “cycle” and of “path with ear” given by Lemma 5.1.

–

$O(G):=\big{\{}(a_{1},a_{2})\in V(G)\times V(G)\;\big{|}\;\text{$a_{1}$ is a member of a cycle of length $9$, \ $a_{2}$ is a member}\\ \text{of a cycle of length $11$, and there is a path from $a_{1}$ to $a_{2}$ of length $17$ with a $15$-ear}\big{\}}$
–

$<^{\mathcal{O}(G)}:=\big{\{}((a_{1},a_{2}),(b_{1},b_{2}))\in O(G)\times O(G)\;\big{|}\;\{a_{2},b_{1}\}\in E(G)\big{\}}$
–

$U_{\textup{min}}^{\mathcal{O}(G)}:=\big{\{}(a_{1},a_{2})\in O(G)\;\big{|}\;\text{$a_{1}$ is a member of a cycle of $5$ elements}\big{\}}$
–

$U_{\textup{max}}^{\mathcal{O}(G)}:=\big{\{}(a_{1},a_{2})\in O(G)\;\big{|}\;\text{$a_{1}$ is a member of a cycle of $7$ elements}\big{\}}$
–

$S^{\mathcal{O}(G)}:=\big{\{}((a_{1},a_{2}),(b_{1},b_{2}))\in O(G)\times O(G)\mid\text{there is a path from $a_{1}$ to $b_{1}$ of length $17$}\\ \text{with a $13$-ear}\big{\}}$ .

Lemma 5.3.

For every complete $\tau_{0}$ -ordering $\mathcal{A}$ we have $\mathcal{O}(G(\mathcal{A}))\cong\mathcal{A}$ .

Proof : Let $G:=G(\mathcal{A})$ and $\mathcal{A}^{+}:=\mathcal{O}(G)$ . We claim that the mapping $h:A\to A^{+}$ defined by

h(a):=(a,a^{\prime})\quad\text{for $a\in A$}

is an isomorphism from $\mathcal{A}$ to $\mathcal{A}^{+}$ . To that end, we first prove that

A^{+}=\big{\{}(a,a^{\prime})\;\big{|}\;a\in A\big{\}},

which implies that $h$ is well defined and a bijection. For every $a\in A$ it is easy to see that $(a,a^{\prime})\in O(G)\ (=A^{+})$ . For the converse, let $(a_{1},a_{2})\in O(G)$ . In particular, $a_{1}$ is a member of a cycle of length $9$ . By Lemma 5.2, this must be a $B$ -cycle which contains some $a\in A$ . Using the same argument, $a_{2}$ is a member of a $C$ -cycle which contains a vertex $b^{\prime}$ being the companion of some $b\in A$ . Furthermore, there is a path from $a_{1}$ to $a_{2}$ of length $17$ with a $15$ -ear. The $15$ -ear is a cycle of length $15$ . Again by Lemma 5.2 this cycle is an ear-cycle which belongs to the gadget we introduced for some $(c,c^{\prime})\in L^{\mathcal{A}^{*}}$ with $c\in A$ . Then it is easy to see that $a=c=b$ . This finishes the proof that $h$ is a bijection from $A$ to $A^{+}$ .

Similarly, we can prove that $h$ preserves all the relations. $\Box$

We want to show that we can obtain $\mathcal{O}(G)$ from $G$ by a strongly existential FO-interpretation. We set

	$\displaystyle\eta(x,x^{\prime},\bar{x},\bar{x}^{\prime},\bar{z},\bar{w}):=$	$\displaystyle\ \text{``$\bar{x}$ is a cycle of length $9$ containing $x$, \ $\bar{x}^{\prime}$ is a cycle of length $11$ containing $x^{\prime}$},$
		and $\bar{z}$ is a path from $x$ to $x^{\prime}$ of length $17$ with the $15$ -ear $\bar{w}$ ”
	$\displaystyle=$	$\displaystyle\ \varphi_{c9}(x,\bar{x})\wedge\varphi_{c11}(x^{\prime},\bar{x}^{\prime})\wedge\varphi_{pe,17,15}(x,x^{\prime}\bar{z},\bar{w}).$

We define the desired interpretation $I$ of width $2$ of $\tau_{0}$ -structures in graphs. We set

\varphi_{\textup{uni}}(x,x^{\prime}):=\exists\bar{x}\exists\bar{x}^{\prime}\exists\bar{z}\exists\bar{w}\,\eta(x,x^{\prime},\bar{x},\bar{x}^{\prime},\bar{z},\bar{w}).

Hence for every graph $G$ ,

O_{I}(G)=\big{\{}(a_{1},a_{2})\in V(G)\times V(G)\;\big{|}\;\text{$G\models\exists\bar{x}\exists\bar{x}^{\prime}\exists\bar{z}\exists\bar{w}\,\eta(a_{1},a_{2},\bar{x},\bar{x}^{\prime},\bar{z},\bar{w})$}\big{\}}.

Furthermore we define

–

$\varphi_{U_{\textup{min}}}(x,x^{\prime}):=\exists\bar{z}\,\varphi_{c5}(x,\bar{z})$ ,
–

$\varphi_{U_{\textup{max}}}(x,x^{\prime}):=\exists\bar{z}\,\varphi_{c7}(x,\bar{z})$ ,
–

$\varphi_{S}(x,x^{\prime},y,y^{\prime}):=\exists\bar{z}\exists\bar{w}\;\text{``$\bar{z}$ is a path of length $17$ from $x$ to $y$ with a $13$-ear $\bar{w}$''}$
$=\exists\bar{z}\exists\bar{w}\varphi_{pe,17,13}(x,\bar{z},\bar{w})$ .

Then we have:

Lemma 5.4.

The interpretation $I$ given by $\big{(}\varphi_{\textup{uni}},\varphi_{<},\varphi_{U_{\textup{min}}},\varphi_{U_{\textup{max}}},\varphi_{S}\big{)}$ is strongly existential. For every complete $\tau_{0}$ -ordering $\mathcal{A}$ we have $\mathcal{O}_{I}(G(\mathcal{A}))=\mathcal{O}(G(\mathcal{A}))$ and hence, by Lemma 5.3,

\mathcal{O}_{I}(G(\mathcal{A}))\cong\mathcal{A}.

Setting $\psi:=\varphi_{\textsc{Graph}}$ , the sentence axiomatizing the class of graphs, we get from Proposition 4.5:

Theorem 5.5 (Tait’s Theorem for graphs).

There is a $\tau_{E}$ -sentence $\varphi$ such that $\textsc{Graph}_{\textup{fin}}(\varphi)$ , the class of finite graphs that are models of $\varphi$ , is closed under induced subgraphs but $\varphi$ is not equivalent to a universal sentence in finite graphs.

In this section we presented a strongly existential interpretation of $\tau_{0}$ -structures and applied it to finite complete $\tau_{0}$ -orderings, i.e, to models of $\varphi_{0}\wedge\varphi_{1}$ . A straightforward generalization of the preceding proofs allows to show the following result for vocabularies obtained from $\tau_{0}$ by adding pairs. We shall use it in Section 6.

Lemma 5.6.

Let $\tau$ be obtained from $\tau_{0}$ by adding pairs. There is a strongly existential interpretation $I\ (=I_{\tau})$ that for every extension $\varphi_{0\tau}$ of $\varphi_{0}$ assigns to every $\tau$ -structure $\mathcal{A}$ that is a model of $\varphi_{0\tau}\wedge\varphi_{1\tau}$ a graph $G(\mathcal{A})$ with $\mathcal{O}_{I}(G(\mathcal{A}))\cong\mathcal{A}$ . For finite $\mathcal{A}$ the graph $G(\mathcal{A})$ is finite.

Proof : We get the graph $G(\mathcal{A})$ as in the case $\tau:=\tau_{0}$ : For the elements of new unary relations we add cycles such that the lengths of the cycles are odd and distinct for distinct unary relations in $\tau$ . Let $c$ be the maximal length of these cycles. Then we add paths with ears to the tuples of binary relations as above. For distinct binary relations the ears should have distinct length and again this length should be odd and greater than $c$ . On the other hand, the length of added new paths can be the same for all binary relations but should be greater than the length of all the cycles. $\Box$

Remark 5.7.

(a) Let $\mathscr{C}:=\textsc{Mod}_{\textup{fin}}(\forall x\neg Exx)$ be the class of directed graphs. Then $\mathscr{C}^{\prime}:=\textsc{Graph}_{\textup{fin}}$ , the class of finite graphs, is a subclass of $\mathscr{C}$ closed under induced substructures and definable in $\mathscr{C}$ by the universal sentence $\forall x\forall y(Exy\to Eyx)$ . As the Łoś-Tarski Theorem fails for the class of finite graphs, it fails for the class of directed graphs by Remark 2.8.

(b) Now let $\mathscr{C}:=\textsc{Graph}_{\textup{fin}}$ and $\mathscr{C}^{\prime}:=\textsc{Planar}_{\textup{fin}}$ be the class of finite planar graphs, a subclass of $\textsc{Graph}_{\textup{fin}}$ closed under induced subgraphs. As mentioned in the Introduction, in [1] it is shown that the Łoś-Tarski Theorem fails for $\textsc{Planar}_{\textup{fin}}$ . As $\textsc{Planar}_{\textup{fin}}$ is not axiomatizable in $\textsc{Graph}_{\textup{fin}}$ by a universal sentence, not even by a first-order sentence, we do not get the failure of the Łoś-Tarski Theorem for the class of finite graphs, i.e., Tait’s Theorem for graphs, by applying the result of Remark 2.8. We show that $\textsc{Planar}_{\textup{fin}}=\textsc{Forb}_{\textup{fin}}(\mathscr{F})$ for a finite set $\mathscr{F}$ of finite graphs (or, equivalently, $\textsc{Planar}_{\textup{fin}}=\textsc{Mod}_{\textup{fin}}(\mu)$ for a universal $\mu$ ) leads to a contradiction. Let $k$ be the maximum size of the set of vertices of graphs in $\mathscr{F}$ . Let $G$ be the graph obtained from the clique $K_{5}$ of 5 vertices by subdividing each edge by $k+1$ . Clearly, $G\notin\textsc{Planar}_{\textup{fin}}$ . However, every subgraph of $G$ induced on at most $k$ elements is planar. Hence, $G\in\textsc{Forb}_{\textup{fin}}(\mathscr{F})$ .

(c) Let $\tau$ be any vocabulary with at least one at least binary relation $T$ . Then the Łoś-Tarski Theorem fails for the class $\mathscr{C}:=\textsc{Str}_{\textup{fin}}[\tau]$ , the class of all finite $\tau$ -structures. By Remark 2.8 it suffices to show the existence of a universally definable subclass $\mathscr{C}^{\prime}$ of $\mathscr{C}$ which “essentially is the class of graphs.” We set

\mu:=\forall x\forall\bar{u}\neg Txx\bar{u}\wedge\forall x\forall y\forall\bar{u}\forall\bar{v}(Txy\bar{u}\to Tyx\bar{v})\wedge\bigwedge_{R\in\tau,\ R\neq T}\forall\bar{u}\neg R\bar{u}

and let $\mathscr{C}^{\prime}$ be $\textsc{Mod}_{\textup{fin}}(\mu)$ .

If $\tau$ only contains unary relation symbols, the Łoś-Tarski Theorem holds for $\textsc{Str}_{\textup{fin}}[\tau]$ . It is easy to see for an $\textup{FO}(\tau)$ -sentence $\varphi$ that the closure under induced substructures of $\textsc{Mod}_{\textup{fin}}(\varphi)$ implies that of $\textsc{Mod}(\varphi)$ .

6 Gurevich’s Theorem

The following discussion will eventually lead to a proof of Gurevich’s Theorem, i.e., Theorem 1.5. Our proof essentially follows Gurevich’s proof in [14], but it contains some elements of Rossman’s proof of the same result in [16]. ²²2The reader of [14] will realize that the definition of $\varphi^{n}$ on page 190 of [14] must be modified in order to ensure that the class of models of $\varphi^{n}$ is closed under induced substructures. Afterwards we show that it remains true if we restrict ourselves to graphs.

Our main tool is Proposition 3.11, and the goal is to construct a formula $\gamma$ in (10) whose size is much smaller than the number $m$ . Basically $\gamma$ will describe a very long computation of a Turing machine on a short input. We fix a universal Turing machine $M$ operating on an one-way infinite tape, the tape alphabet is $\{0,1\}$ , where $0$ is also considered as blank, and $Q$ is the set of states of $M$ . The initial state is $q_{0}$ , and $q_{h}$ is the halting state; thus $q_{0},q_{h}\in Q$ and we assume that $q_{0}\neq q_{h}$ . An instruction of $M$ has the form

qapbd,

where $q,p\in Q$ , $a,b\in\{0,1\}$ and $d\in\{-1,0,1\}$ . It indicates that if $M$ is in state $q$ and the head of $M$ reads an $a$ , then the head replaces $a$ by $b$ and moves to the left (if $d=-1$ ), stays still (if $d=0$ ), or moves to the right (if $d=1$ ). In order to describe computations of $M$ by FO-formulas we introduce binary predicates $H_{q}(x,t)$ for $q\in Q$ to indicate that at time $t$ the machine is in state $q$ and the head scans cell $x$ , and a binary predicate $C_{0}(x,t)$ to indicate that the content of cell $x$ at time $t$ is 0.

The vocabulary $\tau_{M}$ is obtained from $\tau_{0}$ by adding pairs (see Definition 3.6 (a)),

\tau_{M}:=\tau_{0}\cup\big{\{}H_{q},H_{q}^{\textup{comp}}\;\big{|}\;q\in Q\big{\}}\cup\big{\{}C_{0},C_{0}^{\textup{comp}}\big{\}}.

Intuitively, $H_{q}^{\textup{comp}}(x,t)$ says that “at time $t$ the machine is not in state $q$ or the head is not in cell $x$ ;” and $C_{0}^{\textup{comp}}(x,t)$ says that “at time $t$ the content of cell $x$ is (not 0 and thus is) 1.” Sometimes we write $C_{1}$ instead of $C_{0}^{\textup{comp}}$ (e.g., below in $\varphi_{2}$ if $a=1$ or $b=0$ ).

Let $\varphi_{0}$ and $\varphi_{1}$ be the sentences already introduced in Section 3. For $w\in\{0,1\}^{*}$ the sentence $\varphi_{0w}$ will be an extension of $\varphi_{0}$ (compare Definition 3.6 (b)); hence, $\varphi_{0w}$ will be a universal sentence and all relations symbols besides $<$ are negative in $\varphi_{0w}$ ; in particular, it contains as conjuncts $\varphi_{0}$ and

\forall x\forall t\big{(}\neg C_{0}(x,t)\vee\neg C^{\textup{comp}}_{0}(x,t)\big{)}\wedge\bigwedge_{q\in Q}\forall x\forall t\big{(}\neg H_{q}(x,t)\vee\neg H_{q}^{\textup{comp}}(x,t)\big{)}.

Finally, $\varphi_{0w}$ will contain the following sentences $\varphi_{2}$ and $\varphi_{w}$ as conjuncts. The sentence $\varphi_{2}$ describes one computation step. It contains for each instruction of $M$ one conjunct. For example, the instruction $qapb1$ contributes the conjunct

	$\displaystyle\forall xx^{\prime}\forall tt^{\prime}\forall y\Big{(}\big{(}H_{q}$	$\displaystyle(x,t)\wedge C_{a}(x,t)\wedge S(x,x^{\prime})\wedge S(t,t^{\prime})\big{)}$
	$\displaystyle\to\big{(}$	$\displaystyle(\neg C_{1-b}(x,t^{\prime})\wedge\neg H^{\textup{comp}}_{p}(x^{\prime},t^{\prime}))$
		$\displaystyle\wedge(y\neq x^{\prime}\to\bigwedge_{r\in Q}\neg H_{r}(y,t^{\prime}))$
		$\displaystyle\wedge(y\neq x\to((C_{0}(y,t)\to\neg C^{\textup{comp}}_{0}(y,t))\wedge(C^{\textup{comp}}_{0}(y,t^{\prime})\to\neg C_{0}(y,t^{\prime}))))\big{)}\Big{)}.$

For $w\in\{0,1\}^{*}$ the sentence $\varphi_{w}$ describes the initial configuration of $M$ with input $w$ (if $w=w_{1}\ldots w_{|w|}$ , the first $|w|$ cells contain $w_{1},\ldots,w_{|w|}$ , the remaining cells contain $0$ , and the head scans the first cell in the starting state $q_{0}$ ). Hence, as $\varphi_{w}$ we can take the conjunction of

–

$\forall x_{1}\ldots\forall x_{|w|}\big{(}(U_{\textup{min}}\,x_{1}\wedge\bigwedge_{i\in[|w|-1]}S{x_{i}}x_{i+1})\\ {\hskip 113.81102pt}\to(\bigwedge_{\begin{subarray}{c}i\in[|w|],\\ w_{i}=0\end{subarray}}\neg C^{\textup{comp}}_{0}(x_{i},x_{1})\wedge\bigwedge_{\begin{subarray}{c}i\in[|w|],\\ w_{i}=1\end{subarray}}\neg C_{0}(x_{i},x_{1}))\big{)}$
–

$\forall x_{1}\ldots\forall x_{|w|}\forall x\big{(}(U_{\textup{min}}\,x_{1}\wedge\bigwedge_{i\in[|w|-1]}S{x_{i}}x_{i+1}\wedge x_{|w|}<x)\to\neg C^{\textup{comp}}_{0}(x,x_{1})\big{)}$
–

$\forall x\forall y\big{(}U_{\textup{min}}\,x\to(\neg H_{q_{0}}^{\textup{comp}}(x,x)\wedge(y\neq x\to\bigwedge_{q\in Q}\neg H_{q}(y,x)))\big{)}$ .

Note that $U_{\textup{min}}$ , $U_{\textup{max}}$ , and $S$ are negative in $\varphi_{0w}$ . We set $\varphi_{1M}:=\varphi_{1\tau_{M}}$ ; recall that by Definition 3.6 (c),

\varphi_{1M}=\varphi_{1}\wedge\forall x\forall t\big{(}C_{0}(x,t)\vee C^{\textup{comp}}_{0}(x,t)\big{)}\wedge\bigwedge_{q\in Q}\forall x\forall t\big{(}H_{q}(x,t)\vee H_{q}^{\textup{comp}}(x,t)\big{)}.

Let $w\in\{0,1\}^{*}$ and $r\in\mathbb{N}$ . Furthermore, let $\mathcal{A}$ be a $\tau_{M}$ -structure where $<^{\mathcal{A}}$ is an ordering and $|A|\geq r+1$ . Let $a_{0},\ldots,a_{r}$ be the first $r+1$ elements of $<^{\mathcal{A}}$ . Assume that $M$ on the input $w\in\{0,1\}^{*}$ runs at least $r$ steps. We say that $\mathcal{A}$ correctly encodes $r$ steps of the computation of $M$ on $w$ if for $i,j$ with $0\leq i,j\leq r$ ,

\displaystyle(a_{i},a_{j})\in C_{0}^{\mathcal{A}}

\displaystyle\iff

the content of cell

i

after

j

steps is 0

(15)

and for $q\in Q$ ,

\displaystyle(a_{i},a_{j})\in H_{q}^{\mathcal{A}}

\displaystyle\iff

after

j

steps

M

is in state

q

and the head scans cell

j

(16)

From the definitions of the sentences $\varphi_{0w}$ and $\varphi_{1M}$ , we see:

Lemma 6.1.

Let $w\in\{0,1\}^{*}$ and $\mathcal{A}$ be a model $\varphi_{0w}\wedge\varphi_{1M}$ . If for $r\in\mathbb{N}$ we have $r+1\leq|A|$ (in particular, if $A$ is infinite) and $M$ on $w$ runs at least $r$ steps, then $\mathcal{A}$ correctly encodes $r$ steps of the computation of $M$ on $w$ .

Finally, let $\gamma_{M}$ be a sentence expressing that “the machine $M$ reaches the halting state $q_{h}$ in exactly ‘max’ steps,” more precisely,

\gamma_{M}:=\exists t\exists x\big{(}U_{\textup{max}}t\wedge H_{q_{h}}(x,t)\wedge\forall t^{\prime}\forall y(t^{\prime}<t\to\neg H_{q_{h}}(y,t^{\prime}))\big{)}.

(17)

As a consequence of the preceding lemma, we obtain:

Corollary 6.2.

Let $w\in\{0,1\}^{*}$ and assume that $M$ on input $w$ eventually halts, say in $h(w)$ steps, then

\varphi_{0w}\wedge\varphi_{1M}\wedge\gamma_{M}

has no infinite model but a model with exactly $h(w)+1$ elements (this model is unique up to isomorphism).

Proof : Let $\mathcal{A}\models\varphi_{0w}\wedge\varphi_{1M}\wedge\gamma_{M}$ . Then $\mathcal{A}\upharpoonright\tau_{0}$ is a complete $\tau_{0}$ -ordering and $\mathcal{A}$ contains the description of the complete halting computation of $M$ on the input $w$ . As the machine $M$ reaches the halting state in exactly $h(w)$ steps, we see that $|A|=h(w)+1$ ; in particular, $A$ is finite.

On the other hand, we can interpret (15) and (16) as defining relations $C_{0}^{\mathcal{A}}$ and $H_{q}^{\mathcal{A}}$ on the set $A:=\big{\{}a_{0},\ldots,a_{h(w)}\big{\}}$ equipped with the “natural” ordering and its corresponding relations $U_{\textup{min}}$ , $U_{\textup{max}}$ , and $S$ . If furthermore we let $(C^{\textup{comp}}_{0})^{\mathcal{A}}$ and $(H_{q}^{\textup{comp}})^{\mathcal{A}}$ be the complements in $A\times A$ of $C_{0}^{\mathcal{A}}$ and $H_{q}^{\mathcal{A}}$ , respectively, we get a model of $\varphi_{0w}\wedge\varphi_{1M}\wedge\gamma_{M}$ with exactly $h(w)+1$ elements. $\Box$

We set

\chi_{w}:=\varphi_{0w}\wedge(\varphi_{1M}\to\neg\gamma_{M}).

(18)

By Proposition 3.11 and Corollary 6.2, we get:

Lemma 6.3.

Let $M$ on input $w$ eventually halt, say in $h(w)$ steps. Then:

(a)

$\textsc{Mod}(\chi_{w})$ is closed under $<$ -substructures.
(b)

If $\chi_{w}$ is finitely equivalent to a universal sentence $\mu$ , then $|\mu|\geq h(w)+1$ .

Now we show the following version of Gurevich’s Theorem.

Theorem 6.4.

Let $f:\mathbb{N}\to\mathbb{N}$ be a computable function. Then there is a $w\in\{0,1\}^{*}$ such that $\textsc{Mod}(\chi_{w})$ is closed under $<$ -substructures but $\chi_{w}$ is not finitely equivalent to a universal sentence of length less than $f(|\chi_{w}|)$ .

Proof : By the previous lemma it suffices to find a $w\in\{0,1\}^{*}$ such that $M$ on input $w$ halts in $h(w)$ steps with

h(w)\geq f(|\chi_{w}|).

W.l.o.g. we assume that $f$ is increasing. An analysis of the formula $\chi_{w}$ shows that for some $c_{M}\in\mathbb{N}$ we have for all $w\in\{0,1\}^{*}$ ,

|\chi_{w}|\leq c_{M}\cdot|w|.

(19)

We define $g:\mathbb{N}\to\mathbb{N}$ by

g(k):=f(5\cdot c_{M}\cdot k).

(20)

Let $M_{0}$ be a Turing machine computing $g$ , more precisely, the function $1^{k}\mapsto 1^{g(k)}$ . We code $M_{0}$ and $1^{k}$ by a $\{0,1\}$ -string $\textit{code}(M_{0},1^{k})$ such that $M$ on $\textit{code}(M_{0},1^{k})$ simulates the computation of $M_{0}$ on $1^{k}$ .

Choose the least $k$ such that for $w:=\textit{code}(M_{0},1^{k})$ we have

|w|\leq 5k.

(21)

The universal Turing machine $M$ on input $w$ computes $1^{g(k)}$ and thus runs at least $g(k)$ steps, say, exactly $h(w)$ steps. By (19) – (21)

h(w)\geq g(k)=f(5\cdot c_{M}\cdot k)\geq f(c_{M}\cdot|w|)\geq f(|\chi_{w}|).

Finally we prove Gurevich’s Theorem for graphs. For $\tau:=\tau_{M}$ let $I$ be an interpretation according to Lemma 5.6. For $w\in\{0,1\}^{*}$ we consider the sentence

\rho_{w}:=\forall\bar{x}\neg\varphi_{\textup{uni}}(\bar{x})\vee(\varphi_{0w}\wedge(\varphi_{1M}\to\neg\gamma_{M}))^{I}=\forall\bar{x}\neg\varphi_{\textup{uni}}(\bar{x})\vee\chi_{w}^{I}.

(21)

That is, for $G\models\rho_{w}$ , either the graph $G$ interprets an empty $\tau_{M}$ -structure, or a $\tau_{M}$ -structure which is a model of $\chi_{w}$ . If $M$ halts in $h(w)$ steps on input $w$ , then $\varphi_{0w}\wedge\varphi_{1M}\wedge\gamma_{M}$ has no infinite model but a finite model with $h(w)+1$ elements by Corollary 6.2. Hence taking in Proposition 4.6 as $\psi$ the sentence $\psi_{\textsc{Graph}}$ axiomatizing the class of graphs we get the following analogue of Lemma 6.3.

Lemma 6.5.

Let $M$ on input $w$ halt in $h(w)$ steps. Then:

(a)

$\textsc{Graph}(\rho_{w})$ , the class of graphs that are model of $\rho_{w}$ , is closed under induced subgraphs (and hence equivalent in the class of graphs to a universal sentence).
(b)

If $\rho_{w}$ is equivalent in the class of finite graphs to a universal sentence $\mu$ , then $|\mu|^{2}\geq h(w)$ .

Theorem 6.6 (Gurevich’s Theorem for graphs).

Let $f:\mathbb{N}\to\mathbb{N}$ be a computable function. Furthermore, let $\rho_{w}$ be defined by (21), where $I$ is an interpretation for $\tau:=\tau_{M}$ according to Lemma 5.6. Then there is a $w\in\{0,1\}^{*}$ such that $\textsc{Graph}(\rho_{w})$ is closed under induced subgraphs but $\rho_{w}$ is not equivalent in the class of finite graphs to a universal sentence of length less than $f(|\rho_{w}|)$ .

Proof : Again we assume that $f$ is increasing. By the previous lemma it suffices to find a $w\in\{0,1\}^{*}$ such that $M$ on input $w$ halts in $h(w)$ steps with

h(w)\geq f(|\rho_{w}|)^{2}.

There is a $c\in\mathbb{N}$ , which depends on $I$ but not on $w$ , such that for $c_{I}$ as in (13) and $c_{M}$ as in (19) we have for $d_{M}:=c+c_{I}\cdot c_{M}$ ,

|\rho_{w}|\leq c+c_{I}\cdot|\chi_{w}|\leq c+c_{I}\cdot c_{M}\cdot|w|\leq d_{M}\cdot|w|.

(22)

We define $g:\mathbb{N}\to\mathbb{N}$ by

g(k):=f(5\cdot d_{M}\cdot k)^{2}

(23)

and then proceed as in the proof of Theorem 6.4. Let $M_{0}$ be a Turing machine computing the function $1^{k}\mapsto 1^{g(k)}$ . We code $M_{0}$ and $1^{k}$ by a $\{0,1\}$ -string $\textit{code}(M_{0},1^{k})$ such that $M$ on $\textit{code}(M_{0},1^{k})$ simulates the computation of $M_{0}$ on $1^{k}$ .

Choose the least $k$ such that for $w:=\textit{code}(M_{0},1^{k})$ we have

|w|\leq 5k.

(24)

The universal Turing machine $M$ on input $w$ computes $1^{g(k)}$ and thus runs at least $g(k)$ steps, say, exactly $h(w)$ steps. We have

h(w)\geq g(k)=f(5\cdot d_{M}\cdot k)^{2}\geq f(d_{M}\cdot|w|)^{2}\geq f(|\rho_{w}|)^{2}

by (22) – (24). $\Box$

Remark 6.7.

Using previous remarks (Remark 3.12 and Remark 4.7) one can even show that for every computable function $f:\mathbb{N}\to\mathbb{N}$ the sentence $\chi_{w}$ is not finitely equivalent to a $\Pi_{2}$ -sentence of length less than $f(|\chi_{w}|)$ and the sentence $\rho_{w}$ is not finitely equivalent in graphs to a $\Pi_{2}$ -sentence of length less than $f(|\chi_{w}|)$ . Moreover, $\chi_{w}$ and $\rho_{w}$ are equivalent to $\Sigma_{2}$ .

For this purpose note that in models of $\varphi_{0w}$ the sentence $\gamma_{M}$ is equivalent to

\exists t\exists x\big{(}U_{\textup{max}}t\wedge H_{q_{h}}(x,t)\big{)}\wedge\forall t_{1}\forall t_{2}\forall y\big{(}t_{1}<t_{2}\to\neg H_{q_{h}}(y,t_{2})\big{)}.

and hence equivalent to a $\Sigma_{2}$ and to a $\Pi_{2}$ -sentence. One easily verifies that the same holds for $\gamma_{M}^{I}$ .

7 Some undecidable problems

In this section we show that various problems related to the results of the preceding sections are undecidable. Among others, these results explain why it might be hard, in fact impossible in general, to obtain forbidden induced subgraphs for various classes of graphs.

Proposition 7.1.

There is no algorithm that applied to any $\textup{FO}[\tau_{E}]$ -sentence $\varphi$ decides whether the class $\textsc{Graph}(\varphi)$ is closed under induced subgraphs.

Proof : Assume $\mathbb{A}$ is such an algorithm. By the Completeness Theorem there is an algorithm $\mathbb{B}$ that assigns to every sentence $\varphi$ with $\textsc{Graph}(\varphi)$ closed under induced subgraphs a universal sentence equivalent to $\varphi$ in graphs. Define the function $g$ by

g(\varphi):=\begin{cases}0,&\text{if $\mathbb{A}$ rejects $\varphi$}\\ m,&\text{$\mathbb{B}$ needs $m$ steps to produce a universal sentence equivalent to $\varphi$}\end{cases}

and set $f(k):=\textup{max}\{g(\varphi)\mid|\varphi|\leq k\}$ . Then $f$ would contradict Gurevich’s Theorem for graphs, i.e., Theorem 6.6. $\Box$

Corollary 7.2.

There is no algorithm that applied to any $\textup{FO}[\tau_{E}]$ -sentence $\varphi$ either reports that $\textsc{Graph}(\varphi)$ is not closed under induced subgraphs or it computes for $\textsc{Graph}(\varphi)$ a class of forbidden induced subgraphs.

Proof : Otherwise we could use this algorithm as a decision algorithm for the previous result. $\Box$

The following proposition is the analog of Proposition 7.1 for classes of finite graphs. We state it for $\textup{FO}[\tau_{E}]$ -sentences and graphs even though we prove it for $\textup{FO}[\tau_{M}]$ -sentences. One gets the version for graphs using the machinery we developed in previous sections similarly as we do it to get Corollary 7.5 from Proposition 7.4 below.

We write $M:w\mapsto\infty$ for the universal Turing machine $M$ and a word $w\in\{0,1\}^{*}$ if $M$ on input $w$ does not halt. We make use of the sentences $\varphi_{0w}$ , $\varphi_{1M}$ , and $\gamma_{M}$ defined in the previous section.

Proposition 7.3.

There is no algorithm that applied to any $\textup{FO}[\tau_{E}]$ -sentence $\varphi$ decides whether the class $\textsc{Graph}\,_{\textup{fin}}(\varphi)$ is closed under induced subgraphs.

Proof : For the universal Turing machine $M$ and a word $w\in\{0,1\}^{*}$ consider the sentence

\pi_{w}:=\varphi_{0w}\wedge\varphi_{1M}\wedge\gamma_{M}.

Then

\textsc{Mod}\,_{\textup{fin}}(\pi_{w})

is closed under induced subgraphs

\displaystyle\iff

\displaystyle M:w\mapsto\infty.

(25)

In fact, if $M:w\mapsto\infty$ , then $\textsc{Mod}\,_{\textup{fin}}(\pi_{w})=\emptyset$ , hence $\textsc{Mod}\,_{\textup{fin}}(\pi_{w})$ is trivially closed under induced subgraphs. If $M$ on input $w$ halts after $h(w)$ steps, then, up to isomorphism, there is a unique model $\mathcal{A}_{w}$ of $\pi_{w}$ and it has $h(w)+1$ elements. By Lemma 3.8 every proper induced substructure of $\mathcal{A}_{w}$ is not a model of $\pi_{w}$ . Hence $\textsc{Mod}\,_{\textup{fin}}(\pi_{w})$ is not closed under induced subgraphs. As the halting problem for every universal Turing machine is not decidable, by (25) we get our claim. $\Box$

Proposition 7.4.

There is no algorithm that applied to any $\textup{FO}[\tau_{M}]$ -sentence, which is finitely equivalent to a universal sentence, computes such a universal sentence.

Proof : Again we show that such an algorithm would allow us to decide for every $w\in\{0,1\}^{*}$ whether the universal Turing machine $M$ halts on input $w$ . In (18) we defined $\chi_{w}$ by

\chi_{w}=\varphi_{0w}\wedge(\varphi_{1M}\to\neg\gamma_{M}).

If $M$ halts on $w$ , by Lemma 6.3 we know that $\textsc{Mod}(\chi_{w})$ is closed under $<$ -substructures and thus equivalent to a universal sentence. The claimed algorithm (or, even the Completeness Theorem) will produce such a universal $\mu$ . Furthermore, by Corollary 6.2 we know that there is a finite model with $h(w)+1$ elements, which is a model of $\varphi_{0w}\wedge\neg\chi_{w}$ , hence it is a model of $\varphi_{0w}\wedge\neg\mu$ .

If $M$ does not halt on $w$ , then we show that $\textsc{Mod}\,_{\textup{fin}}(\chi_{w})=\textsc{Mod}\,_{\textup{fin}}(\varphi_{0w})$ . Clearly $\textsc{Mod}_{\textup{fin}}(\chi_{w})\subseteq\textsc{Mod}_{\textup{fin}}(\varphi_{0w})$ . Now let $\mathcal{A}$ be a finite model of $\varphi_{0w}$ . If $\mathcal{A}\not\models\varphi_{1M}$ , then $\mathcal{A}\models\chi_{w}$ . Otherwise $\mathcal{A}\models\varphi_{1M}$ , then $\mathcal{A}$ correctly represents the first $|A|-1$ steps of the computation of $M$ on $w$ by Lemma 6.1. Thus $\mathcal{A}$ is a model of $\neg\gamma_{M}$ as $M$ does not halt on $w$ . Therefore, $\mathcal{A}$ is a model of $\chi_{w}$ .

Now we can see whether $M$ does not halt on $w$ by checking whether the universal sentence produced by the claimed algorithm is finitely equivalent to the universal sentence $\varphi_{0w}$ . This can be checked effectively by Corollary 2.5 and Corollary 2.6. $\Box$

Corollary 7.5.

There is no algorithm that applied to any $\textup{FO}[\tau_{E}]$ -sentence $\varphi$ such that $\textsc{Graph}\,_{\textup{fin}}(\varphi)$ has a finite set of forbidden induced subgraphs computes such a set.

Proof : Equivalently we show that there is no algorithm that applied to any $\textup{FO}[\tau_{E}]$ -sentence $\varphi$ such that $\textsc{Graph}\,_{\textup{fin}}(\varphi)=\textsc{Graph}\,_{\textup{fin}}(\mu)$ for some universal sentence $\mu$ computes such a $\mu$ .

For graphs let $I\ (=I_{\tau_{M}})$ be a strongly existential interpretation of $\tau_{M}$ -structures in graphs according to Lemma 5.6. We know that for every finite $\tau_{M}$ -structure $\mathcal{A}$ there is a finite graph $G$ such that $G_{I}\cong\mathcal{A}$ .

For $w\in\{0,1\}^{*}$ we consider the sentence $\rho_{w}$ defined in (21) in the proof of Theorem 6.4,

\rho_{w}=\forall\bar{x}\neg\varphi_{\textup{uni}}(\bar{x})\vee(\varphi_{0w}\wedge(\varphi_{1M}\to\neg\gamma_{M}))^{I}=\forall\bar{x}\neg\varphi_{\textup{uni}}(\bar{x})\vee\chi_{w}^{I}.

We show that $\rho_{w}$ is equivalent to a universal sentence $\mu$ on finite graphs. Moreover, $M$ does not halt on input $w$ if and only if $\mu$ is finitely equivalent to the universal sentence $\forall x\neg\varphi_{\textup{uni}}(\bar{x})\vee\varphi_{0w}^{I}$

If $M$ halts on $w$ , then $\varphi_{0w}\wedge\varphi_{1M}\wedge\gamma_{M}$ has no infinite model but a finite model $\mathcal{A}$ . Hence, by Proposition 4.6 we know that $\textsc{Graph}(\rho_{w})$ is closed under induced subgraphs. Therefore, $\rho_{w}$ is equivalent to a universal sentence $\mu$ in Graph. Let $G$ be a finite graph with $G_{I}\cong\mathcal{A}$ . Then $G\models(\varphi_{0w}\wedge\varphi_{1M}\wedge\gamma_{M})^{I}$ and thus, $G\models\neg\rho_{w}$ . Hence $G$ is a finite graph which is a model of $\varphi^{I}_{0w}\wedge\neg\mu$ . This means that $\mu$ is not equivalent to $\forall x\neg\varphi_{\textup{uni}}(\bar{x})\vee\varphi_{0w}^{I}$ on all finite graphs, as $G$ is also a model of $\forall x\neg\varphi_{\textup{uni}}(\bar{x})\vee\varphi_{0w}^{I}$ .

If $M:w\to\infty$ , then we show that $\textsc{Graph}\,_{\textup{fin}}(\rho_{w})=\textsc{Graph}\,_{\textup{fin}}(\forall x\neg\varphi_{\textup{uni}}(\bar{x})\vee\varphi_{0w}^{I})$ . Clearly $\textsc{Graph}_{\textup{fin}}(\rho_{w})\subseteq\textsc{Graph}_{\textup{fin}}(\forall x\neg\varphi_{\textup{uni}}(\bar{x})\vee\varphi_{0w}^{I})$ . Now let the graph $G$ be a model of $\forall x\neg\varphi_{\textup{uni}}(\bar{x})\vee\varphi_{0w}^{I}$ . Further we can assume that $G\models\exists x\varphi_{\textup{uni}}(\bar{x})$ . In particular, $\mathcal{A}:=G_{I}$ is well defined. If $G\not\models\varphi^{I}_{1M}$ , then $G\models\chi^{I}_{w}$ and therefore, $G\models\rho_{w}$ . If $G\models\varphi^{I}_{1M}$ , then $\mathcal{A}\models\varphi_{0w}\wedge\varphi^{I}_{1M}$ . As $M:w\to\infty$ , by Lemma 6.1 the structure $\mathcal{A}$ correctly represents the first $|A|-1$ steps of the computation of $M$ on $w$ . Thus, $\mathcal{A}$ is a model of $\neg\gamma_{M}$ , again as $M$ does not halt on input $w$ . It follows that $G$ is a model of $\neg\gamma_{M}^{I}$ , and then $G\models\rho_{w}$ .

Now we can decide the halting problem for $M$ . Given a word $w$ , we use the claimed algorithm to get a universal sentence $\mu$ equivalent to $\rho_{w}$ in the class of graphs. Finally we check whether $\mu$ is finitely equivalent to $\forall x\neg\varphi_{\textup{uni}}(\bar{x})\vee\varphi_{0w}^{I}$ . This can be checked effectively again by Corollary 2.5 and Corollary 2.6. $\Box$

Observe that Corollary 7.5 is precisely Theorem 1.3 as stated in the Introduction. Finally we prove Theorem 1.2, which is equivalent to the following result.

Theorem 7.6.

There is no algorithm that applied to an $\textup{FO}[\tau_{E}]$ -sentence $\varphi$ such that $\textsc{Graph}_{\textup{fin}}(\varphi)$ is closed under induced subgraphs decides whether there is a finite set $\mathscr{F}$ of graphs such that

\textsc{Graph}_{\textup{fin}}(\varphi)=\textsc{Forb}_{\textup{fin}}(\mathscr{F}).

Proof : Again we prove the corresponding result for $\tau_{M}$ -sentences and $\tau_{M}$ -structures and leave it to the reader to translate it to graphs as in the previous proof. That is, we show:

There is no algorithm that applied to an $\textup{FO}[\tau_{M}]$ -sentence $\varphi$ such that $\textsc{Mod}_{\textup{fin}}(\varphi)$ is closed under induced substructures decides whether there is a finite set $F$ of finite $\tau_{M}$ -structures such that

$\textsc{Mod}_{\textup{fin}}(\varphi)=\textsc{Forb}_{\textup{fin}}(\mathscr{F}).$

For $w\in\{0,1\}^{*}$ let

\alpha_{w}:=\varphi_{0w}\wedge(\varphi_{1M}\to\gamma_{M}).

We show that $\textsc{Mod}_{\textup{fin}}(\alpha_{w})$ is closed under induced subgraphs and that

\displaystyle M:w\to\infty

\displaystyle\iff

\displaystyle\text{$\alpha_{w}$ is not finitely equivalent to a universal sentence}.

Assume first that $M:w\to\infty$ . Then $\varphi_{0w}\wedge\varphi_{1M}\wedge\gamma_{M}$ has no finite model by Lemma 6.1 and the definition (17) of $\gamma_{M}$ . Therefore, $\textsc{Mod}_{\textup{fin}}(\alpha_{w})=\textsc{Mod}_{\textup{fin}}(\varphi_{0w}\wedge\neg\varphi_{1M})$ . By Lemma 3.10 we know that $\textsc{Mod}_{\textup{fin}}(\varphi_{0w}\wedge\neg\varphi_{1M})$ is closed under induced substructures but not finitely equivalent to a universal sentence.

Now assume that $M$ on input $w$ halts in $h(w)$ steps. Then Corollary 6.2 guarantees that there is a unique model $\mathcal{A}_{w}$ of $\varphi_{0w}\wedge\varphi_{1M}\wedge\gamma_{M}$ with $|A_{w}|=h(w)+1$ . We present a finite set $\mathscr{F}$ of finite $\tau_{M}$ -structures such that

\textsc{Mod}_{\textup{fin}}(\alpha_{w})=\textsc{Forb}_{\textup{fin}}(\mathscr{F}).

(26)

As $\varphi_{0w}$ is universal, there is a finite set $\mathscr{F}_{0}$ of finite $\tau_{M}$ -structures such that

\textsc{Mod}_{\textup{fin}}(\varphi_{0w})=\textsc{Forb}_{\textup{fin}}(\mathscr{F}_{0}).

Moveover, we set

\mathscr{F}_{1}:=\big{\{}\mathcal{B}\in\textsc{Str}[\tau_{M}]\;\big{|}\;\mathcal{B}\models\varphi_{0w}\wedge\varphi_{1M}\text{\ and $B=[\ell]$ for some $\ell\leq h(w)$}\big{\}}

and

	$\displaystyle\mathscr{F}_{2}:=\big{\{}\mathcal{B}\in\textsc{Str}[\tau_{M}]\;\big{\|}\;\mathcal{B}\models\varphi_{0w}\wedge\varphi^{*}_{1M}\wedge\forall t\forall t^{\prime}(t<t^{\prime}\to\forall y$	$\displaystyle\neg H_{q_{h}}(y,t))$
		$\displaystyle\text{and $B=[h(w)+2]$}\big{\}}.$

Here $\varphi^{*}_{1M}$ is obtained from $\varphi_{1M}$ by replacing the conjunct $\varphi_{1}$ (see (7)) by

\varphi^{*}_{1}:=\exists xU_{\textup{min}}x\wedge\forall x\forall y(x<y\to\exists zSxz).

The difference is that $\varphi^{*}_{1}$ does not require the set $U_{\textup{max}}$ to be nonempty. Hence,

\varphi^{*}_{1M}=\varphi^{*}_{1}\wedge\forall x\forall t\big{(}C_{0}(x,t)\vee C^{\textup{comp}}_{0}(x,t)\big{)}\wedge\bigwedge_{q\in Q}\forall x\forall t\big{(}H_{q}(x,t)\vee H_{q}^{\textup{comp}}(x,t)\big{)}.

Note that Lemma 6.1 remains true if in its statement we replace $\varphi_{1M}$ by $\varphi^{*}_{1M}$ .

For $\mathscr{F}:=\mathscr{F}_{0}\cup\mathscr{F}_{1}\cup\mathscr{F}_{2}$ we show (26). Assume first that a finite structure $\mathcal{C}$ is a model of $\alpha_{w}$ . In particular, $\mathcal{C}\models\varphi_{0w}$ and therefore, $\mathcal{C}$ has no induced substructure isomorphic to a structure in $\mathscr{F}_{0}$ .

Now, for a contradiction suppose that $\mathcal{B}$ is an induced substructure of $\mathcal{C}$ isomorphic to a structure in $\mathscr{F}_{1}$ . Then $\mathcal{B}\models\varphi_{1M}$ and thus, by Lemma 3.8, $\mathcal{C}=\mathcal{B}$ . As $\mathcal{C}\models\alpha_{w}$ , we get $\mathcal{C}\models\varphi_{0w}\wedge\varphi_{1M}\wedge\gamma_{M}$ . Hence, $\mathcal{C}\cong\mathcal{A}_{w}$ , a contradiction, as on the one hand $|C|=|B|\leq h(w)$ and on the other hand $|C|=|A_{w}|=h(w)+1$ .

Next we show that $\mathcal{C}$ has no induced substructure $\mathcal{B}$ isomorphic to a structure in $\mathscr{F}_{2}$ . As $\mathcal{B}\models\varphi_{0w}\wedge\varphi^{*}_{1M}$ and has $h(w)+2$ elements, the first $h(w)+1$ elements of $\mathcal{B}$ correctly encode the first $h(w)$ steps of the computation of $M$ on $w$ , hence the full computation. As $|B|=h(w)+2$ , this contradicts $\mathcal{B}\models\forall t\forall t^{\prime}\big{(}t<t^{\prime}\to\forall y\neg H_{q_{h}}(y,t)\big{)}$ .

As the final step let $\mathcal{C}\in\textsc{Forb}_{\textup{fin}}(\mathscr{F})$ . We show that $\mathcal{C}\models\alpha_{w}$ . As $\mathcal{C}$ omits the structures in $\mathscr{F}_{0}$ as induced substructures, we see that $\mathcal{C}\models\varphi_{0w}$ . If $\mathcal{C}\not\models\varphi_{1M}$ , we are done.

Recall that by Lemma 6.1 (more precisely, by the extension of Lemma 6.1 mentioned above) for finite structures $\mathcal{B}$ of $\varphi_{0w}\wedge\varphi^{*}_{1M}$ we know:

(a)

if $|B|\leq h(w)+1$ , then $\mathcal{B}$ encodes $|B|-1$ steps of the computation of $M$ on $w$ ,
(b)

if $|B|>h(w)+1$ , then the first $h(w)+1$ elements in the ordering $<^{\mathcal{B}}$ correctly encode the (full) computation of $M$ on $w$ .

Now assume that $\mathcal{C}\models\varphi_{1M}$ , then (a) and (b) apply to $\mathcal{C}$ . As no structure in $\mathscr{F}_{1}$ is isomorphic to an induced substructure of $\mathcal{C}$ , we see that $|C|\geq h(w)+1$ . But $\mathcal{C}$ cannot have more than $h(w)+1$ elements, as otherwise the substructure of $\mathcal{C}$ induced on the first $h(w)+2$ elements would be isomorphic to a structure $\mathcal{B}$ in $F_{2}$ , a contradiction. $\Box$

Remark 7.7.

Mainly using Remark 6.7 one easily verifies that in all results but Proposition 7.3 of this section we can replace

There is no algorithm that applied to an

\textup{FO}[\tau_{E}]

-sentence

\varphi

…

There is no algorithm that applied to a

\Sigma_{2}

-sentence

\varphi

…

In Proposition 7.3 we have to replace it by

There is no algorithm that applied to a

\Pi_{2}

-sentence

\varphi

…

as $\varphi_{1M}$ (and $\varphi^{I}_{1M}$ ) are $\Pi_{2}$ -sentences.

References

[1] A. Atserias, A. Dawar, and M. Grohe. Preservation under extensions on well-behaved finite structures. SIAM Journal on Computing, 38:1364–1381, 2008.
[2] Y. Chen and J. Flum. FO-definability of shrub-depth. In 28th EACSL Annual Conference on Computer Science Logic, CSL 2020, January 13-16, 2020, Barcelona, Spain, pages 15:1–15:16, 2020.
[3] A. Dawar, M. Grohe, S. Kreutzer, and N. Schweikardt. Model theory makes formulas large. In Automata, Languages and Programming, 34th International Colloquium, ICALP 2007, Wroclaw, Poland, July 9-13, 2007, Proceedings, pages 913–924, 2007.
[4] A. Dawar and A. Sankaran. Extension preservation in the finite and prefix classes of first order logic. CoRR, abs/2007.05459, 2020.
[5] G. Ding. Subgraphs and well-quasi-ordering. Journal of Graph Theory, 16(5):489–502, 1992.
[6] D. Duris. Extension preservation theorems on classes of acyclic finite structures. SIAM Journal on Computing, 39(8):3670–3681, 2010.
[7] Z. Dvorák, A. C. Giannopoulou, and D. M. Thilikos. Forbidden graphs for tree-depth. European Journal of Combinatorics, 33(5):969–979, 2012.
[8] H.-D. Ebbinghaus and J. Flum. Finite Model Theory. Perspectives in Mathematical Logic. Springer, 1999.
[9] M. R. Fellows. Private communication. 2019.
[10] M. R. Fellows and M. A. Langston. On search, decision, and the efficiency of polynomial-time algorithms. Journal of Computer and System Sciences, 49(3):769–779, 1994.
[11] J. Gajarský and S. Kreutzer. Computing shrub-depth decompositions. In 37th International Symposium on Theoretical Aspects of Computer Science, STACS 2020, March 10-13, 2020, Montpellier, France, pages 56:1–56:17, 2020.
[12] R. Ganian, P. Hlinený, J. Nesetril, J. Obdrzálek, and P. Ossona de Mendez. Shrub-depth: Capturing height of dense graphs. Logical Methods in Computer Science, 15(1), 2019.
[13] R. Ganian, P. Hlinený, J. Nesetril, J. Obdrzálek, P. Ossona de Mendez, and R. Ramadurai. When trees grow low: Shrubs and fast $\textup{MSO}_{1}$ . In Mathematical Foundations of Computer Science 2012 - 37th International Symposium, MFCS 2012, Bratislava, Slovakia, August 27-31, 2012. Proceedings, pages 419–430, 2012.
[14] Y. Gurevich. Toward logic tailored for computational complexity. Lecture Notes in Mathematics, 1104:175–216, 1984.
[15] J. Łoś. On the extending of models I. Fundamenta Mathematicae, 42:38–54, 1955.
[16] B. Rossman. Łoś-Tarski Theorem has non-recursive blow-up. Unpublished manuscript, pages 1–2, 2012.
[17] A. Sankaran, B. Adsul, and S. Chakraborty. A generalization of the Łoś-Tarski preservation theorem. Annals of Pure and Applied Logic, 167(3):189–210, 2016.
[18] W. W. Tait. A counterexample to a conjecture of Scott and Suppes. The Journal of Symbolic Logic, 24(1):15–16, 1959.
[19] A. Tarski. Contributions to the theory of models I, II. Indagationes Mathematicae, 16:589–588, 1954.
[20] R. Vaught. Remarks on universal classes of relational systems. Indagationes Mathematicae, 16:572–591, 1954.
[21] T. Zaslavsky. Forbidden induced subgraphs. Electronic Notes in Discrete Mathematics, 63:3–10, 2017.