Simplest Non-Regular Deterministic Context-Free Language

Petr Jančar Dept of Computer Science, Faculty of Science, Palacký University Olomouc, Czechiapetr.jancar@upol.cz Jiří Šíma Institute of Computer Science of the Czech Academy of Sciences, Prague, Czechiasima@cs.cas.cz

Abstract

We introduce a new notion of $\mathcal{C}$ -simple problems for a class $\mathcal{C}$ of decision problems (i.e. languages), w.r.t. a particular reduction. A problem is $\mathcal{C}$ -simple if it can be reduced to each problem in $\mathcal{C}$ . This can be viewed as a conceptual counterpart to $\mathcal{C}$ -hard problems to which all problems in $\mathcal{C}$ reduce. Our concrete example is the class of non-regular deterministic context-free languages (DCFL^′), with a truth-table reduction by Mealy machines (which proves to be a preorder). The main technical result is a proof that the DCFL^′ language $L_{\#}=\{0^{n}1^{n}\mid n\geq 1\}$ is DCFL^′-simple, which can thus be viewed as the simplest problem in the class DCFL^′.

This result has already provided an application, to the computational model of neural networks 1ANN at the first level of analog neuron hierarchy. This model was proven not to recognize $L_{\#}$ , by using a specialized technical argument that can hardly be generalized to other languages in DCFL^′. By the result that $L_{\#}$ is DCFL^′-simple, w.r.t. the reduction that can be implemented by 1ANN, we immediately obtain that 1ANN cannot accept any language in DCFL^′.

It thus seems worthwhile to explore if looking for $\mathcal{C}$ -simple problems in other classes $\mathcal{C}$ under suitable reductions could provide effective tools for expanding the lower-bound results known for single problems to the whole classes of problems.

Keywords: deterministic context-free language, truth-table reduction, Mealy automaton, pushdown automaton

1 Introduction

We introduce a new notion of $\mathcal{C}$ -simple problems for a class $\mathcal{C}$ of decision problems (i.e. languages). A problem is $\mathcal{C}$ -simple if it can be reduced to each problem in $\mathcal{C}$ ; if this problem is, moreover, in $\mathcal{C}$ , it can be viewed as a simplest problem in $\mathcal{C}$ . The $\mathcal{C}$ -simple problems are thus a conceptual counterpart to the common $\mathcal{C}$ -hard problems (like, e.g., NP-hard problems) to which conversely any problem in $\mathcal{C}$ reduces. These definitions (of $\mathcal{C}$ -simple and $\mathcal{C}$ -hard problems) are parametrized by a chosen reduction that does not have a higher computational complexity than the class $\mathcal{C}$ itself. Therefore, it may be said that if a $\mathcal{C}$ -hard problem has a (computationally) “easy” solution, then each problem in $\mathcal{C}$ has an “easy” solution. On the other hand, if we prove that a $\mathcal{C}$ -simple problem is not “easy”, in particular that it cannot be solved by machines of a type $\mathcal{M}$ that can implement the respective reduction, then all problems in $\mathcal{C}$ are not “easy”, that is, are not solvable by $\mathcal{M}$ ; this extends a lower-bound result for one problem to the whole class of problems.

In this paper, we consider $\mathcal{C}$ to be the class of non-regular deterministic context-free languages, which we denote by DCFL^′; we thus have DCFL^′ = DCFL $\smallsetminus$ REG (where REG denotes the class of regular languages). We use a truth-table reduction by Mealy machines (which is motivated below). Hence a DCFL^′-simple problem is a language $L_{0}\subseteq\Sigma^{*}$ (over an alphabet $\Sigma$ ) that can be reduced to each DCFL^′ language $L\subseteq\Delta^{*}$ by a Mealy machine $\mathcal{A}$ with an oracle $L$ , denoted $\mathcal{A}^{L}$ . More precisely, the finite-state transducer $\mathcal{A}$ transforms a given input word $w\in\Sigma^{*}$ to a prefix $\mathcal{A}(w)\in\Delta^{*}$ of queries for the oracle $L$ . In addition, each state $q$ of $\mathcal{A}^{L}$ is associated with a finite tuple $\sigma_{q}=(s_{q1},\ldots,s_{qr_{q}})$ of $r_{q}$ query suffixes from $\Delta^{*}$ , and with a truth table $f_{q}:\{0,1\}^{r_{q}}\rightarrow\{0,1\}$ . After $\mathcal{A}^{L}$ reads an input word $w$ (translating it to $\mathcal{A}(w)$ ), by which it enters a state $q$ , for each $i\in\{1,2,\dots,r_{q}\}$ it queries whether or not the string $\mathcal{A}(w)\cdot s_{qi}$ is in $L$ (or, equivalently, whether or not $\mathcal{A}(w)$ belongs to the quotient $L/s_{qi}=\{v\in\Delta^{*}\mid v\cdot s_{qi}\in L\}$ ), and aggregates the answers by the truth table $f_{q}$ for deciding if $w$ is accepted.

This truth-table reduction by Mealy machines proves to be a preorder, denoted as $\leq_{tt}^{\textsc{A}}$ . The main technical result of this paper is that the DCFL^′ language $L_{\#}=\{0^{n}1^{n}\mid n\geq 1\}$ (over the binary alphabet $\{0,1\}$ ) is DCFL^′-simple, since $L_{\#}\leq_{tt}^{\textsc{A}}L$ for each language $L$ in DCFL^′. The class DCFLS of DCFL^′-simple languages comprises REG and is a strict subclass of DCFL; e.g., the DCFL^′ language $L_{R}=\left\{wcw^{R}\mid w\in\{a,b\}^{*}\right\}$ over the alphabet $\{a,b,c\}$ proves to be not DCFL^′-simple. The closure properties of DCFLS are similar to that of DCFL as the class DCFLS is closed under complement and intersection with regular languages, while being not closed under concatenation, intersection, and union.

The above definition of DCFL^′-simple problems has originally been motivated by the analysis of the computational power of neural network (NN) models which is known to depend on the (descriptive) complexity of their weight parameters [8, 11]. The so-called analog neuron hierarchy [9] of binary-state NNs with increasing number of $\alpha$ extra analog-state neurons, denoted as $\alpha$ ANN for $\alpha\geq 0$ , has been introduced for studying NNs with realistic weights between integers (finite automata) and rational numbers (Turing machines). We use the notation $\alpha$ ANN also for the class of languages accepted by $\alpha$ ANNs, which can clearly be distinguished by the context. The separation 1ANN $\subsetneq$ 2ANN has been witnessed by the DCFL^′ language $L_{\#}\in$ 2ANN $\setminus$ 1ANN. The proof of $L_{\#}\notin$ 1ANN is rather technical (based on the Bolzano-Weierstrass theorem) which could hardly be generalized to other DCFL^′ languages, while it was conjectured that $L\notin$ 1ANN for all DCFL^′ languages $L$ , that is, $\mbox{DCFL${}^{\prime}$}\subseteq(\mbox{2ANN}\,\setminus\,\mbox{1ANN})$ (implying $\mbox{1ANN}\,\cap\,\mbox{DCFL}=\mbox{0ANN}=\mbox{REG}$ ). An idea how to prove this conjecture is to show that $L_{\#}\notin$ 1ANN is in some sense the simplest problem in the class DCFL^′, namely, to reduce $L_{\#}$ to any DCFL^′ language $L$ by using a reduction that can be carried out by 1ANNs, which are at least as powerful as finite automata. This would imply that $L$ cannot be accepted by any 1ANN since it is at least as hard as $L_{\#}$ that has been proven not to be recognized by 1ANNs.

The idea why $L_{\#}$ should serve as the simplest language in the class DCFL^′ comes from the fact that any reduced context-free grammar $G$ generating a non-regular language $L\subseteq\Delta^{*}$ is self-embedding [3, Theorem 4.10]. This means that there is a so-called self-embedding nonterminal $A$ admitting the derivation $A\Rightarrow^{*}xAy$ for some non-empty strings $x,y\in\Delta^{+}$ . Since $G$ is reduced, there are strings $v,w,z\in\Delta^{*}$ such that $S\Rightarrow^{*}vAz$ and $A\Rightarrow^{*}w$ where $S$ is the start nonterminal in $G$ , which implies $S\Rightarrow^{*}vx^{m}wy^{m}z\in L$ for every $m\geq 0$ . It is thus straightforward to suggest to reduce an input word $0^{m}1^{n}\in\{0,1\}^{*}$ where $m,n\geq 1$ , to the string $vx^{m}wy^{n}z\in\Delta^{*}$ (while the inputs outside $0^{+}1^{+}$ are mapped onto some fixed string outside $L$ ) since $0^{m}1^{n}\in L_{\#}$ entails $vx^{m}wy^{n}z\in L$ .

However, the suggested (one-one) reduction from $L_{\#}$ to $L$ is not consistent because $vx^{m}wy^{n}z\in L$ does not necessarily imply $0^{m}1^{n}\in L_{\#}$ . For example, consider the DCFL^′ language $L_{1}=\{0^{m}1^{n}\mid 1\leq m\leq n\}$ over the binary alphabet $\Delta=\{0,1\}$ for which there are no words $v,x,w,y,z\in\Delta^{*}$ such that $vx^{m}wy^{n}z\in L_{1}$ would ensure $m=n$ . Nevertheless, we can pick two inputs $0^{m}1^{n-1}$ and $0^{m}1^{n}$ instead of one, that is, $x=0$ , $y=1$ , and $v=w=z=\varepsilon$ ( $\varepsilon$ denoting the empty string), which satisfy $0^{m}1^{n}\in L_{\#}$ iff $m=n$ iff $vx^{m}wy^{n-1}z\notin L_{1}$ and $vx^{m}wy^{n}z\in L_{1}$ . It turns out that this can be generalized to any DCFL^′ language. Namely, we prove in this paper that for DCFL^′ language $L\subseteq\Delta^{*}$ over any alphabet $\Delta$ , there are non-empty words $v,x,w,y,z\in\Delta^{+}$ and a language $L^{\prime}\in\{L,\overline{L}\}$ , where $\overline{L}=\Delta^{*}\smallsetminus L$ is the complement of $L$ , such that $0^{m}1^{n}\in L_{\#}$ iff $vx^{m}wy^{n-1}z\notin L^{\prime}$ and $vx^{m}wy^{n}z\in L^{\prime}$ .

Therefore, the simple many-one (in fact, one-one) reduction from $L_{\#}$ with one query to the oracle $L$ is replaced by a truth-table reduction, that is, by a special Turing reduction in which all its finitely many (in our case two) oracle queries are presented at the same time and there is a Boolean function (a truth table) which, when given the answers to the queries, produces the final answer of the reduction. This truth-table reduction from $L_{\#}$ to $L$ can be implemented by a deterministic finite-state transducer (a Mealy machine) $\mathcal{A}$ with the oracle $L$ : It transforms the input $0^{m}1^{n}$ where $m,n\geq 1$ (the inputs outside $0^{+}1^{+}$ are rejected), to the output $vx^{m}wy^{n-1}\in\Delta^{+}$ and carries out two queries to $L$ that arise by concatenation of this output with two fixed suffixes $z$ and $yz$ ; hence the queries are $vx^{m}wy^{n-1}z\stackrel{{\scriptstyle?}}{{\in}}L$ and $vx^{m}wy^{n}z\stackrel{{\scriptstyle?}}{{\in}}L$ . The truth table is defined so that the input $0^{m}1^{n}$ is accepted by $\mathcal{A}^{L}$ iff the two answers to these queries are distinct and at same time, the first answer is negative in the case $L^{\prime}=L$ , and positive in the case $L^{\prime}=\overline{L}$ , which is equivalent to $0^{m}1^{n}\in L_{\#}$ .

It follows that the DCFL^′ language $L_{\#}$ is DCFL^′-simple under the truth-table reduction by Mealy machines. Since this reduction can be implemented by 1ANNs, we achieve the desired stronger separation $\mbox{DCFL${}^{\prime}$}\subseteq(\mbox{2ANN}\,\setminus\,\mbox{1ANN})$ in the analog neuron hierarchy [10]. This result constitutes a non-trivial application of the proposed concept of DCFL^′-simple problem. Moreover, if we could generalize the result to (nondeterministic) context-free languages (CFL), e.g. by proving that some DCFL^′ language is CFL^′-simple (where CFL^′ $=$ CFL $\smallsetminus$ REG), which would imply that $L_{\#}$ is CFL^′-simple by the transitivity of reduction, then we would achieve even stronger separation $\mbox{CFL${}^{\prime}$}\subseteq(\mbox{2ANN}\,\setminus\,\mbox{1ANN})$ . We note the interesting fact that $L_{\#}$ cannot be CSL^′-simple (under our reduction), since 1ANN accepts some context-sensitive languages outside CFL [9].

In general, if we show that some $\mathcal{C}$ -simple problem under a given reduction cannot be computed by a computational model $\mathcal{M}$ that implements this reduction, then all problems in the class $\mathcal{C}$ are not solvable by $\mathcal{M}$ either. The notion of $\mathcal{C}$ -simple problems can thus be useful for expanding known (e.g. technical) lower-bound results for individual problems to the whole classes of problems at once, as it was the case of the DCFL^′-simple problem $L_{\#}\notin\,\mbox{1ANN}$ , expanding to $\mbox{DCFL${}^{\prime}$}\cap\,\mbox{1ANN}\,=\emptyset$ . It seems worthwhile to explore if looking for $\mathcal{C}$ -simple problems in other complexity classes $\mathcal{C}$ could provide effective tools for strengthening known lower bounds.

We remark that the hardest context-free language by Greibach [2] can be viewed as CFL-hard under a special type of our reduction $\leq_{tt}^{\textsc{A}}$ . Related line of study concerns the types of reductions used in finite or pushdown automata with oracle. For example, nondeterministic finite automata with oracle complying with many-one restriction have been applied to establishing oracle hierarchies over the context-free languages [7]. For the same purpose, oracle pushdown automata have been used for many-one, truth-table, and Turing reducibilities, respectively, inducing the underlying definitions also to oracle nondeterministic finite automata [13]. In addition, nondeterministic finite automata whose oracle queries are completed by the prefix of an input word that has been read so far and the remaining suffix, have been employed in defining a polynomial-size oracle hierarchy [1].

In the preliminary study [12], some considerations about the simplest DCFL^′ language have appeared, yet without formal definitions of DCFL^′-simple problems, that included only sketches of incomplete proofs of weaker results based on the representation of DCFL by so-called deterministic monotonic restarting automata [5], which have initiated investigations of non-regularity degrees in DCFL [6].

In this paper we achieve a complete argument for $L_{\#}$ to be a DCFL^′-simple problem, within the framework of deterministic pushdown automata (DPDA) by using some ideas on regularity of pushdown processes from [4]. We now give an informal overview of the proof. Given a DPDA $\mathcal{M}$ recognizing a non-regular language $L\subseteq\Delta^{*}$ , it is easy to realize that some computations of $\mathcal{M}$ (from the initial configuration) must be reaching configurations where the stack is arbitrarily large while it can be (almost) erased afterwards. Hence the existence of words $v,x,w,y,z\in\Delta^{+}$ such that $vx^{m}wy^{m}z\in L$ for all $m\geq 0$ is obvious. However, we aim to guarantee that for all $m,n$ the equality $m=n$ holds if, and only if, $vx^{m}wy^{n-1}z\notin L^{\prime}$ and $vx^{m}wy^{n}z\in L^{\prime}$ , where $L^{\prime}$ is either the language $L$ or its complement. This is not so straightforward but it is confirmed by our detailed analysis (in section 3). We study the computation of $\mathcal{M}$ on an infinite word $a_{1}a_{2}a_{3}\cdots$ that visits infinitely many pairwise non-equivalent configurations. We use a natural congruence property of language equivalence on the set of configurations, and avoid some tedious technical details by a particular use of Ramsey’s theorem. This allows us to extract the required tuple $v,x,w,y,z\in\Delta^{+}$ from the mentioned infinite computation. We note that determinism of $\mathcal{M}$ is essential in the presented proof; we leave open if it can be relaxed to show that $L_{\#}$ is even CFL^′-simple.

The rest of the paper is organized as follows. In section 2 we recall basic definitions and notation regarding DPDA and Mealy machines, introduce the novel concept of DCFL^′-simple problems under truth-table reduction by Mealy machines and show some simple properties of the class DCFLS of DCFL^′-simple problems. In section 3 we present the proof of the main technical result which shows that $L_{\#}$ is DCFL^′-simple. Finally, we summarize the results and list some open problems in section 4.

2 DCFL^′-Simple Problem Under Truth-Table Mealy Reduction

In this section we define the truth-table reduction by Mealy machines, introduce the notion of DCFL^′-simple problems, show their basic properties, and formulate the main technical result (theorem 1). But first we recall standard definitions of pushdown automata.

A pushdown automaton (PDA) is a tuple $\mathcal{M}=(Q,\Sigma,\Gamma,R,q_{0},X_{0},F)$ where $Q$ is a finite set of states including the start state $q_{0}\in Q$ and the set $F\subseteq Q$ of accepting states, while the finite sets $\Sigma\not=\emptyset$ and $\Gamma\not=\emptyset$ represent the input and stack alphabets, respectively, with the initial stack symbol $X_{0}\in\Gamma$ . In addition, the set $R$ contains finitely many transition rules $pX\xrightarrow{a}q\gamma$ with the meaning that $\mathcal{M}$ in state $p\in Q$ , on the input $a\in\Sigma_{\varepsilon}=\Sigma\cup\{\varepsilon\}$ (recall $\varepsilon$ denotes the empty string), and with $X\in\Gamma$ as the topmost stack symbol may read $a$ , change the state to $q\in Q$ , and pop $X$ , replacing it by pushing $\gamma\in\Gamma^{*}$ .

By a configuration of $\mathcal{M}$ we mean $p\alpha\in Q\times\Gamma^{*}$ , and we define relations $\xrightarrow{a}$ for $a\in\Sigma_{\varepsilon}$ on $Q\times\Gamma^{*}$ : each rule $pX\xrightarrow{a}q\gamma$ in $R$ induces $pX\alpha\xrightarrow{a}q\gamma\alpha$ for all $\alpha\in\Gamma^{*}$ ; these relations are naturally extended to $\xrightarrow{w}$ for $w\in\Sigma^{*}$ . For a configuration $p\alpha$ we define $\mathcal{L}(p\alpha)=\{w\in\Sigma^{*}\mid p\alpha\xrightarrow{w}q\beta\mbox{ for some }q\in F\mbox{ and }\beta\in\Gamma^{*}\}$ , and $\mathcal{L}(\mathcal{M})=\mathcal{L}(q_{0}X_{0})$ is the language accepted by $\mathcal{M}$ . A PDA $\mathcal{M}$ is deterministic (a DPDA) if there is at most one rule $pX\xrightarrow{a}..$ for each tuple $p\in Q$ , $X\in\Gamma$ , $a\in\Sigma_{\varepsilon}$ ; moreover, if there is a rule $pX\xrightarrow{\varepsilon}..$ , then there is no rule $pX\xrightarrow{a}..$ for $a\in\Sigma$ . We also use the standard assumption that all $\varepsilon$ -steps are popping, that is, in each rule $pX\xrightarrow{\varepsilon}q\gamma$ in $R$ we have $\gamma=\varepsilon$ .

The languages accepted by (deterministic) pushdown automata constitute the class of (deterministic) context-free languages; the classes are denoted by DCFL and CFL, respectively, whereas DCFL^′ $=$ DCFL $\smallsetminus$ REG.

In the following theorem we formulate the main technical result: any language in DCFL^′ includes a certain “projection” of the language $L_{\#}=\{0^{n}1^{n}\mid n\geq 1\}$ , which means that $L_{\#}$ is in some sense the simplest language in the class DCFL^′. The theorem, whose proof will be presented in section 3, thus provides an interesting property of DCFL^′.

Theorem 1.

Let $L\subseteq\Delta^{*}$ be a non-regular deterministic context-free language over an alphabet $\Delta$ . There exist non-empty words $v,x,w,y,z\in\Delta^{+}$ and a language $L^{\prime}\in\{L,\overline{L}\}$ (where $\overline{L}=\Delta^{*}\smallsetminus L$ is the complement of $L$ ) such that for all $m\geq 0$ and $n>0$ we have

\left(vx^{m}wy^{n-1}z\notin L^{\prime}\mbox{ and }\,vx^{m}wy^{n}z\in L^{\prime}\right)\quad\mbox{if{f}}\quad m=n\,.

(1)

In order to formalize the DCFL^′-simple problems, we now define a Mealy machine $\mathcal{A}$ with an oracle: it is a tuple $\mathcal{A}=(Q,\Sigma,\Delta,\delta,\lambda,q_{0},\{(\sigma_{q},f_{q})\mid q\in Q\})$ where $Q$ is a finite set of states including the start state $q_{0}\in Q$ , and the finite sets $\Sigma\not=\emptyset$ and $\Delta\not=\emptyset$ represent the input and output (oracle) alphabets, respectively. Moreover, $\delta:Q\times\Sigma\rightarrow Q$ is a (partial) state-transition function which extends to input strings as $\delta:Q\times\Sigma^{*}\rightarrow Q$ where $\delta(q,\varepsilon)=q$ for every $q\in Q$ , while $\delta(q,wa)=\delta(\delta(q,w),a)$ for all $q\in Q$ , $w\in\Sigma^{*}$ , $a\in\Sigma$ . Similarly, $\lambda:Q\times\Sigma\rightarrow\Delta^{*}$ is an output function which extends to input strings as $\lambda:Q\times\Sigma^{*}\rightarrow\Delta^{*}$ where $\lambda(q,\varepsilon)=\varepsilon$ for all $q\in Q$ , and $\lambda(q,wa)=\lambda(q,w)\cdot\lambda(\delta(q,w),a)$ for all $q\in Q$ , $w\in\Sigma^{*}$ , $a\in\Sigma$ . In addition, for each $q\in Q$ , the tuple $\sigma_{q}=(s_{q1},\ldots,s_{qr_{q}})$ of strings in $\Delta^{*}$ contains $r_{q}$ query suffixes, while $f_{q}:\{0,1\}^{r_{q}}\rightarrow\{0,1\}$ is a truth table that aggregates the answers to the $r_{q}$ oracle queries.

The above Mealy machine $\mathcal{A}$ starts in the start state $q_{0}$ and operates as a deterministic finite-state transducer that transforms an input word $w\in\Sigma^{*}$ to the output string $\mathcal{A}(w)=\lambda(q_{0},w)\in\Delta^{*}$ written to a so-called oracle tape. The oracle tape is a semi-infinite, write-only tape which is empty at the beginning and its contents are only extended in the course of computation by appending the strings to the right. Namely, given a current state $q\in Q$ and an input symbol $a\in\Sigma$ , the machine $\mathcal{A}$ moves to the next state $\delta(q,a)\in Q$ and writes the string $\lambda(q,a)\in\Delta^{*}$ to the oracle tape, if $\delta(q,a)$ is defined; otherwise $\mathcal{A}$ rejects the input. After reading the whole input word $w\in\Sigma^{*}$ , the machine $\mathcal{A}$ is in the state $p=\delta(q_{0},w)\in Q$ , while the oracle tape contains the output $\mathcal{A}(w)=\lambda(q_{0},w)\in\Delta^{*}$ .

Finally, the Mealy machine $\mathcal{A}$ , equipped with an oracle $L\subseteq\Delta^{*}$ , in this case denoted $\mathcal{A}^{L}$ , queries the oracle whether $\mathcal{A}(w)$ belongs to the (right) quotient $L/s_{pi}=\{u\in\Delta^{*}\mid u\cdot s_{pi}\in L\}$ , for each suffix $s_{pi}$ in $\sigma_{p}$ , and the answers are aggregated by the truth table $f_{p}$ . Thus, the oracle Mealy machine $\mathcal{A}^{L}$ accepts the input word $w\in\Sigma^{*}$ iff

f_{p}\left(\chi_{L/s_{p1}}(\mathcal{A}(w)),\chi_{L/s_{p2}}(\mathcal{A}(w)),\ldots,\chi_{L/s_{pr_{p}}}(\mathcal{A}(w))\right)=1

where $p=\delta(q_{0},w)$ and $\chi_{L/s_{pi}}:\Delta^{*}\rightarrow\{0,1\}$ is the characteristic function of $L/s_{pi}$ , that is, $\chi_{L/s_{pi}}(u)=1$ if $u\cdot s_{pi}\in L$ , and $\chi_{L/s_{pi}}(u)=0$ if $u\cdot s_{pi}\notin L$ . The language accepted by the machine $\mathcal{A}^{L}$ is defined as $\mathcal{L}(\mathcal{A}^{L})=\{w\in\Sigma^{*}\mid w$ is accepted by $\mathcal{A}^{L}\}$ .¹¹1Note that the described protocol works also for non-prefix-free languages since for any input prefix that has been read so far, the output value from the truth table determines whether the oracle Mealy machine is in an “accepting” state, deciding about this prefix analogously as a deterministic finite automaton. The truth-table reduction only requires that the given oracle answers do not influence further computation when subsequent input symbols are read.

We say that $L_{1}\subseteq\Sigma^{*}$ is truth-table reducible to $L_{2}\subseteq\Delta^{*}$ by a Mealy machine, which is denoted as $L_{1}\leq_{tt}^{\textsc{A}}L_{2}$ , if $L_{1}=\mathcal{L}(\mathcal{A}^{L_{2}})$ for some Mealy machine $\mathcal{A}$ running with the oracle $L_{2}$ . The following lemma shows that we can chain these reductions together since the relation $\leq_{tt}^{\textsc{A}}$ is a preorder.

Lemma 2.

The relation $\leq_{tt}^{\textsc{A}}$ is reflexive and transitive.

Proof: The relation $\leq_{tt}^{\textsc{A}}$ is reflexive since $L=\mathcal{L}(\mathcal{A}^{L})\subseteq\Sigma^{*}$ for the oracle Mealy machine $\mathcal{A}^{L}=(\{q\},\Sigma,\Sigma,\delta,\lambda,q,\{(\sigma_{q},f_{q})\})$ where $\delta(q,a)=q$ and $\lambda(q,a)=a$ for every $a\in\Sigma$ , $\sigma_{q}=(\varepsilon)$ , and $f_{q}$ is the identity.

Now we show that the relation $\leq_{tt}^{\textsc{A}}$ is transitive. Let $L_{1}\leq_{tt}^{\textsc{A}}L_{2}$ and $L_{2}\leq_{tt}^{\textsc{A}}L_{3}$ which means $L_{1}=\mathcal{L}(\mathcal{A}_{1}^{L_{2}})\subseteq\Sigma^{*}$ and $L_{2}=\mathcal{L}(\mathcal{A}_{2}^{L_{3}})\subseteq\Delta^{*}$ for some oracle Mealy machines $\mathcal{A}_{1}^{L_{2}}=(Q_{1},\Sigma,\Delta,\delta_{1},\lambda_{1},q_{0}^{1},\{(\pi_{q},g_{q})\mid q\in Q_{1}\})$ and $\mathcal{A}_{2}^{L_{3}}=(Q_{2},\Delta,\Theta,\delta_{2},\lambda_{2},q_{0}^{2},\{(\varrho_{q},h_{q})\mid q\in Q_{2}\})$ , respectively. We will construct the oracle Mealy machine $\mathcal{A}^{L_{3}}=(Q,\Sigma,\Theta,\delta,\lambda,q_{0},\{(\sigma_{q},f_{q})\mid q\in Q\})$ such that $L_{1}=\mathcal{L}(\mathcal{A}^{L_{3}})\subseteq\Sigma^{*}$ which implies the transitivity $L_{1}\leq^{\mathcal{A}}L_{3}$ . We define $Q=Q_{1}\times Q_{2}$ with $q_{0}=(q_{0}^{1},q_{0}^{2})$ , $\delta((q_{1},q_{2}),a)=(\delta_{1}(q_{1},a),\delta_{2}(q_{2},\lambda_{1}(q_{1},a)))$ and $\lambda((q_{1},q_{2}),a)=\lambda_{2}(q_{2},\lambda_{1}(q_{1},a))$ for every $(q_{1},q_{2})\in Q$ and $a\in\Sigma$ , which ensures $\mathcal{A}(w)=\lambda(q_{0},w)=\lambda_{2}(q_{0}^{2},\lambda_{1}(q_{0}^{1},w))=\mathcal{A}_{2}(\mathcal{A}_{1}(w))\in\Theta^{*}$ for every $w\in\Sigma^{*}$ . For each state $p=(p_{1},p_{2})\in Q$ in $\mathcal{A}$ , we define the tuple of query suffixes from $\Theta^{*}$ ,

\sigma_{p}=\left(\lambda_{2}(p_{2},s_{p_{1},i})\cdot s_{p_{2}(i),j}\,\big{|}\,i=1,\ldots,r_{p_{1}}\,,\,j=1,\ldots,r_{p_{2}(i)}\right)

where $\pi_{p_{1}}=(s_{p_{1},1},s_{p_{1},2}\ldots,s_{p_{1},r_{p_{1}}})\in\Delta^{r_{p_{1}}}$ and $\varrho_{p_{2}(i)}=(s_{p_{2}(i),1},s_{p_{2}(i),2}\ldots,s_{p_{2}(i),r_{p_{2}(i)}})\in\Theta^{r_{p_{2}(i)}}$ are the query suffixes associated with $p_{1}\in Q_{1}$ and $p_{2}(i)=\delta_{2}(p_{2},s_{p_{1},i})\in Q_{2}$ for $i\in\{1,\ldots,r_{p_{1}}\}$ , respectively, and the truth table $f_{p}=g_{p_{1}}(h_{p_{2}(1)},\ldots,h_{p_{2}(r_{p_{1}})})$ aggregates the answers to the corresponding oracle queries, which ensures $L_{1}=\mathcal{L}(\mathcal{A}^{L_{3}})\subseteq\Sigma^{*}$ . ∎

We say that a (decision) problem $L_{0}\subseteq\Sigma^{*}$ is DCFL^′-simple if $L_{0}\leq_{tt}^{\textsc{A}}L$ for every non-regular deterministic context-free language $L\subseteq\Delta^{*}$ . It follows from theorem 1 that the DCFL^′ language $L_{\#}$ is an example of a DCFL^′-simple problem. In addition, we denote by DCFLS the class of DCFL^′-simple problems and formulate its basic properties.

Corollary 3 (of theorem 1).

The non-regular deterministic context-free language $L_{\#}=\{0^{n}1^{n}\mid n\geq 1\}$ is DCFL^′-simple.

Proof: Let $L\subseteq\Delta^{*}$ be any DCFL^′ language. According to theorem 1, there are $v,x,w,y,z\in\Delta^{+}$ and $L^{\prime}\in\{L,\overline{L}\}$ such that condition (1) holds for $L^{\prime}$ . We define the Mealy machine $\mathcal{A}^{L}=(\{q_{0},q_{1},q_{2}\},\{0,1\},\Delta,\delta,\lambda,q_{0},\{(\sigma_{q},f_{q})\mid q\in Q\})$ with the oracle $L$ , as $\delta(q_{0},0)=\delta(q_{1},0)=q_{1}$ , $\delta(q_{1},1)=\delta(q_{2},1)=q_{2}$ , $\lambda(q_{0},0)=vx$ , $\lambda(q_{1},0)=x$ , $\lambda(q_{1},1)=w$ , $\lambda(q_{2},1)=y$ , $\sigma_{q_{2}}=(z,yz)$ , $f_{q_{0}}=f_{q_{1}}=0$ , $f_{q_{2}}(0,0)=f_{q_{2}}(1,1)=0$ , and $f_{q_{2}}(1,0)=1-f_{q_{2}}(0,1)$ where $f_{q_{2}}(0,1)=1$ iff $L^{\prime}=L$ . It is easy to verify that $L_{\#}=\mathcal{L}(\mathcal{A}^{L})$ , which implies $L_{\#}\leq_{tt}^{\textsc{A}}L$ . Hence, $L_{\#}$ is DCFL^′-simple. ∎

Proposition 4.

1.

$\mbox{REG}\,\subsetneq\,\mbox{DCFLS}$ .
2.

$\mbox{DCFLS}\,\subsetneq\,\mbox{DCFL}$ , and $L_{R}=\{wcw^{R}\mid w\in\{a,b\}^{*}\}\in\mbox{DCFL}\smallsetminus\mbox{DCFLS}$ .
3.

The class DCFLS is closed under complement and intersection with regular languages.
4.

The class DCFLS is not closed under concatenation, intersection and union.

Proof: [Sketch.]
1. For any regular language $L$ , consider a Mealy machine $\mathcal{A}^{L_{\#}}$ with the DCFL^′-simple oracle $L_{\#}$ , that simulates a deterministic finite automaton recognizing $L$ , while its constant truth tables produce 1 iff associated with the accept states. Hence, $L\leq_{tt}^{\textsc{A}}L_{\#}$ which means $L$ is DCFL^′-simple according to lemma 2 and corollary 3 which also implies $\mbox{REG}\,\not=\,\mbox{DCFLS}$ .

2. We first observe that $\mbox{DCFLS}\,\subseteq\,\mbox{DCFL}$ . Let $L\in$ DCFLS be any DCFL^′-simple language which ensures $L\leq_{tt}^{\textsc{A}}L_{\#}$ by an oracle Mealy machine $\mathcal{A}^{L_{\#}}$ . The machine $\mathcal{A}^{L_{\#}}$ can be simulated by a DPDA $\mathcal{M}$ which extends a suitable DPDA $\mathcal{M}_{\#}$ (e.g. with no $\varepsilon$ -transitions) accepting $L_{\#}=\mathcal{L}(\mathcal{M}_{\#})$ , so that the finite control of $\mathcal{M}$ implements the finite-state transducer $\mathcal{A}$ whose output is presented online as an input to $\mathcal{M}_{\#}$ . Moreover, for each state $q$ of $\mathcal{A}$ , the finite control of $\mathcal{M}$ evaluates the truth table $f_{q}$ which aggregates the answers to the queries with $r_{q}$ suffixes associated with $q$ , by inspecting at most constant number of topmost stack symbols. Hence $L=\mathcal{L}(\mathcal{M})\in$ DCFL.

In order to show that $\mbox{DCFLS}\,\not=\,\mbox{DCFL}$ , we prove that the DCFL $L_{R}=\{wcw^{R}\mid w\in\{a,b\}^{*}\}$ over the alphabet $\{a,b,c\}^{*}$ is not DCFL^′-simple. For the sake of contradiction, suppose that $L_{R}\leq_{tt}^{\textsc{A}}L_{\#}$ by a Mealy machine $\mathcal{A}^{L_{\#}}=(Q,\{a,b,c\}^{*},\{0,1\}^{*},\delta,\lambda,q_{0},\{(\sigma_{q},f_{q})\mid q\in Q\})$ with the oracle $L_{\#}=\{0^{n}1^{n}\mid n\geq 1\}$ , which means $L_{R}=\mathcal{L}(\mathcal{A}^{L_{\#}})$ . Consider all the $2^{k}$ possible prefixes $w\in\{a,b\}^{k}$ of inputs presented to $\mathcal{A}^{L_{\#}}$ that have the length $|w|=k$ . These strings can bring $\mathcal{A}^{L_{\#}}$ into a finite number $|\{\delta(q_{0},w)\mid w\in\{a,b\}^{k}\}|\leq|Q|$ of distinct states while the length $|\lambda(q_{0},w)|$ of outputs written to the oracle tape is bounded by $O(k)$ . For $\lambda(q_{0},w)$ outside $0^{*}1^{*}$ , the acceptance of words $wu$ where $u\in\{a,b,c\}^{*}$ , depends only on the truth values $f_{q}(0,\ldots,0)$ associated with the states $q$ from the finite set $Q$ , due to $\lambda(q_{0},wu)\notin L_{\#}/s$ for any $s\in\{0,1\}^{*}$ . On the other hand, the number of distinct outputs $\lambda(q_{0},w)$ in $0^{*}1^{*}$ is bounded by $O(k)$ . This means that for a sufficiently large $k\geq 1$ , there must be two distinct prefixes $w_{1},w_{2}\in\{a,b\}^{k}$ such that $\delta(q_{0},w_{1})=\delta(q_{0},w_{2})$ and $\lambda(q_{0},w_{1})=\lambda(q_{0},w_{2})$ in $0^{*}1^{*}$ , which results in the contradiction $w_{1}cw_{2}^{R}\in\mathcal{L}(\mathcal{A}^{L_{\#}})\smallsetminus L_{R}$ .

3. The class DCFLS is closed under complement since the truth tables can be negated. Furthermore, any oracle Mealy machine be can modified so that it simulates another given finite automaton in parallel and is forced to reject if this automaton rejects, which shows DCFLS to be closed under intersection with regular languages.

4. Observe that $(L_{\#})^{2}$ is not DCFL^′-simple under truth-table reduction. In addition, $L_{1}=\{0^{m}1^{m}0^{n}\mid m,n\geq 1\}$ and $L_{2}=\{0^{m}1^{n}0^{n}\mid m,n\geq 1\}$ are DCFL^′-simple while $L_{1}\cap L_{2}$ is not context-free. The proof for union follows from 3 and De Morgan’s law. ∎

3 Proof of the Main Result (Theorem 1)

Theorem 1 follows from the (more specific) next lemma that we prove in this section.

By $\mathbb{N}$ we denote the set $\{0,1,2,\dots\}$ , and by $[i,j]$ the set $\{i,i{+}1,\dots,j\}$ (for $i,j\in\mathbb{N}$ ).

Lemma 5.

Let $\mathcal{M}=(Q,\Sigma,\Gamma,R,p_{0},X_{0},F)$ be a DPDA where $L=\mathcal{L}(p_{0}X_{0})$ is non-regular (hence $L$ belongs to DCFL^′). There are $v\in\Sigma^{*}$ , $x,w,y,z\in\Sigma^{+}$ , $p,q\in Q$ , $X\in\Gamma$ , $\gamma\in\Gamma^{+}$ , $\delta\in\Gamma^{*}$ such that the following four conditions hold:

$p_{0}X_{0}\xrightarrow{v}pX\delta$ and $pX\xrightarrow{x}pX\gamma$ ,
which entails the infinite (stack increasing) computation

p_{0}X_{0}\xrightarrow{v}pX\delta\xrightarrow{x}pX\gamma\delta\xrightarrow{x}pX\gamma\gamma\delta\xrightarrow{x}pX\gamma\gamma\gamma\delta\xrightarrow{x}\cdots;

(2)

2.

$pX\xrightarrow{w}q$ ;
3.

$q\gamma\xrightarrow{y}q$ ,

hence $q\gamma^{\ell}\delta^{\prime}\xrightarrow{y^{\ell}}q\delta^{\prime}$ for all $\ell\in\mathbb{N}$ and $\delta^{\prime}\in\Gamma^{*}$ ;
4.
one of the following cases is valid (depending on whether $z\in\mathcal{L}(q\delta)$ or $z\not\in\mathcal{L}(q\delta)$ ):
1. (a)
  
  $\mathcal{L}(q\gamma^{k}\delta)\ni y^{\ell}z$ iff $k=\ell$ (for all $k,\ell\in\mathbb{N}$ ), or $\mathcal{L}(q\gamma^{k}\delta)\ni y^{\ell}z$ iff $k\leq\ell$ (for all $k,\ell\in\mathbb{N}$ );
2. (b)
  
  $\mathcal{L}(q\gamma^{k}\delta)\ni y^{\ell}z$ iff $k\neq\ell$ (for all $k,\ell\in\mathbb{N}$ ), or $\mathcal{L}(q\gamma^{k}\delta)\ni y^{\ell}z$ iff $k>\ell$ (for all $k,\ell\in\mathbb{N}$ ).

We note that $p_{0}X_{0}\xrightarrow{v}pX\delta\xrightarrow{x^{m}}pX\gamma^{m}\delta\xrightarrow{w}q\gamma^{m}\delta\xrightarrow{y^{m}}q\delta$ (for each $m\in\mathbb{N}$ ); hence $vx^{m}wy^{m}z\in L$ iff $z\in\mathcal{L}(q\delta)$ (since $z$ is nonempty). Theorem 1 indeed follows from the lemma: there is $L^{\prime}\in\{L,\overline{L}\}$ such that either $vx^{m}wy^{n}z\in L^{\prime}$ iff $m=n$ (for all $m,n\in\mathbb{N}$ ), or $vx^{m}wy^{n}z\in L^{\prime}$ iff $m\leq n$ (for all $m,n\in\mathbb{N}$ ). (In theorem 1 we also stated that $v$ is nonempty. If $v=\varepsilon$ here, then we simply take $vx$ and $yz$ as the new $v,z$ , respectively.)

Proof of Lemma 5

In the rest of this section we provide a proof of lemma 5, assuming a fixed DPDA $\mathcal{M}=(Q,\Sigma,\Gamma,R,p_{0},X_{0},F)$ where $L=\mathcal{L}(p_{0}X_{0})$ is non-regular. The proof structure is visible from the auxiliary claims that we state and prove on the way.

Convention. W.l.o.g. we assume that $\mathcal{M}$ always reads the whole input $w\in\Sigma^{*}$ from $p_{0}X_{0}$ . This can be accomplished in the standard way, by adding a special bottom-of-stack symbol $\bot$ and a (non-accepting) fail-state. (Each empty-stack configuration $q\varepsilon$ becomes $q\bot$ , and each originally stuck computation enters the fail-state where it loops. We also recall that all $\varepsilon$ -steps are popping, and thus infinite $\varepsilon$ -sequences are impossible.) Hence for any infinite word $a_{1}a_{2}a_{3}\cdots$ in $\Sigma^{\omega}$ there is the unique infinite computation of $\mathcal{M}$ starting in $p_{0}X_{0}$ ; it stepwise reads the whole infinite word $a_{1}a_{2}a_{3}\cdots$ .

The left quotient of $L$ by $u\in\Sigma^{*}$ is the set $u\backslash L=\{v\in\Sigma^{*}\mid uv\in L\}$ ; concatenation has priority over $\backslash$ , hence $u_{1}u_{2}\backslash L=(u_{1}u_{2})\backslash L$ . (The next claim is valid for any non-regular $L$ .)

Claim 6.

We can fix an infinite word $a_{1}a_{2}a_{3}\cdots$ in $\Sigma^{\omega}$ ( $a_{i}\in\Sigma$ ) such that $a_{1}a_{2}\cdots a_{i}\backslash L\neq a_{1}a_{2}\cdots a_{j}\backslash L$ for all $i\neq j$ .

Proof: Let us consider the labelled transition system $\mathcal{T}=(\textsc{LQ}(L),\Sigma,(\xrightarrow{a})_{a\in\Sigma})$ where $\textsc{LQ}(L)=\{u\backslash L\mid u\in\Sigma^{*}\}$ and $\mathop{\xrightarrow{a}}=\{(L^{\prime},a\backslash L^{\prime})\mid L^{\prime}\in\textsc{LQ}(L)\}$ . (We recall that $L^{\prime}=u\backslash L$ entails $a\backslash L^{\prime}=ua\backslash L$ .) Since $L$ is non-regular, the set of states reachable from $L=\varepsilon\backslash L$ in $\mathcal{T}$ is infinite. The out-degree of states in $\mathcal{T}$ is finite (in fact, bounded by $|\Sigma|$ ), hence an application of König’s lemma yields an infinite acyclic path $L\xrightarrow{a_{1}}L_{1}\xrightarrow{a_{2}}L_{2}\xrightarrow{a_{3}}\cdots$ . ∎

We call a configuration $p\alpha$ of $\mathcal{M}$ unstable if $\alpha=Y\beta$ and $R$ contains a rule $pY\xrightarrow{\varepsilon}q$ (we recall that $\varepsilon$ -steps are only popping); otherwise $p\alpha$ is stable. Since $\mathcal{M}$ is a deterministic PDA, for each unstable $p\alpha$ we can soundly define the stable successor of $p\alpha$ as the unique stable configuration $p^{\prime}\alpha^{\prime}$ where $p\alpha\xrightarrow{\varepsilon}p^{\prime}\alpha^{\prime}$ ( $\alpha^{\prime}$ being a suffix of $\alpha$ ). The path $p\alpha\xrightarrow{\varepsilon}p^{\prime}\alpha^{\prime}$ might (not) go via an accepting state (in $F$ ), hence $\mathcal{L}(p\alpha)=\mathcal{L}(p^{\prime}\alpha^{\prime})$ or $\mathcal{L}(p\alpha)=\{\varepsilon\}\cup\mathcal{L}(p^{\prime}\alpha^{\prime})$ . (We note that the configurations in the computation (2) that start with $pX$ are necessarily stable.)

Claim 7.

Each configuration is visited at most twice by

the computation of

\mathcal{M}

from

p_{0}X_{0}

a_{1}a_{2}a_{3}\cdots

that is fixed by 6.

(3)

Proof: The computation (3) is infinite, stepwise reading the whole word $a_{1}a_{2}a_{3}\cdots$ , and it can be presented as

$r_{0}\gamma_{0}\xrightarrow{a_{1}}r_{1}\gamma_{1}\xrightarrow{a_{2}}r_{2}\gamma_{2}\xrightarrow{a_{3}}\cdots$ (for $r_{0}\gamma_{0}=p_{0}X_{0}$ )

where each $r_{i}\gamma_{i}$ is stable; each segment $r_{i}\gamma_{i}\xrightarrow{a_{i+1}}r_{i+1}\gamma_{i+1}$ starts with a (visible) $a_{i+1}$ -step that is followed by a (maybe empty) sequence of (popping) $\varepsilon$ -steps via unstable configurations. Since such an $\varepsilon$ -sequence might go through an accepting state, we can have $r_{i}\gamma_{i}=r_{j}\gamma_{j}$ for $i\neq j$ though $a_{1}a_{2}\cdots a_{i}\backslash L\neq a_{1}a_{2}\cdots a_{j}\backslash L$ ; in this case $L$ contains precisely one of the words $a_{1}a_{2}\cdots a_{i}$ and $a_{1}a_{2}\cdots a_{j}$ , and the languages $a_{1}a_{2}\cdots a_{i}\backslash L$ and $a_{1}a_{2}\cdots a_{j}\backslash L$ differ just on $\varepsilon$ . Nevertheless, this reasoning entails that we cannot have $r_{i}\gamma_{i}=r_{j}\gamma_{j}=r_{\ell}\gamma_{\ell}$ for pairwise different $i,j,\ell$ .

Since each segment $r_{i}\gamma_{i}\xrightarrow{a_{i+1}}r_{i+1}\gamma_{i+1}$ visits any unstable configuration at most once and $r_{i+1}\gamma_{i+1}$ is the stable successor for all unstable configurations in the segment, we deduce that also each unstable configuration can be visited at most twice in the computation (3). ∎

Claim 8.

The computation (3) on $a_{1}a_{2}a_{3}\cdots$ can be “stair-factorized”, that is, written

p_{0}X_{0}\xrightarrow{v_{0}}p_{1}X_{1}\alpha_{1}\xrightarrow{v_{1}}p_{2}X_{2}\alpha_{2}\alpha_{1}\xrightarrow{v_{2}}p_{3}X_{3}\alpha_{3}\alpha_{2}\alpha_{1}\xrightarrow{v_{3}}\cdots

(4)

so that for each $i\in\mathbb{N}$ we have $v_{i}\in\Sigma^{+}$ and $p_{i}X_{i}\xrightarrow{v_{i}}p_{i+1}X_{i+1}\alpha_{i+1}$ where $\alpha_{i+1}$ is a nonempty suffix of the right-hand side of a rule in $R$ (i.e., a nonempty suffix of $\gamma$ in a rule $pX\xrightarrow{a}q\gamma$ ).

Proof: We consider the computation (3), and call a stable configuration $pX\beta$ a level, with position $i\in\mathbb{N}$ , if $p_{0}X_{0}\xrightarrow{a_{1}\cdots a_{i}}pX\beta$ and all configurations visited by the computation $pX\beta\xrightarrow{a_{i+1}a_{i+2}\cdots}$ after $pX\beta$ have the stack longer than $|X\beta|$ ; we note that each level $pX\beta$ has a unique position $\textsc{pos}(pX\beta)$ . Since each configuration is visited at most twice in (3), the set of levels is infinite, with elements $p^{\prime}_{0}X^{\prime}_{0}$ , $p_{1}X_{1}\beta_{1}$ , $p_{2}X_{2}\beta_{2}$ , $\dots$ where $0\leq\textsc{pos}(p^{\prime}_{0}X^{\prime}_{0})<\textsc{pos}(p_{1}X_{1}\beta_{1})<\textsc{pos}(p_{2}X_{2}\beta_{2})<\cdots$ . The computation (3) can thus be presented as

$p_{0}X_{0}\xrightarrow{v^{\prime}_{0}}p^{\prime}_{0}X^{\prime}_{0}\xrightarrow{v^{\prime\prime}_{0}}p_{1}X_{1}\beta_{1}\xrightarrow{v_{1}}p_{2}X_{2}\beta_{2}\xrightarrow{v_{2}}p_{3}X_{3}\beta_{3}\xrightarrow{v_{3}}\cdots$

where $|v^{\prime}_{0}|=\textsc{pos}(p^{\prime}_{0}X^{\prime}_{0})$ , and $|v_{0}v_{1}\cdots v_{j-1}|=\textsc{pos}(p_{j}X_{j}\beta_{j})$ for $j\geq 1$ , putting $v_{0}=v^{\prime}_{0}v^{\prime\prime}_{0}$ .

Each segment $pX\beta\xrightarrow{v}p^{\prime}X^{\prime}\beta^{\prime}$ between two neighbouring levels can be obviously written as $pX\beta\xrightarrow{a}q\gamma_{1}\gamma_{2}\beta\xrightarrow{v^{\prime}}p^{\prime}X^{\prime}\gamma_{2}\beta$ where $pX\xrightarrow{a}q\gamma_{1}\gamma_{2}$ is a rule in $R$ , both $\gamma_{1}$ and $\gamma_{2}$ are nonempty, $v=av^{\prime}$ , and $q\gamma_{1}\xrightarrow{v^{\prime}}p^{\prime}X^{\prime}$ . Hence the validity of the claim is clear. ∎

We define the natural equivalence relation $\sim$ on the set of configurations of $\mathcal{M}$ : we put $p\alpha\sim q\beta$ if $\mathcal{L}(p\alpha)=\mathcal{L}(q\beta)$ .

We fix the presentation (4), calling $p_{i}X_{i}\alpha_{i}\alpha_{i-1}\cdots\alpha_{1}$ the level-configurations (for all $i\in\mathbb{N}$ ). Since we have $\mathcal{L}(p_{i}X_{i}\alpha_{i}\alpha_{i-1}\cdots\alpha_{1})\smallsetminus\{\varepsilon\}=(v_{0}v_{1}\cdots v_{i-1}\backslash L)\smallsetminus\{\varepsilon\}$ , there cannot be three level-configurations in the same $\sim$ -class (i.e., in the same equivalence class w.r.t. $\sim$ ). Hence any infinite set of level-configurations represents infinitely many $\sim$ -classes. Now we show a congruence-property that might enable to shorten a level-configuration while keeping its $\sim$ -class. We use the notation $\textsc{DS}(p\alpha)$ (the “down-states” of $p\alpha$ ), putting

$\textsc{DS}(p\alpha)=\{q\mid p\alpha\xrightarrow{w}q$ for some $w\in\Sigma^{*}\}$ .

Claim 9.

If $q\gamma\sim q\gamma^{\prime}$ for each $q\in\textsc{DS}(p\beta)$ , then $p\beta\gamma\sim p\beta\gamma^{\prime}$ .

Proof: Let us consider $w\in\Sigma^{*}$ . If $w\in\mathcal{L}(p\beta)$ , then $w\in\mathcal{L}(p\beta\mu)$ for all $\mu\in\Gamma^{*}$ . If $w\not\in\mathcal{L}(p\beta)$ and there is no prefix $v$ of $w$ such that $p\beta\xrightarrow{v}q$ , then $w\not\in\mathcal{L}(p\beta\mu)$ for all $\mu\in\Gamma^{*}$ . If $w\not\in\mathcal{L}(p\beta)$ and $w=vv^{\prime}$ where $pX\beta\xrightarrow{v}q$ (necessarily for some $q\in\textsc{DS}(pX\beta)$ ), then $w\in\mathcal{L}(p\beta\mu)$ iff $v^{\prime}\in\mathcal{L}(q\mu)$ . Hence the claim is clear. ∎

The next claim is an immediate corollary.

Claim 10.

Any computation $p_{0}X_{0}\xrightarrow{w_{1}}pX\beta_{1}\xrightarrow{w_{2}}pX\beta_{2}\beta_{1}\xrightarrow{w_{3}}p^{\prime}X^{\prime}\beta_{3}\beta_{2}\beta_{1}$ where $pX\xrightarrow{w_{2}}pX\beta_{2}$ ( $w_{2}\in\Sigma^{+}$ ), $pX\xrightarrow{w_{3}}p^{\prime}X^{\prime}\beta_{3}$ , and $q\beta_{2}\beta_{1}\sim q\beta_{1}$ for each $q\in\textsc{DS}(p^{\prime}X^{\prime}\beta_{3})$ can be shortened to $p_{0}X_{0}\xrightarrow{w_{1}}pX\beta_{1}\xrightarrow{w_{3}}p^{\prime}X^{\prime}\beta_{3}\beta_{1}$ where $p^{\prime}X^{\prime}\beta_{3}\beta_{1}\sim p^{\prime}X^{\prime}\beta_{3}\beta_{2}\beta_{1}$ .

The $i$ -th level-configuration in (4) is reached by the computation $p_{0}X_{0}\xrightarrow{v_{0}v_{1}\cdots v_{i-1}}p_{i}X_{i}\alpha_{i}\alpha_{i-1}\cdots\alpha_{1}$ . It can happen that there are $j_{1},j_{2}$ , $0\leq j_{1}<j_{2}\leq i$ such that $p_{j_{1}}X_{j_{1}}=p_{j_{2}}X_{j_{2}}$ and $q\alpha_{j_{2}}\alpha_{j_{2}-1}\cdots\alpha_{1}\sim q\alpha_{j_{1}}\alpha_{j_{1}-1}\cdots\alpha_{1}$ for all $q\in\textsc{DS}(p_{i}X_{i}\alpha_{i}\alpha_{i-1}\cdots\alpha_{j_{2}+1})$ . In this case we can shorten the computation as in 10, where $v_{j_{1}}v_{j_{1}+1}\cdots v_{j_{2}-1}$ corresponds to the omitted $w_{2}$ . The resulting shorter computation might be possible to be repeatedly shortened further (if it can be presented so that the conditions of 10 are satisfied). Now for each $i\geq 1$ we fix a (stair-factorized) computation

p_{i,0}X_{i,0}\xrightarrow{v_{i,0}}p_{i,1}X_{i,1}\alpha_{i,1}\xrightarrow{v_{i,1}}p_{i,2}X_{i,2}\alpha_{i,2}\alpha_{i,1}\ \cdots\ \xrightarrow{v_{i,n_{i}-1}}p_{i,n_{i}}X_{i,n_{i}}\alpha_{i,n_{i}}\alpha_{i,n_{i}-1}\cdots\alpha_{i,1}

(5)

that has arisen by a maximal sequence of the above shortenings of the prefix

$p_{0}X_{0}\xrightarrow{v_{0}v_{1}\cdots v_{i-1}}p_{i}X_{i}\alpha_{i}\alpha_{i-1}\cdots\alpha_{1}$ of (4).

Hence $p_{i,0}X_{i,0}=p_{0}X_{0}$ , $p_{i,n_{i}}X_{i,n_{i}}=p_{i}X_{i}$ , $\alpha_{i,n_{i}},\alpha_{i,n_{i}-1},\dots,\alpha_{i,1}$ is a subsequence of
$\alpha_{i},\alpha_{i-1},\dots,\alpha_{1}$ , and $p_{i,n_{i}}X_{i,n_{i}}\alpha_{i,n_{i}}\alpha_{i,n_{i}-1}\cdots\alpha_{i,1}\sim p_{i}X_{i}\alpha_{i}\alpha_{i-1}\cdots\alpha_{1}$ .

Claim 11.

For each $\ell\in\mathbb{N}$ there is $i$ such that $n_{i}>\ell$ (where $n_{i}$ is from (5)).

Proof: As already discussed, the set of level-configurations represents infinitely many $\sim$ -classes. The last configurations of computations (5) represent the same infinite set of $\sim$ -classes, and their lengths thus cannot be bounded; since the lengths of all $\alpha_{i,j}$ are bounded (they are shorter than the longest right-hand sides of the rules in $R$ ), the claim is clear. ∎

Now we come to a crucial claim in our proof of lemma 5. Besides the notation $\textsc{DS}(p\alpha)$ we also introduce $\textsc{ES}(p\alpha)$ (the by- $\varepsilon$ -reached down-states of $p\alpha$ ), by putting

$\textsc{ES}(p\alpha)=\{q\mid p\alpha\xrightarrow{\varepsilon}q\}$ .

Hence $\textsc{ES}(p\alpha)\subseteq\textsc{DS}(p\alpha)$ , and $|\textsc{ES}(p\alpha)|\leq 1$ (due to the determinism of the DPDA $\mathcal{M}$ ).

We recall that $p\alpha\sim q\beta$ means $\mathcal{L}(p\alpha)=\mathcal{L}(q\beta)$ . To handle the special case of the empty word $\varepsilon$ , we also define a (much) coarser equivalence $\sim_{0}$ : we put $p\alpha\sim_{0}q\beta$ if $\varepsilon$ either belongs to both $\mathcal{L}(p\alpha)$ and $\mathcal{L}(q\beta)$ , or belongs to none of them.

Claim 12.

There is a constant $\textsc{B}\in\mathbb{N}$ determined by the DPDA $\mathcal{M}$ such that for all $i\in\mathbb{N}$ where $n_{i}>B$ the final configuration in (5) can be written as

$p_{i,n_{i}}X_{i,n_{i}}\alpha_{i,n_{i}}\alpha_{i,n_{i}-1}\cdots\alpha_{i,1}=\bar{p}\bar{X}\beta\gamma\delta$

where the following conditions hold:

1.

$\gamma=\alpha_{i,j}\alpha_{i,j-1}\cdots\alpha_{i,j^{\prime}{+}1}$ where $n_{i}\geq j>j^{\prime}\geq n_{i}{-}B$ and $p_{i,j}X_{i,j}=p_{i,j^{\prime}}X_{i,j^{\prime}}$
(and $\beta=\alpha_{i,n_{i}}\alpha_{i,n_{i}-1}\cdots\alpha_{i,j{+}1}$ , $\delta=\alpha_{i,j^{\prime}}\alpha_{i,j^{\prime}-1}\cdots\alpha_{i,1}$ );
2.

the sets $\textsc{DS}(\bar{p}\bar{X}\beta)$ and $\textsc{DS}(\bar{p}\bar{X}\beta\gamma)$ are equal, further being denoted by $\bar{Q}$ ;
3.

for each $q\in\bar{Q}$ , if $\textsc{ES}(q\gamma)=\{q^{\prime}\}$ , then $\textsc{ES}(q^{\prime}\gamma)=\{q^{\prime}\}$ (and $q^{\prime}\in\bar{Q}$ );
4.

each $q^{\prime}\in\bar{Q}$ belongs to $\textsc{DS}(q\gamma)$ for some self-containing $q\in\bar{Q}$ , where $q\in\bar{Q}$ is self-containing if $q\in\textsc{DS}(q\gamma)$ ;
5.

there is a state $q^{\prime}\in\bar{Q}$ for which $q^{\prime}\gamma\delta\not\sim q^{\prime}\delta$ and $q^{\prime}\gamma\delta\sim_{0}q^{\prime}\delta$ .

Proof: We fix some $i$ with $n_{i}$ larger than a constant $B$ determined by $\mathcal{M}$ as described below (there are such $i$ by 11). For convenience we put $p_{i,n_{i}}X_{i,n_{i}}=\bar{p}\bar{X}$ , $n_{i}=n$ , and $\alpha_{i,j}=\bar{\alpha}_{j}$ , hence the final configuration in (5) is $p_{i,n_{i}}X_{i,n_{i}}\alpha_{i,n_{i}}\alpha_{i,n_{i}-1}\cdots\alpha_{i,1}=\bar{p}\bar{X}\bar{\alpha}_{n}\bar{\alpha}_{n-1}\cdots\bar{\alpha}_{1}$ . We view the $n{+}1$ prefixes

$\bar{p}\bar{X},\ \bar{p}\bar{X}\bar{\alpha}_{n},\ \bar{p}\bar{X}\bar{\alpha}_{n}\bar{\alpha}_{n-1},\ \bar{p}\bar{X}\bar{\alpha}_{n}\bar{\alpha}_{n-1}\bar{\alpha}_{n-2},\ \dots,\ \bar{p}\bar{X}\bar{\alpha}_{n}\bar{\alpha}_{n-1}\cdots\bar{\alpha}_{1}$

as the vertices of a complete graph with coloured edges.

For $\bar{p}\bar{X}\bar{\alpha}_{n}\bar{\alpha}_{n-1}\cdots\bar{\alpha}_{1}=\bar{p}\bar{X}\mu\nu\rho$ , where $\mu=\bar{\alpha}_{n}\bar{\alpha}_{n-1}\cdots\bar{\alpha}_{j{+}1}$ , $\nu=\bar{\alpha}_{j}\bar{\alpha}_{j-1}\cdots\bar{\alpha}_{j^{\prime}{+}1}$ , and $\rho=\bar{\alpha}_{j^{\prime}}\bar{\alpha}_{j^{\prime}-1}\cdots\bar{\alpha}_{1}$ , $n\geq j>j^{\prime}\geq 0$ , the edge between the vertices $\bar{p}\bar{X}\mu$ and $\bar{p}\bar{X}\mu\nu$ has the following tuple as its colour:

$\left(\,p_{i,j}X_{i,j},\ p_{i,j^{\prime}}X_{i,j^{\prime}},\ \textsc{DS}(\bar{p}\bar{X}\mu),\ \textsc{DS}(\bar{p}\bar{X}\mu\nu),\ (\textsc{DS}(q\nu),\textsc{ES}(q\nu))_{q\in\textsc{DS}(\bar{p}\bar{X}\mu)},\ \textsc{Q}_{\not\sim},\ \textsc{Q}_{0}\right)$

where $\textsc{Q}_{\not\sim}=\{q^{\prime}\in\textsc{DS}(\bar{p}\bar{X}\mu)\mid q^{\prime}\nu\rho\not\sim q^{\prime}\rho\}$ and $\textsc{Q}_{0}=\{q^{\prime}\in\textsc{Q}_{\not\sim}\mid q^{\prime}\nu\rho\sim_{0}q^{\prime}\rho\}$ (and $p_{i,j}X_{i,j},\ p_{i,j^{\prime}}X_{i,j^{\prime}}$ are taken from (5)).

Since the set of colours is bounded (by a constant determined by $\mathcal{M}$ ), Ramsey’s theorem yields a bound $B$ guaranteeing that there is a monochromatic clique of size $3$ among the vertices $\bar{p}\bar{X}$ , $\bar{p}\bar{X}\bar{\alpha}_{n}$ , $\bar{p}\bar{X}\bar{\alpha}_{n}\bar{\alpha}_{n-1}$ , $\dots$ , $\bar{p}\bar{X}\bar{\alpha}_{n}\bar{\alpha}_{n-1}\cdots\bar{\alpha}_{n-B}$ . (We have soundly chosen $i$ so that $n=n_{i}$ is bigger than $B$ .) We fix such a monochromatic clique MC, denoting its $3$ vertices as

$\bar{p}\bar{X}\beta$ , $\bar{p}\bar{X}\beta\gamma$ , $\bar{p}\bar{X}\beta\gamma\bar{\gamma}$ , and its colour as $\,\textsc{C}=(p^{\prime}X^{\prime},p^{\prime}X^{\prime},\bar{Q},\bar{Q},(\mathcal{D}_{q},\mathcal{E}_{q})_{q\in\bar{Q}},Q^{\prime},Q^{\prime}_{0})$ .

This is sound, since the fact that both edges $\{\bar{p}\bar{X}\beta,\bar{p}\bar{X}\beta\gamma\}$ and $\{\bar{p}\bar{X}\beta\gamma,\bar{p}\bar{X}\beta\gamma\bar{\gamma}\}$ have the same colour entails that the first component in this colour is the same as the second component, and the third component is the same as the fourth component.

We now show that the conditions $1$ – $5$ are satisfied for the presentation of $\bar{p}\bar{X}\bar{\alpha}_{n}\bar{\alpha}_{n-1}\cdots\bar{\alpha}_{1}$ as $\bar{p}\bar{X}\beta\gamma\delta$ , where $\delta=\bar{\gamma}\bar{\alpha}_{k}\bar{\alpha}_{k-1}\cdots\bar{\alpha}_{1}$ for the respective $k$ .

Conditions $1$ and $2$ are trivial (due to the colour C).

Condition 3: Let $q\in\bar{Q}$ and $\textsc{ES}(q\gamma)=\{q^{\prime}\}$ (hence also $q^{\prime}\in\bar{Q}$ ). Then $\mathcal{E}_{q}=\textsc{ES}(q\gamma)=\textsc{ES}(q\gamma\bar{\gamma})=\{q^{\prime}\}$ (since MC is monochromatic). This entails $\textsc{ES}(q^{\prime}\bar{\gamma})=\{q^{\prime}\}$ , hence $\mathcal{E}_{q^{\prime}}=\{q^{\prime}\}$ , which in turn entails $\textsc{ES}(q^{\prime}\gamma)=\{q^{\prime}\}$ .

Condition $4$ : We first note a general fact: $\textsc{DS}(p\mu\nu)=\bigcup_{q\in\textsc{DS}(p\mu)}\textsc{DS}(q\nu)$ . Since $\bar{Q}=\textsc{DS}(\bar{p}\bar{X}\beta)=\textsc{DS}(\bar{p}\bar{X}\beta\gamma)=\textsc{DS}(\bar{p}\bar{X}\beta\gamma\bar{\gamma})$ , for each $q^{\prime}\in\bar{Q}$ there is thus $q\in\bar{Q}$ such that $q^{\prime}\in\mathcal{D}_{q}$ . We also have the following “transitivity”: if $q_{1},q_{2},q_{3}\in\bar{Q}$ , $q_{1}\in\mathcal{D}_{q_{2}}$ , and $q_{2}\in\mathcal{D}_{q_{3}}$ , then $q_{1}\in\mathcal{D}_{q_{3}}$ (since MC is monochromatic). For any $q^{\prime}\in\bar{Q}$ there is clearly a “chain” $q^{\prime}=q_{1},q_{2},q_{3},\dots,q_{\ell}$ where $\ell>1$ , $q_{j}\in\mathcal{D}_{q_{j+1}}$ for all $j\in[1,\ell{-}1]$ , and $q_{j}=q_{\ell}$ for some $j<\ell$ . By the above transitivity, $q_{\ell}$ is self-containing ( $q_{\ell}\in\mathcal{D}_{q_{\ell}}$ and thus $q_{\ell}\in\textsc{DS}(q_{\ell}\gamma)$ ) and $q^{\prime}\in\mathcal{D}_{q_{\ell}}$ (hence $q^{\prime}\in\textsc{DS}(q_{\ell}\gamma)$ ).

Condition $5$ : For any three configurations at least two belong to the same $\sim_{0}$ -class. Since the edges among the vertices $\bar{p}\bar{X}\beta$ , $\bar{p}\bar{X}\beta\gamma$ , $\bar{p}\bar{X}\beta\gamma\bar{\gamma}$ have the same $Q^{\prime}_{0}$ in their colour C, we get that $Q^{\prime}_{0}=Q^{\prime}$ , and thus also $q^{\prime}\gamma\delta\sim_{0}q^{\prime}\delta$ for all $q^{\prime}\in\bar{Q}$ such that $q^{\prime}\gamma\delta\not\sim q^{\prime}\delta$ . Now if for all $q^{\prime}\in\bar{Q}$ we had $q^{\prime}\gamma\delta\sim q^{\prime}\delta$ (which includes the case $\bar{Q}=\emptyset$ ), then we would get a contradiction with our choice of (5) since it could have been shortened as in 10. ∎

Now we are already close to lemma 5:

Claim 13.

There are $v\in\Sigma^{*}$ , $x,w,y,z\in\Sigma^{+}$ , $p,q\in Q$ , $X\in\Gamma$ , $\gamma\in\Gamma^{+}$ , $\delta\in\Gamma^{*}$ such that $p_{0}X_{0}\xrightarrow{v}pX\delta$ , $pX\xrightarrow{x}pX\gamma$ , $pX\xrightarrow{w}q$ , $q\gamma\xrightarrow{y}q$ , and

•

either $z\in\mathcal{L}(q\delta)$ and $z\not\in\mathcal{L}(q\gamma^{\ell}\delta)$ for all $\ell>0$ ,
•

or $z\not\in\mathcal{L}(q\delta)$ and $z\in\mathcal{L}(q\gamma^{\ell}\delta)$ for all $\ell>0$ .

Proof: We fix one $\bar{p}\bar{X}\beta\gamma\delta$ guaranteed by 12 (satisfying the respective conditions $1$ – $5$ ). There are $v\in\Sigma^{*}$ , $x,w,y,\bar{z}\in\Sigma^{+}$ , $p,q\in Q$ , $X\in\Gamma$ , $\gamma\in\Gamma^{+}$ , $\delta\in\Gamma^{*}$ , $q^{\prime}\in\textsc{DS}(q\gamma)$ such that

$p_{0}X_{0}\xrightarrow{v}pX\delta$ , $pX\xrightarrow{x}pX\gamma$ , $pX\xrightarrow{w}q$ , $q\gamma\xrightarrow{y}q$ , and $\mathcal{L}(q^{\prime}\gamma\delta)$ and $\mathcal{L}(q^{\prime}\delta)$ differ on $\bar{z}$

(i.e., $\bar{z}\in(\mathcal{L}(q^{\prime}\gamma\delta)\smallsetminus\mathcal{L}(q^{\prime}\delta))\cup(\mathcal{L}(q^{\prime}\delta)\smallsetminus\mathcal{L}(q^{\prime}\gamma\delta))$ .
(Indeed: The respective computation (5) can be written $p_{0}X_{0}\xrightarrow{v}pX\delta\xrightarrow{x}pX\gamma\delta\xrightarrow{w^{\prime}}\bar{p}\bar{X}\beta\gamma\delta$ where $x$ and $\gamma$ are nonempty. The claimed $q^{\prime}$ and [nonempty] $\bar{z}$ are guaranteed by $5$ in 12, and $q$ is a respective self-containing state from $4$ . Since $q\in\textsc{DS}(\bar{p}\bar{X}\beta)$ and $q\in\textsc{DS}(q\gamma)$ , we get $pX\gamma\delta\xrightarrow{w^{\prime}w^{\prime\prime}}q\gamma\delta\xrightarrow{y}q\delta$ , where $w^{\prime\prime}\neq\varepsilon$ . We also have $y\neq\varepsilon$ , since otherwise $\textsc{DS}(q\gamma)=\textsc{ES}(q\gamma)=\{q\}$ , $q^{\prime}=q$ , and we could not have $q\gamma\delta\not\sim q\delta$ and $q\gamma\delta\sim_{0}q\delta$ .)

Since $q^{\prime}\in\textsc{DS}(q\gamma)$ , we can fix $z^{\prime}$ such that $q\gamma\xrightarrow{z^{\prime}}q^{\prime}$ . Hence the languages $\mathcal{L}(q\gamma\gamma\delta)$ and $\mathcal{L}(q\gamma\delta)$ differ on $z=z^{\prime}\bar{z}$ ; more generally, $\mathcal{L}(q\gamma^{\ell+1}\gamma\delta)$ and $\mathcal{L}(q\gamma^{\ell}\gamma\delta)$ differ on $y^{\ell}z$ for all $\ell\geq 0$ . Now we aim to find out for which $\ell$ we have $z\in\mathcal{L}(q\gamma^{\ell}\delta)$ .

We recall that $\bar{Q}=\textsc{DS}(\bar{p}\bar{X}\beta)=\textsc{DS}(\bar{p}\bar{X}\beta\gamma)$ ; hence $\bigcup_{\bar{q}\in\bar{Q}}\textsc{DS}(\bar{q}\gamma)=\bar{Q}$ . Since $q\in\bar{Q}$ , we get that $\textsc{DS}(q\gamma^{d})\subseteq\bar{Q}$ for all $d\in\mathbb{N}$ (by induction). We now distinguish two cases:

1.

For each prefix $z_{1}$ of $z$ and each $d\leq|z|$ we have: if $q\gamma^{d}\xrightarrow{z_{1}}\bar{q}$ , then $\textsc{ES}(\bar{q}\gamma)=\emptyset$ .
2.

There are a prefix $z_{1}$ of $z$ , $d\leq|z|$ , and $\bar{q},q^{\prime\prime}\in\bar{Q}$ such that $q\gamma^{d}\xrightarrow{z_{1}}\bar{q}$ and $\textsc{ES}(\bar{q}\gamma)=\{q^{\prime\prime}\}$ .

In the case $1$ we clearly have either $\forall\ell>|z|:z\in\mathcal{L}(q\gamma^{\ell}\delta)$ or $\forall\ell>|z|:z\not\in\mathcal{L}(q\gamma^{\ell}\delta)$ (here $\delta$ plays no role). In the case $2$ we recall that $\bar{q}\gamma\xrightarrow{\varepsilon}q^{\prime\prime}$ entails that $\bar{q}\gamma^{k}\delta\xrightarrow{\varepsilon}q^{\prime\prime}\delta$ for all $k\geq 1$ (since $\textsc{ES}(q^{\prime\prime}\gamma)=\{q^{\prime\prime}\}$ by $3$ in 12). Hence we have either $\forall\ell>|z|+1:z\in\mathcal{L}(q\gamma^{\ell}\delta)$ or $\forall\ell>|z|+1:z\not\in\mathcal{L}(q\gamma^{\ell}\delta)$ .

Since $\mathcal{L}(q\gamma^{2}\delta)$ and $\mathcal{L}(q\gamma^{1}\delta)$ differ on $z$ , we deduce that there is $\ell_{0}\geq 1$ such that either $z\in\mathcal{L}(q\gamma^{\ell_{0}}\delta)$ and $z\not\in\mathcal{L}(q\gamma^{\ell}\delta)$ for all $\ell>\ell_{0}$ , or $z\not\in\mathcal{L}(q\gamma^{\ell_{0}}\delta)$ and $z\in\mathcal{L}(q\gamma^{\ell}\delta)$ for all $\ell>\ell_{0}$ . Hence for $\bar{\delta}=\gamma^{\ell_{0}}\delta$ we have either $z\in\mathcal{L}(q\bar{\delta})$ and $z\not\in\mathcal{L}(q\gamma^{\ell}\bar{\delta})$ for all $\ell>0$ , or $z\not\in\mathcal{L}(q\bar{\delta})$ and $z\in\mathcal{L}(q\gamma^{\ell}\bar{\delta})$ for all $\ell>0$ . Since for $\bar{v}=vx^{\ell_{0}}$ we have $p_{0}X_{0}\xrightarrow{\bar{v}}pX\bar{\delta}$ , the claim is proven. ∎

Claim 13 is a weaker version of lemma 5; it shows that there is $L^{\prime}\in\{L,\overline{L}\}$ such that $vx^{m}wy^{m}z\in L^{\prime}$ and $vx^{m}wy^{n}z\not\in L^{\prime}$ for $m>n$ . To handle the case $m<n$ , we have to find out for which $\ell$ we have $y^{\ell}z\in\mathcal{L}(q\delta)$ . We thus look at the computation from $q\delta$ on the infinite word $y^{\omega}$ (recalling our convention that this computation is infinite, stepwise reading the word $yyy\cdots$ ), and use the obvious fact that after a prefix this computation becomes “periodic” (either cycling among finitely many configurations, or increasing the stack forever).

Claim 14.

For any configuration $q\delta$ and words $y,z$ there are numbers $k\geq 0$ and $\textsc{p}>0$ (“period”) such that for all $\ell\geq k$ the remainder $(\ell\bmod\textsc{p})$ determines whether or not $\mathcal{L}(q\delta)\ni y^{\ell}z$ .

Proof: We assume $y\neq\varepsilon$ (otherwise the claim is trivial). For the infinite computation from $q\delta$ on $yyy\cdots$ there are obviously $k_{1}\geq 0$ , $k_{2}>0$ , $\bar{q}\in Q$ , and $\rho,\mu,\nu\in\Gamma^{*}$ such that the computation can be written $q\delta\xrightarrow{y^{k_{1}}}\bar{q}\rho\nu\xrightarrow{y^{k_{2}}}\bar{q}\rho\mu\nu\xrightarrow{y^{k_{2}}}\bar{q}\rho\mu\mu\nu\xrightarrow{y^{k_{2}}}\bar{q}\rho\mu\mu\mu\nu\xrightarrow{y^{k_{2}}}\cdots$ where $\bar{q}\rho\xrightarrow{y^{k_{2}}}\bar{q}\rho\mu$ . (We have $\mu=\varepsilon$ if the computation visits only finitely many configurations, and otherwise we consider the stair-factorization of the computation.)

For each $j\in[0,k_{2}{-}1]$ we put $\bar{q}\rho\xrightarrow{y^{j}}\bar{q}\rho_{j}$ , and we have two possible cases:

1.

There is $d_{0}\geq 0$ such that for all $d\geq d_{0}$ performing $z$ from $\bar{q}\rho_{j}\mu^{d}\nu$ does not reach $\nu$ at the bottom.
2.

There are $d_{0}\geq 0$ , a prefix $z^{\prime}$ of $z$ , $q^{\prime}\in Q$ , and $\bar{d}\in[1,|Q|]$ such that $\bar{q}\rho_{j}\mu^{d_{0}}\xrightarrow{z^{\prime}}q^{\prime}$ and $q^{\prime}\mu^{\bar{d}}\xrightarrow{\varepsilon}q^{\prime}$ .

In the case $1$ either $\mathcal{L}(q\delta)\ni y^{d\cdot k_{2}+j}z$ for all $d\geq d_{0}$ , or $\mathcal{L}(q\delta)\not\ni y^{d\cdot k_{2}+j}z$ for all $d\geq d_{0}$ .
In the case $2$ , for each $d\geq 0$ we have $q^{\prime}\mu^{d}\xrightarrow{\varepsilon}q_{d}$ where $q_{d_{1}}=q_{d_{2}}$ if $d_{1}\equiv d_{2}\ (\bmod\ \bar{d})$ . Hence for each $d\geq d_{0}$ , the (non)membership of $y^{d\cdot k_{2}+j}z$ in $\mathcal{L}(q\delta)$ is determined by $(d\bmod\bar{d})$ .

The claim is thus clear. ∎

Now we finish the proof of lemma 5. We take the notation from 13; for the respective $q\delta,y,z$ we add $k,\textsc{p}$ from 14. Let $k_{0}$ be a multiple of p that is bigger than $k$ . We now view $x^{k_{0}}$ , $y^{k_{0}}$ , $\gamma^{k_{0}}$ as new $x,y,\gamma$ , respectively. Claims 13 and 14 now yield the statement of lemma 5.

4 Conclusion and Open Problems

In this paper, we have introduced a new notion of the $\mathcal{C}$ -simple problem that reduces to each problem in $\mathcal{C}$ , being thus a conceptual counterpart to the $\mathcal{C}$ -hard problem to which each problem in $\mathcal{C}$ reduces. We have illustrated this concept on the definition of the DCFL^′-simple problem that reduces to each DCFL^′ language under the truth-table reduction by Mealy machines. We have proven that the DCFL^′ language $L_{\#}=\{0^{n}1^{n}\mid n\geq 1\}$ is DCFL^′-simple, and thus represents the simplest languages in the class DCFL^′. This result finds its application in expanding the known lower bound for $L_{\#}$ , namely that $L_{\#}$ cannot be recognized by the neural network model 1ANN, to all DCFL^′ languages. Moreover, the class DCFLS of DCFL^′-simple problems containing the regular languages is a strict subclass of DCFL and has similar closure properties as DCFL.

We note that the hardest context-free language $L_{0}$ by Greibach [2], where each $L$ in CFL is an inverse homomorphic image of $L_{0}$ or $L_{0}\smallsetminus\{\varepsilon\}$ , can be viewed as CFL-hard w.r.t. a many-one reduction based on Mealy machines realizing the respective homomorphisms. Our aims in the definition of DCFL^′-simple problems cannot be achieved by such a many-one reduction, hence we have generalized it to a truth-table reduction. We can alternatively consider a general Turing reduction that is implemented by a Mealy machine which queries the oracle at special query states, each associated with a corresponding query suffix, while its next transition from the query state depends on the given oracle answer. The oracle Mealy machine then accepts an input word if it reaches an accept state after reading the input. The language $L_{\#}$ proves to be DCFL^′-simple under this Turing reduction allowing for an unbounded number of online oracle queries; this can be shown by 13 (a weaker version of lemma 5).

It is natural to try extending our result to non-regular nondeterministic (or at least unambiguous) context-free languages, by possibly showing that $L_{\#}$ is CFL^′-simple. Another important challenge for further research is looking for $\mathcal{C}$ -simple problems for other complexity classes $\mathcal{C}$ and suitable reductions. This could provide an effective tool for strengthening lower-bounds results known for single problems to the whole classes of problems, which deserves a deeper study.

Acknowledgements

Presented research has been partially supported by the Czech Science Foundation, grant GA19-05704S, and by the institutional support RVO: 67985807 (J. Šíma). J. Šíma also thanks Martin Plátek for his intensive collaboration at the first stages of this research.

References

[1] Anabtawi, M., Hassan, S., Kapoutsis, C.A., Zakzok, M.: An oracle hierarchy for small one-way finite automata. In: Proceedings of LATA 2019. pp. 57–69. LNCS 11417, Springer (2019). https://doi.org/10.1007/978-3-030-13435-8_4
[2] Greibach, S.A.: The hardest context-free language. SIAM J. Comput. 2(4), 304–310 (1973). https://doi.org/10.1137/0202025
[3] Hopcroft, J.E., Ullman, J.D.: Formal languages and their relation to automata. Addison-Wesley (1969), https://www.worldcat.org/oclc/00005012
[4] Jančar, P.: Deciding semantic finiteness of pushdown processes and first-order grammars w.r.t. bisimulation equivalence. J. Comput. Syst. Sci. 109, 22–44 (2020). https://doi.org/10.1016/j.jcss.2019.10.002
[5] Jančar, P., Mráz, F., Plátek, M., Vogel, J.: Restarting automata. In: Proceedings of FCT 1995. pp. 283–292. LNCS 965, Springer (1995). https://doi.org/10.1007/3-540-60249-6_60
[6] Mráz, F., Pardubská, D., Plátek, M., Šíma, J.: Pumping deterministic monotone restarting automata and DCFL. In: Proceedings of ITAT 2020. pp. 51–58. CEUR Workshop Proceedings 2718 (2020), http://ceur-ws.org/Vol-2718/paper13.pdf
[7] Reinhardt, K.: Hierarchies over the context-free languages. In: Proceedings of IMYCS 1990. pp. 214–224. LNCS 464, Springer (1990). https://doi.org/10.1007/3-540-53414-8_44
[8] Siegelmann, H.T.: Neural networks and analog computation – Beyond the Turing limit. Birkhäuser (1999)
[9] Šíma, J.: Analog neuron hierarchy. Neural Netw. 128, 199–215 (2020). https://doi.org/10.1016/j.neunet.2020.05.006
[10] Šíma, J.: Stronger separation of analog neuron hierarchy by deterministic context-free languages (2021), arXiv:2102.01633 (submitted to a journal)
[11] Šíma, J., Orponen, P.: General-purpose computation with neural networks: A survey of complexity theoretic results. Neural Comput. 15(12), 2727–2778 (2003). https://doi.org/10.1162/089976603322518731
[12] Šíma, J., Plátek, M.: One analog neuron cannot recognize deterministic context-free languages. In: Proceedings of ICONIP 2019, Part III. pp. 77–89. LNCS 11955, Springer (2019). https://doi.org/10.1007/978-3-030-36718-3_7
[13] Yamakami, T.: Oracle pushdown automata, nondeterministic reducibilities, and the hierarchy over the family of context-free languages. In: Proceedings of SOFSEM 2014. pp. 514–525. LNCS 8327, Springer (2014). https://doi.org/10.1007/978-3-319-04298-5_45, (full version arXiv:1303.1717)