This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

\secondaddress

Università di Roma “La Sapienza”, Dipartimento di Matematica, Piazzale Aldo Moro 2, 00185 Roma, Italy

\sameaddress

1

LANGUAGE CLASSES ASSOCIATED WITH AUTOMATA OVER MATRIX GROUPS

Özlem Salehi ozlem.salehi@boun.edu.tr & say@boun.edu.tr Boğaziçi University, Department of Computer Engineering, Bebek 34342 İstanbul, Turkey Flavio D’Alessandro Boğaziçi University, Department of Mathematics, Bebek 34342, İstanbul, Turkey dalessan@mat.uniroma1.it  and  A. C. Cem Say
(Date: …)
Abstract.

We investigate the language classes recognized by group automata over matrix groups. For the case of 2×22\times 2 matrices, we prove that the corresponding group automata for rational matrix groups are more powerful than the corresponding group automata for integer matrix groups. Finite automata over some special matrix groups, such as the discrete Heisenberg group and the Baumslag-Solitar group are also examined. We also introduce the notion of time complexity for group automata and demonstrate some separations among related classes. The case of linear-time bounds is examined in detail throughout our repertory of matrix group automata.

Key words and phrases:
group automata, time complexity
1991 Mathematics Subject Classification:
68Q45, 68Q05
This work was supported by Boğaziçi University Research Fund under grant number 11760. A preliminary version of this work was presented at the 8’th Workshop on Non-Classical Models of Automata and Applications (NCMA), Debrecen, Hungary, August 29-30, 2016.
Özlem Salehi was partially supported by TÜBİTAK (Scientific and Technological Research Council of Turkey).
The research of F. D’Alessandro was supported by a EC-FP7 Marie Curie-TÜBİTAK Co-Funded Brain Circulation Scheme Project 2236 Fellowship.

1. Introduction

Many extensions of the classical finite automaton model have been examined. One such variant is the group automaton (finite automaton over groups), which is a nondeterministic finite automaton equipped with a register that holds an element from a group [18]. The register is initialized to the identity element of the group, and a computation is deemed successful if the register is equal to the identity element at the end of the computation after being multiplied at every step. This setup generalizes various models such as nondeterministic blind multicounter automata [8] and finite automata with multiplication [13].

The theory of group automata has been essentially developed in the case of free groups [5, 4, 14], and in the case of free Abelian groups [7, 6], where strong theorems allow to characterize the power of such models and the combinatorial properties of the languages recognized by these automata. For groups that are not of the types mentioned above, even in the case of groups of matrices of low dimension, the study of group automata quickly beomes nontrivial, and there are remarkable classes of linear groups for which little is known about the automaton models that they define.

In this paper, we present several new results about the classes of languages recognized by finite automata over matrix groups. We focus on matrix groups with integer and rational entries. For the case of 2×22\times 2 matrices, we prove that the corresponding group automata for rational matrix groups are more powerful than the corresponding group automata for integer matrix groups. We also explore finite automata over some special matrix groups, such as the discrete Heisenberg group and the Baumslag-Solitar group. The “zoo” of language classes associated with different groups is presented, visualizing known relationships and open problems.

We also introduce the notion of time complexity for group automata, and use this additional dimension to analyze the relationships among the language families of various automata using different groups. We develop a method for proving that automata over matrix groups where the growth rate of the group and the time are bounded can not recognize certain languages, even if one uses a very weak definition of time-bounded computation, and use this to demonstrate some new relationships between time-bounded versions of our language classes. The case of linear-time bounds is examined in detail throughout our repertory of matrix groups.

2. Preliminaries

2.1. Notation and terminology

The following notation will be used throughout the paper: QQ is the set of states, q0Qq_{0}\in Q denotes the initial state, QaQQ_{a}\subseteq Q denotes the set of accepting states, and Σ\Sigma is the input alphabet.

By wrw^{r}, we represent the reverse of the string ww. The length of ww is denoted by |w||w|.

𝖱𝖤𝖦\mathsf{REG}, 𝖢𝖥\mathsf{CF}, and 𝖱𝖤\mathsf{RE} denote the families of regular languages, context-free languages, and recursively enumerable languages, respectively.

We assume a familiarity with some basic notions from algebra and group theory (see [9],[17] for references on this topic). For a finitely generated group GG and a set XX of generators, the word problem language of GG is the language W(G,X)W(G,X) over Σ={XX1}\Sigma=\{X\cup X^{-1}\} which consists of all words that represent the identity element of GG. Most of the time, the statements about the word problem are independent of the generating set and in these cases the word problem language is denoted by W(G)W(G). For a string w=w1w2wnW(G)w=w_{1}w_{2}\dots w_{n}\in W(G), w1=wn1w11w^{-1}=w_{n}^{-1}\dots w_{1}^{-1} where each wiΣw_{i}\in\Sigma represents a generator.

2.2. Group automata

Group automata first appear explicitly in the paper [18] under the name of extended finite automaton. The definition is formally given as follows.

Let K=(M,,e)K=(M,\circ,e) be a group under the operation denoted by \circ with the neutral element denoted by ee. An extended finite automaton over the group K=(M,,e)K=(M,\circ,e) is a 6-tuple

=(Q,Σ,K,δ,q0,Qa),\mathcal{F}=(Q,\Sigma,K,\delta,q_{0},Q_{a}),

where the transition function δ\delta is defined as

δ:Q×(Σ{ε})(Q×M).\delta:Q\times(\Sigma\cup\{\varepsilon\})\rightarrow\mathbb{P}(Q\times M).

δ(q,σ)(q,m)\delta(q,\sigma)\ni(q^{\prime},m) means that when \mathcal{F} reads the symbol (or empty string) σΣ{ε}\sigma\in\Sigma\cup\{\varepsilon\} in state qq, it can move to state qq^{\prime}, and write xmx\circ m in the register, where xx is the old content of the register. The initial value of the register is the neutral element ee of the group KK. The string is accepted if, after completely reading the string, \mathcal{F} enters an accept state with the content of the register being equal to the neutral element of KK.

We will prefer using the name group automaton (GG-automaton) instead of extended finite automaton over group GG.

Monoid automata are defined analogously where the group GG is replaced by some monoid NN.

The class of languages recognized by GG-automata will be denoted as 𝔏(G)\mathfrak{L}(G).

3. Matrix groups and associated language classes

In this section, we are going to prove some new results about the classes of languages recognized by finite automata over various groups, focusing on linear groups.

3.1. Basic results

We will denote the free group over rr generators by 𝐅r\mathbf{F}_{r}. Note that 𝐅0\mathbf{F}_{0} is the trivial group, and 𝐅1\mathbf{F}_{1} is isomorphic to \mathbb{Z}, the additive group of integers. The class of regular languages is characterized as the set of languages recognized by finite automata over the trivial group 𝐅0\mathbf{F}_{0} in [5].

The relation between the classes of languages recognized by free group automata is summarized as follows.

Fact \thethrm.

[5] 𝖱𝖤𝖦=𝔏(𝐅0)𝔏(𝐅1)=𝔏()𝔏(𝐅2)\mathsf{REG}=\mathfrak{L}(\mathbf{F}_{0})\subsetneq\mathfrak{L}(\mathbf{F}_{1})=\mathfrak{L}(\mathbb{Z})\subsetneq\mathfrak{L}(\mathbf{F}_{2}).

A characterization of context-free languages by group automata was first stated by Dassow and Mitrana [5], and proven in [4]. Let us note that 𝐅2\mathbf{F}_{2} contains any free group of rank n2n\geq 2 [17].

Fact \thethrm.

[5, 4, 14] 𝔏(𝐅2)\mathfrak{L}(\mathbf{F}_{2}) is the family of context-free languages.

We will denote by k\mathbb{Z}^{k} the additive group of integer vectors of dimension kk. This group is isomorphic to the free Abelian group of rank kk, and k\mathbb{Z}^{k}-automata are equivalent to nondeterministic blind kk-counter automata [11].

The following result states the hierarchy between the classes of languages recognized by k\mathbb{Z}^{k}-automata. This result also follows from the hierarchy between the class of languages recognized by nondeterministic blind kk-counter automata.

Fact \thethrm.

[3] 𝔏(k)𝔏(k+1)\mathfrak{L}(\mathbb{Z}^{k})\subsetneq\mathfrak{L}(\mathbb{Z}^{k+1}) for k1k\geq 1.

We denote by +\mathbb{Q}^{+} the multiplicative group of positive rational numbers, which is isomorphic to a free Abelian group of infinite rank. A +\mathbb{Q}^{+}-automaton is also equivalent to a nondeterministic finite automaton with multiplication without equality (1NFAMW) of Ibarra et al. [13].

The following fact characterizes the class of languages recognized for the case where the alphabet is unary,

Fact \thethrm.

[13] All 1NFAMW-recognizable languages over a unary alphabet are regular.

Let us mention that the class of context-free languages and the class of languages recognized by nondeterministic blind counter automata are incomparable.

Fact \thethrm.

𝖢𝖥\mathsf{CF} and 𝔏(k)\mathfrak{L}(\mathbb{Z}^{k}) are incomparable for all k2k\geq 2.

Proof.

Consider the language 𝙻={anbn|n0}\mathtt{L}=\{a^{n}b^{n}|n\geq 0\} which is a context-free language. Since context-free languages are closed under star, 𝙻\mathtt{L}^{*} is a context-free language whereas it cannot be recognized by any k\mathbb{Z}^{k}-automaton for all k1k\geq 1 by [11]. On the other hand, the non-context-free language 𝙻={anbncn|n0}\mathtt{L}^{\prime}=\{a^{n}b^{n}c^{n}|n\geq 0\} can be recognized by a 2\mathbb{Z}^{2}-automaton. ∎

3.2. Automata on groups of 2×22\times 2 and 3×33\times 3 matrices

We denote by GL(2,)GL(2,\mathbb{Z}) the general linear group of degree two over the field of integers, that is, the group of 2×22\times 2 invertible matrices with integer entries. Note that these matrices have determinant ±1\pm 1. Restricting the matrices in GL(2,)GL(2,\mathbb{Z}) to those that have determinant 1, we obtain the special linear group of degree two over the field of integers, SL(2,)SL(2,\mathbb{Z}).

Let 𝐆\mathbf{G} be the group generated by the matrices

Ma=[1201]andMb=[1021].M_{a}=\left[\begin{array}[]{cc}1&2\\ 0&1\\ \end{array}\right]~~~\mbox{and}~~~M_{b}=\left[\begin{array}[]{cc}1&0\\ 2&1\\ \end{array}\right].

There exists an isomorphism φ\varphi from 𝐅2\mathbf{F}_{2} onto 𝐆\mathbf{G} by [15]. Note that MaM_{a} and MbM_{b} are integer matrices with determinant 1, which proves that 𝐅2\mathbf{F}_{2} is a subgroup of SL(2,)SL(2,\mathbb{Z}).

Now the question is whether 𝔏(GL(2,))\mathfrak{L}(GL(2,\mathbb{Z})) and 𝔏(SL(2,))\mathfrak{L}(SL(2,\mathbb{Z})) correspond to larger classes of languages than the class of context-free languages. We are going to use the following fact to prove that the answer is negative.

Fact \thethrm.

[4] Suppose GG is a finitely generated group and HH is a subgroup of finite index. Then 𝔏(G)=𝔏(H)\mathfrak{L}(G)=\mathfrak{L}(H).

Now we are ready to state our theorem.

{thrm}

𝖢𝖥=𝔏(𝐅2)=𝔏(SL(2,))=𝔏(GL(2,))\mathsf{CF}=\mathfrak{L}(\mathbf{F}_{2})=\mathfrak{L}(SL(2,\mathbb{Z}))=\mathfrak{L}(GL(2,\mathbb{Z})).

Proof.

We are going to use Fact 3.2 to prove the result. Since SL(2,)SL(2,\mathbb{Z}) has index 2 in GL(2,)GL(2,\mathbb{Z}) and GL(2,)GL(2,\mathbb{Z}) is finitely generated, 𝔏(GL(2,))=𝔏(SL(2,))\mathfrak{L}(GL(2,\mathbb{Z}))=\mathfrak{L}(SL(2,\mathbb{Z})). Since 𝐅2\mathbf{F}_{2} has index 12 in SL(2,)SL(2,\mathbb{Z}) [2] and SL(2,)SL(2,\mathbb{Z}) is finitely generated, 𝔏(SL(2,))=𝔏(𝐅2)\mathfrak{L}(SL(2,\mathbb{Z}))=\mathfrak{L}(\mathbf{F}_{2}) which is equal to the family of context-free languages by Fact 3.1. ∎

Let us now investigate the group SL(3,)SL(3,\mathbb{Z}), the group of 3×33\times 3 integer matrices with determinant 11.

We start by looking at an important subgroup of SL(3,)SL(3,\mathbb{Z}), the discrete Heisenberg group. The discrete Heisenberg group 𝐇\mathbf{H} is defined as a,b|ab=bac,ac=ca,bc=cb\langle a,b|ab=bac,ac=ca,bc=cb\rangle, where c=a1b1abc=a^{-1}b^{-1}ab is the commutator of aa and bb.

a=[110010001]b=[100011001]c=[101010001]a=\left[\begin{array}[]{ccc}1&1&0\\ 0&1&0\\ 0&0&1\end{array}\right]~~~b=\left[\begin{array}[]{ccc}1&0&0\\ 0&1&1\\ 0&0&1\end{array}\right]~~~c=\left[\begin{array}[]{ccc}1&0&1\\ 0&1&0\\ 0&0&1\end{array}\right]

Any element g𝐇g\in\mathbf{H} can be written uniquely as bjaickb^{j}a^{i}c^{k}.

g=[1ik01j001]=bjaickg=\left[\begin{array}[]{ccc}1&i&k\\ 0&1&j\\ 0&0&1\end{array}\right]=b^{j}a^{i}c^{k}

It is shown in [19] that the languages 𝙼𝚄𝙻𝚃={xpyqzpq|p,q0}\mathtt{MULT}=\{x^{p}y^{q}z^{pq}|p,q\geq 0\}, 𝙲𝙾𝙼𝙿𝙾𝚂𝙸𝚃𝙴={xpq|p,q>1}\mathtt{COMPOSITE}=\{x^{pq}|p,q>1\} and 𝙼𝚄𝙻𝚃𝙸𝙿𝙻𝙴={xpypn|p}\mathtt{MULTIPLE}=\{x^{p}y^{pn}|p\in\mathbb{N}\} can be recognized by 𝐇\mathbf{H}-automata, using the special multiplication property of the group.

Correcting a small error in [19], we rewrite the multiplication property of the elements of 𝐇\mathbf{H}.

(bxaycz)(bxaycz)=bx+xay+ycz+z+yx(b^{x}a^{y}c^{z})(b^{x^{\prime}}a^{y^{\prime}}c^{z^{\prime}})=b^{x+x^{\prime}}a^{y+y^{\prime}}c^{z+z^{\prime}+yx^{\prime}}

We can make the following observation using the fact that 𝔏(𝐇)\mathfrak{L}(\mathbf{H}) contains non-context-free languages.

{thrm}

𝔏(SL(2,))𝔏(SL(3,))\mathfrak{L}(SL(2,\mathbb{Z}))\subsetneq\mathfrak{L}(SL(3,\mathbb{Z})).

Proof.

It is obvious that an SL(2,)SL(2,\mathbb{Z})-automaton can be simulated by an SL(3,)SL(3,\mathbb{Z})-automaton. Note that 𝔏(SL(2,))\mathfrak{L}(SL(2,\mathbb{Z})) is the family of context-free languages by Theorem 3.2. Since 𝔏(𝐇)𝔏(SL(3,))\mathfrak{L}(\mathbf{H})\subsetneq\mathfrak{L}(SL(3,\mathbb{Z})) and the non-context-free language 𝙼𝚄𝙻𝚃={xpyqzpq|p,q0}\mathtt{MULT}=\{x^{p}y^{q}z^{pq}|p,q\geq 0\} can be recognized by an 𝐇\mathbf{H}-automaton [19], the result follows. ∎

The following result is a direct consequence of Fact 3.2.

{thrm}

𝔏(SL(3,))=𝔏(GL(3,))\mathfrak{L}(SL(3,\mathbb{Z}))=\mathfrak{L}(GL(3,\mathbb{Z})).

Proof.

Since GL(3,)GL(3,\mathbb{Z}) is a finitely generated group and SL(3,)SL(3,\mathbb{Z}) has finite index in GL(3,)GL(3,\mathbb{Z}), the result follows by Fact 3.2. ∎

We have talked about the discrete Heisenberg group H. Now let us look at a subgroup of 𝐇\mathbf{H} generated by the matrices BB and CC, which we will call 𝐆𝟐\mathbf{G_{2}}.

B=[100011001]C=[101010001]B=\left[\begin{array}[]{ccc}1&0&0\\ 0&1&1\\ 0&0&1\end{array}\right]~~~C=\left[\begin{array}[]{ccc}1&0&1\\ 0&1&0\\ 0&0&1\end{array}\right]~~~

𝐆𝟐=B,C|BC=CB\mathbf{G_{2}}=\langle B,C|BC=CB\rangle is a free Abelian group of rank 2, and therefore it is isomorphic to 2\mathbb{Z}^{2}.

We conclude the following about the language recognition power of 2\mathbb{Z}^{2} and 𝐇\mathbf{H}.

{thrm}

𝔏(2)𝔏(𝐇)\mathfrak{L}(\mathbb{Z}^{2})\subsetneq\mathfrak{L}(\mathbf{H}).

Proof.

Since 2\mathbb{Z}^{2} is a subgroup of 𝐇\mathbf{H}, 𝔏(2)𝔏(𝐇)\mathfrak{L}(\mathbb{Z}^{2})\subseteq\mathfrak{L}(\mathbf{H}) follows. The inclusion is proper since 𝐇\mathbf{H} can recognize the language 𝙼𝚄𝙻𝚃={xpyqzpq|p,q0}\mathtt{MULT}=\{x^{p}y^{q}z^{pq}|p,q\geq 0\} [19], whereas any bounded language in 𝔏(+)\mathfrak{L}(\mathbb{Q}^{+}) is semilinear [13]. ∎

Now let us move on to the discussion about matrix groups with rational entries.

Let us denote by GL(2,)GL(2,\mathbb{Q}) the general linear group of degree two over the field of rational numbers, that is, the group of invertible matrices with rational entries. Restricting the matrices in GL(2,)GL(2,\mathbb{Q}) to those that have determinant 1, we obtain the special linear group of degree two over the field of rationals, SL(2,)SL(2,\mathbb{Q}).

We will start by proving that allowing rational entries enlarges the class of languages recognized by matrices with determinant 1.

{thrm}

𝔏(SL(2,))𝔏(SL(2,))\mathfrak{L}(SL(2,\mathbb{Z}))\subsetneq\mathfrak{L}(SL(2,\mathbb{Q})).

Proof.

It is obvious that 𝔏(SL(2,))𝔏(SL(2,))\mathfrak{L}(SL(2,\mathbb{Z}))\subseteq\mathfrak{L}(SL(2,\mathbb{Q})). We will prove that the inclusion is proper.

Let us construct an SL(2,)SL(2,\mathbb{Q})-automaton 𝒢\mathcal{G} recognizing the language 𝙻={a22n+1|\mathtt{L}=\{a^{2^{2n+1}}| n0}n\geq 0\}. The state diagram of 𝒢\mathcal{G} and the matrices are given in Figure 1. Without scanning any input symbol, 𝒢\mathcal{G} first multiplies its register with the matrix A1A_{1}. 𝒢\mathcal{G} then multiplies its register with the matrix A2A_{2} successively until nondeterministically moving to the next state. After that point, 𝒢\mathcal{G} starts reading the string and multiplies its register with the matrix A3A_{3} for each scanned aa. At some point, 𝒢\mathcal{G} nondeterministically stops reading the rest of the string and multiplies its register with the matrix A4A_{4}. After successive multiplications with A4A_{4}, 𝒢\mathcal{G} nondeterministically decides moving to an accept state.

Refer to caption
Figure 1. State diagram of 𝒢\mathcal{G} accepting the language 𝙻={a22n+1|n0}\mathtt{L}=\{a^{2^{2n+1}}|n\geq 0\}

Let us trace the value of the register at different stages of the computation. Before reading the first input symbol, the register has the value

[2x+102x12x+1]\left[\begin{array}[]{cc}2^{x+1}&0\\ 2^{x}&\frac{1}{2^{x+1}}\\ \end{array}\right]

as a result of the multiplications with the matrix A1A_{1} and xx times the matrix A2A_{2}. Multiplication with each A3A_{3} leaves 2x+12^{x+1} and 12x+1\frac{1}{2^{x+1}} unchanged while subtracting 12x+1\frac{1}{2^{x+1}} from 2x2^{x} for each scanned aa. As a result of yy multiplications with A3A_{3}, the register will have the value

[2x+102xy2x+112x+1].\left[\begin{array}[]{cc}2^{x+1}&0\\ 2^{x}-\frac{y}{2^{x+1}}&\frac{1}{2^{x+1}}\\ \end{array}\right].

For the rest of the computation, 𝒢\mathcal{G} will multiply its register with A4A_{4} until nondeterministically moving to the final state. As a result of zz multiplications with A4A_{4}, the register will have the value

[2x+12z0(2xy2x+1)12z2z2x+1].\left[\begin{array}[]{cc}\frac{2^{x+1}}{2^{z}}&0\\ \bigl{(}2^{x}-\frac{y}{2^{x+1}}\bigr{)}\frac{1}{2^{z}}&\frac{2^{z}}{2^{x+1}}\\ \end{array}\right].

The final value of the register is equal to the identity matrix when y=22x+1y=2^{2x+1} and z=x+1z=x+1, which is possible only when the length of the input string is 22x+12^{2x+1} for some x0x\geq 0. In the successful branch, the register will be equal to the identity matrix and 𝒢\mathcal{G} will end up in the final state having successfully read the input string. For input strings which are not members of 𝙻\mathtt{L}, either the computation will end before reading the whole input string, or the final state will be reached with the register value not equaling the identity matrix.

Since the matrices used during the computation are 2 by 2 rational matrices with determinant 1, 𝙻𝔏(SL(2,))\mathtt{L}\in\mathfrak{L}(SL(2,\mathbb{Q})). 𝔏(SL(2,))\mathfrak{L}(SL(2,\mathbb{Q})) contains a unary nonregular language, which is not true for 𝔏(SL(2,))\mathfrak{L}(SL(2,\mathbb{Z})) by Theorem 3.2 and we conclude the result. ∎

Let us note that the set of languages recognized by +\mathbb{Q}^{+}-automata is a proper subset of the set of languages recognized by SL(2,)SL(2,\mathbb{Q})-automata, which can be concluded with the help of the following fact.

{thrm}

𝔏(+)𝔏(SL(2,))\mathfrak{L}(\mathbb{Q}^{+})\subsetneq\mathfrak{L}(SL(2,\mathbb{Q})).

Proof.

Let 𝙻𝔏(+)\mathtt{L}\in\mathfrak{L}(\mathbb{Q}^{+}) and let 𝒢\mathcal{G} be a +\mathbb{Q}^{+}-automaton recognizing 𝙻\mathtt{L}. We will construct an SL(2,)SL(2,\mathbb{Q})-automaton 𝒢\mathcal{G}^{\prime} recognizing 𝙻\mathtt{L}. Let S={s1,,sn}S=\{s_{1},\dots,s_{n}\} be the set of elements multiplied with the register during the computation of 𝒢\mathcal{G}. We define the mapping φ\varphi as follows.

φ:si[si001si]\varphi:s_{i}\mapsto\left[\begin{array}[]{cc}s_{i}&0\\ 0&\frac{1}{s_{i}}\\ \end{array}\right]~~~

The elements φ(si)\varphi(s_{i}) are 2×22\times 2 rational matrices with determinant 1. Let δ\delta and δ\delta^{\prime} be the transition functions of 𝒢\mathcal{G} and 𝒢\mathcal{G}^{\prime} respectively. We let (q,si)δ(q,σ)(q,φ(si))δ(q,σ)(q^{\prime},s_{i})\in\delta(q,\sigma)\iff(q^{\prime},\varphi(s_{i}))\in\delta^{\prime}(q,\sigma) for every q,qQq,q^{\prime}\in Q, σΣ\sigma\in\Sigma and siSs_{i}\in S. The resulting 𝒢\mathcal{G}^{\prime} recognizes 𝙻\mathtt{L}.

The inclusion is proper since 𝙻={a22n+1|n0}𝔏(SL(2,))\mathtt{L}=\{a^{2^{2n+1}}|n\geq 0\}\in\mathfrak{L}(SL(2,\mathbb{Q})) by Theorem 3.2, and 𝔏(+)\mathfrak{L}(\mathbb{Q}^{+}) does not contain any unary nonregular languages by Fact 3.1, noting that +\mathbb{Q}^{+}-automata are equivalent to 1NFAMW’s. ∎

We will now look at a special subgroup of GL(2,)GL(2,\mathbb{Q}).

For two integers mm and nn, the Baumslag-Solitar group BS(m,n)BS(m,n) is defined as BS(m,n)=a,b|bamb1=anBS(m,n)=\langle a,b|ba^{m}b^{-1}=a^{n}\rangle. We are going to focus on BS(1,2)=a,b|bab1=a2BS(1,2)=\langle a,b|bab^{-1}=a^{2}\rangle.

Consider the matrix group GBSG_{BS} generated by the matrices

A=[1011]andB=[1/2001].A=\left[\begin{array}[]{cc}1&0\\ -1&1\\ \end{array}\right]~~~\mbox{and}~~~B=\left[\begin{array}[]{cc}1/2&0\\ 0&1\\ \end{array}\right].

Consider the isomorphism aAa\mapsto A, bBb\mapsto B. The matrices AA and BB satisfy the property BAB1=A2BAB^{-1}=A^{2},

[1/2001][1011][2001]=[1021],\left[\begin{array}[]{cc}1/2&0\\ 0&1\\ \end{array}\right]\left[\begin{array}[]{cc}1&0\\ -1&1\\ \end{array}\right]\left[\begin{array}[]{cc}2&0\\ 0&1\\ \end{array}\right]=\left[\begin{array}[]{cc}1&0\\ -2&1\\ \end{array}\right],

and we conclude that GBSG_{BS} is isomorphic to BS(1,2)BS(1,2).

We will prove that there exists a BS(1,2)BS(1,2)-automaton which recognizes a non-context-free language.

{thrm}

𝔏(BS(1,2))𝖢𝖥\mathfrak{L}(BS(1,2))\nsubseteq\mathsf{CF}.

Proof.

Let us construct a BS(1,2)BS(1,2)-automaton 𝒢\mathcal{G} recognizing the language 𝚄𝙿𝙾𝚆={a2n|n0}\mathtt{UPOW}=\{a^{2^{n}}|n\geq 0\}. The state diagram of 𝒢\mathcal{G} and the matrices are given in Figure 2. Without scanning any input symbol, 𝒢\mathcal{G} multiplies its register with the matrix A1A_{1} successively. 𝒢\mathcal{G} nondeterministically moves to the next state reading the first input symbol without modifying the register. After that point, 𝒢\mathcal{G} starts reading the string and multiplies its register with the matrix A2A_{2} for each scanned aa. At some point, 𝒢\mathcal{G} nondeterministically stops reading the rest of the string and multiplies its register with the element A3A_{3}. After successive multiplications with A3A_{3}, 𝒢\mathcal{G} nondeterministically decides to move to an accept state.

Refer to caption
Figure 2. State diagram of 𝒢\mathcal{G} recognizing 𝚄𝙿𝙾𝚆={a2n|n0}\mathtt{UPOW}=\{a^{2^{n}}|n\geq 0\}

As a result of ii multiplications with A1A_{1}, the register has the value

[2i02i11]\left[\begin{array}[]{cc}2^{i}&0\\ 2^{i}-1&1\\ \end{array}\right]

before reading the first input symbol. Multiplication with each A2A_{2} leaves 2i2^{i} unchanged while subtracting 1 from 2i12^{i}-1 for each scanned aa. The register will have the value

[2i02i1j1]\left[\begin{array}[]{cc}2^{i}&0\\ 2^{i}-1-j&1\\ \end{array}\right]

as a result of jj multiplications with the matrix A2A_{2}.

For the rest of the computation, 𝒢\mathcal{G} will multiply its register with A3A_{3} resulting in the register value

[2i2k02i1j1]\left[\begin{array}[]{cc}\frac{2^{i}}{2^{k}}&0\\ 2^{i}-1-j&1\\ \end{array}\right]

since each multiplication with A3A_{3} divides 2i2^{i} by 2.

The register contains the identity matrix at the end of the computation if i=ki=k and j=2i1j=2^{i}-1 which is possible if the input string is of the form a1+2i1=a2ia^{1+2^{i}-1}=a^{2^{i}}. In the successful branch, the register will be equal to the identity matrix and 𝒢\mathcal{G} will end up in the final state having successfully read the input string.

For input strings which are not members of 𝚄𝙿𝙾𝚆\mathtt{UPOW}, either the computation will end before reading the whole input string or the final state will be reached with the register value being different from the identity matrix. Note that A1=B1A1A_{1}=B^{-1}A^{-1}, A2=AA_{2}=A and A3=BA_{3}=B, where AA and BB are the generators of the group GBSG_{BS} and recall that GBSG_{BS} is isomorphic to BS(1,2)BS(1,2). Since 𝚄𝙿𝙾𝚆\mathtt{UPOW} is a unary nonregular language, it is not context-free and we conclude the result. ∎

Note that 𝔏()𝔏(BS(1,2))\mathfrak{L}(\mathbb{Z})\subsetneq\mathfrak{L}(BS(1,2)) since the subgroup generated by aa in BS(1,2)BS(1,2) is isomorphic to \mathbb{Z} and 𝔏(BS(1,2))\mathfrak{L}(BS(1,2)) contains a unary nonregular language.

3.3. Automata on matrices of higher dimensions

In [18], it is proven that 𝐅2×𝐅2\mathbf{F}_{2}\times\mathbf{F}_{2}-automata are as powerful as Turing machines.

Fact \thethrm.

[18] 𝔏(𝐅2×𝐅2)\mathfrak{L}(\mathbf{F}_{2}\times\mathbf{F}_{2}) is the family of recursively enumerable languages.

We make the following observation.

{thrm}

𝖱𝖤=𝔏(𝐅2×𝐅2)=𝔏(SL(4,))\mathsf{RE}=\mathfrak{L}(\mathbf{F}_{2}\times\mathbf{F}_{2})=\mathfrak{L}(SL(4,\mathbb{Z})).

Proof.

The first equality is Fact 3.3. Recall from Section 3.2 that φ\varphi is an isomorphism from 𝐅2\mathbf{F}_{2} onto 𝐆\mathbf{G}, the matrix group generated by the matrices MaM_{a} and MbM_{b}. Let 𝐆\mathbf{G}^{\prime} be the following group of matrices

{[M10000 M200],M1,M2𝐆}.\left\{\left[\begin{array}[]{clll}\lx@intercol\hbox{\multirowsetup$M_{1}$}\hfil\lx@intercol&0&0\\ &&0&0\\ 0&0&\lx@intercol\hfil\hbox{\multirowsetup $M_{2}$}\hfil\lx@intercol\\ 0&0&&\\ \end{array}\right],\ M_{1},\ M_{2}\in\mathbf{G}\right\}.

We will define the mapping ψ:𝐅2×𝐅2𝐆\psi:\mathbf{F}_{2}\times\mathbf{F}_{2}\rightarrow\mathbf{G}^{\prime} as ψ(g1,g2)=(φ(g1),φ(g2))\psi(g_{1},g_{2})=(\varphi(g_{1}),\varphi(g_{2})) for all (g1,g2)𝐅2×𝐅2(g_{1},g_{2})\in\mathbf{F}_{2}\times\mathbf{F}_{2} which is an isomorphism from 𝐅2×𝐅2\mathbf{F}_{2}\times\mathbf{F}_{2} onto 𝐆\mathbf{G}^{\prime}.

This proves that 𝐅2×𝐅2\mathbf{F}_{2}\times\mathbf{F}_{2} is isomorphic to a subgroup of SL(4,)SL(4,\mathbb{Z}). The fact that 𝔏(𝐅2×𝐅2)\mathfrak{L}(\mathbf{F}_{2}\times\mathbf{F}_{2}) is the set of recursively enumerable languages helps us to conclude that 𝔏(SL(n,))\mathfrak{L}(SL(n,\mathbb{Z})) is the set of recursively enumerable languages for n4n\geq 4. ∎

Let us also state that the classes of languages recognized by automata over supergroups of SL(4,)SL(4,\mathbb{Z}) such as GL(4,)GL(4,\mathbb{Z}) or SL(4,)SL(4,\mathbb{Q}) are also identical to the class of recursively enumerable languages. {thrm} 𝔏(G)=𝖱𝖤\mathfrak{L}(G)=\mathsf{RE}, where GG is any matrix group whose matrix entries are computable numbers and SL(4,)SL(4,\mathbb{Z}) is a subgroup of GG.

Proof.

Note that any finite automaton over a matrix group can be simulated by a nondeterministic Turing machine which keeps track of the register simply by multiplying the matrices and checking whether the identity matrix is reached at the end of the computation, provided that the matrix entries are computable numbers. Since 𝖱𝖤=𝔏(SL(4,))\mathsf{RE}=\mathfrak{L}(SL(4,\mathbb{Z})) and GG contains SL(4,)SL(4,\mathbb{Z}) as a subgroup, 𝔏(G)\mathfrak{L}(G) is the set of recursively enumerable languages. ∎

We summarize the results in Figure 3. Solid arrows represent proper inclusion, dashed arrows represent inclusion and dashed lines represent incomparability.

Refer to caption
Figure 3. Language classes associated with groups

4. Time complexity

In the previous section, we compared various automaton models solely on the basis of the groups they employed as a computational resource. The theory of computational complexity deals with various different types of such resources, the allowed runtime of the machines being the most prominent among them. Some of the automata we saw in Section 3 (e.g. Figure 2) have arbitrarily long computations, and it is a legitimate question to ask whether our results, for instance, the relationships in Figure 3, would still hold if one imposed common time bounds on the automata. We study such questions in this section.

4.1. Definitions

A group automaton 𝒢\mathcal{G} recognizing language 𝙻\mathtt{L} is said to be strongly t(n)t(n) time-bounded if for any input string xx with |x|=n|x|=n, every computation of 𝒢\mathcal{G} on xx takes at most t(n)t(n) steps. We will denote the set of languages recognized by strongly t(n)t(n)-time bounded GG-automata by 𝔏(G)t(n)s\mathfrak{L}(G)_{t(n)}^{s}.

Although the strong mode of recognition defined above is standard in studies of time complexity, we will be able to prove the impossibility results of the next subsection even when the machines are subjected to the following, looser requirement:

A group automaton 𝒢\mathcal{G} recognizing language 𝙻\mathtt{L} is said to be weakly t(n)t(n) time-bounded if for each accepted input string x𝙻x\in\mathtt{L} with |x|=n|x|=n, 𝒢\mathcal{G} has a successful computation which takes at most t(n)t(n) steps. So any input string is allowed to cause longer computations, as long as none of those are accepting for inputs which are not members of 𝙻\mathtt{L}. We will denote the set of languages recognized by weakly t(n)t(n)-time bounded GG-automata by 𝔏(G)t(n)w\mathfrak{L}(G)_{t(n)}^{w}.

A machine is real-time if every transition consumes an input symbol.

Note that the statement 𝔏(G)t(n)s𝔏(G)t(n)w\mathfrak{L}(G)_{t(n)}^{s}\subseteq\mathfrak{L}(G)_{t(n)}^{w} is true by definition.

Let XX be a generator set for the group GG. The length of gGg\in G, denoted |g|X|g|_{X}, is the length of the shortest representative for gg in (XX1)(X\cup X^{-1})^{*}. Let BGX(n)={gG,|g|Xn}B^{X}_{G}(n)=\{g\in G,|g|_{X}\leq n\} be the set of all elements in GG which can be represented by a word of length at most nn. The growth function of a group GG with respect to a generating set XX, denoted gGX(n)g^{X}_{G}(n), is the cardinality of the set BGX(n)B^{X}_{G}(n), that is gGX(n)=|BGX(n)|g^{X}_{G}(n)=|B^{X}_{G}(n)|. The growth function is asymptotically independent of the generating set, and we will denote the growth function of a group GG by gG(n)g_{G}(n).

For a positive integer nn, two strings w,wΣw,w^{\prime}\in\Sigma^{*} are nn-dissimilar for 𝙻\mathtt{L} if |w|n|w|\leq n, |w|n|w^{\prime}|\leq n, and there exists a string vΣv\in\Sigma^{*} with |wv|n|wv|\leq n, |wv|n|w^{\prime}v|\leq n such that wv𝙻wv\in\mathtt{L} iff wv𝙻w^{\prime}v\notin\mathtt{L}. Let A𝙻(n)A_{\mathtt{L}}(n) be the maximum kk such that there exist kk distinct strings that are pairwise nn-dissimilar.

A finite set of strings SS is said to be a set of uniformly nn-dissimilar strings for 𝙻\mathtt{L} if for each string wSw\in S, there exists a string vv such that |wv|n|wv|\leq n and wv𝙻wv\in\mathtt{L} and for any string wSw^{\prime}\in S such that www\neq w^{\prime}, |wv|n|w^{\prime}v|\leq n and wv𝙻w^{\prime}v\notin\mathtt{L}. Let U𝙻(n)U_{\mathtt{L}}(n) be the maximum kk such that there exist kk distinct strings that are uniformly nn-dissimilar.

Note that the following is always true by definition, since the strings in a uniformly nn-dissimilar set are pairwise nn-dissimilar.

{lmm}

U𝙻(n)A𝙻(n)U_{\mathtt{L}}(n)\leq A_{\mathtt{L}}(n) for all n0n\geq 0.

4.2. Limitations of machines on slow groups running in short time

{thrm}

Let GG be a group with growth function gG(n)g_{G}(n). 𝙻𝔏(G)t(n)w\mathtt{L}\notin\mathfrak{L}(G)_{t(n)}^{w} if gG(t(n))o(U𝙻(n))g_{G}(t(n))\in o(U_{\mathtt{L}}(n)).

Proof.

Suppose for a contradiction that there exists a weakly t(n)t(n) time-bounded GG-automaton 𝒢\mathcal{G} recognizing 𝙻\mathtt{L} in time t(n)t(n). For a sufficiently large nn, let SS be the set of uniformly nn-dissimilar strings such that |S|=U𝙻(n)|S|=U_{\mathtt{L}}(n). For every string wiSw_{i}\in S, there exists a string viv_{i} such that wivi𝙻w_{i}v_{i}\in\mathtt{L} and wjvi𝙻w_{j}v_{i}\notin\mathtt{L} for all wjSw_{j}\in S with iji\neq j .

Let SaccS_{acc} be the set of accepted extended strings of the form wivi𝙻w_{i}v_{i}\in\mathtt{L} with |wivi|n|w_{i}v_{i}|\leq n where wiSw_{i}\in S and wjvi𝙻w_{j}v_{i}\notin\mathtt{L} for all wjSw_{j}\in S with iji\neq j and |wjvi|n|w_{j}v_{i}|\leq n. Let CC be the set of t(n)t(n) time bounded accepting computation paths for the strings in SaccS_{acc}. The computation cwiviCc_{w_{i}v_{i}}\in C on the string wiviw_{i}v_{i} can be written as

cwivi=cwiviwicwivivic_{w_{i}v_{i}}=c_{w_{i}v_{i}}^{w_{i}}c_{w_{i}v_{i}}^{v_{i}}

where cwiviwic_{w_{i}v_{i}}^{w_{i}} represents the computation up to the end of the prefix wiw_{i} and cwivivic_{w_{i}v_{i}}^{v_{i}} represents the rest of the computation on the string viv_{i}.

A configuration of a group automaton is a pair consisting of a state and a group element. Let us count the number of configurations that can be reached at the end of the computation cwiviwic_{w_{i}v_{i}}^{w_{i}}. Since the number of states is constant, the number of configurations that can be reached is dependent on the number of different group elements that can appear in the register. After reading a prefix wiw_{i} with |wi|=mn|w_{i}|=m\leq n, the product of the labels on the edges can be given by l=gi1gi2gikl=g_{i_{1}}g_{i_{2}}\dots g_{i_{k}} for some kt(m)k\leq t(m), since the computation in consideration is time bounded. ll can be expressed as a product of κ\kappa generators, where κ\kappa is at most CkC\cdot k for some constant CC, since each group element labeling a transition in 𝒢\mathcal{G} is composed of at most some constant number of generators, which is independent of the length of the string. The number of elements in GG which can be represented as a product of at most κ\kappa generators is given by gG(κ)g_{G}(\kappa) by the definition of the growth function of GG. Hence, the number of different values that can appear in the register after reading a string of length exactly mm is less than or equal to gG(κ)g_{G}(\kappa). Since κCk\kappa\leq C\cdot k and kt(m)k\leq t(m) and gG(t(n))o(U𝙻(n))g_{G}(t(n))\in o(U_{\mathtt{L}}(n)), we can conclude that

gG(κ)gG(Ct(m))o(U𝙻(n)).g_{G}(\kappa)\leq g_{G}(C\cdot t(m))\in o(U_{\mathtt{L}}(n)).

Now it is easy to see that the number of different configurations that can be reached at the end of a computation cwiviwic_{w_{i}v_{i}}^{w_{i}} is o(U𝙻(n))o(U_{\mathtt{L}}(n)). Note that the cardinality of the set SS, and thus that of SaccS_{acc}, is equal to UL(n)U_{L}(n). Due to the pigeonhole principle, the same configuration must be reached at the end of two computations cwiviwic_{w_{i}v_{i}}^{w_{i}} and cwjvjwjc_{w_{j}v_{j}}^{w_{j}} for some iji\neq j. This will result in the acceptance of the strings wivjw_{i}v_{j} and wjviw_{j}v_{i}, which are not members of 𝙻\mathtt{L}. We arrive at a contradiction and conclude that 𝙻\mathtt{L} cannot be recognized by any weakly t(n)t(n) time-bounded GG-automaton. ∎

In the next lemma, we set a lower bound on maximum cardinality of the set of uniformly nn-dissimilar strings in the word problem language of some group GG.

{lmm}

Let GG be a finitely generated group with growth function gG(n)g_{G}(n). Then UW(G)(n)gG(n2)U_{W(G)}(n)\geq g_{G}(\lfloor\frac{n}{2}\rfloor).

Proof.

Let XX be the generator set of GG. The number of distinct elements gg in GG which can be represented by a word of length less than or equal to n2\lfloor\frac{n}{2}\rfloor is gG(n2)g_{G}(\lfloor\frac{n}{2}\rfloor), which is the cardinality of the set BGX(n2)={gG,|g|Xn2}B_{G}^{X}(\lfloor\frac{n}{2}\rfloor)=\{g\in G,|g|_{X}\leq\lfloor\frac{n}{2}\rfloor\}. Let SS be the set containing the string representations of the elements in BGX(n2)B_{G}^{X}(\lfloor\frac{n}{2}\rfloor). Every wiSw_{i}\in S can be extended with wi1w_{i}^{-1} so that the extended string represents the identity element of GG and has length less than or equal to nn. Since the strings in W(G)W(G) are those which belong to (XX1)(X\cup X^{-1})^{*} and represent the identity element of GG, the extended string wiwi1W(G)w_{i}w^{-1}_{i}\in W(G). For every string wjSw_{j}\in S such that iji\neq j, wjwi1W(G)w_{j}w_{i}^{-1}\notin W(G) since it is not possible for wjwi1w_{j}w_{i}^{-1} to represent the identity element of GG. We conclude that the set SS is uniformly nn-dissimilar. Since |S|=|BGX(n2)|=gG(n2)|S|=|B_{G}^{X}(\lfloor\frac{n}{2}\rfloor)|=g_{G}(\lfloor\frac{n}{2}\rfloor), it follows that UW(G)(n)gG(n2)U_{W(G)}(n)\geq g_{G}(\lfloor\frac{n}{2}\rfloor). ∎

The following theorem is about the language recognition power of finite automata over polynomial-growth groups which are weakly polynomial time-bounded. {thrm} Let GG and HH be groups with polynomial and exponential growth functions gG(n)g_{G}(n) and gH(n)g_{H}(n), respectively. For any polynomial t(n)t(n), 𝔏(H)𝔏(G)t(n)w\mathfrak{L}(H)\nsubseteq\mathfrak{L}(G)_{t(n)}^{w}.

Proof.

Since UW(H)(n)gH(n2)U_{W(H)}(n)\geq g_{H}(\lfloor\frac{n}{2}\rfloor) by Lemma 4.2, and gH(n)g_{H}(n) is an exponential function, UW(H)(n)U_{W(H)}(n) is also at least exponential. gG(t(n))g_{G}(t(n)) is a polynomial function, since both gG(n)g_{G}(n) and t(n)t(n) are polynomial. Hence, W(H)𝔏(G)t(n)wW(H)\notin\mathfrak{L}(G)_{t(n)}^{w} by Theorem 4.2, and the result follows since W(H)W(H) is trivially in 𝔏(H)\mathfrak{L}(H). ∎

{thrm}

Let GG be a group with a polynomial growth function. For any polynomial t(n)t(n), 𝖢𝖥𝔏(G)t(n)w\mathsf{CF}\nsubseteq\mathfrak{L}(G)_{t(n)}^{w}.

Proof.

It is known that the word problem of the free group of rank 2, W(𝐅2)W(\mathbf{F}_{2}), has an exponential growth function [12]. Assuming that GG is a group with polynomial growth function, W(𝐅2)W(\mathbf{F}_{2}) cannot be recognized by any weakly t(n)t(n) time-bounded GG-automaton by Theorem 4.2. Since W(𝐅2)W(\mathbf{F}_{2}) is a context-free language, the proof is complete. ∎

4.3. Group automata under linear time bounds

In this section, we focus on linear-time computation.

Let XX be a generator set. For each symbol xXx\in X, the functions PxP_{x} and QxQ_{x} are defined as follows.

Px:XXwwx\displaystyle P_{x}:X^{*}\rightarrow X^{*}\hskip 36.135ptw\mapsto wx
Qx:XxXwxw\displaystyle Q_{x}:X^{*}x\rightarrow X^{*}\hskip 36.135ptwx\mapsto w

Let KXK_{X} be the submonoid of all partial functions on XX^{*} generated by PxP_{x} and QxQ_{x} for all xXx\in X. KXK_{X} is called the polycyclic monoid on XX. A KXK_{X}-automaton is equivalent to a pushdown automaton, where PxP_{x} and QxQ_{x} can be interpreted as pushing and popping symbols on the stack. The equivalence between the two models is due to the nature of the functions PxP_{x} and QxQ_{x}, and is described in detail in [14]. The resemblance between the free group and KXK_{X} is used to prove that 𝔏(𝐅2)=𝖢𝖥\mathfrak{L}(\mathbf{F}_{2})=\mathsf{CF} in [14] and [4].

Our aim is to show that 𝐅2\mathbf{F}_{2}-automata working in linear time can recognize all context-free languages. It is stated in [21] that KXK_{X}-automata which consume at least one input symbol at each step are as powerful as KXK_{X}-automata without any time bound. However, it is not straightforward to see whether the same is true for 𝐅2\mathbf{F}_{2}-automata.

{thrm}

𝔏(𝐅2)O(n)w=𝖢𝖥\mathfrak{L}(\mathbf{F}_{2})_{O(n)}^{w}=\mathsf{CF}.

Proof.

We are going to use the construction of Kambites [14] to prove that any context-free language can be recognized by a weakly linear-time bounded 𝐅2\mathbf{F}_{2}-automaton.

Let 𝙻\mathtt{L} be a context-free language and let ={Q,Σ,KX,δ,q0,Qa}\mathcal{M}=\{Q,\Sigma,K_{X},\delta,q_{0},Q_{a}\} be a polycyclic monoid automaton recognizing 𝙻\mathtt{L}. KXK_{X} is the polycyclic monoid on XX where the cardinality of the set XX is nn for some n2n\geq 2. Let ee be the identity element of KxK_{x}. The construction of Kambites provides an 𝐅n+1\mathbf{F}_{n+1}-automaton 𝒢={Q,Σ,𝐅n+1,δ,q0,Qa}\mathcal{G}=\{Q^{\prime},\Sigma,\mathbf{F}_{n+1},\delta^{\prime},q_{0}^{\prime},Q_{a}^{\prime}\} recognizing the language 𝙻\mathtt{L}. The generator set for 𝐅n+1\mathbf{F}_{n+1} is XX^{\prime}, where X=X#X^{\prime}=X\cup\#.

Let us analyze the construction in more detail.

  • Q=QQ+Q^{\prime}=Q_{-}\cup Q_{+} where Q={q|qQ}Q_{-}=\{q_{-}|q\in Q\} and Q+={q+|qQ}Q_{+}=\{q_{+}|q\in Q\}

  • q0q_{0}^{\prime}=q+q_{+} where q=q0q=q_{0}.

  • Qa={q|qQa}Q_{a}^{\prime}=\{q_{-}|q\in Q_{a}\}.

  • δ(p+,σ)=(q+,x#)\delta^{\prime}(p_{+},\sigma)=(q_{+},x\#) if δ(p,σ)=(q,x#)\delta(p,\sigma)=(q,x\#) where xx is a positive generator for all σΣ\sigma\in\Sigma.

  • δ(p,σ)=(q+,x#)\delta^{\prime}(p_{-},\sigma)=(q_{+},x^{\prime}\#) if δ(p,σ)=(q,x#)\delta(p,\sigma)=(q,x^{\prime}\#) where xx^{\prime} is a negative generator for all σΣ\sigma\in\Sigma.

  • δ(p+,σ)=(q+,e)\delta^{\prime}(p_{+},\sigma)=(q_{+},e) if δ(p,σ)=(q,e)\delta(p,\sigma)=(q,e) for all σΣ\sigma\in\Sigma.

  • δ(q+,ϵ)=(q,e)\delta^{\prime}(q_{+},\epsilon)=(q_{-},e) for each qQq\in Q.

  • δ(q,ϵ)=(q,#1)\delta^{\prime}(q_{-},\epsilon)=(q_{-},\#^{-1}) for each qQq\in Q.

We will prove that 𝒢\mathcal{G} actually runs in linear time. There are two transitions where the automaton is allowed to move without consuming any input symbols.

For each state qQq\in Q, there are two states q+q_{+} and qq_{-} in 𝒢\mathcal{G} which are connected with an edge labeled (ϵ,e)(\epsilon,e). These transitions do not change the register value, and cannot contribute more than half of the runtime of the machine, since at least one input symbol has to be consumed between any two executions of such transitions.

ϵ\epsilon-loops exist in the machine 𝒢\mathcal{G} for each state qq_{-} where the loop is labeled by (ϵ,#1)(\epsilon,\#^{-1}). Although this looks worrisome at first for the purpose of bounding the runtime, the number of times these loops are traversed is actually bounded, as the following argument suggests. Suppose that the register is multiplied with l1l_{1}, l2l_{2}, \cdots, lml_{m} while reading some input string ww of length nn, resulting in the register value l=l1l2lm(#1)kl=l_{1}l_{2}\cdots l_{m}(\#^{-1})^{k}, where kk\in\mathbb{N}, at the end of the computation. If ww is accepted by the machine, ll should satisfy the following, as well as being equal to the identity element:

li={(#1)pxi# for some p,if xi is a negative generatorxi#,if xi is a positive generatorl_{i}=\Biggl{\{}\begin{array}[]{lr}(\#^{-1})^{p}x_{i}\#\mbox{ for some }p\in\mathbb{N},&\mbox{if $x_{i}$ is a negative generator}\\ x_{i}\#,&\mbox{if $x_{i}$ is a positive generator}\\ \end{array}

This is called a permissible padding in [14]. By looking at the transition function of 𝒢\mathcal{G}, one can see that the register is multiplied by a #\# only when an input symbol is consumed. Hence, the number of #\#’s that occur in ll is less than or equal to the length of the string. The register is multiplied with #1\#^{-1} without consuming any input symbol. In order for the #\#’s and #1\#^{-1}’s to cancel each other, they should be equal in number. Therefore, it can be concluded that the ϵ\epsilon-loops are traversed at most nn times.

We can conclude that any context-free language can be recognized by a weakly linear-time bounded free group automaton. Since 𝐅2\mathbf{F}_{2} contains every free group of countable rank, the proof is complete. ∎

We state the following theorem, which is the linear-time equivalent of Fact 3.2 [4].

{thrm}

Suppose GG is a finitely generated group and HH is a subgroup of finite index. Then 𝔏(G)O(n)w=𝔏(H)O(n)w\mathfrak{L}(G)_{O(n)}^{w}=\mathfrak{L}(H)_{O(n)}^{w}.

Proof.

We know that the statement is true in general when there is no time bound by [4]. The proof in [4] still works when all automata in the constructions are required to work in linear time. ∎

Now we can show that Theorem 3.2 also holds for linear-time bounded group automaton.

{thrm}

𝖢𝖥=𝔏(𝐅2)O(n)w=𝔏(SL(2,))O(n)w=𝔏(GL(2,))O(n)w\mathsf{CF}=\mathfrak{L}(\mathbf{F}_{2})_{O(n)}^{w}=\mathfrak{L}(SL(2,\mathbb{Z}))_{O(n)}^{w}=\mathfrak{L}(GL(2,\mathbb{Z}))_{O(n)}^{w}.

Proof.

The proof is identical with the proof of Theorem 3.2 by using Theorem 4.3. ∎

By using the results proven in Subsection 4.2, we can demonstrate the language recognition power of weakly linear-time bounded 𝐇\mathbf{H}-automata.

{thrm}

𝔏(H)O(n)w𝔏(SL(3,))O(n)w\mathfrak{L}(\textbf{H})_{O(n)}^{w}\subsetneq\mathfrak{L}(SL(3,\mathbb{Z}))_{O(n)}^{w}.

Proof.

𝔏(𝐇)O(n)w𝔏(SL(3,))O(n)w\mathfrak{L}(\mathbf{H})^{w}_{O(n)}\subseteq\mathfrak{L}(SL(3,\mathbb{Z}))_{O(n)}^{w} since 𝐇\mathbf{H} is a subgroup of SL(3,)SL(3,\mathbb{Z}). Since the Heisenberg group has polynomial growth function [16], there exists a context-free language which can not be recognized by any H-automaton in polynomial time by Theorem 4.2. Since 𝖢𝖥=𝔏(SL(2,))O(n)w\mathsf{CF}=\mathfrak{L}(SL(2,\mathbb{Z}))_{O(n)}^{w} by Theorem 4.3, the result follows. ∎

{thrm}

i. For k5k\geq 5, 𝔏(𝐇)O(n)w\mathfrak{L}(\mathbf{H})^{w}_{O(n)} and 𝔏(k)O(n)w\mathfrak{L}(\mathbb{Z}^{k})^{w}_{O(n)} are incomparable.
ii. 𝔏(𝐇)O(n)w\mathfrak{L}(\mathbf{H})^{w}_{O(n)} and 𝖢𝖥\mathsf{CF} are incomparable.

Proof.

i. In [19], a weakly linear-time bounded 𝐇\mathbf{H}-automaton which recognizes the language 𝙼𝚄𝙻𝚃={xpyqzpq|p,q0}\mathtt{MULT}=\{x^{p}y^{q}z^{pq}|p,q\geq 0\} is constructed. The language 𝙼𝚄𝙻𝚃\mathtt{MULT} can not be recognized by any k\mathbb{Z}^{k}-automaton, since any bounded language in 𝔏(+)\mathfrak{L}(\mathbb{Q}^{+}) is semilinear by [13].

In [10], it is implicitly proven there exists a uniformly nn-dissimilar set of size Θ(nk)\Theta(n^{k}) for the language 𝙻k={0a110a210ak10a110a210ak1}\mathtt{L}_{k}=\{0^{a_{1}}10^{a_{2}}1\dots 0^{a_{k}}10^{a_{1}}10^{a_{2}}1\dots 0^{a_{k}}1\} for all integers kk. For k=5k=5, there exists a uniformly nn-dissimilar set of size Θ(n5)\Theta(n^{5}) for the language 𝙻5\mathtt{L}_{5} and U𝙻5(n)n5U_{\mathtt{L}_{5}}(n)\geq n^{5}. Since g𝐇(n)g_{\mathbf{H}}(n) is a polynomial of order 4 [16] and t(n)=O(n)t(n)=O(n), g𝐇(t(n))o(U𝙻5(n))g_{\mathbf{H}}(t(n))\in o(U_{\mathtt{L}_{5}}(n)). By Theorem 4.2, we conclude the result.

ii.ii. The language 𝙼𝚄𝙻𝚃={xpyqzpq|p,q0}\mathtt{MULT}=\{x^{p}y^{q}z^{pq}|p,q\geq 0\} is not a context-free language. Since 𝐇\mathbf{H} has a polynomial growth function [16], there exists a context-free language which can not be recognized by any 𝐇\mathbf{H}-automaton in polynomial-time by Theorem 4.2.

Let us note that 𝙻5\mathtt{L}_{5} can be recognized by a 5\mathbb{Z}^{5}-automaton in real time. The existence of the languages 𝙻k\mathtt{L}_{k} can be used to prove the linear-time nondeterministic counter hierarchy, with the help of Theorem 4.2.

{thrm}

𝔏(k)O(n)w𝔏(k+1)O(n)w\mathfrak{L}(\mathbb{Z}^{k})^{w}_{O(n)}\subsetneq\mathfrak{L}(\mathbb{Z}^{k+1})^{w}_{O(n)} for k1k\geq 1.

Proof.

The language 𝙻k+1={0a110a210ak+110a110a210ak+11}\mathtt{L}_{k+1}=\{0^{a_{1}}10^{a_{2}}1\dots 0^{a_{k+1}}10^{a_{1}}10^{a_{2}}1\dots 0^{a_{k+1}}1\} can be recognized by a k+1\mathbb{Z}^{k+1}-automaton in real time. While scanning the first k+1k+1 segments of 0’s, the ii’th counter is increased for each scanned 0 as 0ai0^{a_{i}} is read. In the remainder of the computation, the ii’th counter is decreased for each scanned 0 when 0ai0^{a_{i}} is read.

There exists a uniformly nn-dissimilar set of size Θ(nk+1)\Theta(n^{k+1}) for the language 𝙻k+1\mathtt{L}_{k+1}, so U𝙻k+1(n)nk+1U_{\mathtt{L}_{k+1}}(n)\geq n^{k+1}. Since t(n)=O(n)t(n)=O(n) and gk(n)=nkg_{\mathbb{Z}^{k}}(n)=n^{k} [12], gk(t(n))o(U𝙻5(n))g_{\mathbb{Z}^{k}}(t(n))\in o(U_{\mathtt{L}_{5}}(n)). We conclude by Theorem 4.2. ∎

A celebrated result of the field of computational complexity, the nondeterministic time hierarchy theorem, will enable us to demonstrate that the computational power 𝐅2×𝐅2\mathbf{F}_{2}\times\mathbf{F}_{2}-automata is dependent on the time allotted for their execution.

Fact \thethrm.

[20] If g(n)g(n) is a time-constructible function, and f(n+1)=o(g(n))f(n+1)=o(g(n)), then there exists a language which cannot be recognized by any nondeterministic Turing machine in time f(n)f(n), but can be recognized by a nondeterministic Turing machine in time g(n)g(n).

Assume that any recursively enumerable language can be recognized by some linear-time 𝐅2×𝐅2\mathbf{F}_{2}\times\mathbf{F}_{2}-automaton. One can easily build a nondeterministic Turing machine that simulates such a 𝐅2×𝐅2\mathbf{F}_{2}\times\mathbf{F}_{2}-automaton with only a polynomial slowdown. But this would mean that any recursively enumerable language can be recognized by some nondeterministic TM in polynomial time, contradicting Fact 4.3, which implies that there exist languages which can only be recognized by nondeterministic Turing machines which run in at least exponential time. We have proven the following theorem.

{thrm}

𝔏(𝐅2×𝐅2)O(n)w𝖱𝖤\mathfrak{L}(\mathbf{F}_{2}\times\mathbf{F}_{2})_{O(n)}^{w}\subsetneq\mathsf{RE}.

Using the ability of Turing machines to simulate any finite automaton over a computable matrix group, the statement of the above theorem can be extended as follows.

{thrm}

𝔏(G)O(n)w𝖱𝖤\mathfrak{L}(G)_{O(n)}^{w}\subsetneq\mathsf{RE} for any matrix group GG whose matrix entries are computable numbers.

Proof.

In Theorem 3.3, we have mentioned that Turing machines can simulate any finite automaton over a computable matrix group. By the nondeterministic time hierarchy theorem, it can be shown that there exist some languages which can not be recognized by any finite automata over matrix groups in linear time. ∎

{thrm}

𝔏(𝐅2)O(n)w𝔏(𝐅2×𝐅2)O(n)w\mathfrak{L}(\mathbf{F}_{2})_{O(n)}^{w}\subsetneq\mathfrak{L}(\mathbf{F}_{2}\times\mathbf{F}_{2})_{O(n)}^{w}.

Proof.

It is obvious that an 𝐅2\mathbf{F}_{2}-automaton can be simulated by an 𝐅2×𝐅2\mathbf{F}_{2}\times\mathbf{F}_{2}-automaton. 𝔏(𝐅2)O(n)w=𝖢𝖥\mathfrak{L}(\mathbf{F}_{2})_{O(n)}^{w}=\mathsf{CF} by Theorem 4.3. The inclusion is proper since the non-context-free language 𝙻={anbncn|n0}\mathtt{L}=\{a^{n}b^{n}c^{n}|n\geq 0\} can be recognized by an 𝐅2×𝐅2\mathbf{F}_{2}\times\mathbf{F}_{2}-automaton in real time by using the two registers as two counters. ∎

In the rest of the section, the linear-time counterparts of the relationships in Figure 3 will be stated.

{thrm}
  1. i.

    𝔏(+)O(n)w𝔏(SL(2,))O(n)w\mathfrak{L}(\mathbb{Q}^{+})^{w}_{O(n)}\subsetneq\mathfrak{L}(SL(2,\mathbb{Q}))^{w}_{O(n)}.

  2. ii.

    𝔏()O(n)w𝔏(BS(1,2))O(n)w𝖢𝖥\mathfrak{L}(\mathbb{Z})^{w}_{O(n)}\subsetneq\mathfrak{L}(BS(1,2))^{w}_{O(n)}\nsubseteq\mathsf{CF}.

  3. iii.

    𝔏(SL(2,))O(n)w𝔏(SL(3,))O(n)w\mathfrak{L}(SL(2,\mathbb{Z}))^{w}_{O(n)}\subsetneq\mathfrak{L}(SL(3,\mathbb{Z}))^{w}_{O(n)}.

  4. iv.

    𝔏(2)O(n)w𝔏(𝐇)O(n)w\mathfrak{L}(\mathbb{Z}^{2})^{w}_{O(n)}\subsetneq\mathfrak{L}(\mathbf{H})^{w}_{O(n)}.

  5. v.

    𝖢𝖥\mathsf{CF} and 𝔏(k)O(n)w\mathfrak{L}(\mathbb{Z}^{k})^{w}_{O(n)} are incomparable for all k2k\geq 2.

  6. vi.

    𝔏(SL(3,))O(n)w=𝔏(GL(3,))O(n)w\mathfrak{L}(SL(3,\mathbb{Z}))^{w}_{O(n)}=\mathfrak{L}(GL(3,\mathbb{Z}))^{w}_{O(n)}.

  7. vii.

    𝖱𝖤𝖦=𝔏(𝐅0)O(n)w𝔏(𝐅1)O(n)w=𝔏()O(n)w𝔏(𝐅2)O(n)w\mathsf{REG}=\mathfrak{L}(\mathbf{F}_{0})^{w}_{O(n)}\subsetneq\mathfrak{L}(\mathbf{F}_{1})^{w}_{O(n)}=\mathfrak{L}(\mathbb{Z})^{w}_{O(n)}\subsetneq\mathfrak{L}(\mathbf{F}_{2})^{w}_{O(n)}.

Proof.

(i.,ii.,iii.,iv.)(i.,ii.,iii.,iv.) Analogous results where no time bound was imposed on the machines were proven in Theorems 3.2, 3.2, 3.2, and 3.2, respectively. The group automata recognizing the witness languages 𝙻={a22n+1|n0}\mathtt{L}=\{a^{2^{2n+1}}|n\geq 0\}, 𝚄𝙿𝙾𝚆={a2n|n0}\mathtt{UPOW}=\{a^{2^{n}}|n\geq 0\} and 𝙼𝚄𝙻𝚃={xpyqzpq|p,q0}\mathtt{MULT}=\{x^{p}y^{q}z^{pq}|p,q\geq 0\} operate in weakly linear time in all cases.

v. The equivalent result for the general case is given in Fact 3.1. The non-context-free language 𝙻={anbncn|n0}\mathtt{L}^{\prime}=\{a^{n}b^{n}c^{n}|n\geq 0\} can be recognized by a 2\mathbb{Z}^{2}-automaton in real time.

vi. The equivalent result for the general case is given in Theorem 3.2. The result follows by Theorem 4.3.

vii. The equivalent result for the general case is given in Fact 3.1. 𝐅0\mathbf{F}_{0} is the trivial group, and any regular language can be recognized by a deterministic finite automaton, which can be seen as finite automaton over 𝐅0\mathbf{F}_{0}, in real time. Since 𝐅1\mathbf{F}_{1} is isomorophic to \mathbb{Z}, the equality is obvious. Since the nonregular language 𝙻={anbn|n0}\mathtt{L}=\{a^{n}b^{n}|n\geq 0\} can be recognized by a \mathbb{Z}-automaton in real time, the proper inclusion follows. Lastly, since 𝔏(𝐅2)O(n)w\mathfrak{L}(\mathbf{F}_{2})^{w}_{O(n)} is equivalent to 𝖢𝖥\mathsf{CF} by Theorem 4.3, the last proper inclusion is still valid. ∎

The results are summarized in Figure 4.

Refer to caption
Figure 4. Language classes recognized by weakly linear-time bounded group automata

5. Open questions

Does there exist an SL(3,)SL(3,\mathbb{Z})-automaton recognizing W(3)W(\mathbb{Z}^{3})? 111Corollary 2 of [3] states that the word problem of a finitely generated Abelian group HH is recognized by a GG-automaton if and only if HH has a finite index subgroup isomorphic to a subgroup of GG. That corollary could be used to give an affirmative answer to this open question. Unfortunately, the corollary is wrong: Let HH be an Abelian group and let G=𝐅2×𝐅2G=\mathbf{F}_{2}\times\mathbf{F}_{2}. 𝔏(𝐅2×𝐅2)\mathfrak{L}(\mathbf{F}_{2}\times\mathbf{F}_{2}) contains the word problem of any finitely generated Abelian group. Since 𝐅2×𝐅2\mathbf{F}_{2}\times\mathbf{F}_{2} is finitely generated, any finite index subgroup of 𝐅2×𝐅2\mathbf{F}_{2}\times\mathbf{F}_{2} is also finitely generated. Any finite index subgroup of 𝐅2×𝐅2\mathbf{F}_{2}\times\mathbf{F}_{2} is either free or has a subgroup of finite index that is a direct product of free groups [1]. Any subgroup of an Abelian group is again Abelian. Hence, it is not possible that GG has a finite index subgroup isomorphic to a subgroup of HH.

Can we prove a stronger version of Theorem 4.2, which is independent of the time component? For instance, for the case of 𝐅2\mathbf{F}_{2}, is it true that W(𝐅2)𝔏(𝐇)W(\mathbf{F}_{2})\notin\mathfrak{L}(\mathbf{H}) in general?

Can we describe the necessary properties of a group GG so that 𝔏(G)\mathfrak{L}(G) contains W(F2)W(\textbf{F}_{2})?

Little is known about BS(1,2)BS(1,2)-automata. Does 𝔏(BS(1,2))\mathfrak{L}(BS(1,2)) contain every context-free language?

Which, if any, of the subset relationships in Figure 3 are proper inclusions?

Can we add other classes above RE in Figure 3 by examining groups on matrices with uncomputable entries?

Theorem 4.2 uses the definition of uniform nn-dissimilarity requiring that gG(t(n))g_{G}(t(n)) o(U𝙻(n))\in o(U_{\mathtt{L}}(n)). Would the theorem be still true if we replace U𝙻(n)U_{\mathtt{L}}(n) by A𝙻(n)A_{\mathtt{L}}(n) ? The gap between U𝙻(n)U_{\mathtt{L}}(n) and A𝙻(n)A_{\mathtt{L}}(n) might be large as mentioned in [10]. Consider the language 𝙻={aibj|ij}\mathtt{L}=\{a^{i}b^{j}|i\neq j\}. It is stated in [10] that a set of uniformly nn-dissimilar strings for 𝙻\mathtt{L} can not contain more than two strings. However, A𝙻(n)O(1)A_{\mathtt{L}}(n)\notin O(1), since 𝙻\mathtt{L} is not a regular language.

Can real-time 𝐅2\mathbf{F}_{2}-automata recognize every context-free language?

References

  • [1] Gilbert Baumslag and James E Roseblade. Subgroups of direct products of free groups. Journal of the London Mathematical Society, 2(1):44–52, 1984.
  • [2] Nathanial Patrick Brown and Narutaka Ozawa. C*-algebras and finite-dimensional approximations, volume 88. American Mathematical Soc., 2008.
  • [3] Sean Cleary, Murray Elder, and Gretchen Ostheimer. The word problem distinguishes counter languages. arXiv preprint math/0606415, 2006.
  • [4] Jon M. Corson. Extended finite automata and word problems. International Journal of Algebra and Computation, 15(03):455–466, 2005.
  • [5] Jürgen Dassow and Victor Mitrana. Finite automata over free groups. International Journal of Algebra and Computation, 10(06):725–737, 2000.
  • [6] Murray Elder, Mark Kambites, and Gretchen Ostheimer. On groups and counter automata. International Journal of Algebra and Computation, 18(08):1345–1364, 2008.
  • [7] Gillian Z. Elston and Gretchen Ostheimer. On groups whose word problem is solved by a counter automaton. Theoretical Computer Science, 320(2–3):175 – 185, 2004.
  • [8] Patrick C. Fischer, Albert R. Meyer, and Arnold L. Rosenberg. Real time counter machines. In Proceedings of the 8th Annual Symposium on Switching and Automata Theory (SWAT 1967), FOCS ’67, pages 148–154, 1967.
  • [9] J.B. Fraleigh and V.J. Katz. A first course in abstract algebra. Addison-Wesley world student series. Addison-Wesley, 2003.
  • [10] Ian Glaister and Jeffrey Shallit. Automaticity iii: Polynomial automaticity and context-free languages. Computational Complexity, 7(4):371–387, 1998.
  • [11] S. A. Greibach. Remarks on blind and partially blind one-way multicounter machines. Theoretical Computer Science, 7:311–324, 1978.
  • [12] Rostislav I Grigorchuk. On growth in group theory. In Proceedings of the International Congress of Mathematicians, volume 1, pages 325–338, 1990.
  • [13] Oscar H. Ibarra, Sartaj K. Sahni, and Chul E. Kim. Finite automata with multiplication. Theoretical Computer Science, 2(3):271 – 294, 1976.
  • [14] Mark Kambites. Formal languages and groups as memory. Communications in Algebra, 37(1):193–208, 2009.
  • [15] Mikhail I. Kargapolov and Ju. I. Merzljakov. Fundamentals of the Theory of Groups. Springer-Verlag, 1979.
  • [16] Pierre De La Harpe. Topics in geometric group theory. The University Of Chicago Press, Chicago, 2000.
  • [17] Roger C. Lyndon and Paul E. Schupp. Combinatorial Group Theory. Springer-Verlag, 1977.
  • [18] Victor Mitrana and Ralf Stiebe. The accepting power of finite automata over groups. In New Trends in Formal Languages, pages 39–48. Springer-Verlag, 1997.
  • [19] Elaine Render. Rational monoid and semigroup automata. PhD thesis, University of Manchester, 2010.
  • [20] Stanisław Żak. A Turing machine time hierarchy. Theoretical Computer Science, 26(3):327 – 333, 1983.
  • [21] Georg Zetzsche. Silent transitions in automata with storage. In International Colloquium on Automata, Languages, and Programming, pages 434–445. Springer Berlin Heidelberg, 2013.