\secondaddress

Università di Roma “La Sapienza”, Dipartimento di Matematica, Piazzale Aldo Moro 2, 00185 Roma, Italy

\sameaddress

LANGUAGE CLASSES ASSOCIATED WITH AUTOMATA OVER MATRIX GROUPS

Özlem Salehi ozlem.salehi@boun.edu.tr & say@boun.edu.tr Boğaziçi University, Department of Computer Engineering, Bebek 34342 İstanbul, Turkey , Flavio D’Alessandro Boğaziçi University, Department of Mathematics, Bebek 34342, İstanbul, Turkey dalessan@mat.uniroma1.it and A. C. Cem Say

(Date: …)

Abstract.

We investigate the language classes recognized by group automata over matrix groups. For the case of $2\times 2$ matrices, we prove that the corresponding group automata for rational matrix groups are more powerful than the corresponding group automata for integer matrix groups. Finite automata over some special matrix groups, such as the discrete Heisenberg group and the Baumslag-Solitar group are also examined. We also introduce the notion of time complexity for group automata and demonstrate some separations among related classes. The case of linear-time bounds is examined in detail throughout our repertory of matrix group automata.

Key words and phrases:

group automata, time complexity

1991 Mathematics Subject Classification:

68Q45, 68Q05

This work was supported by Boğaziçi University Research Fund under grant number 11760. A preliminary version of this work was presented at the 8’th Workshop on Non-Classical Models of Automata and Applications (NCMA), Debrecen, Hungary, August 29-30, 2016.

Özlem Salehi was partially supported by TÜBİTAK (Scientific and Technological Research Council of Turkey).

The research of F. D’Alessandro was supported by a EC-FP7 Marie Curie-TÜBİTAK Co-Funded Brain Circulation Scheme Project 2236 Fellowship.

1. Introduction

Many extensions of the classical finite automaton model have been examined. One such variant is the group automaton (finite automaton over groups), which is a nondeterministic finite automaton equipped with a register that holds an element from a group [18]. The register is initialized to the identity element of the group, and a computation is deemed successful if the register is equal to the identity element at the end of the computation after being multiplied at every step. This setup generalizes various models such as nondeterministic blind multicounter automata [8] and finite automata with multiplication [13].

The theory of group automata has been essentially developed in the case of free groups [5, 4, 14], and in the case of free Abelian groups [7, 6], where strong theorems allow to characterize the power of such models and the combinatorial properties of the languages recognized by these automata. For groups that are not of the types mentioned above, even in the case of groups of matrices of low dimension, the study of group automata quickly beomes nontrivial, and there are remarkable classes of linear groups for which little is known about the automaton models that they define.

In this paper, we present several new results about the classes of languages recognized by finite automata over matrix groups. We focus on matrix groups with integer and rational entries. For the case of $2\times 2$ matrices, we prove that the corresponding group automata for rational matrix groups are more powerful than the corresponding group automata for integer matrix groups. We also explore finite automata over some special matrix groups, such as the discrete Heisenberg group and the Baumslag-Solitar group. The “zoo” of language classes associated with different groups is presented, visualizing known relationships and open problems.

We also introduce the notion of time complexity for group automata, and use this additional dimension to analyze the relationships among the language families of various automata using different groups. We develop a method for proving that automata over matrix groups where the growth rate of the group and the time are bounded can not recognize certain languages, even if one uses a very weak definition of time-bounded computation, and use this to demonstrate some new relationships between time-bounded versions of our language classes. The case of linear-time bounds is examined in detail throughout our repertory of matrix groups.

2. Preliminaries

2.1. Notation and terminology

The following notation will be used throughout the paper: $Q$ is the set of states, $q_{0}\in Q$ denotes the initial state, $Q_{a}\subseteq Q$ denotes the set of accepting states, and $\Sigma$ is the input alphabet.

By $w^{r}$ , we represent the reverse of the string $w$ . The length of $w$ is denoted by $|w|$ .

$\mathsf{REG}$ , $\mathsf{CF}$ , and $\mathsf{RE}$ denote the families of regular languages, context-free languages, and recursively enumerable languages, respectively.

We assume a familiarity with some basic notions from algebra and group theory (see [9],[17] for references on this topic). For a finitely generated group $G$ and a set $X$ of generators, the word problem language of $G$ is the language $W(G,X)$ over $\Sigma=\{X\cup X^{-1}\}$ which consists of all words that represent the identity element of $G$ . Most of the time, the statements about the word problem are independent of the generating set and in these cases the word problem language is denoted by $W(G)$ . For a string $w=w_{1}w_{2}\dots w_{n}\in W(G)$ , $w^{-1}=w_{n}^{-1}\dots w_{1}^{-1}$ where each $w_{i}\in\Sigma$ represents a generator.

2.2. Group automata

Group automata first appear explicitly in the paper [18] under the name of extended finite automaton. The definition is formally given as follows.

Let $K=(M,\circ,e)$ be a group under the operation denoted by $\circ$ with the neutral element denoted by $e$ . An extended finite automaton over the group $K=(M,\circ,e)$ is a 6-tuple

\mathcal{F}=(Q,\Sigma,K,\delta,q_{0},Q_{a}),

where the transition function $\delta$ is defined as

\delta:Q\times(\Sigma\cup\{\varepsilon\})\rightarrow\mathbb{P}(Q\times M).

$\delta(q,\sigma)\ni(q^{\prime},m)$ means that when $\mathcal{F}$ reads the symbol (or empty string) $\sigma\in\Sigma\cup\{\varepsilon\}$ in state $q$ , it can move to state $q^{\prime}$ , and write $x\circ m$ in the register, where $x$ is the old content of the register. The initial value of the register is the neutral element $e$ of the group $K$ . The string is accepted if, after completely reading the string, $\mathcal{F}$ enters an accept state with the content of the register being equal to the neutral element of $K$ .

We will prefer using the name group automaton ( $G$ -automaton) instead of extended finite automaton over group $G$ .

Monoid automata are defined analogously where the group $G$ is replaced by some monoid $N$ .

The class of languages recognized by $G$ -automata will be denoted as $\mathfrak{L}(G)$ .

3. Matrix groups and associated language classes

In this section, we are going to prove some new results about the classes of languages recognized by finite automata over various groups, focusing on linear groups.

3.1. Basic results

We will denote the free group over $r$ generators by $\mathbf{F}_{r}$ . Note that $\mathbf{F}_{0}$ is the trivial group, and $\mathbf{F}_{1}$ is isomorphic to $\mathbb{Z}$ , the additive group of integers. The class of regular languages is characterized as the set of languages recognized by finite automata over the trivial group $\mathbf{F}_{0}$ in [5].

The relation between the classes of languages recognized by free group automata is summarized as follows.

Fact \thethrm.

[5] $\mathsf{REG}=\mathfrak{L}(\mathbf{F}_{0})\subsetneq\mathfrak{L}(\mathbf{F}_{1})=\mathfrak{L}(\mathbb{Z})\subsetneq\mathfrak{L}(\mathbf{F}_{2})$ .

A characterization of context-free languages by group automata was first stated by Dassow and Mitrana [5], and proven in [4]. Let us note that $\mathbf{F}_{2}$ contains any free group of rank $n\geq 2$ [17].

Fact \thethrm.

[5, 4, 14] $\mathfrak{L}(\mathbf{F}_{2})$ is the family of context-free languages.

We will denote by $\mathbb{Z}^{k}$ the additive group of integer vectors of dimension $k$ . This group is isomorphic to the free Abelian group of rank $k$ , and $\mathbb{Z}^{k}$ -automata are equivalent to nondeterministic blind $k$ -counter automata [11].

The following result states the hierarchy between the classes of languages recognized by $\mathbb{Z}^{k}$ -automata. This result also follows from the hierarchy between the class of languages recognized by nondeterministic blind $k$ -counter automata.

Fact \thethrm.

[3] $\mathfrak{L}(\mathbb{Z}^{k})\subsetneq\mathfrak{L}(\mathbb{Z}^{k+1})$ for $k\geq 1$ .

We denote by $\mathbb{Q}^{+}$ the multiplicative group of positive rational numbers, which is isomorphic to a free Abelian group of infinite rank. A $\mathbb{Q}^{+}$ -automaton is also equivalent to a nondeterministic finite automaton with multiplication without equality (1NFAMW) of Ibarra et al. [13].

The following fact characterizes the class of languages recognized for the case where the alphabet is unary,

Fact \thethrm.

[13] All 1NFAMW-recognizable languages over a unary alphabet are regular.

Let us mention that the class of context-free languages and the class of languages recognized by nondeterministic blind counter automata are incomparable.

Fact \thethrm.

$\mathsf{CF}$ and $\mathfrak{L}(\mathbb{Z}^{k})$ are incomparable for all $k\geq 2$ .

Proof.

Consider the language $\mathtt{L}=\{a^{n}b^{n}|n\geq 0\}$ which is a context-free language. Since context-free languages are closed under star, $\mathtt{L}^{*}$ is a context-free language whereas it cannot be recognized by any $\mathbb{Z}^{k}$ -automaton for all $k\geq 1$ by [11]. On the other hand, the non-context-free language $\mathtt{L}^{\prime}=\{a^{n}b^{n}c^{n}|n\geq 0\}$ can be recognized by a $\mathbb{Z}^{2}$ -automaton. ∎

3.2. Automata on groups of $2\times 2$ and $3\times 3$ matrices

We denote by $GL(2,\mathbb{Z})$ the general linear group of degree two over the field of integers, that is, the group of $2\times 2$ invertible matrices with integer entries. Note that these matrices have determinant $\pm 1$ . Restricting the matrices in $GL(2,\mathbb{Z})$ to those that have determinant 1, we obtain the special linear group of degree two over the field of integers, $SL(2,\mathbb{Z})$ .

Let $\mathbf{G}$ be the group generated by the matrices

M_{a}=\left[\begin{array}[]{cc}1&2\\ 0&1\\ \end{array}\right]~~~\mbox{and}~~~M_{b}=\left[\begin{array}[]{cc}1&0\\ 2&1\\ \end{array}\right].

There exists an isomorphism $\varphi$ from $\mathbf{F}_{2}$ onto $\mathbf{G}$ by [15]. Note that $M_{a}$ and $M_{b}$ are integer matrices with determinant 1, which proves that $\mathbf{F}_{2}$ is a subgroup of $SL(2,\mathbb{Z})$ .

Now the question is whether $\mathfrak{L}(GL(2,\mathbb{Z}))$ and $\mathfrak{L}(SL(2,\mathbb{Z}))$ correspond to larger classes of languages than the class of context-free languages. We are going to use the following fact to prove that the answer is negative.

Fact \thethrm.

[4] Suppose $G$ is a finitely generated group and $H$ is a subgroup of finite index. Then $\mathfrak{L}(G)=\mathfrak{L}(H)$ .

Now we are ready to state our theorem.

{thrm}

$\mathsf{CF}=\mathfrak{L}(\mathbf{F}_{2})=\mathfrak{L}(SL(2,\mathbb{Z}))=\mathfrak{L}(GL(2,\mathbb{Z}))$ .

Proof.

We are going to use Fact 3.2 to prove the result. Since $SL(2,\mathbb{Z})$ has index 2 in $GL(2,\mathbb{Z})$ and $GL(2,\mathbb{Z})$ is finitely generated, $\mathfrak{L}(GL(2,\mathbb{Z}))=\mathfrak{L}(SL(2,\mathbb{Z}))$ . Since $\mathbf{F}_{2}$ has index 12 in $SL(2,\mathbb{Z})$ [2] and $SL(2,\mathbb{Z})$ is finitely generated, $\mathfrak{L}(SL(2,\mathbb{Z}))=\mathfrak{L}(\mathbf{F}_{2})$ which is equal to the family of context-free languages by Fact 3.1. ∎

Let us now investigate the group $SL(3,\mathbb{Z})$ , the group of $3\times 3$ integer matrices with determinant $1$ .

We start by looking at an important subgroup of $SL(3,\mathbb{Z})$ , the discrete Heisenberg group. The discrete Heisenberg group $\mathbf{H}$ is defined as $\langle a,b|ab=bac,ac=ca,bc=cb\rangle$ , where $c=a^{-1}b^{-1}ab$ is the commutator of $a$ and $b$ .

a=\left[\begin{array}[]{ccc}1&1&0\\ 0&1&0\\ 0&0&1\end{array}\right]~~~b=\left[\begin{array}[]{ccc}1&0&0\\ 0&1&1\\ 0&0&1\end{array}\right]~~~c=\left[\begin{array}[]{ccc}1&0&1\\ 0&1&0\\ 0&0&1\end{array}\right]

Any element $g\in\mathbf{H}$ can be written uniquely as $b^{j}a^{i}c^{k}$ .

g=\left[\begin{array}[]{ccc}1&i&k\\ 0&1&j\\ 0&0&1\end{array}\right]=b^{j}a^{i}c^{k}

It is shown in [19] that the languages $\mathtt{MULT}=\{x^{p}y^{q}z^{pq}|p,q\geq 0\}$ , $\mathtt{COMPOSITE}=\{x^{pq}|p,q>1\}$ and $\mathtt{MULTIPLE}=\{x^{p}y^{pn}|p\in\mathbb{N}\}$ can be recognized by $\mathbf{H}$ -automata, using the special multiplication property of the group.

Correcting a small error in [19], we rewrite the multiplication property of the elements of $\mathbf{H}$ .

(b^{x}a^{y}c^{z})(b^{x^{\prime}}a^{y^{\prime}}c^{z^{\prime}})=b^{x+x^{\prime}}a^{y+y^{\prime}}c^{z+z^{\prime}+yx^{\prime}}

We can make the following observation using the fact that $\mathfrak{L}(\mathbf{H})$ contains non-context-free languages.

{thrm}

$\mathfrak{L}(SL(2,\mathbb{Z}))\subsetneq\mathfrak{L}(SL(3,\mathbb{Z}))$ .

Proof.

It is obvious that an $SL(2,\mathbb{Z})$ -automaton can be simulated by an $SL(3,\mathbb{Z})$ -automaton. Note that $\mathfrak{L}(SL(2,\mathbb{Z}))$ is the family of context-free languages by Theorem 3.2. Since $\mathfrak{L}(\mathbf{H})\subsetneq\mathfrak{L}(SL(3,\mathbb{Z}))$ and the non-context-free language $\mathtt{MULT}=\{x^{p}y^{q}z^{pq}|p,q\geq 0\}$ can be recognized by an $\mathbf{H}$ -automaton [19], the result follows. ∎

The following result is a direct consequence of Fact 3.2.

{thrm}

$\mathfrak{L}(SL(3,\mathbb{Z}))=\mathfrak{L}(GL(3,\mathbb{Z}))$ .

Proof.

Since $GL(3,\mathbb{Z})$ is a finitely generated group and $SL(3,\mathbb{Z})$ has finite index in $GL(3,\mathbb{Z})$ , the result follows by Fact 3.2. ∎

We have talked about the discrete Heisenberg group H. Now let us look at a subgroup of $\mathbf{H}$ generated by the matrices $B$ and $C$ , which we will call $\mathbf{G_{2}}$ .

B=\left[\begin{array}[]{ccc}1&0&0\\ 0&1&1\\ 0&0&1\end{array}\right]~~~C=\left[\begin{array}[]{ccc}1&0&1\\ 0&1&0\\ 0&0&1\end{array}\right]~~~

$\mathbf{G_{2}}=\langle B,C|BC=CB\rangle$ is a free Abelian group of rank 2, and therefore it is isomorphic to $\mathbb{Z}^{2}$ .

We conclude the following about the language recognition power of $\mathbb{Z}^{2}$ and $\mathbf{H}$ .

{thrm}

$\mathfrak{L}(\mathbb{Z}^{2})\subsetneq\mathfrak{L}(\mathbf{H})$ .

Proof.

Since $\mathbb{Z}^{2}$ is a subgroup of $\mathbf{H}$ , $\mathfrak{L}(\mathbb{Z}^{2})\subseteq\mathfrak{L}(\mathbf{H})$ follows. The inclusion is proper since $\mathbf{H}$ can recognize the language $\mathtt{MULT}=\{x^{p}y^{q}z^{pq}|p,q\geq 0\}$ [19], whereas any bounded language in $\mathfrak{L}(\mathbb{Q}^{+})$ is semilinear [13]. ∎

Now let us move on to the discussion about matrix groups with rational entries.

Let us denote by $GL(2,\mathbb{Q})$ the general linear group of degree two over the field of rational numbers, that is, the group of invertible matrices with rational entries. Restricting the matrices in $GL(2,\mathbb{Q})$ to those that have determinant 1, we obtain the special linear group of degree two over the field of rationals, $SL(2,\mathbb{Q})$ .

We will start by proving that allowing rational entries enlarges the class of languages recognized by matrices with determinant 1.

{thrm}

$\mathfrak{L}(SL(2,\mathbb{Z}))\subsetneq\mathfrak{L}(SL(2,\mathbb{Q}))$ .

Proof.

It is obvious that $\mathfrak{L}(SL(2,\mathbb{Z}))\subseteq\mathfrak{L}(SL(2,\mathbb{Q}))$ . We will prove that the inclusion is proper.

Let us construct an $SL(2,\mathbb{Q})$ -automaton $\mathcal{G}$ recognizing the language $\mathtt{L}=\{a^{2^{2n+1}}|$ $n\geq 0\}$ . The state diagram of $\mathcal{G}$ and the matrices are given in Figure 1. Without scanning any input symbol, $\mathcal{G}$ first multiplies its register with the matrix $A_{1}$ . $\mathcal{G}$ then multiplies its register with the matrix $A_{2}$ successively until nondeterministically moving to the next state. After that point, $\mathcal{G}$ starts reading the string and multiplies its register with the matrix $A_{3}$ for each scanned $a$ . At some point, $\mathcal{G}$ nondeterministically stops reading the rest of the string and multiplies its register with the matrix $A_{4}$ . After successive multiplications with $A_{4}$ , $\mathcal{G}$ nondeterministically decides moving to an accept state.

Refer to caption — Figure 1. State diagram of $\mathcal{G}$ accepting the language $\mathtt{L}=\{a^{2^{2n+1}}|n\geq 0\}$

Let us trace the value of the register at different stages of the computation. Before reading the first input symbol, the register has the value

\left[\begin{array}[]{cc}2^{x+1}&0\\ 2^{x}&\frac{1}{2^{x+1}}\\ \end{array}\right]

as a result of the multiplications with the matrix $A_{1}$ and $x$ times the matrix $A_{2}$ . Multiplication with each $A_{3}$ leaves $2^{x+1}$ and $\frac{1}{2^{x+1}}$ unchanged while subtracting $\frac{1}{2^{x+1}}$ from $2^{x}$ for each scanned $a$ . As a result of $y$ multiplications with $A_{3}$ , the register will have the value

\left[\begin{array}[]{cc}2^{x+1}&0\\ 2^{x}-\frac{y}{2^{x+1}}&\frac{1}{2^{x+1}}\\ \end{array}\right].

For the rest of the computation, $\mathcal{G}$ will multiply its register with $A_{4}$ until nondeterministically moving to the final state. As a result of $z$ multiplications with $A_{4}$ , the register will have the value

\left[\begin{array}[]{cc}\frac{2^{x+1}}{2^{z}}&0\\ \bigl{(}2^{x}-\frac{y}{2^{x+1}}\bigr{)}\frac{1}{2^{z}}&\frac{2^{z}}{2^{x+1}}\\ \end{array}\right].

The final value of the register is equal to the identity matrix when $y=2^{2x+1}$ and $z=x+1$ , which is possible only when the length of the input string is $2^{2x+1}$ for some $x\geq 0$ . In the successful branch, the register will be equal to the identity matrix and $\mathcal{G}$ will end up in the final state having successfully read the input string. For input strings which are not members of $\mathtt{L}$ , either the computation will end before reading the whole input string, or the final state will be reached with the register value not equaling the identity matrix.

Since the matrices used during the computation are 2 by 2 rational matrices with determinant 1, $\mathtt{L}\in\mathfrak{L}(SL(2,\mathbb{Q}))$ . $\mathfrak{L}(SL(2,\mathbb{Q}))$ contains a unary nonregular language, which is not true for $\mathfrak{L}(SL(2,\mathbb{Z}))$ by Theorem 3.2 and we conclude the result. ∎

Let us note that the set of languages recognized by $\mathbb{Q}^{+}$ -automata is a proper subset of the set of languages recognized by $SL(2,\mathbb{Q})$ -automata, which can be concluded with the help of the following fact.

{thrm}

$\mathfrak{L}(\mathbb{Q}^{+})\subsetneq\mathfrak{L}(SL(2,\mathbb{Q}))$ .

Proof.

Let $\mathtt{L}\in\mathfrak{L}(\mathbb{Q}^{+})$ and let $\mathcal{G}$ be a $\mathbb{Q}^{+}$ -automaton recognizing $\mathtt{L}$ . We will construct an $SL(2,\mathbb{Q})$ -automaton $\mathcal{G}^{\prime}$ recognizing $\mathtt{L}$ . Let $S=\{s_{1},\dots,s_{n}\}$ be the set of elements multiplied with the register during the computation of $\mathcal{G}$ . We define the mapping $\varphi$ as follows.

\varphi:s_{i}\mapsto\left[\begin{array}[]{cc}s_{i}&0\\ 0&\frac{1}{s_{i}}\\ \end{array}\right]~~~

The elements $\varphi(s_{i})$ are $2\times 2$ rational matrices with determinant 1. Let $\delta$ and $\delta^{\prime}$ be the transition functions of $\mathcal{G}$ and $\mathcal{G}^{\prime}$ respectively. We let $(q^{\prime},s_{i})\in\delta(q,\sigma)\iff(q^{\prime},\varphi(s_{i}))\in\delta^{\prime}(q,\sigma)$ for every $q,q^{\prime}\in Q$ , $\sigma\in\Sigma$ and $s_{i}\in S$ . The resulting $\mathcal{G}^{\prime}$ recognizes $\mathtt{L}$ .

The inclusion is proper since $\mathtt{L}=\{a^{2^{2n+1}}|n\geq 0\}\in\mathfrak{L}(SL(2,\mathbb{Q}))$ by Theorem 3.2, and $\mathfrak{L}(\mathbb{Q}^{+})$ does not contain any unary nonregular languages by Fact 3.1, noting that $\mathbb{Q}^{+}$ -automata are equivalent to 1NFAMW’s. ∎

We will now look at a special subgroup of $GL(2,\mathbb{Q})$ .

For two integers $m$ and $n$ , the Baumslag-Solitar group $BS(m,n)$ is defined as $BS(m,n)=\langle a,b|ba^{m}b^{-1}=a^{n}\rangle$ . We are going to focus on $BS(1,2)=\langle a,b|bab^{-1}=a^{2}\rangle$ .

Consider the matrix group $G_{BS}$ generated by the matrices

A=\left[\begin{array}[]{cc}1&0\\ -1&1\\ \end{array}\right]~~~\mbox{and}~~~B=\left[\begin{array}[]{cc}1/2&0\\ 0&1\\ \end{array}\right].

Consider the isomorphism $a\mapsto A$ , $b\mapsto B$ . The matrices $A$ and $B$ satisfy the property $BAB^{-1}=A^{2}$ ,

\left[\begin{array}[]{cc}1/2&0\\ 0&1\\ \end{array}\right]\left[\begin{array}[]{cc}1&0\\ -1&1\\ \end{array}\right]\left[\begin{array}[]{cc}2&0\\ 0&1\\ \end{array}\right]=\left[\begin{array}[]{cc}1&0\\ -2&1\\ \end{array}\right],

and we conclude that $G_{BS}$ is isomorphic to $BS(1,2)$ .

We will prove that there exists a $BS(1,2)$ -automaton which recognizes a non-context-free language.

{thrm}

$\mathfrak{L}(BS(1,2))\nsubseteq\mathsf{CF}$ .

Proof.

Let us construct a $BS(1,2)$ -automaton $\mathcal{G}$ recognizing the language $\mathtt{UPOW}=\{a^{2^{n}}|n\geq 0\}$ . The state diagram of $\mathcal{G}$ and the matrices are given in Figure 2. Without scanning any input symbol, $\mathcal{G}$ multiplies its register with the matrix $A_{1}$ successively. $\mathcal{G}$ nondeterministically moves to the next state reading the first input symbol without modifying the register. After that point, $\mathcal{G}$ starts reading the string and multiplies its register with the matrix $A_{2}$ for each scanned $a$ . At some point, $\mathcal{G}$ nondeterministically stops reading the rest of the string and multiplies its register with the element $A_{3}$ . After successive multiplications with $A_{3}$ , $\mathcal{G}$ nondeterministically decides to move to an accept state.

As a result of $i$ multiplications with $A_{1}$ , the register has the value

\left[\begin{array}[]{cc}2^{i}&0\\ 2^{i}-1&1\\ \end{array}\right]

before reading the first input symbol. Multiplication with each $A_{2}$ leaves $2^{i}$ unchanged while subtracting 1 from $2^{i}-1$ for each scanned $a$ . The register will have the value

\left[\begin{array}[]{cc}2^{i}&0\\ 2^{i}-1-j&1\\ \end{array}\right]

as a result of $j$ multiplications with the matrix $A_{2}$ .

For the rest of the computation, $\mathcal{G}$ will multiply its register with $A_{3}$ resulting in the register value

\left[\begin{array}[]{cc}\frac{2^{i}}{2^{k}}&0\\ 2^{i}-1-j&1\\ \end{array}\right]

since each multiplication with $A_{3}$ divides $2^{i}$ by 2.

The register contains the identity matrix at the end of the computation if $i=k$ and $j=2^{i}-1$ which is possible if the input string is of the form $a^{1+2^{i}-1}=a^{2^{i}}$ . In the successful branch, the register will be equal to the identity matrix and $\mathcal{G}$ will end up in the final state having successfully read the input string.

For input strings which are not members of $\mathtt{UPOW}$ , either the computation will end before reading the whole input string or the final state will be reached with the register value being different from the identity matrix. Note that $A_{1}=B^{-1}A^{-1}$ , $A_{2}=A$ and $A_{3}=B$ , where $A$ and $B$ are the generators of the group $G_{BS}$ and recall that $G_{BS}$ is isomorphic to $BS(1,2)$ . Since $\mathtt{UPOW}$ is a unary nonregular language, it is not context-free and we conclude the result. ∎

Note that $\mathfrak{L}(\mathbb{Z})\subsetneq\mathfrak{L}(BS(1,2))$ since the subgroup generated by $a$ in $BS(1,2)$ is isomorphic to $\mathbb{Z}$ and $\mathfrak{L}(BS(1,2))$ contains a unary nonregular language.

3.3. Automata on matrices of higher dimensions

In [18], it is proven that $\mathbf{F}_{2}\times\mathbf{F}_{2}$ -automata are as powerful as Turing machines.

Fact \thethrm.

[18] $\mathfrak{L}(\mathbf{F}_{2}\times\mathbf{F}_{2})$ is the family of recursively enumerable languages.

We make the following observation.

{thrm}

$\mathsf{RE}=\mathfrak{L}(\mathbf{F}_{2}\times\mathbf{F}_{2})=\mathfrak{L}(SL(4,\mathbb{Z}))$ .

Proof.

The first equality is Fact 3.3. Recall from Section 3.2 that $\varphi$ is an isomorphism from $\mathbf{F}_{2}$ onto $\mathbf{G}$ , the matrix group generated by the matrices $M_{a}$ and $M_{b}$ . Let $\mathbf{G}^{\prime}$ be the following group of matrices

\left\{\left[\begin{array}[]{clll}\lx@intercol\hbox{\multirowsetup$M_{1}$}\hfil\lx@intercol&0&0\\ &&0&0\\ 0&0&\lx@intercol\hfil\hbox{\multirowsetup $M_{2}$}\hfil\lx@intercol\\ 0&0&&\\ \end{array}\right],\ M_{1},\ M_{2}\in\mathbf{G}\right\}.

We will define the mapping $\psi:\mathbf{F}_{2}\times\mathbf{F}_{2}\rightarrow\mathbf{G}^{\prime}$ as $\psi(g_{1},g_{2})=(\varphi(g_{1}),\varphi(g_{2}))$ for all $(g_{1},g_{2})\in\mathbf{F}_{2}\times\mathbf{F}_{2}$ which is an isomorphism from $\mathbf{F}_{2}\times\mathbf{F}_{2}$ onto $\mathbf{G}^{\prime}$ .

This proves that $\mathbf{F}_{2}\times\mathbf{F}_{2}$ is isomorphic to a subgroup of $SL(4,\mathbb{Z})$ . The fact that $\mathfrak{L}(\mathbf{F}_{2}\times\mathbf{F}_{2})$ is the set of recursively enumerable languages helps us to conclude that $\mathfrak{L}(SL(n,\mathbb{Z}))$ is the set of recursively enumerable languages for $n\geq 4$ . ∎

Let us also state that the classes of languages recognized by automata over supergroups of $SL(4,\mathbb{Z})$ such as $GL(4,\mathbb{Z})$ or $SL(4,\mathbb{Q})$ are also identical to the class of recursively enumerable languages. {thrm} $\mathfrak{L}(G)=\mathsf{RE}$ , where $G$ is any matrix group whose matrix entries are computable numbers and $SL(4,\mathbb{Z})$ is a subgroup of $G$ .

Proof.

Note that any finite automaton over a matrix group can be simulated by a nondeterministic Turing machine which keeps track of the register simply by multiplying the matrices and checking whether the identity matrix is reached at the end of the computation, provided that the matrix entries are computable numbers. Since $\mathsf{RE}=\mathfrak{L}(SL(4,\mathbb{Z}))$ and $G$ contains $SL(4,\mathbb{Z})$ as a subgroup, $\mathfrak{L}(G)$ is the set of recursively enumerable languages. ∎

We summarize the results in Figure 3. Solid arrows represent proper inclusion, dashed arrows represent inclusion and dashed lines represent incomparability.

4. Time complexity

In the previous section, we compared various automaton models solely on the basis of the groups they employed as a computational resource. The theory of computational complexity deals with various different types of such resources, the allowed runtime of the machines being the most prominent among them. Some of the automata we saw in Section 3 (e.g. Figure 2) have arbitrarily long computations, and it is a legitimate question to ask whether our results, for instance, the relationships in Figure 3, would still hold if one imposed common time bounds on the automata. We study such questions in this section.

4.1. Definitions

A group automaton $\mathcal{G}$ recognizing language $\mathtt{L}$ is said to be strongly $t(n)$ time-bounded if for any input string $x$ with $|x|=n$ , every computation of $\mathcal{G}$ on $x$ takes at most $t(n)$ steps. We will denote the set of languages recognized by strongly $t(n)$ -time bounded $G$ -automata by $\mathfrak{L}(G)_{t(n)}^{s}$ .

Although the strong mode of recognition defined above is standard in studies of time complexity, we will be able to prove the impossibility results of the next subsection even when the machines are subjected to the following, looser requirement:

A group automaton $\mathcal{G}$ recognizing language $\mathtt{L}$ is said to be weakly $t(n)$ time-bounded if for each accepted input string $x\in\mathtt{L}$ with $|x|=n$ , $\mathcal{G}$ has a successful computation which takes at most $t(n)$ steps. So any input string is allowed to cause longer computations, as long as none of those are accepting for inputs which are not members of $\mathtt{L}$ . We will denote the set of languages recognized by weakly $t(n)$ -time bounded $G$ -automata by $\mathfrak{L}(G)_{t(n)}^{w}$ .

A machine is real-time if every transition consumes an input symbol.

Note that the statement $\mathfrak{L}(G)_{t(n)}^{s}\subseteq\mathfrak{L}(G)_{t(n)}^{w}$ is true by definition.

Let $X$ be a generator set for the group $G$ . The length of $g\in G$ , denoted $|g|_{X}$ , is the length of the shortest representative for $g$ in $(X\cup X^{-1})^{*}$ . Let $B^{X}_{G}(n)=\{g\in G,|g|_{X}\leq n\}$ be the set of all elements in $G$ which can be represented by a word of length at most $n$ . The growth function of a group $G$ with respect to a generating set $X$ , denoted $g^{X}_{G}(n)$ , is the cardinality of the set $B^{X}_{G}(n)$ , that is $g^{X}_{G}(n)=|B^{X}_{G}(n)|$ . The growth function is asymptotically independent of the generating set, and we will denote the growth function of a group $G$ by $g_{G}(n)$ .

For a positive integer $n$ , two strings $w,w^{\prime}\in\Sigma^{*}$ are $n$ -dissimilar for $\mathtt{L}$ if $|w|\leq n$ , $|w^{\prime}|\leq n$ , and there exists a string $v\in\Sigma^{*}$ with $|wv|\leq n$ , $|w^{\prime}v|\leq n$ such that $wv\in\mathtt{L}$ iff $w^{\prime}v\notin\mathtt{L}$ . Let $A_{\mathtt{L}}(n)$ be the maximum $k$ such that there exist $k$ distinct strings that are pairwise $n$ -dissimilar.

A finite set of strings $S$ is said to be a set of uniformly $n$ -dissimilar strings for $\mathtt{L}$ if for each string $w\in S$ , there exists a string $v$ such that $|wv|\leq n$ and $wv\in\mathtt{L}$ and for any string $w^{\prime}\in S$ such that $w\neq w^{\prime}$ , $|w^{\prime}v|\leq n$ and $w^{\prime}v\notin\mathtt{L}$ . Let $U_{\mathtt{L}}(n)$ be the maximum $k$ such that there exist $k$ distinct strings that are uniformly $n$ -dissimilar.

Note that the following is always true by definition, since the strings in a uniformly $n$ -dissimilar set are pairwise $n$ -dissimilar.

{lmm}

$U_{\mathtt{L}}(n)\leq A_{\mathtt{L}}(n)$ for all $n\geq 0$ .

4.2. Limitations of machines on slow groups running in short time

{thrm}

Let $G$ be a group with growth function $g_{G}(n)$ . $\mathtt{L}\notin\mathfrak{L}(G)_{t(n)}^{w}$ if $g_{G}(t(n))\in o(U_{\mathtt{L}}(n))$ .

Proof.

Suppose for a contradiction that there exists a weakly $t(n)$ time-bounded $G$ -automaton $\mathcal{G}$ recognizing $\mathtt{L}$ in time $t(n)$ . For a sufficiently large $n$ , let $S$ be the set of uniformly $n$ -dissimilar strings such that $|S|=U_{\mathtt{L}}(n)$ . For every string $w_{i}\in S$ , there exists a string $v_{i}$ such that $w_{i}v_{i}\in\mathtt{L}$ and $w_{j}v_{i}\notin\mathtt{L}$ for all $w_{j}\in S$ with $i\neq j$ .

Let $S_{acc}$ be the set of accepted extended strings of the form $w_{i}v_{i}\in\mathtt{L}$ with $|w_{i}v_{i}|\leq n$ where $w_{i}\in S$ and $w_{j}v_{i}\notin\mathtt{L}$ for all $w_{j}\in S$ with $i\neq j$ and $|w_{j}v_{i}|\leq n$ . Let $C$ be the set of $t(n)$ time bounded accepting computation paths for the strings in $S_{acc}$ . The computation $c_{w_{i}v_{i}}\in C$ on the string $w_{i}v_{i}$ can be written as

c_{w_{i}v_{i}}=c_{w_{i}v_{i}}^{w_{i}}c_{w_{i}v_{i}}^{v_{i}}

where $c_{w_{i}v_{i}}^{w_{i}}$ represents the computation up to the end of the prefix $w_{i}$ and $c_{w_{i}v_{i}}^{v_{i}}$ represents the rest of the computation on the string $v_{i}$ .

A configuration of a group automaton is a pair consisting of a state and a group element. Let us count the number of configurations that can be reached at the end of the computation $c_{w_{i}v_{i}}^{w_{i}}$ . Since the number of states is constant, the number of configurations that can be reached is dependent on the number of different group elements that can appear in the register. After reading a prefix $w_{i}$ with $|w_{i}|=m\leq n$ , the product of the labels on the edges can be given by $l=g_{i_{1}}g_{i_{2}}\dots g_{i_{k}}$ for some $k\leq t(m)$ , since the computation in consideration is time bounded. $l$ can be expressed as a product of $\kappa$ generators, where $\kappa$ is at most $C\cdot k$ for some constant $C$ , since each group element labeling a transition in $\mathcal{G}$ is composed of at most some constant number of generators, which is independent of the length of the string. The number of elements in $G$ which can be represented as a product of at most $\kappa$ generators is given by $g_{G}(\kappa)$ by the definition of the growth function of $G$ . Hence, the number of different values that can appear in the register after reading a string of length exactly $m$ is less than or equal to $g_{G}(\kappa)$ . Since $\kappa\leq C\cdot k$ and $k\leq t(m)$ and $g_{G}(t(n))\in o(U_{\mathtt{L}}(n))$ , we can conclude that

g_{G}(\kappa)\leq g_{G}(C\cdot t(m))\in o(U_{\mathtt{L}}(n)).

Now it is easy to see that the number of different configurations that can be reached at the end of a computation $c_{w_{i}v_{i}}^{w_{i}}$ is $o(U_{\mathtt{L}}(n))$ . Note that the cardinality of the set $S$ , and thus that of $S_{acc}$ , is equal to $U_{L}(n)$ . Due to the pigeonhole principle, the same configuration must be reached at the end of two computations $c_{w_{i}v_{i}}^{w_{i}}$ and $c_{w_{j}v_{j}}^{w_{j}}$ for some $i\neq j$ . This will result in the acceptance of the strings $w_{i}v_{j}$ and $w_{j}v_{i}$ , which are not members of $\mathtt{L}$ . We arrive at a contradiction and conclude that $\mathtt{L}$ cannot be recognized by any weakly $t(n)$ time-bounded $G$ -automaton. ∎

In the next lemma, we set a lower bound on maximum cardinality of the set of uniformly $n$ -dissimilar strings in the word problem language of some group $G$ .

{lmm}

Let $G$ be a finitely generated group with growth function $g_{G}(n)$ . Then $U_{W(G)}(n)\geq g_{G}(\lfloor\frac{n}{2}\rfloor)$ .

Proof.

Let $X$ be the generator set of $G$ . The number of distinct elements $g$ in $G$ which can be represented by a word of length less than or equal to $\lfloor\frac{n}{2}\rfloor$ is $g_{G}(\lfloor\frac{n}{2}\rfloor)$ , which is the cardinality of the set $B_{G}^{X}(\lfloor\frac{n}{2}\rfloor)=\{g\in G,|g|_{X}\leq\lfloor\frac{n}{2}\rfloor\}$ . Let $S$ be the set containing the string representations of the elements in $B_{G}^{X}(\lfloor\frac{n}{2}\rfloor)$ . Every $w_{i}\in S$ can be extended with $w_{i}^{-1}$ so that the extended string represents the identity element of $G$ and has length less than or equal to $n$ . Since the strings in $W(G)$ are those which belong to $(X\cup X^{-1})^{*}$ and represent the identity element of $G$ , the extended string $w_{i}w^{-1}_{i}\in W(G)$ . For every string $w_{j}\in S$ such that $i\neq j$ , $w_{j}w_{i}^{-1}\notin W(G)$ since it is not possible for $w_{j}w_{i}^{-1}$ to represent the identity element of $G$ . We conclude that the set $S$ is uniformly $n$ -dissimilar. Since $|S|=|B_{G}^{X}(\lfloor\frac{n}{2}\rfloor)|=g_{G}(\lfloor\frac{n}{2}\rfloor)$ , it follows that $U_{W(G)}(n)\geq g_{G}(\lfloor\frac{n}{2}\rfloor)$ . ∎

The following theorem is about the language recognition power of finite automata over polynomial-growth groups which are weakly polynomial time-bounded. {thrm} Let $G$ and $H$ be groups with polynomial and exponential growth functions $g_{G}(n)$ and $g_{H}(n)$ , respectively. For any polynomial $t(n)$ , $\mathfrak{L}(H)\nsubseteq\mathfrak{L}(G)_{t(n)}^{w}$ .

Proof.

Since $U_{W(H)}(n)\geq g_{H}(\lfloor\frac{n}{2}\rfloor)$ by Lemma 4.2, and $g_{H}(n)$ is an exponential function, $U_{W(H)}(n)$ is also at least exponential. $g_{G}(t(n))$ is a polynomial function, since both $g_{G}(n)$ and $t(n)$ are polynomial. Hence, $W(H)\notin\mathfrak{L}(G)_{t(n)}^{w}$ by Theorem 4.2, and the result follows since $W(H)$ is trivially in $\mathfrak{L}(H)$ . ∎

{thrm}

Let $G$ be a group with a polynomial growth function. For any polynomial $t(n)$ , $\mathsf{CF}\nsubseteq\mathfrak{L}(G)_{t(n)}^{w}$ .

Proof.

It is known that the word problem of the free group of rank 2, $W(\mathbf{F}_{2})$ , has an exponential growth function [12]. Assuming that $G$ is a group with polynomial growth function, $W(\mathbf{F}_{2})$ cannot be recognized by any weakly $t(n)$ time-bounded $G$ -automaton by Theorem 4.2. Since $W(\mathbf{F}_{2})$ is a context-free language, the proof is complete. ∎

4.3. Group automata under linear time bounds

In this section, we focus on linear-time computation.

Let $X$ be a generator set. For each symbol $x\in X$ , the functions $P_{x}$ and $Q_{x}$ are defined as follows.

	$\displaystyle P_{x}:X^{}\rightarrow X^{}\hskip 36.135ptw\mapsto wx$
	$\displaystyle Q_{x}:X^{}x\rightarrow X^{}\hskip 36.135ptwx\mapsto w$

Let $K_{X}$ be the submonoid of all partial functions on $X^{*}$ generated by $P_{x}$ and $Q_{x}$ for all $x\in X$ . $K_{X}$ is called the polycyclic monoid on $X$ . A $K_{X}$ -automaton is equivalent to a pushdown automaton, where $P_{x}$ and $Q_{x}$ can be interpreted as pushing and popping symbols on the stack. The equivalence between the two models is due to the nature of the functions $P_{x}$ and $Q_{x}$ , and is described in detail in [14]. The resemblance between the free group and $K_{X}$ is used to prove that $\mathfrak{L}(\mathbf{F}_{2})=\mathsf{CF}$ in [14] and [4].

Our aim is to show that $\mathbf{F}_{2}$ -automata working in linear time can recognize all context-free languages. It is stated in [21] that $K_{X}$ -automata which consume at least one input symbol at each step are as powerful as $K_{X}$ -automata without any time bound. However, it is not straightforward to see whether the same is true for $\mathbf{F}_{2}$ -automata.

{thrm}

$\mathfrak{L}(\mathbf{F}_{2})_{O(n)}^{w}=\mathsf{CF}$ .

Proof.

We are going to use the construction of Kambites [14] to prove that any context-free language can be recognized by a weakly linear-time bounded $\mathbf{F}_{2}$ -automaton.

Let $\mathtt{L}$ be a context-free language and let $\mathcal{M}=\{Q,\Sigma,K_{X},\delta,q_{0},Q_{a}\}$ be a polycyclic monoid automaton recognizing $\mathtt{L}$ . $K_{X}$ is the polycyclic monoid on $X$ where the cardinality of the set $X$ is $n$ for some $n\geq 2$ . Let $e$ be the identity element of $K_{x}$ . The construction of Kambites provides an $\mathbf{F}_{n+1}$ -automaton $\mathcal{G}=\{Q^{\prime},\Sigma,\mathbf{F}_{n+1},\delta^{\prime},q_{0}^{\prime},Q_{a}^{\prime}\}$ recognizing the language $\mathtt{L}$ . The generator set for $\mathbf{F}_{n+1}$ is $X^{\prime}$ , where $X^{\prime}=X\cup\#$ .

Let us analyze the construction in more detail.

•

$Q^{\prime}=Q_{-}\cup Q_{+}$ where $Q_{-}=\{q_{-}|q\in Q\}$ and $Q_{+}=\{q_{+}|q\in Q\}$
•

$q_{0}^{\prime}$ = $q_{+}$ where $q=q_{0}$ .
•

$Q_{a}^{\prime}=\{q_{-}|q\in Q_{a}\}$ .
•

$\delta^{\prime}(p_{+},\sigma)=(q_{+},x\#)$ if $\delta(p,\sigma)=(q,x\#)$ where $x$ is a positive generator for all $\sigma\in\Sigma$ .
•

$\delta^{\prime}(p_{-},\sigma)=(q_{+},x^{\prime}\#)$ if $\delta(p,\sigma)=(q,x^{\prime}\#)$ where $x^{\prime}$ is a negative generator for all $\sigma\in\Sigma$ .
•

$\delta^{\prime}(p_{+},\sigma)=(q_{+},e)$ if $\delta(p,\sigma)=(q,e)$ for all $\sigma\in\Sigma$ .
•

$\delta^{\prime}(q_{+},\epsilon)=(q_{-},e)$ for each $q\in Q$ .
•

$\delta^{\prime}(q_{-},\epsilon)=(q_{-},\#^{-1})$ for each $q\in Q$ .

We will prove that $\mathcal{G}$ actually runs in linear time. There are two transitions where the automaton is allowed to move without consuming any input symbols.

For each state $q\in Q$ , there are two states $q_{+}$ and $q_{-}$ in $\mathcal{G}$ which are connected with an edge labeled $(\epsilon,e)$ . These transitions do not change the register value, and cannot contribute more than half of the runtime of the machine, since at least one input symbol has to be consumed between any two executions of such transitions.

$\epsilon$ -loops exist in the machine $\mathcal{G}$ for each state $q_{-}$ where the loop is labeled by $(\epsilon,\#^{-1})$ . Although this looks worrisome at first for the purpose of bounding the runtime, the number of times these loops are traversed is actually bounded, as the following argument suggests. Suppose that the register is multiplied with $l_{1}$ , $l_{2}$ , $\cdots$ , $l_{m}$ while reading some input string $w$ of length $n$ , resulting in the register value $l=l_{1}l_{2}\cdots l_{m}(\#^{-1})^{k}$ , where $k\in\mathbb{N}$ , at the end of the computation. If $w$ is accepted by the machine, $l$ should satisfy the following, as well as being equal to the identity element:

l_{i}=\Biggl{\{}\begin{array}[]{lr}(\#^{-1})^{p}x_{i}\#\mbox{ for some }p\in\mathbb{N},&\mbox{if $x_{i}$ is a negative generator}\\ x_{i}\#,&\mbox{if $x_{i}$ is a positive generator}\\ \end{array}

This is called a permissible padding in [14]. By looking at the transition function of $\mathcal{G}$ , one can see that the register is multiplied by a $\#$ only when an input symbol is consumed. Hence, the number of $\#$ ’s that occur in $l$ is less than or equal to the length of the string. The register is multiplied with $\#^{-1}$ without consuming any input symbol. In order for the $\#$ ’s and $\#^{-1}$ ’s to cancel each other, they should be equal in number. Therefore, it can be concluded that the $\epsilon$ -loops are traversed at most $n$ times.

We can conclude that any context-free language can be recognized by a weakly linear-time bounded free group automaton. Since $\mathbf{F}_{2}$ contains every free group of countable rank, the proof is complete. ∎

We state the following theorem, which is the linear-time equivalent of Fact 3.2 [4].

{thrm}

Suppose $G$ is a finitely generated group and $H$ is a subgroup of finite index. Then $\mathfrak{L}(G)_{O(n)}^{w}=\mathfrak{L}(H)_{O(n)}^{w}$ .

Proof.

We know that the statement is true in general when there is no time bound by [4]. The proof in [4] still works when all automata in the constructions are required to work in linear time. ∎

Now we can show that Theorem 3.2 also holds for linear-time bounded group automaton.

{thrm}

$\mathsf{CF}=\mathfrak{L}(\mathbf{F}_{2})_{O(n)}^{w}=\mathfrak{L}(SL(2,\mathbb{Z}))_{O(n)}^{w}=\mathfrak{L}(GL(2,\mathbb{Z}))_{O(n)}^{w}$ .

Proof.

The proof is identical with the proof of Theorem 3.2 by using Theorem 4.3. ∎

By using the results proven in Subsection 4.2, we can demonstrate the language recognition power of weakly linear-time bounded $\mathbf{H}$ -automata.

{thrm}

$\mathfrak{L}(\textbf{H})_{O(n)}^{w}\subsetneq\mathfrak{L}(SL(3,\mathbb{Z}))_{O(n)}^{w}$ .

Proof.

$\mathfrak{L}(\mathbf{H})^{w}_{O(n)}\subseteq\mathfrak{L}(SL(3,\mathbb{Z}))_{O(n)}^{w}$ since $\mathbf{H}$ is a subgroup of $SL(3,\mathbb{Z})$ . Since the Heisenberg group has polynomial growth function [16], there exists a context-free language which can not be recognized by any H-automaton in polynomial time by Theorem 4.2. Since $\mathsf{CF}=\mathfrak{L}(SL(2,\mathbb{Z}))_{O(n)}^{w}$ by Theorem 4.3, the result follows. ∎

{thrm}

i. For $k\geq 5$ , $\mathfrak{L}(\mathbf{H})^{w}_{O(n)}$ and $\mathfrak{L}(\mathbb{Z}^{k})^{w}_{O(n)}$ are incomparable.
ii. $\mathfrak{L}(\mathbf{H})^{w}_{O(n)}$ and $\mathsf{CF}$ are incomparable.

Proof.

i. In [19], a weakly linear-time bounded $\mathbf{H}$ -automaton which recognizes the language $\mathtt{MULT}=\{x^{p}y^{q}z^{pq}|p,q\geq 0\}$ is constructed. The language $\mathtt{MULT}$ can not be recognized by any $\mathbb{Z}^{k}$ -automaton, since any bounded language in $\mathfrak{L}(\mathbb{Q}^{+})$ is semilinear by [13].

In [10], it is implicitly proven there exists a uniformly $n$ -dissimilar set of size $\Theta(n^{k})$ for the language $\mathtt{L}_{k}=\{0^{a_{1}}10^{a_{2}}1\dots 0^{a_{k}}10^{a_{1}}10^{a_{2}}1\dots 0^{a_{k}}1\}$ for all integers $k$ . For $k=5$ , there exists a uniformly $n$ -dissimilar set of size $\Theta(n^{5})$ for the language $\mathtt{L}_{5}$ and $U_{\mathtt{L}_{5}}(n)\geq n^{5}$ . Since $g_{\mathbf{H}}(n)$ is a polynomial of order 4 [16] and $t(n)=O(n)$ , $g_{\mathbf{H}}(t(n))\in o(U_{\mathtt{L}_{5}}(n))$ . By Theorem 4.2, we conclude the result.

$ii.$ The language $\mathtt{MULT}=\{x^{p}y^{q}z^{pq}|p,q\geq 0\}$ is not a context-free language. Since $\mathbf{H}$ has a polynomial growth function [16], there exists a context-free language which can not be recognized by any $\mathbf{H}$ -automaton in polynomial-time by Theorem 4.2.

∎

Let us note that $\mathtt{L}_{5}$ can be recognized by a $\mathbb{Z}^{5}$ -automaton in real time. The existence of the languages $\mathtt{L}_{k}$ can be used to prove the linear-time nondeterministic counter hierarchy, with the help of Theorem 4.2.

{thrm}

$\mathfrak{L}(\mathbb{Z}^{k})^{w}_{O(n)}\subsetneq\mathfrak{L}(\mathbb{Z}^{k+1})^{w}_{O(n)}$ for $k\geq 1$ .

Proof.

The language $\mathtt{L}_{k+1}=\{0^{a_{1}}10^{a_{2}}1\dots 0^{a_{k+1}}10^{a_{1}}10^{a_{2}}1\dots 0^{a_{k+1}}1\}$ can be recognized by a $\mathbb{Z}^{k+1}$ -automaton in real time. While scanning the first $k+1$ segments of $0$ ’s, the $i$ ’th counter is increased for each scanned $0$ as $0^{a_{i}}$ is read. In the remainder of the computation, the $i$ ’th counter is decreased for each scanned $0$ when $0^{a_{i}}$ is read.

There exists a uniformly $n$ -dissimilar set of size $\Theta(n^{k+1})$ for the language $\mathtt{L}_{k+1}$ , so $U_{\mathtt{L}_{k+1}}(n)\geq n^{k+1}$ . Since $t(n)=O(n)$ and $g_{\mathbb{Z}^{k}}(n)=n^{k}$ [12], $g_{\mathbb{Z}^{k}}(t(n))\in o(U_{\mathtt{L}_{5}}(n))$ . We conclude by Theorem 4.2. ∎

A celebrated result of the field of computational complexity, the nondeterministic time hierarchy theorem, will enable us to demonstrate that the computational power $\mathbf{F}_{2}\times\mathbf{F}_{2}$ -automata is dependent on the time allotted for their execution.

Fact \thethrm.

[20] If $g(n)$ is a time-constructible function, and $f(n+1)=o(g(n))$ , then there exists a language which cannot be recognized by any nondeterministic Turing machine in time $f(n)$ , but can be recognized by a nondeterministic Turing machine in time $g(n)$ .

Assume that any recursively enumerable language can be recognized by some linear-time $\mathbf{F}_{2}\times\mathbf{F}_{2}$ -automaton. One can easily build a nondeterministic Turing machine that simulates such a $\mathbf{F}_{2}\times\mathbf{F}_{2}$ -automaton with only a polynomial slowdown. But this would mean that any recursively enumerable language can be recognized by some nondeterministic TM in polynomial time, contradicting Fact 4.3, which implies that there exist languages which can only be recognized by nondeterministic Turing machines which run in at least exponential time. We have proven the following theorem.

{thrm}

$\mathfrak{L}(\mathbf{F}_{2}\times\mathbf{F}_{2})_{O(n)}^{w}\subsetneq\mathsf{RE}$ .

Using the ability of Turing machines to simulate any finite automaton over a computable matrix group, the statement of the above theorem can be extended as follows.

{thrm}

$\mathfrak{L}(G)_{O(n)}^{w}\subsetneq\mathsf{RE}$ for any matrix group $G$ whose matrix entries are computable numbers.

Proof.

In Theorem 3.3, we have mentioned that Turing machines can simulate any finite automaton over a computable matrix group. By the nondeterministic time hierarchy theorem, it can be shown that there exist some languages which can not be recognized by any finite automata over matrix groups in linear time. ∎

{thrm}

$\mathfrak{L}(\mathbf{F}_{2})_{O(n)}^{w}\subsetneq\mathfrak{L}(\mathbf{F}_{2}\times\mathbf{F}_{2})_{O(n)}^{w}$ .

Proof.

It is obvious that an $\mathbf{F}_{2}$ -automaton can be simulated by an $\mathbf{F}_{2}\times\mathbf{F}_{2}$ -automaton. $\mathfrak{L}(\mathbf{F}_{2})_{O(n)}^{w}=\mathsf{CF}$ by Theorem 4.3. The inclusion is proper since the non-context-free language $\mathtt{L}=\{a^{n}b^{n}c^{n}|n\geq 0\}$ can be recognized by an $\mathbf{F}_{2}\times\mathbf{F}_{2}$ -automaton in real time by using the two registers as two counters. ∎

In the rest of the section, the linear-time counterparts of the relationships in Figure 3 will be stated.

{thrm}

i.

$\mathfrak{L}(\mathbb{Q}^{+})^{w}_{O(n)}\subsetneq\mathfrak{L}(SL(2,\mathbb{Q}))^{w}_{O(n)}$ .
ii.

$\mathfrak{L}(\mathbb{Z})^{w}_{O(n)}\subsetneq\mathfrak{L}(BS(1,2))^{w}_{O(n)}\nsubseteq\mathsf{CF}$ .
iii.

$\mathfrak{L}(SL(2,\mathbb{Z}))^{w}_{O(n)}\subsetneq\mathfrak{L}(SL(3,\mathbb{Z}))^{w}_{O(n)}$ .
iv.

$\mathfrak{L}(\mathbb{Z}^{2})^{w}_{O(n)}\subsetneq\mathfrak{L}(\mathbf{H})^{w}_{O(n)}$ .
v.

$\mathsf{CF}$ and $\mathfrak{L}(\mathbb{Z}^{k})^{w}_{O(n)}$ are incomparable for all $k\geq 2$ .
vi.

$\mathfrak{L}(SL(3,\mathbb{Z}))^{w}_{O(n)}=\mathfrak{L}(GL(3,\mathbb{Z}))^{w}_{O(n)}$ .
vii.

$\mathsf{REG}=\mathfrak{L}(\mathbf{F}_{0})^{w}_{O(n)}\subsetneq\mathfrak{L}(\mathbf{F}_{1})^{w}_{O(n)}=\mathfrak{L}(\mathbb{Z})^{w}_{O(n)}\subsetneq\mathfrak{L}(\mathbf{F}_{2})^{w}_{O(n)}$ .

Proof.

$(i.,ii.,iii.,iv.)$ Analogous results where no time bound was imposed on the machines were proven in Theorems 3.2, 3.2, 3.2, and 3.2, respectively. The group automata recognizing the witness languages $\mathtt{L}=\{a^{2^{2n+1}}|n\geq 0\}$ , $\mathtt{UPOW}=\{a^{2^{n}}|n\geq 0\}$ and $\mathtt{MULT}=\{x^{p}y^{q}z^{pq}|p,q\geq 0\}$ operate in weakly linear time in all cases.

v. The equivalent result for the general case is given in Fact 3.1. The non-context-free language $\mathtt{L}^{\prime}=\{a^{n}b^{n}c^{n}|n\geq 0\}$ can be recognized by a $\mathbb{Z}^{2}$ -automaton in real time.

vi. The equivalent result for the general case is given in Theorem 3.2. The result follows by Theorem 4.3.

vii. The equivalent result for the general case is given in Fact 3.1. $\mathbf{F}_{0}$ is the trivial group, and any regular language can be recognized by a deterministic finite automaton, which can be seen as finite automaton over $\mathbf{F}_{0}$ , in real time. Since $\mathbf{F}_{1}$ is isomorophic to $\mathbb{Z}$ , the equality is obvious. Since the nonregular language $\mathtt{L}=\{a^{n}b^{n}|n\geq 0\}$ can be recognized by a $\mathbb{Z}$ -automaton in real time, the proper inclusion follows. Lastly, since $\mathfrak{L}(\mathbf{F}_{2})^{w}_{O(n)}$ is equivalent to $\mathsf{CF}$ by Theorem 4.3, the last proper inclusion is still valid. ∎

The results are summarized in Figure 4.

5. Open questions

Does there exist an $SL(3,\mathbb{Z})$ -automaton recognizing $W(\mathbb{Z}^{3})$ ? ¹¹1Corollary 2 of [3] states that the word problem of a finitely generated Abelian group $H$ is recognized by a $G$ -automaton if and only if $H$ has a finite index subgroup isomorphic to a subgroup of $G$ . That corollary could be used to give an affirmative answer to this open question. Unfortunately, the corollary is wrong: Let $H$ be an Abelian group and let $G=\mathbf{F}_{2}\times\mathbf{F}_{2}$ . $\mathfrak{L}(\mathbf{F}_{2}\times\mathbf{F}_{2})$ contains the word problem of any finitely generated Abelian group. Since $\mathbf{F}_{2}\times\mathbf{F}_{2}$ is finitely generated, any finite index subgroup of $\mathbf{F}_{2}\times\mathbf{F}_{2}$ is also finitely generated. Any finite index subgroup of $\mathbf{F}_{2}\times\mathbf{F}_{2}$ is either free or has a subgroup of finite index that is a direct product of free groups [1]. Any subgroup of an Abelian group is again Abelian. Hence, it is not possible that $G$ has a finite index subgroup isomorphic to a subgroup of $H$ .

Can we prove a stronger version of Theorem 4.2, which is independent of the time component? For instance, for the case of $\mathbf{F}_{2}$ , is it true that $W(\mathbf{F}_{2})\notin\mathfrak{L}(\mathbf{H})$ in general?

Can we describe the necessary properties of a group $G$ so that $\mathfrak{L}(G)$ contains $W(\textbf{F}_{2})$ ?

Little is known about $BS(1,2)$ -automata. Does $\mathfrak{L}(BS(1,2))$ contain every context-free language?

Which, if any, of the subset relationships in Figure 3 are proper inclusions?

Can we add other classes above RE in Figure 3 by examining groups on matrices with uncomputable entries?

Theorem 4.2 uses the definition of uniform $n$ -dissimilarity requiring that $g_{G}(t(n))$ $\in o(U_{\mathtt{L}}(n))$ . Would the theorem be still true if we replace $U_{\mathtt{L}}(n)$ by $A_{\mathtt{L}}(n)$ ? The gap between $U_{\mathtt{L}}(n)$ and $A_{\mathtt{L}}(n)$ might be large as mentioned in [10]. Consider the language $\mathtt{L}=\{a^{i}b^{j}|i\neq j\}$ . It is stated in [10] that a set of uniformly $n$ -dissimilar strings for $\mathtt{L}$ can not contain more than two strings. However, $A_{\mathtt{L}}(n)\notin O(1)$ , since $\mathtt{L}$ is not a regular language.

Can real-time $\mathbf{F}_{2}$ -automata recognize every context-free language?

References

[1] Gilbert Baumslag and James E Roseblade. Subgroups of direct products of free groups. Journal of the London Mathematical Society, 2(1):44–52, 1984.
[2] Nathanial Patrick Brown and Narutaka Ozawa. C*-algebras and finite-dimensional approximations, volume 88. American Mathematical Soc., 2008.
[3] Sean Cleary, Murray Elder, and Gretchen Ostheimer. The word problem distinguishes counter languages. arXiv preprint math/0606415, 2006.
[4] Jon M. Corson. Extended finite automata and word problems. International Journal of Algebra and Computation, 15(03):455–466, 2005.
[5] Jürgen Dassow and Victor Mitrana. Finite automata over free groups. International Journal of Algebra and Computation, 10(06):725–737, 2000.
[6] Murray Elder, Mark Kambites, and Gretchen Ostheimer. On groups and counter automata. International Journal of Algebra and Computation, 18(08):1345–1364, 2008.
[7] Gillian Z. Elston and Gretchen Ostheimer. On groups whose word problem is solved by a counter automaton. Theoretical Computer Science, 320(2–3):175 – 185, 2004.
[8] Patrick C. Fischer, Albert R. Meyer, and Arnold L. Rosenberg. Real time counter machines. In Proceedings of the 8th Annual Symposium on Switching and Automata Theory (SWAT 1967), FOCS ’67, pages 148–154, 1967.
[9] J.B. Fraleigh and V.J. Katz. A first course in abstract algebra. Addison-Wesley world student series. Addison-Wesley, 2003.
[10] Ian Glaister and Jeffrey Shallit. Automaticity iii: Polynomial automaticity and context-free languages. Computational Complexity, 7(4):371–387, 1998.
[11] S. A. Greibach. Remarks on blind and partially blind one-way multicounter machines. Theoretical Computer Science, 7:311–324, 1978.
[12] Rostislav I Grigorchuk. On growth in group theory. In Proceedings of the International Congress of Mathematicians, volume 1, pages 325–338, 1990.
[13] Oscar H. Ibarra, Sartaj K. Sahni, and Chul E. Kim. Finite automata with multiplication. Theoretical Computer Science, 2(3):271 – 294, 1976.
[14] Mark Kambites. Formal languages and groups as memory. Communications in Algebra, 37(1):193–208, 2009.
[15] Mikhail I. Kargapolov and Ju. I. Merzljakov. Fundamentals of the Theory of Groups. Springer-Verlag, 1979.
[16] Pierre De La Harpe. Topics in geometric group theory. The University Of Chicago Press, Chicago, 2000.
[17] Roger C. Lyndon and Paul E. Schupp. Combinatorial Group Theory. Springer-Verlag, 1977.
[18] Victor Mitrana and Ralf Stiebe. The accepting power of finite automata over groups. In New Trends in Formal Languages, pages 39–48. Springer-Verlag, 1997.
[19] Elaine Render. Rational monoid and semigroup automata. PhD thesis, University of Manchester, 2010.
[20] Stanisław Żak. A Turing machine time hierarchy. Theoretical Computer Science, 26(3):327 – 333, 1983.
[21] Georg Zetzsche. Silent transitions in automata with storage. In International Colloquium on Automata, Languages, and Programming, pages 434–445. Springer Berlin Heidelberg, 2013.

LANGUAGE CLASSES ASSOCIATED WITH AUTOMATA OVER MATRIX GROUPS

Abstract.

Key words and phrases:

1991 Mathematics Subject Classification:

1. Introduction

2. Preliminaries

2.1. Notation and terminology

2.2. Group automata

3. Matrix groups and associated language classes

3.1. Basic results

Fact \thethrm.

Fact \thethrm.

Fact \thethrm.

Fact \thethrm.

Fact \thethrm.

Proof.

3.2. Automata on groups of 2×22\times 2 and 3×33\times 3 matrices

Fact \thethrm.

Proof.

Proof.

Proof.

Proof.

Proof.

Proof.

Proof.

3.3. Automata on matrices of higher dimensions

Fact \thethrm.

Proof.

Proof.

4. Time complexity

4.1. Definitions

4.2. Limitations of machines on slow groups running in short time

Proof.

Proof.

Proof.

Proof.

4.3. Group automata under linear time bounds

Proof.

Proof.

Proof.

Proof.

Proof.

Proof.

Fact \thethrm.

Proof.

Proof.

Proof.

5. Open questions

References

3.2. Automata on groups of $2\times 2$ and $3\times 3$ matrices