Two-Dimensional Pattern Languages^†^†thanks: This document is a full version (i. e., it contains all proofs) of the conference paper [3].

Henning Fernau Fachbereich 4 – Abteilung Informatikwissenschaften, Universität Trier, 54286 Trier, Germany, {fernau,mschmid}@uni-trier.de Markus L. Schmid Fachbereich 4 – Abteilung Informatikwissenschaften, Universität Trier, 54286 Trier, Germany, {fernau,mschmid}@uni-trier.de K. G. Subramanian School of Computer Sciences, Universiti Sains Malaysia, 11800 Penang, Malaysia, kgsmani1948@yahoo.com

Abstract

We introduce several classes of array languages obtained by generalising Angluin’s pattern languages to the two-dimensional case. These classes of two-dimensional pattern languages are compared with respect to their expressive power and their closure properties are investigated.

1 Introduction

Several methods of generation of two-dimensional languages (also called array languages or picture languages) have been proposed in the literature, extending the techniques and results of formal string language theory. A picture is considered as a rectangular array of terminal symbols in the two-dimensional plane. Models based on grammars or automata as well as those based on theoretical properties of the string languages are well-known and have been extensively investigated. We refer the interested readers to books and surveys like the ones by Rosenfeld [12], Wang [15], Rosenfeld and Siromoney [13], Giammarresi and Restivo [7], or Morita [11]. For example, regular string languages (also known as recognizable string languages) can be characterized in terms of local languages and projections. Based on a similar idea, the class REC of recognizable picture languages (see Giammarresi and Restivo [6]) was proposed as a two dimensional counterpart of regular string languages. In this work, we attempt to generalise a class of string languages to the two-dimensional case, which also provides several desirable features and has therefore attracted considerable interest over the last three decades in the formal language theory community as well as in the learning theory community: Angluin’s pattern languages (see [1]).

In this context, a pattern is a string over an alphabet $\{x_{1},x_{2},x_{3},\ldots\}$ of variables, e. g., $\alpha:=x_{1}\,x_{1}\,x_{2}\,x_{2}\,x_{1}$ . For some finite alphabet $\Sigma$ of terminal symbols, the pattern language described by $\alpha$ (with respect to $\Sigma$ ) is the set of all words over $\Sigma$ that can be derived from $\alpha$ by uniformly substituting the variables in $\alpha$ by (non-empty) terminal words. For example, if $\Sigma:=\{\mathtt{a},\mathtt{b},\mathtt{c}\}$ , then $u:=\mathtt{b}\mathtt{c}\mathtt{b}\mathtt{b}\mathtt{c}\mathtt{b}\mathtt{c}\mathtt{c}\mathtt{a}\mathtt{c}\mathtt{c}\mathtt{a}\mathtt{b}\mathtt{c}\mathtt{b}$ and $v:=\mathtt{a}\mathtt{b}\mathtt{a}\mathtt{b}\mathtt{a}\mathtt{b}\mathtt{a}\mathtt{a}\mathtt{b}\mathtt{a}\mathtt{a}\mathtt{b}$ are words of the pattern language given by $\alpha$ , since replacing $x_{1}$ by $\mathtt{b}\mathtt{c}\mathtt{b}$ and $x_{2}$ by $\mathtt{c}\mathtt{c}\mathtt{a}$ turns $\alpha$ into $u$ and replacing $x_{1}$ by $\mathtt{a}\mathtt{b}$ and $x_{2}$ by $\mathtt{a}\mathtt{b}\mathtt{a}$ turns $\alpha$ into $v$ . On the other hand, the word $\mathtt{c}\mathtt{a}\mathtt{b}\mathtt{a}\mathtt{c}\mathtt{a}\mathtt{b}\mathtt{a}\mathtt{b}\mathtt{a}\mathtt{b}\mathtt{a}$ is not a member of the pattern language of $\alpha$ .

One of the most notable features of pattern languages is that they have natural and compact human readable descriptors (or generators), namely the patterns. In particular, this advantage becomes evident when patterns are compared to other language descriptors as, e. g., grammars or automata, which are usually quite involved even though the language they describe is rather simple. Nevertheless, patterns can compete with common automata models and grammars in terms of expressive power and their practical relevance is demonstrated by the widespread use of so-called extended regular expressions with backreferences, which implicitly use the concept of patterns and are capable of defining all pattern languages.¹¹1In fact, these extended regular expressions with backreferences are nowadays a standard element of most text editors and programming languages (cf. Friedl [5]).

The main goal of this paper is to generalise the concept of patterns as language descriptors to the two-dimensional case, while preserving the desirable features of (one-dimensional) pattern languages, i. e., the simplicity and compactness of their descriptors. The work done so far on two-dimensional languages demonstrates that there are difficulties that seem to be symptomatic for the task of generalising a class of string languages to the two-dimensional case. Firstly, such a generalisation is usually accompanied with a substantial increase in complexity of the descriptors (e. g., when extending context-free or contextual grammars to the two-dimensional case (see Fernau et al. [2], Freund et al. [4])) and, secondly, there are often many competing and seemingly different ways to generalise a specific class of string languages, which all can be considered natural (e. g., it is still on debate what the appropriate two-dimensional counterpart of the class of regular languages might be (see Giammarresi et al. [8], Matz [10])). Our two-dimensional patterns, to be introduced in this work, are as simple and compact as their one-dimensional counterparts. Although there are several different possibilities of how these two-dimensional patterns can describe two-dimensional languages, one of these sticks out as the intuitively most natural one. Hence, the model of Angluin’s pattern languages seems to be comparatively two-dimensional friendly.

Besides the conceptional contribution of this paper, we present a comparison between the expressive power of different classes of two-dimensional pattern languages and an investigation of their closure properties. We conclude the paper by outlining further research questions and possible extensions to the model of two-dimensional pattern languages.

2 Preliminaries

In this section, we briefly recall the standard definitions and notations regarding one- and two-dimensional words and languages.

Let $\mathbb{N}:=\{1,2,3,\ldots\}$ and let $\mathbb{N}_{0}:=\mathbb{N}\cup\{0\}$ . For a finite alphabet $\Sigma$ , a string or word (over $\Sigma$ ) is a finite sequence of symbols from $\Sigma$ , and $\varepsilon$ stands for the empty string. The notation $\Sigma^{+}$ denotes the set of all nonempty strings over $\Sigma$ , and $\Sigma^{*}:=\Sigma^{+}\cup\{\varepsilon\}$ . For the concatenation of two strings $w_{1},w_{2}$ we write $w_{1}\cdot w_{2}$ or simply $w_{1}w_{2}$ . We say that a string $v\in\Sigma^{*}$ is a factor of a string $w\in\Sigma^{*}$ if there are $u_{1},u_{2}\in\Sigma^{*}$ such that $w=u_{1}\cdot v\cdot u_{2}$ . If $u_{1}$ or $u_{2}$ is the empty string, then $v$ is a prefix (or a suffix, respectively) of $w$ . The notation $|w|$ stands for the length of a string $w$ .

A two-dimensional word (or array) over $\Sigma$ is a tuple

W:=((a_{1,1},a_{1,2},\ldots,a_{1,n}),(a_{2,1},a_{2,2},\ldots,a_{2,n}),\ldots,(a_{m,1},a_{m,2},\ldots,a_{m,n}))\,,

where $m,n\in\mathbb{N}$ and, for every $i$ , $1\leq i\leq m$ , and $j$ , $1\leq j\leq n$ , $a_{i,j}\in\Sigma$ . We define the number of columns (or width) and number of rows (or height) of $W$ by $|W|_{c}:=n$ and $|W|_{r}:=m$ , respectively. The empty array is denoted by $\lambda$ , i. e., $|\lambda|_{c}=|\lambda|_{r}=0$ . For the sake of convenience, we also denote $W$ by $[a_{i,j}]_{m,n}$ or by a matrix of one of the following forms:

\begin{smallmatrix}a_{1,1}&a_{1,2}&\ldots&a_{1,n}\\ a_{2,1}&a_{2,2}&\ldots&a_{2,n}\\ \vdots&\vdots&\ddots&\vdots\\ a_{m,1}&a_{m,2}&\ldots&a_{m,n}\end{smallmatrix},\>\>\>\>\>\>\begin{bmatrix}a_{1,1}&a_{1,2}&\ldots&a_{1,n}\\ a_{2,1}&a_{2,2}&\ldots&a_{2,n}\\ \vdots&\vdots&\ddots&\vdots\\ a_{m,1}&a_{m,2}&\ldots&a_{m,n}\end{bmatrix}\,.

If we want to refer to the $j^{\text{th}}$ symbol in row $i$ of the array $W$ , then we use $W[i,j]=a_{i,j}$ . By $\Sigma^{++}$ , we denote the set of all nonempty arrays over $\Sigma$ , and $\Sigma^{**}:=\Sigma^{++}\cup\{\lambda\}$ . Every subset $L\subseteq\Sigma^{**}$ is an array language.

Let $W:=[a_{i,j}]_{m,n}$ and $W^{\prime}:=[a^{\prime}_{i,j}]_{m^{\prime},n^{\prime}}$ be two non-empty arrays over $\Sigma$ . The column concatenation of $W$ and $W^{\prime}$ , denoted by $W\varobar W^{\prime}$ , is undefined if $m\neq m^{\prime}$ and is the array

\begin{smallmatrix}a_{1,1}&a_{1,2}&\ldots&a_{1,n}&b_{1,1}&b_{1,2}&\ldots&b_{1,n^{\prime}}\\ a_{2,1}&a_{2,2}&\ldots&a_{2,n}&b_{2,1}&b_{2,2}&\ldots&b_{2,n^{\prime}}\\ \vdots&\vdots&\ddots&\vdots&\vdots&\vdots&\ddots&\vdots\\ a_{m,1}&a_{m,2}&\ldots&a_{m,n}&b_{m^{\prime},1}&b_{m^{\prime},2}&\ldots&b_{m^{\prime},n^{\prime}}\end{smallmatrix}

otherwise. The row concatenation of $W$ and $W^{\prime}$ , denoted by $W\varominus W^{\prime}$ , is undefined if $n\neq n^{\prime}$ and is the array

\begin{smallmatrix}a_{1,1}&a_{1,2}&\ldots&a_{1,n}\\ a_{2,1}&a_{2,2}&\ldots&a_{2,n}\\ \vdots&\vdots&\ddots&\vdots\\ a_{m,1}&a_{m,2}&\ldots&a_{m,n}\\ b_{1,1}&b_{1,2}&\ldots&b_{1,n^{\prime}}\\ b_{2,1}&b_{2,2}&\ldots&b_{2,n^{\prime}}\\ \vdots&\vdots&\ddots&\vdots\\ b_{m^{\prime},1}&b_{m^{\prime},2}&\ldots&b_{m^{\prime},n^{\prime}}\end{smallmatrix}

otherwise. Intuitively speaking, the vertical line and the horizontal line in the symbols $\varobar$ and $\varominus$ , respectively, indicate the edge where the arrays are concatenated. In order to denote that, e. g., $U\varominus V$ is undefined, we also write $U\varominus V=\operatorname{\texttt{undef}}$ .

Furthermore, for every array $U$ , $U\varobar\lambda=\lambda\varobar U=U\varominus\lambda=\lambda\varominus U=U$ and $U\varobar\operatorname{\texttt{undef}}=\operatorname{\texttt{undef}}\varobar U=U\varominus\operatorname{\texttt{undef}}=\operatorname{\texttt{undef}}\varominus U=\operatorname{\texttt{undef}}$ . Algebraically speaking, if $\Sigma^{**}_{\operatorname{\texttt{undef}}}:=\Sigma^{**}\cup\{\operatorname{\texttt{undef}}\}$ , then $(\Sigma^{**}_{\operatorname{\texttt{undef}}},\varobar,\lambda)$ and $(\Sigma^{**}_{\operatorname{\texttt{undef}}},\varominus,\lambda)$ both form monoids with $\operatorname{\texttt{undef}}$ as an absorbing element.

Example 1.

Let

W_{1}:=\begin{bmatrix}\mathtt{a}&\mathtt{b}&\mathtt{a}\\ \mathtt{b}&\mathtt{c}&\mathtt{a}\\ \mathtt{a}&\mathtt{b}&\mathtt{b}\\ \end{bmatrix},W_{2}:=\begin{bmatrix}\mathtt{b}&\mathtt{c}\\ \mathtt{b}&\mathtt{a}\\ \mathtt{c}&\mathtt{a}\end{bmatrix},W_{3}:=\begin{bmatrix}\mathtt{a}&\mathtt{b}&\mathtt{c}\\ \mathtt{c}&\mathtt{b}&\mathtt{b}\end{bmatrix}\text{ and }W_{4}:=\begin{bmatrix}\mathtt{a}&\mathtt{a}\\ \mathtt{a}&\mathtt{b}\end{bmatrix}\,.

Then $W_{1}\varominus W_{2}$ and $W_{1}\varobar W_{3}=\operatorname{\texttt{undef}}$ , but

W_{1}\varobar W_{2}=\begin{bmatrix}\mathtt{a}&\mathtt{b}&\mathtt{a}&\mathtt{b}&\mathtt{c}\\ \mathtt{b}&\mathtt{c}&\mathtt{a}&\mathtt{b}&\mathtt{a}\\ \mathtt{a}&\mathtt{b}&\mathtt{b}&\mathtt{c}&\mathtt{a}\end{bmatrix}\text{ and }W_{1}\varominus W_{3}=\begin{bmatrix}\mathtt{a}&\mathtt{b}&\mathtt{a}\\ \mathtt{b}&\mathtt{c}&\mathtt{a}\\ \mathtt{a}&\mathtt{b}&\mathtt{b}\\ \mathtt{a}&\mathtt{b}&\mathtt{c}\\ \mathtt{c}&\mathtt{b}&\mathtt{b}\end{bmatrix}\,.

Example 2.

Let $W_{1},W_{2},W_{3}$ and $W_{4}$ be defined as in Example 1.

	$\displaystyle(W_{1}\varobar W_{2})\varominus(W_{3}\varobar W_{4})=(W_{1}\varominus W_{3})\varobar(W_{2}\varominus W_{4})=\begin{bmatrix}\mathtt{a}&\mathtt{b}&\mathtt{a}&\mathtt{b}&\mathtt{c}\\ \mathtt{b}&\mathtt{c}&\mathtt{a}&\mathtt{b}&\mathtt{a}\\ \mathtt{a}&\mathtt{b}&\mathtt{b}&\mathtt{c}&\mathtt{a}\\ \mathtt{a}&\mathtt{b}&\mathtt{c}&\mathtt{a}&\mathtt{a}\\ \mathtt{c}&\mathtt{b}&\mathtt{b}&\mathtt{a}&\mathtt{b}\end{bmatrix}\,,$
	$\displaystyle(W_{1}\varobar W_{2})\varominus(W_{4}\varobar W_{3})=\begin{bmatrix}\mathtt{a}&\mathtt{b}&\mathtt{a}&\mathtt{b}&\mathtt{c}\\ \mathtt{b}&\mathtt{c}&\mathtt{a}&\mathtt{b}&\mathtt{a}\\ \mathtt{a}&\mathtt{b}&\mathtt{b}&\mathtt{c}&\mathtt{a}\\ \mathtt{a}&\mathtt{a}&\mathtt{a}&\mathtt{b}&\mathtt{c}\\ \mathtt{a}&\mathtt{b}&\mathtt{c}&\mathtt{b}&\mathtt{b}\end{bmatrix}\,,\>\>(W_{1}\varominus W_{4})\varobar(W_{2}\varominus W_{3})=\operatorname{\texttt{undef}}\,.$

Next, we define some operations for array languages. The row and column concatenation for array languages $L_{1}$ and $L_{2}$ is defined by $L_{1}\varominus L_{2}:=\{U\varominus V\mid U\in L_{1},V\in L_{2},U\varominus V\neq\operatorname{\texttt{undef}}\}$ and $L_{1}\varobar L_{2}:=\{U\varobar V\mid U\in L_{1},V\in L_{2},U\varobar V\neq\operatorname{\texttt{undef}}\}$ , respectively. For an array language $L$ and $k\in\mathbb{N}$ , $L^{\varominus k}$ denotes the $k$ -fold row concatenation of $L$ , i. e., $L^{\varominus k}:=L_{1}\varominus L_{2}\varominus\ldots\varominus L_{k}$ , $L_{i}=L$ , $1\leq i\leq k$ . The $k$ -fold column concatenation, denoted by $L^{\varobar k}$ , is defined analogously. The row and column concatenation closure of an array language $L$ is defined by $L^{\varominus*}:=\bigcup_{k=1}^{\infty}L^{\varominus k}$ and $L^{\varobar*}:=\bigcup_{k=1}^{\infty}L^{\varobar k}$ , respectively. Obviously, the row and column concatenation closure of an array language correspond to the Kleene closure of a string language.

Now, we turn our attention to some geometric operations for arrays. The transposition of an array $U$ , denoted by ${U}^{\texttt{T}}$ , is obtained by reflecting $U$ along the main diagonal. The $\varominus$ -reflection and $\varobar$ -reflection of $U$ , denoted by $U^{\varominus\texttt{R}}$ and $U^{\varobar\texttt{R}}$ , respectively, are obtained by reflecting $U$ along the horizontal and vertical axis, respectively. The right turn and left turn of $U$ , denoted by $U^{\curvearrowright}$ and $U^{\curvearrowleft}$ , respectively, is obtained by turning $U$ through $90$ degrees to the right and to the left, respectively. For example, if $U:=\begin{bmatrix}\mathtt{a}&\mathtt{b}&\mathtt{c}&\mathtt{d}\\ \mathtt{e}&\mathtt{f}&\mathtt{g}&\mathtt{h}\end{bmatrix}$ , then

\displaystyle{U}^{\texttt{T}}=\begin{bmatrix}\mathtt{a}&\mathtt{e}\\ \mathtt{b}&\mathtt{f}\\ \mathtt{c}&\mathtt{g}\\ \mathtt{d}&\mathtt{h}\\ \end{bmatrix}\,,U^{\varominus\texttt{R}}=\begin{bmatrix}\mathtt{e}&\mathtt{f}&\mathtt{g}&\mathtt{h}\\ \mathtt{a}&\mathtt{b}&\mathtt{c}&\mathtt{d}\\ \end{bmatrix}\,,U^{\varobar\texttt{R}}=\begin{bmatrix}\mathtt{d}&\mathtt{c}&\mathtt{b}&\mathtt{a}\\ \mathtt{h}&\mathtt{g}&\mathtt{f}&\mathtt{e}\\ \end{bmatrix}\,,U^{\curvearrowright}=\begin{bmatrix}\mathtt{e}&\mathtt{a}\\ \mathtt{f}&\mathtt{b}\\ \mathtt{g}&\mathtt{c}\\ \mathtt{h}&\mathtt{d}\\ \end{bmatrix}\,,U^{\curvearrowleft}=\begin{bmatrix}\mathtt{d}&\mathtt{h}\\ \mathtt{c}&\mathtt{g}\\ \mathtt{b}&\mathtt{f}\\ \mathtt{a}&\mathtt{e}\\ \end{bmatrix}\,,(U^{\curvearrowright})^{\curvearrowright}=\begin{bmatrix}\mathtt{h}&\mathtt{g}&\mathtt{f}&\mathtt{e}\\ \mathtt{d}&\mathtt{c}&\mathtt{b}&\mathtt{a}\end{bmatrix}\,.

We address left and right turn also as quarter-turns below. Moreover, the twofold right turn (displayed right-most in the example above) is also known as a half-turn.

A special operation considered in the context of arrays is the conjugation of an array $U\in\Sigma^{**}$ with $|\Sigma|=2$ , denoted by $U^{\texttt{C}}$ , which means that the two symbols of $\Sigma$ are exchanged in $U$ , e. g., $\left(\begin{bmatrix}\mathtt{a}&\mathtt{b}&\mathtt{a}&\mathtt{a}\\ \mathtt{b}&\mathtt{a}&\mathtt{a}&\mathtt{b}\\ \end{bmatrix}\right)^{\texttt{C}}=\begin{bmatrix}\mathtt{b}&\mathtt{a}&\mathtt{b}&\mathtt{b}\\ \mathtt{a}&\mathtt{b}&\mathtt{b}&\mathtt{a}\\ \end{bmatrix}$ . This can be also viewed as a quite restricted form of a two-dimensional morphism defined in the next section.

All these operations for arrays are extended to array languages in the obvious way.

Next, we briefly summarise the concept of (one-dimensional) pattern languages as introduced in [1] by Angluin. Technically, the version of pattern languages used here are called nonerasing terminal-free pattern languages (for an overview of different versions of one-dimensional pattern languages, the reader is referred to [9] by Mateescu and Salomaa).

A (one-dimensional) pattern is a string over an alphabet $X:=\{x_{1},x_{2},x_{3},\ldots\}$ of variables, e. g., $\alpha:=x_{1}\,x_{1}\,x_{2}\,x_{2}\,x_{1}$ . In Section 1, we have seen an intuitive definition of the language described by a pattern $\alpha$ . This intuition can be formalised in an elegant way by using the concept of (word) morphisms, i. e., mappings $h:\Sigma_{1}^{+}\rightarrow\Sigma_{2}^{+}$ , which satisfy $h(u\,v)=h(u)\,h(v)$ , for all $u,v\in\Sigma_{1}^{+}$ . In this regard, for some finite alphabet $\Sigma$ , the (one-dimensional) pattern language of $\alpha$ (with respect to $\Sigma$ ) is the set $L^{\operatorname{\texttt{1D}}}_{\Sigma}(\alpha):=\{h(\alpha)\mid h:X^{+}\rightarrow\Sigma^{+}\text{ is a morphism}\}$ . An alternative, yet equivalent, way to define pattern languages is by means of factorisations. To this end, let $\alpha:=y_{1}\,y_{2}\ldots y_{n}$ , $y_{i}\in X$ , $1\leq i\leq n$ . Then $L^{\operatorname{\texttt{1D}}}_{\Sigma}(\alpha)$ is the set of all words $w\in\Sigma^{+}$ that have a characteristic factorisation for $\alpha$ , i. e., a factorisation $w=u_{1}\,u_{2}\cdots u_{n}$ , such that, for every $i$ , $1\leq i\leq j\leq n$ , $y_{i}=y_{j}$ implies $u_{i}=u_{j}$ . It can be easily seen, that these two definitions are equivalent. However, for the two-dimensional case, we shall see that a generalisation of these two approaches will lead to different versions of two-dimensional pattern languages. The class of all one-dimensional pattern languages over the alphabet $\Sigma$ is denoted by $\mathcal{L}^{\operatorname{\texttt{1D}}}_{\Sigma}$ . We recall the example pattern $\alpha=x_{1}\,x_{1}\,x_{2}\,x_{2}\,x_{1}$ and the words $u:=\mathtt{b}\mathtt{c}\mathtt{b}\mathtt{b}\mathtt{c}\mathtt{b}\mathtt{c}\mathtt{c}\mathtt{a}\mathtt{c}\mathtt{c}\mathtt{a}\mathtt{b}\mathtt{c}\mathtt{b}$ and $v:=\mathtt{a}\mathtt{b}\mathtt{a}\mathtt{b}\mathtt{a}\mathtt{b}\mathtt{a}\mathtt{a}\mathtt{b}\mathtt{a}\mathtt{a}\mathtt{b}$ of Section 1 Since $h(\alpha)=u$ and $g(\alpha)=v$ , where $h$ and $g$ are the morphisms induced by $h(x_{1}):=\mathtt{b}\mathtt{c}\mathtt{b}$ , $h(x_{2}):=\mathtt{c}\mathtt{c}\mathtt{a}$ and $g(x_{1}):=\mathtt{a}\mathtt{b}$ , $g(x_{2}):=\mathtt{a}\mathtt{b}\mathtt{a}$ , we can conclude that $u,v\in L^{\operatorname{\texttt{1D}}}_{\Sigma}(\alpha)$ , where $\Sigma:=\{\mathtt{a},\mathtt{b},\mathtt{c}\}$ .

3 Two-Dimensional Pattern Languages

As already mentioned, this work deals with the task of generalising pattern languages from the one-dimensional to the two-dimensional case. In order to motivate our approach to solve this task, we first spent some effort on illustrating the general difficulties and obstacles that arise.

Abstractly speaking, a pattern language for a given pattern $\alpha$ is the collection of all elements that satisfy $\alpha$ . Thus, a sound definition of how elements satisfy patterns directly entails a sound definition of a class of pattern languages. In the one-dimensional case, the situation that a word satisfies a pattern is intuitively clear and it can be defined in several equivalent ways, i. e., a word $w$ satisfies the pattern $\alpha$ if and only if

•

$w$ can be derived from $\alpha$ by uniformly substituting the variables in $\alpha$ ,
•

$w$ is a morphic image of $\alpha$ ,
•

$w$ has a characteristic factorisation for $\alpha$ .

We shall now demonstrate that for a two-dimensional pattern, i. e., a two-dimensional word over the set of variables $X$ , e. g., $\alpha:=\begin{bmatrix}x_{1}&x_{2}\\ x_{3}&x_{1}\end{bmatrix}$ , these concepts do not work anymore or they describe fundamentally different situations. For instance, the basic operation of substituting a single symbol in a word by another word cannot that easily be extended to the two-dimensional case. For example, the replacements $x_{1}\mapsto\begin{bmatrix}\mathtt{a}&\mathtt{a}\end{bmatrix},x_{2}\mapsto\begin{bmatrix}\mathtt{c}\\ \mathtt{c}\end{bmatrix}$ and $x_{3}\mapsto\begin{bmatrix}\mathtt{b}\end{bmatrix}$ may turn $\alpha$ into one of the following objects,

\begin{bmatrix}&&\mathtt{c}&\\ \mathtt{a}&\mathtt{a}&\mathtt{c}&\\ &\mathtt{b}&\mathtt{a}&\mathtt{a}\end{bmatrix},\begin{bmatrix}\mathtt{a}&\mathtt{a}&\mathtt{c}\\ &&\mathtt{c}\\ \mathtt{b}&\mathtt{a}&\mathtt{a}\end{bmatrix},\begin{bmatrix}&&\mathtt{c}\\ \mathtt{a}&\mathtt{a}&\mathtt{c}\\ \mathtt{b}&\mathtt{a}&\mathtt{a}\end{bmatrix},\begin{bmatrix}&&&\mathtt{c}\\ \mathtt{a}&\mathtt{a}&&\mathtt{c}\\ &\mathtt{b}&\mathtt{a}&\mathtt{a}\end{bmatrix},

which are not two-dimensional words, since they all contain holes or are not of rectangular shape and, most importantly, are not uniquely defined. On the other hand, it is straightforward to generalise the concept of a morphism to the two-dimensional case:

Definition 1.

A mapping $h:\Sigma_{1}^{++}\rightarrow\Sigma_{2}^{++}$ is a two-dimensional morphism if it satisfies $h(V\varobar W)=h(V)\varobar h(W)$ and $h(V\varominus W)=h(V)\varominus h(W)$ for all $V,W\in\Sigma_{1}^{++}$ .

Hence, we may say that a two-dimensional word $W$ satisfies a two-dimensional pattern $\alpha$ if and only if there exists a two-dimensional morphism which maps $\alpha$ to $W$ . Unfortunately, this definition seems to be too strong as demonstrated by the following example. From an intuitive point of view, the two-dimensional word $W:=\begin{bmatrix}\mathtt{a}&\mathtt{a}&\mathtt{b}&\mathtt{a}&\mathtt{a}&\mathtt{b}\\ \mathtt{a}&\mathtt{a}&\mathtt{b}&\mathtt{a}&\mathtt{a}&\mathtt{b}\\ \mathtt{c}&\mathtt{c}&\mathtt{c}&\mathtt{c}&\mathtt{c}&\mathtt{c}\end{bmatrix}$ should satisfy the two-dimensional pattern $\alpha:=\begin{bmatrix}x_{1}&x_{1}\\ x_{2}&x_{2}\end{bmatrix}$ , but there is no two-dimensional morphism mapping $\alpha$ to $W$ . This is due to the fact that, as pointed out by the following proposition (which has also been mentioned by Siromoney et al. in [14]), a two-dimensional morphism is a mapping with a surprisingly strong condition.

Proposition 1.

Let $\Sigma_{1}:=\{a_{1},a_{2},\ldots,a_{k}\}$ and $\Sigma_{2}$ be alphabets. If a mapping $h:\Sigma_{1}^{++}\rightarrow\Sigma_{2}^{++}$ is a two-dimensional morphism, then

|h(\begin{bmatrix}a_{1}\end{bmatrix})|_{c}=|h(\begin{bmatrix}a_{2}\end{bmatrix})|_{c}=\ldots=|h(\begin{bmatrix}a_{k}\end{bmatrix})|_{c}\text{ and }|h(\begin{bmatrix}a_{1}\end{bmatrix})|_{r}=|h(\begin{bmatrix}a_{2}\end{bmatrix})|_{r}=\ldots=|h(\begin{bmatrix}a_{k}\end{bmatrix})|_{r}\,.

Proof.

We prove the statement of the proposition by contraposition. To this end, we assume that, for some $i,j$ , $1\leq i<j\leq k$ , $|h(\begin{bmatrix}a_{i}\end{bmatrix})|_{r}\neq|h(\begin{bmatrix}a_{j}\end{bmatrix})|_{r}$ , which implies $h(\begin{bmatrix}a_{i}\end{bmatrix})\varobar h(\begin{bmatrix}a_{j}\end{bmatrix})=\operatorname{\texttt{undef}}$ . Hence, since $h(\begin{bmatrix}a_{i}\end{bmatrix}\varobar\begin{bmatrix}a_{j}\end{bmatrix})=h(\begin{bmatrix}a_{i}&a_{j}\end{bmatrix})\in\Sigma_{2}^{++}$ , we can conclude that $h(\begin{bmatrix}a_{i}\end{bmatrix}\varobar\begin{bmatrix}a_{j}\end{bmatrix})\neq h(\begin{bmatrix}a_{i}\end{bmatrix})\varobar h(\begin{bmatrix}a_{j}\end{bmatrix})$ , which contradicts the morphism property. Similarly, if $|h(\begin{bmatrix}a_{i}\end{bmatrix})|_{c}\neq|h(\begin{bmatrix}a_{j}\end{bmatrix})|_{c}$ , then $h(\begin{bmatrix}a_{i}\end{bmatrix}\varominus\begin{bmatrix}a_{j}\end{bmatrix})\neq h(\begin{bmatrix}a_{i}\end{bmatrix})\varominus h(\begin{bmatrix}a_{j}\end{bmatrix})$ . ∎

Similarly as in the string case, homomorphisms $h:\Sigma_{1}^{++}\rightarrow\Sigma_{2}^{++}$ are uniquely defined by giving the images $h(\Sigma_{1})$ . If in particular $h(\Sigma_{1})\subseteq\Sigma_{2}$ , we term the resulting morphism a letter-to-letter morphism, while in the even more restricted case when the restriction $h_{\Sigma_{1}}$ of $h$ to $\Sigma_{1}$ yields a surjective mapping $h_{\Sigma_{1}}:\Sigma_{1}\to\Sigma_{2}$ , $h$ is referred to as a projection. We can conclude that the existence of a two-dimensional morphism seems to be a reasonable sufficient criterion for the situation that a two-dimensional word satisfies a two-dimensional pattern, but not a necessary one.

In fact, it turns out that characteristic factorisations provide the most promising approach to formalise how a two-dimensional word satisfies a two-dimensional pattern. Recall the example pattern $\alpha=\begin{bmatrix}x_{1}&x_{1}\\ x_{2}&x_{2}\end{bmatrix}$ from above. Since $\alpha=(\begin{bmatrix}x_{1}\end{bmatrix}\varobar\begin{bmatrix}x_{1}\end{bmatrix})\varominus(\begin{bmatrix}x_{2}\end{bmatrix}\varobar\begin{bmatrix}x_{2}\end{bmatrix})$ , a characteristic factorisation of a two-dimensional word $U$ for $\alpha$ is a factorisation of the form $U=(V_{1}\varobar V_{1})\varominus(V_{2}\varobar V_{2})$ . On the other hand, since $\alpha=(\begin{bmatrix}x_{1}\end{bmatrix}\varominus\begin{bmatrix}x_{2}\end{bmatrix})\varobar(\begin{bmatrix}x_{1}\end{bmatrix}\varominus\begin{bmatrix}x_{2}\end{bmatrix})$ , we could as well regard a factorisation $U=(V_{1}\varominus V_{2})\varobar(V_{1}\varominus V_{2})$ as characteristic for $\alpha$ . For the sake of convenience, we say that the former factorisation is of column-row type and the latter one is of row-column type. Obviously, the two-dimensional word $W$ from above has a characteristic factorisation of column-row type and a characteristic factorisation of row-column type (with respect to $\alpha$ ):

	$\displaystyle W=(V_{1}\varobar V_{1})\varominus(V_{2}\varobar V_{2})=(\begin{bmatrix}\mathtt{a}&\mathtt{a}&\mathtt{b}\\ \mathtt{a}&\mathtt{a}&\mathtt{b}\end{bmatrix}\varobar\begin{bmatrix}\mathtt{a}&\mathtt{a}&\mathtt{b}\\ \mathtt{a}&\mathtt{a}&\mathtt{b}\end{bmatrix})\varominus(\begin{bmatrix}\mathtt{c}&\mathtt{c}&\mathtt{c}\end{bmatrix}\varobar\begin{bmatrix}\mathtt{c}&\mathtt{c}&\mathtt{c}\end{bmatrix})=\begin{bmatrix}\mathtt{a}&\mathtt{a}&\mathtt{b}&\mathtt{a}&\mathtt{a}&\mathtt{b}\\ \mathtt{a}&\mathtt{a}&\mathtt{b}&\mathtt{a}&\mathtt{a}&\mathtt{b}\\ \mathtt{c}&\mathtt{c}&\mathtt{c}&\mathtt{c}&\mathtt{c}&\mathtt{c}\end{bmatrix}\,,$
	$\displaystyle W=(V_{1}\varominus V_{2})\varobar(V_{1}\varominus V_{2})=(\begin{bmatrix}\mathtt{a}&\mathtt{a}&\mathtt{b}\\ \mathtt{a}&\mathtt{a}&\mathtt{b}\end{bmatrix}\varominus\begin{bmatrix}\mathtt{c}&\mathtt{c}&\mathtt{c}\end{bmatrix})\varobar(\begin{bmatrix}\mathtt{a}&\mathtt{a}&\mathtt{b}\\ \mathtt{a}&\mathtt{a}&\mathtt{b}\end{bmatrix}\varominus\begin{bmatrix}\mathtt{c}&\mathtt{c}&\mathtt{c}\end{bmatrix})=\begin{bmatrix}\mathtt{a}&\mathtt{a}&\mathtt{b}&\mathtt{a}&\mathtt{a}&\mathtt{b}\\ \mathtt{a}&\mathtt{a}&\mathtt{b}&\mathtt{a}&\mathtt{a}&\mathtt{b}\\ \mathtt{c}&\mathtt{c}&\mathtt{c}&\mathtt{c}&\mathtt{c}&\mathtt{c}\end{bmatrix}\,.$

As a matter of fact, for every two-dimensional word $U$ there exists a characteristic factorisation for $\alpha=\begin{bmatrix}x_{1}&x_{1}\\ x_{2}&x_{2}\end{bmatrix}$ of column-row type if and only if there exists a characteristic factorisation for $\alpha$ of row-column type. However, this is a particularity of $\alpha$ and, e. g., for $\alpha^{\prime}=\begin{bmatrix}x_{1}&x_{2}&x_{3}\\ x_{2}&x_{3}&x_{1}\end{bmatrix}$ and $W^{\prime}:=\begin{bmatrix}\mathtt{a}&\mathtt{a}&\mathtt{a}&\mathtt{b}&\mathtt{c}\\ \mathtt{b}&\mathtt{c}&\mathtt{a}&\mathtt{a}&\mathtt{a}\\ \end{bmatrix}$ , there exists a characteristic factorisation of column-row type $W^{\prime}=(V_{1}\varobar V_{2}\varobar V_{3})\varominus(V_{2}\varobar V_{3}\varobar V_{1})=(\begin{bmatrix}\mathtt{a}&\mathtt{a}&\mathtt{a}\end{bmatrix}\varobar\begin{bmatrix}\mathtt{b}\end{bmatrix}\varobar\begin{bmatrix}\mathtt{c}\end{bmatrix})\varominus(\begin{bmatrix}\mathtt{b}\end{bmatrix}\varobar\begin{bmatrix}\mathtt{c}\end{bmatrix}\varobar\begin{bmatrix}\mathtt{a}&\mathtt{a}&\mathtt{a}\end{bmatrix})$ , but no characteristic factorisation of row-column type. Furthermore, the column-row factorisation of $W^{\prime}$ is somewhat at odds with our intuitive understanding of what it means that a two-dimensional word satisfies a two-dimensional pattern. This is due to the fact that factorising $W^{\prime}$ into $(\begin{bmatrix}\mathtt{a}&\mathtt{a}&\mathtt{a}\end{bmatrix}\varobar\begin{bmatrix}\mathtt{b}\end{bmatrix}\varobar\begin{bmatrix}\mathtt{c}\end{bmatrix})\varominus(\begin{bmatrix}\mathtt{b}\end{bmatrix}\varobar\begin{bmatrix}\mathtt{c}\end{bmatrix}\varobar\begin{bmatrix}\mathtt{a}&\mathtt{a}&\mathtt{a}\end{bmatrix})$ means that we associate the two-dimensional factors $\begin{bmatrix}\mathtt{a}&\mathtt{a}&\mathtt{a}\end{bmatrix}$ , $\begin{bmatrix}\mathtt{b}\end{bmatrix}$ and $\begin{bmatrix}\mathtt{c}\end{bmatrix}$ with the variables $x_{1}$ , $x_{2}$ and $x_{3}$ , respectively, but in the pattern $\alpha^{\prime}$ the vertical neighbourship relation between the occurrence of $x_{2}$ in the first row and the occurrence of $x_{3}$ in the second row is not preserved in $W^{\prime}$ with respect to the corresponding two-dimensional factors $\begin{bmatrix}\mathtt{b}\end{bmatrix}$ and $\begin{bmatrix}\mathtt{c}\end{bmatrix}$ . More precisely, while a column-row factorisation preserves the horizontal neighbourship relation of the variables, it may violate their vertical neighbourship relation, where for row-column factorisations it is the other way around. Consequently, if we want both the vertical as well as the horizontal neighbourship relation to be preserved, we should require that the two-dimensional word $U$ can be disassembled into two-dimensional factors that induce both a column-row as well as a row-column factorisation. More precisely, we say that $U$ satisfies $\alpha^{\prime}=\begin{bmatrix}x_{1}&x_{2}&x_{3}\\ x_{2}&x_{3}&x_{1}\end{bmatrix}$ if and only if there exist two-dimensional words $V_{1},V_{2}$ and $V_{3}$ , such that $U=(V_{1}\varobar V_{2}\varobar V_{3})\varominus(V_{2}\varobar V_{3}\varobar V_{1})=(V_{1}\varominus V_{2})\varobar(V_{2}\varominus V_{3})\varobar(V_{3}\varominus V_{1})$ , which we call a proper characteristic factorisation of $U$ .

We are now ready to formalise the ideas developed so far and we can finally give a sound definition of two-dimensional pattern languages. Although we consider the class of two-dimensional pattern languages that results from the proper characteristic factorisations as the natural two-dimensional counterpart of the class of one-dimensional pattern languages, we shall also define the other classes of two-dimensional pattern languages which were sketched above.

For the definition of two-dimensional patterns, we use the same set of variables $X$ that has already been used in the definition of one-dimensional pattern languages. An array pattern is a non-empty two-dimensional word over $X$ and a terminal array is a non-empty two-dimensional word over a terminal alphabet $\Sigma$ . If it is clear from the context that we are concerned with array patterns and terminal arrays, then we simply say pattern and array, respectively. Any mapping $h:X\rightarrow\Sigma^{++}$ is called a substitution. For any substitution $h$ , by $h_{\varobar,\varominus}$ , we denote the mapping $X^{++}\rightarrow\Sigma^{++}$ defined in the following way. For any $\alpha:=[y_{i,j}]_{m,n}\in X^{++}$ , we define

	$\displaystyle h_{\varobar,\varominus}(\alpha):=\>$	$\displaystyle(h(y_{1,1})\varobar h(y_{1,2})\varobar\ldots\varobar h(y_{1,n}))\varominus$
		$\displaystyle(h(y_{2,1})\varobar h(y_{2,2})\varobar\ldots\varobar h(y_{2,n}))\varominus\ldots\varominus$
		$\displaystyle(h(y_{m,1})\varobar h(y_{m,2})\varobar\ldots\varobar h(y_{m,n}))\,.$

Similarly, $h_{\varominus,\varobar}:X^{++}\rightarrow\Sigma^{++}$ is defined by

	$\displaystyle h_{\varominus,\varobar}(\alpha):=\>$	$\displaystyle(h(y_{1,1})\varominus h(y_{2,1})\varominus\ldots\varominus h(y_{m,1}))\varobar$
		$\displaystyle(h(y_{1,2})\varominus h(y_{2,2})\varominus\ldots\varominus h(y_{m,2}))\varobar\ldots\varobar$
		$\displaystyle(h(y_{1,n})\varominus h(y_{2,n})\varominus\ldots\varominus h(y_{m,n}))\,.$

Intuitively speaking, both mappings $h_{\varobar,\varominus}$ and $h_{\varobar,\varominus}$ , when applied to an array pattern $\alpha$ , first substitute every variable occurrence of $\alpha$ by a terminal array according to the substitution $h$ and then these $m\times n$ individual terminal arrays are assembled to one terminal array by either first column-concatenating all the $n$ terminal arrays in every individual row and then row-concatenating the resulting $m$ terminal arrays, or by first row-concatenating all the $m$ terminal arrays in every individual column and then column-concatenating the resulting $n$ terminal arrays.

Let $\alpha\in X^{++}$ , $W\in\Sigma^{++}$ and let $h:X\rightarrow\Sigma^{++}$ . The array $W$ is a (1) column-row image of $\alpha$ (with respect to $h$ ), (2) a row-column image of $\alpha$ (with respect to $h$ ) or (3) a proper image of $\alpha$ (with respect to $h$ ) if and only if (1) $h_{\varobar,\varominus}(\alpha)=W$ , (2) $h_{\varominus,\varobar}(\alpha)=W$ or (3) $h_{\varobar,\varominus}(\alpha)=h_{\varominus,\varobar}(\alpha)=W$ , respectively. The mapping $h$ is called a column-row substitution for $\alpha$ and $W$ , a row-column substitution for $\alpha$ and $W$ or a proper substitution for $\alpha$ and $W$ , respectively. We say that $W$ is a column-row, a row-column or a proper image of $\alpha$ if there exists a column-row, a row-column or a proper substitution, respectively, for $\alpha$ and $W$ .

A nice and intuitive way to interpret the different kinds of images of array patterns is to imagine a grid to be placed over the terminal array. The vertical lines of the grid represent a column concatenation and the horizontal lines of the grid represent a row concatenation of the corresponding factorisation. This means that every rectangular area of the grid corresponds to an occurrence of a variable $x$ in the array pattern or, more precisely, to the array $h(x)$ substituted for $x$ . The fact that an array satisfies a pattern is then represented by the situation that each two rectangular areas of the grid that correspond to occurrences of the same variable must have identical content. In Figure 1, an example for each a morphic image, a proper image, a column-row image and a row-column image of a $5\times 4$ pattern is represented in this illustrative way.

Figure 1: Illustrating possible image partitions.

Alternatively, we can interpret the property that a terminal array $W$ is a certain type of image of an array pattern as a tiling of $W$ . More precisely, $W$ satisfies a given array pattern $\alpha$ with $n$ different variables if and only if $n$ tiles can be allocated to the $n$ variables of $\alpha$ such that combining the tiles as indicated by the structure of $\alpha$ yields $W$ . The grids depicted in Figure 1 then illustrate the structure of such a tiling. The definitions of the corresponding classes of pattern languages are now straightforward:

Definition 2.

Let $\alpha\in X^{++}$ be an array pattern. We define the following variants of two-dimensional pattern languages:

•

$L_{\Sigma,h}(\alpha):=\{W\in\Sigma^{**}\mid W\text{ is a morphic image of }\alpha\}$ ,
•

$L_{\Sigma,p}(\alpha):=\{W\in\Sigma^{**}\mid W\text{ is a proper image of }\alpha\}$ ,
•

$L_{\Sigma,r}(\alpha):=\{W\in\Sigma^{**}\mid W\text{ is a column-row image of }\alpha\}$ ,
•

$L_{\Sigma,c}(\alpha):=\{W\in\Sigma^{**}\mid W\text{ is a row-column image of }\alpha\}$ ,
•

$L_{\Sigma,rc}(\alpha):=L_{\Sigma,r}(\alpha)\cup L_{\Sigma,c}(\alpha)$ .

For a pattern $\alpha$ , we also denote the above languages by $Z$ pattern language of $\alpha$ , where $Z\in\{h,p,r,c,rc\}$ . For every $x\in\{r,c,rc,p,h\}$ , we define $\mathcal{L}_{\Sigma,x}:=\{L_{\Sigma,x}(\alpha)\mid\alpha\in X^{++}\}$ and $\mathcal{L}_{x}:=\{\mathcal{L}_{\Sigma,x}\mid\Sigma\text{ is some alphabet}\}$ .

Since, for a fixed array pattern $\alpha$ , every morphic image is a proper image and every proper image is a row-column image as well as a column-row image, the following subset relations between the different types of pattern languages hold (in the following diagram, an arrow denotes a subset relation):

Remark 1.

As indicated in the introductory part of this section, we consider the class of $p$ pattern languages as the most natural class of two-dimensional pattern languages. Another observation that supports this claim is that the $p$ pattern languages are compatible, in a certain sense, to the one-dimensional pattern languages. More precisely, for a one-dimensional (i. e., $1\times n$ ) array pattern $\alpha$ the set $L_{\Sigma,p}(\alpha)\cap\{W\in\Sigma^{++}\mid|W|_{r}=1\}$ coincides with the one-dimensional pattern language of $\alpha$ . This does not hold for the $h$ pattern languages (since in the one-dimensional case the words variables are mapped to can differ in length), but holds for the $r$ , $c$ and $rc$ pattern languages. However, as pointed out above, the $r$ , $c$ and $rc$ pattern language of a given pattern $\alpha$ may contain arrays that, from an intuitive point of view, do not satisfy $\alpha$ .

4 General Observations

In this section, we state some general lemmas about two-dimensional morphisms and array pattern languages, which shall be important for proving the further results presented in this paper. First, we refine Proposition 1, by giving a convenient characterisation for the morphism property for mappings on arrays. To this end, we define a substitution $h$ to be $(m,n)$ -uniform if, for every $x\in X$ , $|h(x)|_{r}=m$ and $|h(x)|_{c}=n$ and a substitution is uniform if it is $(m,n)$ -uniform, for some $m,n\in\mathbb{N}$ .

Lemma 1.

A mapping $h:\Sigma^{**}\to\Gamma^{**}$ is a two-dimensional morphism if and only if $h=g_{\varominus,\varobar}=g_{\varobar,\varominus}$ , where $g:\Sigma\to\Gamma^{**}$ is a uniform substitution.

Proof.

We first observe that if $g$ is uniform, then $g_{\varominus,\varobar}=g_{\varobar,\varominus}$ obviously holds (so it is sufficient to prove the statement of the lemma only for one of these two mappings). Furthermore, if $g$ is uniform, then, for every $U,V\in\Sigma^{**}$ , $g_{\varobar,\varominus}(U\varominus V)=g_{\varobar,\varominus}(U)\varominus g_{\varobar,\varominus}(V)$ and $g_{\varobar,\varominus}(U\varobar V)=g_{\varobar,\varominus}(U)\varobar g_{\varobar,\varominus}(V)$ , which proves the if direction. In order to prove the only if direction, we assume that $h$ is a two-dimensional morphism and we define a substitution $g$ by $g(b):=h(b)$ , $b\in\Sigma$ . Furthermore, let $\widehat{g}\in\{g_{\varominus,\varobar},g_{\varobar,\varominus}\}$ . We now show that $h$ equals $\widehat{g}$ by induction. By definition, for every $b\in\Sigma$ , $h(b)=\widehat{g}(b)$ . Now let $U,V\in\Sigma^{**}$ with $h(U)=g^{\prime}(U)$ and $h(V)=g^{\prime}(V)$ . Then $h(U\varominus V)=h(U)\varominus h(V)=g^{\prime}(U)\varominus g^{\prime}(V)=g^{\prime}(U\varominus V)$ and, analogously, $h(U\varobar V)=h(U)\varobar h(V)=g^{\prime}(U)\varobar g^{\prime}(V)=g^{\prime}(U\varobar V)$ . By induction, it follows that $h=\widehat{g}$ . Consequently, we can conclude that $g$ is uniform. ∎

The next lemma states that the composition of two two-dimensional morphisms is again a two-dimensional morphism.

Lemma 2.

Let $h_{1}:\Gamma^{**}\to\Sigma^{**}$ and $h_{2}:\Sigma^{**}\to\Delta^{**}$ be two-dimensional morphisms. Then, the composition $h_{1,2}:=h_{1}\circ h_{2}:\Gamma^{**}\to\Delta^{**}$ is also a two-dimensional morphism.

Proof.

We first observe the following. If $g$ and $f$ are some uniform substitutions, then $h:=g\circ f$ is a uniform substitution as well. Furthermore, it can be easily verified that $g_{\varominus,\varobar}\circ f_{\varominus,\varobar}=h_{\varominus,\varobar}$ . With Lemma 1, this directly implies the statement of the lemma. ∎

It is intuitively clear that the structure of a pattern fully determines the corresponding pattern language and the actual names of the variables are irrelevant, e. g., the patterns $\begin{bmatrix}x_{1}&x_{2}&x_{1}\\ x_{2}&x_{3}&x_{3}\end{bmatrix}$ and $\begin{bmatrix}x_{7}&x_{3}&x_{7}\\ x_{3}&x_{5}&x_{5}\end{bmatrix}$ should be considered identical. In the following we formalise this intuition. Two array patterns $\alpha:=[y_{i,j}]_{m,n}$ and $\beta:=[z_{i,j}]_{m^{\prime},n^{\prime}}$ are equivalent up to a renaming, denoted by $\alpha\sim\beta$ , if and only if $m=m^{\prime}$ , $n=n^{\prime}$ and, for every $i,j,i^{\prime},j^{\prime}$ , $1\leq i,i^{\prime}\leq m$ , $1\leq j,j^{\prime}\leq n$ , $y_{i,j}=y_{i^{\prime},j^{\prime}}$ if and only if $z_{i,j}=z_{i^{\prime},j^{\prime}}$ .

Lemma 3.

Let $z,z^{\prime}\in\{h,p,r,c,rc\}$ , let $\Sigma$ be an alphabet with $|\Sigma|\geq 2$ and let $\alpha,\beta\in X^{++}$ . If $L_{\Sigma,z}(\alpha)=L_{\Sigma,z^{\prime}}(\beta)$ , then $\alpha\sim\beta$ .

Proof.

We assume that $L_{\Sigma,z}(\alpha)=L_{\Sigma,z^{\prime}}(\beta)$ and note that this implies that $|\alpha|_{c}=|\beta|_{c}=m$ and $|\alpha|_{r}=|\beta|_{r}=n$ . This is due to the fact that if $|\alpha|_{c}<|\beta|_{c}$ or $|\alpha|_{r}<|\beta|_{r}$ , then the array obtained from $\alpha$ by replacing every variable by a single symbol $\mathtt{a}\in\Sigma$ is in $L_{\Sigma,z}(\alpha)$ , but not in $L_{\Sigma,z^{\prime}}(\beta)$ . We further assume that $\alpha\nsim\beta$ , which implies that there are $i,j,i^{\prime},j^{\prime}$ with $1\leq i,i^{\prime}\leq m$ and $1\leq j,j^{\prime}\leq n$ , such that $\alpha[i,j]=\alpha[i^{\prime},j^{\prime}]$ and $\beta[i,j]\neq\beta[i^{\prime},j^{\prime}]$ (or $\alpha[i,j]\neq\alpha[i^{\prime},j^{\prime}]$ and $\beta[i,j]=\beta[i^{\prime},j^{\prime}]$ , for which an analogous argument applies). We now define a substitution $h:X\rightarrow\Sigma$ in the following way. For every $x\in X$ , if $\beta[i,j]=x$ , then $h(x):=\mathtt{a}$ and if $\beta[i,j]\neq x$ , then $h(x):=\mathtt{b}$ . We observe that, since $h_{\varominus,\varobar}(\beta)$ is a morphic image, a proper image, a row-column image and a column-row image of $\beta$ , $h_{\varominus,\varobar}(\beta)\in L_{\Sigma,z^{\prime}}(\beta)$ . Furthermore, $h_{\varominus,\varobar}(\beta)[i,j]\neq h_{\varominus,\varobar}(\beta)[i^{\prime},j^{\prime}]$ . On the other hand, for every $W\in L_{\Sigma,z}(\alpha)$ , with $|W|_{c}=|h_{\varominus,\varobar}(\beta)|_{c}$ and $|W|_{r}=|h_{\varominus,\varobar}(\beta)|_{r}$ , $W[i,j]=W[i^{\prime},j^{\prime}]$ is satisfied. Thus, $h_{\varominus,\varobar}(\beta)\notin L_{\Sigma,z}(\alpha)$ , which implies that $L_{\Sigma,z}(\alpha)\neq L_{\Sigma,z^{\prime}}(\beta)$ . ∎

For every $z,z^{\prime}\in\{h,p,r,c,rc\}$ , $z\neq z^{\prime}$ , $\alpha\sim\beta$ does not necessarily imply $L_{\Sigma,z}(\alpha)=L_{\Sigma,z^{\prime}}(\beta)$ , as pointed out by, e. g., $L_{\Sigma,h}(\begin{bmatrix}x_{1}&x_{2}\end{bmatrix})\neq L_{\Sigma,p}(\begin{bmatrix}x_{1}&x_{2}\end{bmatrix})$ or $L_{\Sigma,p}(\begin{bmatrix}x_{1}&x_{2}\\ x_{3}&x_{1}\end{bmatrix})\neq L_{\Sigma,c}(\begin{bmatrix}x_{1}&x_{2}\\ x_{3}&x_{1}\end{bmatrix})$ . On the other hand, since, for every $z,\in\{h,p,r,c,rc\}$ , $\alpha\sim\beta$ obviously implies $L_{\Sigma,z}(\alpha)=L_{\Sigma,z}(\beta)$ , two $z$ pattern languages are equivalent if and only if they are described by two patterns that are equivalent up to a renaming.

In the remainder of this work, we do not distinguish anymore between patterns that are equivalent up to a renaming, i. e., from now on we say that $\alpha$ and $\beta$ are equivalent, denoted by $\alpha=\beta$ for simplicity, if they are actually the same arrays or if they are equivalent up to a renaming.

5 Comparison of Array Pattern Language Classes

In this section, we provide a pairwise comparison of our different classes of array pattern languages and, furthermore, we compare them with the class of recognisable array languages, denoted by REC, which is one of the most prominent classes of array languages. For a detailed description of REC, the reader is referred to the survey [7] by Giammarresi and Restivo. Next, we show that, for every alphabet $\Sigma$ with $|\Sigma|\geq 2$ , the language classes REC, $\mathcal{L}_{\Sigma,h}$ , $\mathcal{L}_{\Sigma,p}$ , $\mathcal{L}_{\Sigma,r}$ , $\mathcal{L}_{\Sigma,c}$ and $\mathcal{L}_{\Sigma,rc}$ are pairwise incomparable. More precisely, for every $\mathcal{L}_{1},\mathcal{L}_{2}\in\{\text{REC},\mathcal{L}_{\Sigma,h},\mathcal{L}_{\Sigma,p},\mathcal{L}_{\Sigma,r},\mathcal{L}_{\Sigma,c},\mathcal{L}_{\Sigma,rc}\}$ with $\mathcal{L}_{1}\neq\mathcal{L}_{2}$ , we show that $\mathcal{L}_{1}\setminus\mathcal{L}_{2}\neq\emptyset$ , $\mathcal{L}_{2}\setminus\mathcal{L}_{1}\neq\emptyset$ and $\mathcal{L}_{1}\cap\mathcal{L}_{2}\neq\emptyset$ . The non-emptiness of the pairwise intersections of these language classes can be easily seen:

Proposition 2.

For every $z\in\{h,p,r,c,rc\}$ , $L_{z,\Sigma}(\begin{bmatrix}x_{1}\end{bmatrix})=\Sigma^{++}$ and $\Sigma^{++}\in\text{REC}$ .

It remains to find, for every $\mathcal{L}_{1},\mathcal{L}_{2}\in\{\text{REC},\mathcal{L}_{\Sigma,h},\mathcal{L}_{\Sigma,p},\mathcal{L}_{\Sigma,r},\mathcal{L}_{\Sigma,c},\mathcal{L}_{\Sigma,rc}\}$ , a separating language $L_{1}\in\mathcal{L}_{1}\setminus\mathcal{L}_{2}$ and a separating language $L_{2}\in\mathcal{L}_{2}\setminus\mathcal{L}_{1}$ . We first present all these separating languages in a table and then we formally prove their separating property. In rows $2$ to $6$ of the following table, if a pattern $\alpha$ is the entry that corresponds to the row labeled by class $\mathcal{L}_{\Sigma,z}$ and the column labeled by class $\mathcal{L}_{\Sigma,z^{\prime}}$ , where $z,z^{\prime}\in\{h,p,r,c,rc\}$ , $z\neq z^{\prime}$ , then this means that $L\in\mathcal{L}_{\Sigma,z}\setminus\mathcal{L}_{\Sigma,z^{\prime}}$ . Row $1$ , on the other hand, contains recognisable array languages that are not array pattern languages.

	REC	$\mathcal{L}_{\Sigma,h}$	$\mathcal{L}_{\Sigma,p}$	$\mathcal{L}_{\Sigma,r}$	$\mathcal{L}_{\Sigma,c}$	$\mathcal{L}_{\Sigma,rc}$

REC	–	$\{\begin{bmatrix}\mathtt{a}\end{bmatrix}\}$	$\{\begin{bmatrix}\mathtt{a}\end{bmatrix}\}$	$\{\begin{bmatrix}\mathtt{a}\end{bmatrix}\}$	$\{\begin{bmatrix}\mathtt{a}\end{bmatrix}\}$	$\{\begin{bmatrix}\mathtt{a}\end{bmatrix}\}$
$\mathcal{L}_{\Sigma,h}$	$\begin{bmatrix}x_{1}\\ x_{1}\end{bmatrix}$	–	$\begin{bmatrix}x_{1}&x_{2}\\ x_{3}&x_{4}\\ \end{bmatrix}$	$\begin{bmatrix}x_{1}&x_{2}\\ x_{3}&x_{4}\\ \end{bmatrix}$	$\begin{bmatrix}x_{1}&x_{2}\\ x_{3}&x_{4}\\ \end{bmatrix}$	$\begin{bmatrix}x_{1}&x_{2}\\ x_{3}&x_{4}\\ \end{bmatrix}$
$\mathcal{L}_{\Sigma,p}$	$\begin{bmatrix}x_{1}\\ x_{1}\end{bmatrix}$	$\begin{bmatrix}x_{1}&x_{2}\\ x_{3}&x_{4}\\ \end{bmatrix}$	–	$\begin{bmatrix}x_{1}&x_{2}\\ x_{2}&x_{1}\end{bmatrix}$	$\begin{bmatrix}x_{1}&x_{2}\\ x_{2}&x_{1}\end{bmatrix}$	$\begin{bmatrix}x_{1}&x_{2}\\ x_{2}&x_{1}\end{bmatrix}$
$\mathcal{L}_{\Sigma,r}$	$\begin{bmatrix}x_{1}\\ x_{1}\end{bmatrix}$	$\begin{bmatrix}x_{1}&x_{2}\\ x_{3}&x_{4}\\ \end{bmatrix}$	$\begin{bmatrix}x_{1}&x_{2}\\ x_{2}&x_{1}\end{bmatrix}$	–	$\begin{bmatrix}x_{1}&x_{2}\\ x_{2}&x_{1}\end{bmatrix}$	$\begin{bmatrix}x_{1}&x_{2}\\ x_{2}&x_{1}\end{bmatrix}$
$\mathcal{L}_{\Sigma,c}$	$\begin{bmatrix}x_{1}\\ x_{1}\end{bmatrix}$	$\begin{bmatrix}x_{1}&x_{2}\\ x_{3}&x_{4}\\ \end{bmatrix}$	$\begin{bmatrix}x_{1}&x_{2}\\ x_{2}&x_{1}\end{bmatrix}$	$\begin{bmatrix}x_{1}&x_{2}\\ x_{2}&x_{1}\end{bmatrix}$	–	$\begin{bmatrix}x_{1}&x_{2}\\ x_{2}&x_{1}\end{bmatrix}$
$\mathcal{L}_{\Sigma,rc}$	$\begin{bmatrix}x_{1}\\ x_{1}\end{bmatrix}$	$\begin{bmatrix}x_{1}&x_{2}\\ x_{3}&x_{4}\\ \end{bmatrix}$	$\begin{bmatrix}x_{1}&x_{2}\\ x_{2}&x_{1}\end{bmatrix}$	$\begin{bmatrix}x_{1}&x_{2}\\ x_{2}&x_{1}\end{bmatrix}$	$\begin{bmatrix}x_{1}&x_{2}\\ x_{2}&x_{1}\end{bmatrix}$	–

Lemma 4.

$L_{\Sigma,h}(\begin{bmatrix}x_{1}\\ x_{1}\end{bmatrix})\notin\text{REC}$ .

Proof.

In this proof, we use the characterisation of REC by local array languages and projections (see Giammarresi and Restivo [7] and also the next section). Let $\alpha:=\begin{bmatrix}x_{1}\\ x_{1}\end{bmatrix}$ and $L:=L_{\Sigma,h}(\alpha)=\{W\varominus W\mid W\in\Sigma^{++}\}$ . Suppose $L\in REC$ . Then there is a local array language $L^{\prime}$ over an alphabet $\Gamma$ so that $L$ is a projection of $L^{\prime}$ . For the sake of convenience, we define $r:=|\Gamma|$ and $s:=|\Sigma|$ . For every $m,n\in\mathbb{N}$ , let

L_{m,n}:=\{W\varominus W\mid W\in\Sigma^{++}\text{ and }|W|_{r}=m,|W|_{c}=n\}\subseteq L\,.

Obviously, $|L_{m,n}|=s^{mn}$ . Now let $L^{\prime}_{m,n}$ be the set of pictures in $L^{\prime},$ whose projections are in $L_{m,n}$ . In the arrays of $L^{\prime}_{m,n},$ there are at most $r^{2n}$ possibilities of how the $m^{\text{th}}$ and $(m+1)^{\text{th}}$ row can look like. For sufficiently large $m,$ $s^{mn}>r^{2n}.$ Thus, there exist two arrays $W_{1}\varominus W_{1}$ and $W_{2}\varominus W_{2},$ $W_{1}\neq W_{2}$ in $L_{m,n}$ such that the corresponding arrays $W_{1}^{\prime}\varominus W_{1}^{\prime}$ and $W_{2}^{\prime}\varominus W_{2}^{\prime}$ in $L^{\prime}_{m,n}$ have the same $m^{\text{th}}$ row and the same $(m+1)^{\text{th}}$ row. Hence, since $L^{\prime}$ is a local array language, $W_{1}^{\prime}\varominus W_{2}^{\prime},W_{2}^{\prime}\varominus W_{1}^{\prime}\in L^{\prime}_{m,n}$ and therefore $W_{1}\varominus W_{2},W_{2}\varominus W_{1}\in L_{m,n}\subseteq L$ , which is a contradiction. ∎

It can be easily verified that, for every $z\in\{p,r,c,rc\}$ , $L_{\Sigma,z}(\alpha)=L_{\Sigma,h}(\alpha)$ , where $\alpha:=\begin{bmatrix}x_{1}\\ x_{1}\end{bmatrix}$ . Hence, for every $z\in\{h,p,r,c,rc\}$ , $L_{\Sigma,z}(\alpha)\notin\text{REC}$ , which implies the first column of the table. Furthermore, for every $z\in\{h,p,r,c,rc\}$ , $\{\begin{bmatrix}\mathtt{a}\end{bmatrix}\}\notin L_{\Sigma,z}(\alpha)$ , but $\{\begin{bmatrix}\mathtt{a}\end{bmatrix}\}\in\text{REC}$ , which implies the first row of the table.

We point out that, by Lemma 3, for every $z,z^{\prime}\in\{h,p,r,c,rc\}$ , $z\neq z^{\prime}$ , if there exists a pattern $\beta$ with $L_{\Sigma,z}(\beta)\neq L_{\Sigma,z^{\prime}}(\beta)$ , then $L_{\Sigma,z}(\beta)\in\mathcal{L}_{\Sigma,z}\setminus\mathcal{L}_{\Sigma,z^{\prime}}$ and $L_{\Sigma,z^{\prime}}(\beta)\in\mathcal{L}_{\Sigma,z^{\prime}}\setminus\mathcal{L}_{\Sigma,z}$ . Consequently, in order to prove the remaining entries of the table, it is sufficient to identify, for every $z,z^{\prime}\in\{h,p,r,c,rc\}$ , $z\neq z^{\prime}$ , a pattern $\beta$ with $L_{\Sigma,z}(\beta)\neq L_{\Sigma,z^{\prime}}(\beta)$ , which is done by the following two lemmas.

Lemma 5.

For every $z\in\{p,c,r,cr\}$ , $L_{\Sigma,h}(\begin{bmatrix}x_{1}&x_{2}\\ x_{3}&x_{4}\\ \end{bmatrix})\neq L_{\Sigma,z}(\begin{bmatrix}x_{1}&x_{2}\\ x_{3}&x_{4}\\ \end{bmatrix})$ .

Proof.

Let $\beta:=\begin{bmatrix}x_{1}&x_{2}\\ x_{3}&x_{4}\\ \end{bmatrix}$ and let $W:=\begin{bmatrix}\mathtt{a}&\mathtt{a}&\mathtt{a}\\ \mathtt{a}&\mathtt{a}&\mathtt{a}\\ \mathtt{a}&\mathtt{a}&\mathtt{a}\\ \end{bmatrix}$ . For every $z\in\{p,c,r,cr\}$ , $W\in\mathcal{L}_{\Sigma,z}(\beta)$ , since $g_{\varominus,\varobar}(\beta)=g_{\varobar,\varominus}(\beta)=W$ , where $g$ is defined by

g(x_{1}):=\begin{bmatrix}\mathtt{a}&\mathtt{a}\\ \mathtt{a}&\mathtt{a}\\ \end{bmatrix},\>\>\\ g(x_{2}):=\begin{bmatrix}\mathtt{a}\\ \mathtt{a}\\ \end{bmatrix},\>\>\\ g(x_{3}):=\begin{bmatrix}\mathtt{a}&\mathtt{a}\\ \end{bmatrix},\>\>\\ g(x_{4}):=\begin{bmatrix}\mathtt{a}\\ \end{bmatrix}\,.

By Lemma 1, it is obvious that there does not exist any morphism $h$ with $h(\beta)=W$ . Thus, $W\notin L_{\Sigma,h}(\beta)$ and, for every $z\in\{p,c,r,cr\}$ , $L_{\Sigma,h}(\beta)\neq L_{\Sigma,z}(\beta)$ . ∎

Lemma 6.

For every $z,z^{\prime}\in\{p,c,r,cr\}$ , $z\neq z^{\prime}$ , $L_{\Sigma,z}(\begin{bmatrix}x_{1}&x_{2}\\ x_{2}&x_{1}\end{bmatrix})\neq L_{\Sigma,z^{\prime}}(\begin{bmatrix}x_{1}&x_{2}\\ x_{2}&x_{1}\\ \end{bmatrix})$ .

Proof.

Let $\gamma:=\begin{bmatrix}x_{1}&x_{2}\\ x_{2}&x_{1}\end{bmatrix}$ and let $W_{1}:=\begin{bmatrix}\mathtt{a}&\mathtt{a}\\ \mathtt{a}&\mathtt{a}\\ \mathtt{a}&\mathtt{a}\\ \end{bmatrix},W_{2}:=\begin{bmatrix}\mathtt{a}&\mathtt{a}&\mathtt{a}\\ \mathtt{a}&\mathtt{a}&\mathtt{a}\\ \end{bmatrix}$ . We observe that $g_{\varominus,\varobar}(\gamma)=W_{1}$ , $g^{\prime}_{\varobar,\varominus}(\gamma)=W_{2},$ where $g,g^{\prime}$ are defined by

g(x_{1}):=\begin{bmatrix}\mathtt{a}\\ \end{bmatrix},\,g(x_{2}):=\begin{bmatrix}\mathtt{a}\\ \mathtt{a}\\ \end{bmatrix},\,g^{\prime}(x_{1}):=\begin{bmatrix}\mathtt{a}\\ \end{bmatrix},\,g^{\prime}(x_{2}):=\begin{bmatrix}\mathtt{a}&\mathtt{a}\\ \end{bmatrix}\,.

Thus, $W_{1}\in L_{\Sigma,c}(\gamma)$ , $W_{2}\in L_{\Sigma,r}(\gamma)$ and $W_{1},W_{2}\in L_{\Sigma,rc}(\gamma)$ . On the other hand, $W_{1},W_{2}\notin L_{\Sigma,p}(\gamma)$ , since every proper image of $\gamma$ must have an even number of columns and an even number of rows. Consequently, for every $z\in\{r,c,rc\}$ , $L_{\Sigma,p}(\gamma)\neq L_{\Sigma,z}(\gamma)$ . Similarly, $W_{1}\notin L_{\Sigma,r}$ and $W_{2}\notin L_{\Sigma,c}$ , since every column-row image of $\gamma$ must have an even number of rows and every row-column image of $\gamma$ must have an even number of columns. This implies that, for every $z,z^{\prime}\in\{c,r,cr\}$ , $z\neq z^{\prime}$ , $L_{\Sigma,z}(\gamma)\neq L_{\Sigma,z^{\prime}}(\gamma)$ , which concludes the proof. ∎

6 Closure Properties of Array Pattern Languages

The research of closure properties of classes of formal languages is a classical topic in this area. However, the number of natural properties is richer in the case of arrays compared to the more conventional string case. Thus, in this section, we classify the operations that shall be investigated in this regard according to whether or not they correspond to string language operations.

First, in Section 6.1, we investigate operations that correspond to string language operations. These are the Boolean operations of union, intersection and complementation, and also two special cases of (inverse) morphisms: letter-to-letter morphisms, and more special surjective letter-to-letter morphisms, known as projections in the terminology of array languages, and, more generally, the two-dimensional morphisms as defined in Section 3

Next, in Section 6.2, we take a closer look at operations similar to string language operations. More precisely, we investigate closure under concatenation and concatenation closure (or Kleene star), which constitute classical operations for string languages, but, with respect to the array case, we encounter an important difference, namely, there are two different types of concatenations: row and column concatenation. In particular, the concatenation of two arrays could be undefined (just because the dimensions do not match), but the concatenation of the two corresponding languages need not be empty.

Finally, in Section 6.3, we investigate operations special to arrays, that are usually not considered or even defined for string languages. These are mainly geometric operations like quarter turn, half turn, reflection or transposition of an array.

6.1 String Language Operations

We first point out that, due to Lemma 7 below, whenever a non-closure result is known for terminal-free non-erasing string pattern languages, this would straightforwardly transfer to the array case. We will therefore focus on finding proofs for the string case for non-closure properties, and conversely, we will try to give proofs for the array case for closure properties. Interestingly enough, (non-)closure properties have not been studied for the (classical) terminal-free non-erasing string pattern languages, all published proofs that we are aware of for this topic use terminals or erasing. So, our study also contributes to the theory of string pattern languages. Conversely, if we do not manage to find proofs or examples as required for the mentioned approach, this implicitly always raises an open classical string language question. For any mode $z\in\{h,p,r,c,rc\}$ and any pattern $\pi$ , let $L_{\Sigma,z}^{\operatorname{\texttt{1D}}}(\pi)$ denote those arrays from $L_{\Sigma,z}(\pi)$ that have just one row, i. e., $L_{\Sigma,z}^{\operatorname{\texttt{1D}}}(\pi):=\{W\in L_{\Sigma,z}(\pi)\mid|W|_{r}=1\}$ . Clearly, such arrays can be interpreted as strings and vice versa. In this sense, $L_{\Sigma,z}^{\operatorname{\texttt{1D}}}(\pi)$ and the string language $L^{\operatorname{\texttt{1D}}}_{\Sigma}(\pi)$ generated by the pattern $\pi$ coincide, as long as $z\neq h$ . For $z=h$ , we encounter the special case that all inserted words have to be of the same length. Let us formulate this more formally:

Lemma 7.

Let $\pi$ be an array pattern of height one. Then, $\pi$ is, at the same time, a string pattern. Moreover, for any $z\in\{p,r,c,rc\}$ , $L_{\Sigma,z}^{\operatorname{\texttt{1D}}}(\pi)=L^{\operatorname{\texttt{1D}}}_{\Sigma}(\pi)$ , while $L_{\Sigma,h}^{\operatorname{\texttt{1D}}}(\pi)\subseteq L^{\operatorname{\texttt{1D}}}_{\Sigma}(\pi)$ .

We shall now prove non-closure properties for $L^{\operatorname{\texttt{1D}}}_{\Sigma}(\pi)$ , which directly carry over to the classes $L_{\Sigma,z}(\pi)$ , $z\in\{h,p,r,c,rc\}$ (for some operations, however, the class $L_{\Sigma,h}(\pi)$ constitutes a special case, which is treated separately). To this end, we will mostly focus on two patterns: $\alpha=xyx$ and $\beta=xxy$ . The next lemma states an immediate observation for these patterns.

Lemma 8.

Over the terminal alphabet $\Sigma=\{\mathtt{a},\mathtt{b}\}$ , let $L_{s}(\alpha)$ ( $L_{s}(\beta)$ ) denote the shortest words that can be described by $\alpha$ and $\beta$ , respectively, disallowing erasing. Then,

	$\displaystyle L_{s}(\alpha)$	$\displaystyle=$	$\displaystyle\{\mathtt{a}\mathtt{a}\mathtt{a},\mathtt{a}\mathtt{b}\mathtt{a},\mathtt{b}\mathtt{a}\mathtt{b},\mathtt{b}\mathtt{b}\mathtt{b}\}\,,$
	$\displaystyle L_{s}(\beta)$	$\displaystyle=$	$\displaystyle\{\mathtt{a}\mathtt{a}\mathtt{a},\mathtt{a}\mathtt{a}\mathtt{b},\mathtt{b}\mathtt{b}\mathtt{a},\mathtt{b}\mathtt{b}\mathtt{b}\}\,.$

Proposition 3.

For any non-unary alphabet $\Sigma$ , $\mathcal{L}^{\operatorname{\texttt{1D}}}_{\Sigma}$ is not closed under union.

Proof.

Without loss of generality, $\{\mathtt{a},\mathtt{b}\}\subseteq\Sigma$ . In the following argument, we actually focus on $\Sigma=\{\mathtt{a},\mathtt{b}\}$ , but this can be easily extended to the more general case. Assume that there was a pattern $\gamma$ with $\mathcal{L}^{\operatorname{\texttt{1D}}}_{\Sigma}(\gamma)=\mathcal{L}^{\operatorname{\texttt{1D}}}_{\Sigma}(\alpha)\cup\mathcal{L}^{\operatorname{\texttt{1D}}}_{\Sigma}(\beta)$ . Then,

L_{s}(\gamma)=L_{s}(\alpha)\cup L_{s}(\beta)=\{\mathtt{a}\mathtt{a}\mathtt{a},\mathtt{a}\mathtt{a}\mathtt{b},\mathtt{a}\mathtt{b}\mathtt{a},\mathtt{b}\mathtt{a}\mathtt{b},\mathtt{b}\mathtt{b}\mathtt{a},\mathtt{b}\mathtt{b}\mathtt{b}\}.

This means that $\gamma$ contains exactly three variable occurrences (with more, these words cannot be generated, with less, shorter words could be generated), i.e., $\gamma=y_{1}\,y_{2}\,y_{3}$ , $y_{i}\in X$ , $1\leq i\leq 3$ . As $L_{s}(\gamma)\neq\{\mathtt{a},\mathtt{b}\}^{3}$ , some of these variables must coincide, which leads to a contradiction. More precisely, if $y_{1}=y_{2}$ or $y_{2}=y_{3}$ , then $\mathtt{a}\mathtt{b}\mathtt{a}$ cannot be generated and if $y_{1}=y_{3}$ , then $\mathtt{a}\mathtt{a}\mathtt{b}$ cannot be generated. Hence, $\gamma$ with $L_{s}(\gamma)=L_{s}(\alpha)\cup L_{s}(\beta)$ cannot exist. ∎

Now if there was a pattern $\gamma$ such that $L_{\Sigma,z}(\gamma)=L_{\Sigma,z}(\alpha)\cup L_{\Sigma,z}(\beta)$ , $z\in\{p,r,c,rc\}$ , then, by Lemma 7, this would imply $L_{\Sigma}^{\operatorname{\texttt{1D}}}(\gamma)=L_{\Sigma}^{\operatorname{\texttt{1D}}}(\alpha)\cup L_{\Sigma}^{\operatorname{\texttt{1D}}}(\beta)$ , contradicting Proposition 3. We point out that in the proof of Proposition 3, we do not use any replacements by words of different lengths to obtain our contradiction. Hence, this argument is also valid in the case when $z=h$ .

Corollary 1.

None of the array pattern language classes under consideration (over some non-unary alphabet) is closed under union.

We proceed with the intersection operation.

Proposition 4.

For any non-unary alphabet $\Sigma$ , $\mathcal{L}^{\operatorname{\texttt{1D}}}_{\Sigma}$ is not closed under intersection.

Proof.

The argument resembles the previous proof. Assume that $\gamma$ describes $L^{\operatorname{\texttt{1D}}}_{\Sigma}(\alpha)\cap L^{\operatorname{\texttt{1D}}}_{\Sigma}(\beta)$ . Notice that $L_{s}(\gamma)=\{\mathtt{a}\mathtt{a}\mathtt{a},\mathtt{b}\mathtt{b}\mathtt{b}\}$ , which clearly implies that $\gamma=xxx$ . However, $\mathtt{a}\mathtt{a}\mathtt{b}\mathtt{a}\mathtt{a}\in(L^{\operatorname{\texttt{1D}}}_{\Sigma}(\alpha)\cap L^{\operatorname{\texttt{1D}}}_{\Sigma}(\beta))\setminus L^{\operatorname{\texttt{1D}}}_{\Sigma}(\gamma)$ . ∎

Notice that the replacement words we used for deriving a contradiction are of different lengths, meaning that $\mathtt{a}\mathtt{a}\mathtt{b}\mathtt{a}\mathtt{a}\in L_{\Sigma}(\alpha)$ because of the replacement $x\mapsto\mathtt{a}\mathtt{a}$ and $y\mapsto\mathtt{b}$ , but $\mathtt{a}\mathtt{a}\mathtt{b}\mathtt{a}\mathtt{a}\in L_{\Sigma}(\beta)$ because of $x\mapsto\mathtt{a}$ and $y\mapsto\mathtt{b}\mathtt{a}\mathtt{a}$ . Hence, we cannot conclude non-closure for the $h$ -mode in the following corollary:

Corollary 2.

None of the array pattern language classes under consideration (over some non-unary alphabet and apart from the $h$ -case) is closed under intersection.

Indeed, the $h$ -mode plays a special rôle, as can be seen by the following result.

Proposition 5.

Let $\Sigma$ be some alphabet. Then, $\mathcal{L}_{\Sigma,h}$ is closed under intersection.

Proof.

Assume that $a\in\Sigma$ . Let $\alpha,\beta$ be two array patterns. Let $m_{\alpha}$ be the height (number of rows) of $\alpha$ and $n_{\alpha}$ be the width (number of columns) of $\alpha$ . Likewise, $m_{\beta}$ and $n_{\beta}$ are understood. Then, the width of the smallest arrays in $L_{\Sigma,h}(\alpha)\cap L_{\Sigma,h}(\beta)$ equals $n=\operatorname{lcm}(n_{\alpha},n_{\beta})$ , and their height equals $m=\operatorname{lcm}(m_{\alpha},m_{\beta})$ . This can be easily seen by substituting each variable in $\alpha$ by the unique array of width $n^{\prime}_{\alpha}=\frac{\operatorname{lcm}(n_{\alpha},n_{\beta})}{n_{\alpha}}$ and height $m^{\prime}_{\alpha}=\frac{\operatorname{lcm}(m_{\alpha},m_{\beta})}{m_{\alpha}}$ over the alphabet $\{a\}$ into the pattern $\alpha$ , as $n=n_{\alpha}\times n^{\prime}_{\alpha}$ , $m=m_{\alpha}\times m^{\prime}_{\alpha}$ , and a similar substitution within $\beta$ .

For each variable $x$ that occurs in $\alpha$ , take $m^{\prime}_{\alpha}\times n^{\prime}_{\alpha}$ new variables $x_{i,j}$ with $1\leq i\leq m^{\prime}_{\alpha}$ and $1\leq j\leq n^{\prime}_{\alpha}$ . Define a morphism $h_{\alpha}$ by replacing the variable $x$ by the following array of height $n^{\prime}_{\alpha}$ and width $m^{\prime}_{\alpha}$ , consisting of the previously introduced variables:

\begin{smallmatrix}x_{11}&x_{12}&\cdots&x_{1n^{\prime}_{\alpha}}\\ x_{21}&x_{22}&\cdots&x_{2n^{\prime}_{\alpha}}\\ \vdots&\vdots&\vdots&\vdots\\ x_{m^{\prime}_{\alpha}1}&x_{m^{\prime}_{\alpha}2}&\cdots&x_{m^{\prime}_{\alpha}n^{\prime}_{\alpha}}\end{smallmatrix}

Hence, $h_{\alpha}(\alpha)$ is some array of height $m$ and of width $n$ . Accordingly, one can define a morphism $h_{\beta}$ such that $h_{\beta}(\beta)$ is again some array of height $m$ and of width $n$ . Due to Lemma 2, $L_{\Sigma,h}(h_{\alpha}(\alpha))\subseteq L_{\Sigma,h}(\alpha)$ and $L_{\Sigma,h}(h_{\beta}(\beta))\subseteq L_{\Sigma,h}(\beta)$ (*). Namely, if $U\in L_{\Sigma,h}(h_{\alpha}(\alpha))$ , then there exists some two-dimensional morphism $g$ such that $U=g(h_{\alpha}(\alpha))$ , i.e., there also exists some two-dimensional morphism $f:=h_{\alpha}\circ g$ with $U=f(\alpha)$ , so that $U\in L_{\Sigma,h}(\alpha)$ . Now, define an array pattern $\gamma$ of height $m$ and of width $n$ , consisting exclusively of variable entries, as follows: The variables occurring at positions $(i,j)$ and $(i^{\prime},j^{\prime})$ in $\gamma$ (where $1\leq i,i^{\prime}\leq m$ and $1\leq j,j^{\prime}\leq n$ ) are identical if and only if the corresponding variables in at least one of the arrays $h_{\alpha}(\alpha)$ or $h_{\beta}(\beta)$ are identical.

We claim that $L_{\Sigma,h}(\gamma)=L_{\Sigma,h}(\alpha)\cap L_{\Sigma,h}(\beta)$ .

As $\gamma$ is obtained from $h_{\alpha}(\alpha)$ by identifying certain variables, due to (*) we find that:

L_{\Sigma,h}(\gamma)\subseteq L_{\Sigma,h}(h_{\alpha}(\alpha))\subseteq L_{\Sigma,h}(\alpha),

and likewise for $\beta$ , so that $L_{\Sigma,h}(\gamma)\subseteq L_{\Sigma,h}(\alpha)\cap L_{\Sigma,h}(\beta)$ .

Conversely, we have already argued that the smallest arrays in $L_{\Sigma,h}(\alpha)\cap L_{\Sigma,h}(\beta)$ have height $m$ and width $n$ . More generally, any array $U\in L_{\Sigma,h}(\alpha)\cap L_{\Sigma,h}(\beta)$ has height $k\cdot m$ and width $l\cdot n$ . As $U\in L_{\Sigma,h}(\alpha)$ , there is some two-dimensional morphism $h_{U,\alpha}$ such that $U=h_{U,\alpha}(\alpha)$ . Moreover, for each variable $x$ in $\alpha$ , $|h_{U,\alpha}(x)|_{r}=k_{\alpha}=\frac{k\cdot m}{m_{\alpha}}$ and $|h_{U,\alpha}(x)|_{c}=l_{\alpha}=\frac{l\cdot n}{n_{\alpha}}$ . We can make an analogous reasoning with $\beta$ , introducing the constants $k_{\beta}=\frac{k\cdot m}{m_{\beta}}$ and $l_{\beta}=\frac{l\cdot n}{n_{\beta}}$ for the morphism $h_{U,\beta}$ . As $U=h_{U,\alpha}(\alpha)=h_{U,\beta}(\beta)$ , entries in $U$ must coincide both according to $\alpha$ and to $\beta$ . This is exactly reflected in the construction of $\gamma$ provided above, so that $h_{U,\alpha}=h_{\alpha}\circ h^{\prime}$ and $h_{U,\beta}=h_{\beta}\circ h^{\prime}$ for some two-dimensional morphism $h^{\prime}$ with $h^{\prime}(\gamma)=U$ . Hence, $U\in L_{\Sigma,h}(\gamma)$ . ∎

Arguments as in Propositions 3 and 4 can be given for any non-trivial binary set operation, for instance, symmetric difference or set difference. This also gives the according result for complementation, but there is also an easier argument in that case. Notice that, as non-erasing pattern languages or array patterns cannot reasonably cope with the empty word or the empty array, we disregard this in the complement operation.

Proposition 6.

For any alphabet $\Sigma$ , $\mathcal{L}^{\operatorname{\texttt{1D}}}_{\Sigma}$ is not closed under complementation.

Proof.

Let $\Sigma$ be some alphabet with $a\in\Sigma$ . Consider the pattern $\alpha=xy$ . The complement (disregarding the empty word) contains as shortest words a word of the form $a$ . This implies that a pattern $\beta$ describing the complement must have length one, i.e., $\beta=x$ . Hence, $L(\beta)=\Sigma^{+}$ , which is not the complement of $L(\alpha)$ . ∎

Corollary 3.

None of the array pattern language classes under consideration (over any alphabet) is closed under complementation.

Notice that in the other cases (but complementation), we cannot cope with unary alphabets. This might need some different arguments.

We shall now turn to operations that are described by different kinds of morphisms. For the array case, codings (or projections) is a common such operation.

Theorem 1.

Any of our array pattern language classes (over arbitrary alphabets) is closed under projections.

Proof.

The following proof works for every $z\in\{h,p,r,c,rc\}$ . Let $\alpha$ be some array pattern and let $\pi:\Sigma\to\Gamma$ be a surjective mapping that describes some projection. We shall now show that $\pi(L_{\Sigma,z}(\alpha))=L_{\Gamma,z}(\alpha)$ .

Namely, consider some array $U\in L_{\Sigma,z}(\alpha)$ . $U$ is obtained from $\alpha$ by replacing any variable $x$ occurring in $\alpha$ by some array $h(x)$ . If we replace $x$ by $\pi(h(x))$ instead (which has the same dimensions as $h(x)$ ), we can see that $\pi(U)\in L_{\Gamma,z}(\alpha)$ .

Conversely, fix some letter from $\pi^{-1}(\mathtt{a})$ for each $\mathtt{a}\in\Gamma$ for now and denote it by $\kappa(\mathtt{a})$ . This is possible, as $\pi$ is a surjective mapping. As $\pi(\kappa(\mathtt{a}))=\mathtt{a}$ for each $\mathtt{a}\in\Gamma$ by construction, for any $U\in L_{\Gamma,z}(\alpha)$ we can describe some $V\in L_{\Sigma,x}(\alpha)$ such that $\pi(V)=U$ , just by taking $V=\kappa(U)$ . ∎

The result does not generalize to (string) morphisms where each image is of the same length.

Proposition 7.

For any non-unary alphabet $\Sigma$ , $\mathcal{L}^{\operatorname{\texttt{1D}}}_{\Sigma}$ is not closed under morphisms that map every letter to a word of length two.

Proof.

Consider the pattern $x$ on the alphabet $\Sigma=\{\mathtt{a},\mathtt{b}\}$ , describing the universal language $\Sigma^{+}$ , and the morphism $h$ with $h(\mathtt{a})=\mathtt{a}\mathtt{b}$ and $h(\mathtt{b})=\mathtt{a}\mathtt{b}$ . Then, $h(\Sigma^{+})=\{\mathtt{a}\mathtt{b}\}^{+}$ , which is not a pattern language, as an easy analysis shows. ∎

Corollary 4.

None of the array pattern language classes under consideration (over some non-unary alphabet) is closed under two-dimensional morphisms.

This is also true for the more general operation of substitution, with the same examples.

Let us also remark that Proposition 7 did not rely on the fact that we restricted our attention to one specific non-unary alphabet $\Sigma$ . However, if we have a specific alphabet, then we can even state:

Proposition 8.

Any of our array pattern language classes (over some fixed alphabet $\Sigma$ ) is closed under some projection $\pi:\Sigma\to\Sigma$ if and only if $\pi$ is a bijection.

Proof.

If $\pi$ is some bijection, then the argument given in the proof of Theorem 1 applies. Recall now that for finite sets as $\Sigma$ , $\pi$ is a bijection if and only if $\pi$ is a surjection. If $\pi$ is not a bijection, then $\pi$ is not a surjection; hence, there is some $b\in\Sigma\setminus\pi(\Sigma)$ , Consider the pattern $\alpha=x$ . Clearly, $\pi(L_{\Sigma,z}(x))=(\pi(\Sigma))^{**}$ , $z\in\{h,p,r,c,rc\}$ , but any array pattern language over $\Sigma$ will also contain patterns with the letter $b$ . ∎

This immediately implies the following two results:

Corollary 5.

Any of our array pattern language classes (over binary alphabets) is closed under conjugation.

Corollary 6.

None of our array pattern language classes (over some fixed alphabet $\Sigma$ ) is closed under all letter-to-letter morphisms.

Alternatively, we could also look at inverse morphisms. Here, we already get negative results for inverse codings.

Proposition 9.

For any alphabet $\Sigma$ with at least four letters, $\mathcal{L}^{\operatorname{\texttt{1D}}}_{\Sigma}$ is not closed under inverse letter-to-letter morphisms.

Proof.

Consider the coding $h:\mathtt{a}\mapsto 1,\mathtt{b}\mapsto 1,\mathtt{c}\mapsto 2,\mathtt{d}\mapsto 3$ . The pattern $xx$ generates (over the alphabet $\{1,2,3\}$ ) the language $L=\{ww\mid w\in\{1,2,3\}^{+}\}$ . However, $L^{\prime}=h^{-1}(L)$ is not a pattern language. The shortest words in $L^{\prime}$ are of length two, so that a hypothetical pattern for $L^{\prime}$ must be of the form $yz$ or $yy$ . In the first case, $\mathtt{c}\mathtt{d}\notin L^{\prime}$ is produced, while in the second case, $\mathtt{a}\mathtt{b}\in L^{\prime}$ cannot be produced. ∎

Corollary 7.

None of the array pattern language classes under consideration (over some sufficiently large alphabet) is closed under inverse (two-dimensional) morphisms.

6.2 Operations Similar to String Language Operations

Let us first turn to the concatenation operation. As a warm-up, we first consider the string case.

Lemma 9.

For any alphabet $\Sigma$ , $\mathcal{L}^{\operatorname{\texttt{1D}}}_{\Sigma}$ is closed under concatenation.

Proof.

Consider two patterns $\alpha$ and $\beta$ . After renaming, we can assume that $\alpha$ and $\beta$ do not contain any identical variables. Then, $L^{\operatorname{\texttt{1D}}}_{\Sigma}(\alpha\beta)=L^{\operatorname{\texttt{1D}}}_{\Sigma}(\alpha)L^{\operatorname{\texttt{1D}}}_{\Sigma}(\beta)$ . ∎

The first thing one should note is that the concatenation of two arrays could be undefined (i. e., if their dimensions do not match), even though the concatenation of the two according languages need not be empty. However, we can prove:

Theorem 2.

Fix some alphabet $\Sigma$ .

•

$\mathcal{L}_{\Sigma,r}$ is closed under row concatenation $\varominus$ ;
•

$\mathcal{L}_{\Sigma,c}$ is closed under column concatenation $\varobar$ ;
•

$\mathcal{L}_{\Sigma,p}$ and $\mathcal{L}_{\Sigma,h}$ are closed both under row and under column concatenation.

Proof.

We only prove the first item. The others follow similarly. Observe that in the case of $\mathcal{L}_{\Sigma,h}$ , we need Lemmas 1 and 2 to finish the argument. Let $\alpha$ and $\beta$ be two array patterns. We can assume that the variable alphabets of $\alpha$ and $\beta$ are disjoint. We want to construct an array pattern $\gamma$ such that

L_{\Sigma,r}(\gamma)=L_{\Sigma,r}(\alpha)\varominus L_{\Sigma,r}(\beta).\qquad(*)

Let $m_{\alpha}$ be the number of rows of $\alpha$ and $n_{\alpha}$ be the number of columns of $\alpha$ . Accordingly, $m_{\beta}$ is the number of rows of $\beta$ and $n_{\beta}$ is the number of columns of $\beta$ . If $n_{\alpha}=n_{\beta}$ , then we can set $\gamma=\alpha\varominus\beta$ to satisfy $(*)$ as in the string case of Lemma 9. More generally, set $n=\operatorname{lcm}(n_{\alpha},n_{\beta})$ . $n$ is the width of the smallest arrays in $L_{\Sigma,r}(\alpha)\varominus L_{\Sigma,r}(\beta)$ . More generally speaking, any array in $L_{\Sigma,r}(\alpha)\varominus L_{\Sigma,r}(\beta)$ has some width that is a multiple of $n$ . We are going to exploit this property by constructing two array patterns $\alpha^{\prime}$ and $\beta^{\prime}$ of width $n$ such that

L_{\Sigma,r}(\alpha)\varominus L_{\Sigma,r}(\beta)=L_{\Sigma,r}(\alpha^{\prime})\varominus L_{\Sigma,r}(\beta^{\prime}),

so that we can apply our previous reasoning, defining now $\gamma=\alpha^{\prime}\varominus\beta^{\prime}$ . Consider the two-dimensional morphism $h_{\alpha}$ that maps every variable $x$ of $\alpha$ on an array of height one and width $n^{\prime}_{\alpha}=\frac{\operatorname{lcm}(n_{\alpha},n_{\beta})}{n_{\alpha}}$ , more precisely, onto $x_{1}x_{2}\cdots x_{n^{\prime}_{\alpha}}$ . Here, $x_{1},\dots,x_{n^{\prime}_{\alpha}}$ are “new variables.” Then, $h_{\alpha}(\alpha)$ is an array pattern of width $n=n_{\alpha}\cdot n^{\prime}_{\alpha}$ . Similarly, we define a morphism $h_{\beta}$ , yielding an array pattern $h_{\beta}(\beta)$ of width $n=n_{\beta}\cdot n^{\prime}_{\beta}$ . Now, any array $U$ in $L_{\Sigma,r}(\alpha^{\prime})$ has a width that is a multiple of $n$ and also belongs to $L_{\Sigma,r}(\alpha)$ , and conversely any $U\in L_{\Sigma,r}(\alpha)$ that has a width that is a multiple of $n$ belongs to $L_{\Sigma,r}(\alpha^{\prime})$ . Together with similar statements for the pattern $\beta$ , the claim follows. ∎

It is not a coincidence that for $\mathcal{L}_{\Sigma,r}$ and $\mathcal{L}_{\Sigma,c}$ , we had to focus on the “correct” concatenation operation in the preceding theorem. More precisely, we can show:

Theorem 3.

Fix some non-unary alphabet $\Sigma$ .

•

$\mathcal{L}_{\Sigma,r}$ is not closed under column concatenation $\varobar$ ;
•

$\mathcal{L}_{\Sigma,c}$ is not closed under row concatenation $\varominus$ ;
•

$\mathcal{L}_{\Sigma,rc}$ is neither closed under row nor under column concatenation.

Proof.

Again, we only prove the first item; the others can be seen in a similar fashion.

Consider the array patterns $\alpha=\begin{bmatrix}x_{1}&x_{2}\\ x_{2}&x_{3}\end{bmatrix}$ and $\beta=\begin{bmatrix}y_{1}&y_{2}\\ y_{2}&y_{3}\end{bmatrix}$ . Notice that $L_{\Sigma,r}(\alpha)=L_{\Sigma,r}(\beta)$ , There are arrays of width four and height two in $L:=L_{\Sigma,r}(\alpha)\varobar L_{\Sigma,r}(\beta)$ , and these are the smallest arrays in $L$ . Hence, any array pattern $\gamma$ with $L=L_{\Sigma,r}(\gamma)$ has width four and height two. Let $\gamma=\begin{bmatrix}z_{1}&z_{2}&z_{3}&z_{4}\\ z_{5}&z_{6}&z_{7}&z_{8}\end{bmatrix}$ , with possibly some of the variables being the same. As $\begin{bmatrix}\mathtt{a}&\mathtt{b}&\mathtt{a}&\mathtt{b}\\ \mathtt{b}&\mathtt{a}&\mathtt{b}&\mathtt{a}\\ \end{bmatrix}\in L$ , but $\begin{bmatrix}\mathtt{a}&\mathtt{b}&\mathtt{a}&\mathtt{a}\\ \mathtt{b}&\mathtt{a}&\mathtt{b}&\mathtt{a}\\ \end{bmatrix}\notin L$ and also $\begin{bmatrix}\mathtt{a}&\mathtt{a}&\mathtt{a}&\mathtt{b}\\ \mathtt{b}&\mathtt{a}&\mathtt{b}&\mathtt{a}\\ \end{bmatrix}\notin L$ , $z_{2}=z_{5}$ and $z_{4}=z_{7}$ . More generally, it can be verified that the array pattern $\zeta=\begin{bmatrix}z_{1}&z_{2}&z_{3}&z_{4}\\ z_{2}&z_{6}&z_{4}&z_{8}\end{bmatrix}$ describes those and only those arrays of width four and height two that belong to $L$ . However, $U=\begin{bmatrix}\mathtt{a}&\mathtt{a}&\mathtt{b}&\mathtt{a}&\mathtt{b}&\mathtt{b}\\ \mathtt{b}&\mathtt{a}&\mathtt{b}&\mathtt{b}&\mathtt{a}&\mathtt{b}\end{bmatrix}$ is an array from $L_{\Sigma,r}(\zeta)$ (namely, consider $z_{1}=\mathtt{a}\mathtt{a}$ , $z_{2}=\mathtt{b}\mathtt{a}$ , $z_{3}=z_{4}=z_{6}=\mathtt{b}$ and $z_{8}=\mathtt{a}\mathtt{b}$ ) that does not belong to $L$ , as this would mean that $U=U_{1}\varobar U_{2}$ for some array $U_{1}$ or $U_{2}$ of width two and height two from $L_{\Sigma,r}(\alpha)=L_{\Sigma,r}(\beta)$ . However, neither $U_{1}=\begin{bmatrix}\mathtt{a}&\mathtt{a}\\ \mathtt{b}&\mathtt{a}\end{bmatrix}$ nor $U_{2}=\begin{bmatrix}\mathtt{b}&\mathtt{b}\\ \mathtt{a}&\mathtt{b}\end{bmatrix}$ belongs to $L_{\Sigma,r}(\alpha)=L_{\Sigma,r}(\beta)$ . ∎

Notice that the proofs of negative closure properties necessitate a non-unary alphabet to work.

We now turn to the Kleene closure. Here, we can again first show a non-closure result for the string case that then readily transfers to the array cases.

Lemma 10.

Let $\Sigma$ be a non-unary alphabet. Consider $\alpha=xx$ . Then, $(L^{\operatorname{\texttt{1D}}}_{\Sigma}(\alpha))^{+}\notin\mathcal{L}^{\operatorname{\texttt{1D}}}_{\Sigma}$ .

Proof.

The shortest words in $L:=(L^{\operatorname{\texttt{1D}}}_{\Sigma}(\alpha))^{+}$ are of length two. Hence, there are only two different possibilities for any pattern $\beta$ with $L^{\operatorname{\texttt{1D}}}_{\Sigma}(\beta)=L$ : If $\beta=\alpha=xx$ , then $\mathtt{a}\mathtt{a}\mathtt{b}\mathtt{b}\in L\setminus L^{\operatorname{\texttt{1D}}}_{\Sigma}(\beta)$ , while if $\beta=xy$ , then $\mathtt{a}\mathtt{b}\in L^{\operatorname{\texttt{1D}}}_{\Sigma}(\beta)\setminus L$ . ∎

Proposition 10.

Let $\Sigma$ be a non-unary alphabet. Then, none of the array language families $\mathcal{L}_{\Sigma,x}$ with $x\in\{r,c,rc,p,h\}$ is closed under column concatenation closure nor under row concatenation closure.

Proof.

Consider $\mathcal{L}_{\Sigma,r}$ . Due to Lemmas 7 and 10, this class is not closed under row concatenation closure. For the case of column contentation closure, reconsider the proof of Theorem 3. There, we have presented a language $L\in\mathcal{L}_{\Sigma,r}$ such that $L\varobar L\notin\mathcal{L}_{\Sigma,r}$ . But that argument also shows that the column contentation closure of $L$ does not belong to $\mathcal{L}_{\Sigma,r}$ .

The other cases are simillarly seen. For the case of morphisms, observe that the contradiction in Lemma 10 was derived by substituting the variables by words of the same length. ∎

6.3 Operations Special to Arrays

Recall that the transposition operation is first defined for arrays (or patterns) and can then be lifted to languages and even to language classes. Nearly by definition, we find:

Lemma 11.

Let $\Sigma$ be some alphabet. Let $\alpha$ be a pattern. Then, ${L_{\Sigma,r}(\alpha)}^{\texttt{T}}=L_{\Sigma,c}({\alpha}^{\texttt{T}})$ and ${L_{\Sigma,c}(\alpha)}^{\texttt{T}}=L_{\Sigma,r}({\alpha}^{\texttt{T}})$ .

Corollary 8.

Let $\Sigma$ be some alphabet. Then, ${\mathcal{L}_{\Sigma,r}}^{\texttt{T}}=\mathcal{L}_{\Sigma,c}$ and ${\mathcal{L}_{\Sigma,c}}^{\texttt{T}}=\mathcal{L}_{\Sigma,r}$ .

Since $\alpha:=\begin{bmatrix}x_{1}&x_{2}\\ x_{2}&x_{1}\\ \end{bmatrix}$ is identical to its transposition and, as shown in the proof of Lemma 6, describes an $r$ pattern language (a $c$ pattern language), which is not a $c$ pattern language (not an $r$ pattern language, respectively), we can conclude the following:

Proposition 11.

Let $\Sigma$ be an alphabet. Neither $\mathcal{L}_{\Sigma,c}$ nor $\mathcal{L}_{\Sigma,r}$ are closed under transposition.

Proof.

Consider the pattern $\alpha:=\begin{bmatrix}x_{1}&x_{2}\\ x_{2}&x_{1}\\ \end{bmatrix}$ . As $\alpha={\alpha}^{\texttt{T}}$ ,

{L_{\Sigma,r}(\alpha)}^{\texttt{T}}=L_{\Sigma,c}({\alpha}^{\texttt{T}})=L_{\Sigma,c}(\alpha)\notin\mathcal{L}_{\Sigma,r},

as we have shown in Lemma 6. Symmetrically, the other claim follows. ∎

Proposition 12.

For any alphabet $\Sigma$ and $x\in\{h,p,rc\}$ , $\mathcal{L}_{\Sigma,x}$ is closed under transposition.

Proof.

For $h$ and $p$ , this claim is immediate from the fact that we have proper factorizations. For the case $x=rc$ , we use Lemma 11. Let $\alpha$ be some pattern. Then,

$\displaystyle{L_{\Sigma,rc}(\alpha)}^{\texttt{T}}$	$\displaystyle=$	$\displaystyle{({L_{\Sigma,r}(\alpha)}\cup{L_{\Sigma,c}(\alpha)})}^{\texttt{T}}$
	$\displaystyle=$	$\displaystyle L_{\Sigma,c}({\alpha}^{\texttt{T}})\cup L_{\Sigma,r}({\alpha}^{\texttt{T}})$
	$\displaystyle=$	$\displaystyle L_{\Sigma,rc}({\alpha}^{\texttt{T}})$

This immediately implies the claim. ∎

With respect to purely geometric operations as turns and reflections, we find the following:

Proposition 13.

Let $\Sigma$ be some alphabet.

•

$\mathcal{L}_{\Sigma,rc}$ , $\mathcal{L}_{\Sigma,p}$ and $\mathcal{L}_{\Sigma,h}$ are closed under quarter-turn.
•

For every $x\in\{r,c,rc,p,h\}$ , $\mathcal{L}_{\Sigma,x}$ is closed under half-turn and reflections.
•

$\mathcal{L}_{\Sigma,r}$ and $\mathcal{L}_{\Sigma,c}$ are closed neither under left nor under right turn.

Proof.

For the positive closure results, simply observe that the language described by the quarter-turn, by the half-turn or by a reflection of the array pattern $\alpha$ is just the quarter-turn, the half-turn or the reflection of the language described by $\alpha$ .

For the non-closure properties, by symmetry it suffices to show that there is a language in $\mathcal{L}_{\Sigma,r}$ whose quarter-turn is not in $\mathcal{L}_{\Sigma,r}$ . To this end, consider $L:=L_{\Sigma,r}\left(\begin{bmatrix}x&y\\ y&x\end{bmatrix}\right)$ . Observe that the quarter-turn of $L$ is the same as $L_{\Sigma,c}\left(\begin{bmatrix}x&y\\ y&x\end{bmatrix}\right)$ , which was proven not to be in $\mathcal{L}_{\Sigma,r}$ in Lemma 6. ∎

The positive closure properties can be easily observed by applying the geometric operation directly on the array pattern. In order to show non-closure of $\mathcal{L}_{\Sigma,r}$ and $\mathcal{L}_{\Sigma,c}$ with respect to left and right turn, it is again sufficient to observe that the pattern $\alpha$ from above is identical to its left or right turn and then apply a similar argument as in the proof of Lemma 6.

Due to symmetry, it does not matter if we consider horizontal or vertical reflections. Notice that both half-turns and reflections coincide in the string case in any meaningful, non-trivial interpretation; in that case, the operation is also known as mirror image.

7 Future Research Directions

A thorough investigation of the typical decision problems for two-dimensional pattern languages like the membership, inclusion and equivalence problem is left for future research. It can be easily seen that the NP-completeness of the membership problem for string pattern languages carries over to $\mathcal{L}_{\Sigma,x}$ , $x\in\{p,r,c,rc\}$ . On the other hand, for a given array pattern $\alpha$ and a terminal array $W$ , the question whether or not $W\in L_{\Sigma,h}(\alpha)$ can be decided in polynomial time by checking whether $W$ is a morphic image of $\alpha$ with respect to a $(\frac{|W|_{r}}{|\alpha|_{r}},\frac{|W|_{c}}{|\alpha|_{c}})$ -uniform substitution. As shown by Lemma 3, the equivalence problem for all the classes $\mathcal{L}_{\Sigma,x}$ with $x\in\{h,p,r,c,rc\}$ and $|\Sigma|\geq 2$ can be easily solved by simply comparing the patterns. However, for every $z,z^{\prime}\in\{h,p,r,c,rc\}$ , $z\neq z^{\prime}$ , the problem to decide for given patterns $\alpha$ and $\beta$ whether or not $L_{\Sigma,z}(\alpha)=L_{\Sigma,z^{\prime}}(\beta)$ might be worth investigating. The inclusion problem for terminal-free nonerasing string pattern languages is still open. Hence, with respect to the inclusion problem, a positive decidability result for two-dimensional pattern languages implies a positive decidability result for terminal-free nonerasing string pattern languages.

For string pattern languages it is common to use terminal symbols in the patterns as well as to consider the erasing case, i. e., variables can be replaced by the empty word. The $p$ pattern languages can be adapted to the erasing case by allowing variables to be substituted by the empty array. Furthermore, the situation of having a terminal symbol at position $(i,j)$ of an array pattern simply forces all the variables in the $i^{\text{th}}$ row to be substituted by arrays of height $1$ and all the variables in the $j^{\text{th}}$ column to be substituted by arrays of width $1$ . As in the string case, it is likely that in the two-dimensional case the difference between erasing and nonerasing substitutions and patterns with and without terminal symbols lead to different language classes with different decidability properties, too.

Finally, we wish to point out that it is straightforward to generalise our different classes of two-dimensional pattern languages to the three-dimensional or even $n$ -dimensional case.

References

[1] D. Angluin. Finding patterns common to a set of strings. Journal of Computer and System Sciences, 21:46–62, 1980.
[2] H. Fernau, R. Freund, and M. Holzer. The generative power of $d$ -dimensional #-context-free array grammars. In M. Margenstern, editor, Proceedings of MCU’98, Volume 2, pages 43–56. University of Metz, 1998.
[3] H. Fernau, M. L. Schmid, and K. G. Subramanian. Two-dimensional pattern languages. In S. Bensch, F. Drewes, R. Freund, and F. Otto, editors, Fifth Workshop on Non-Classical Models for Automata and Applications, NCMA, volume 294 of books@ocg.at, pages 117–132. Österreichische Computer Gesellschaft, 2013.
[4] R. Freund, G. Păun, and G. Rozenberg. Chapter 8: Contextual array grammars. In C. Martín-Vide, V. Mitrana, and G. Păun, editors, Series in Machine Perception and Artificial Intelligence: Volume 66 - Formal Models, Languages and Applications, pages 112–136. World Scientific, 2007.
[5] J. E. F. Friedl. Mastering Regular Expressions. O’Reilly, Sebastopol, CA, third edition, 2006.
[6] D. Giammarresi and A. Restivo. Recognizable picture languages. International Journal of Pattern Recognition and Artificial Intelligence, 6:31–46, 1992.
[7] D. Giammarresi and A. Restivo. Two-dimensional languages. In G. Rozenberg and A. Salomaa, editors, Handbook of Formal Languages, volume 3, chapter 4, pages 215–267. Springer, 1997.
[8] D. Giammarresi, A. Restivo, S. Seibert, and W. Thomas. Monadic second-order logic over rectangular pictures and recognizability by tiling systems. Information and Computation (formerly Information and Control), 125:32–45, 1996.
[9] A. Mateescu and A. Salomaa. Patterns. In G. Rozenberg and A. Salomaa, editors, Handbook of Formal Languages, volume 1, pages 230–242. Springer, 1997.
[10] O. Matz. Recognizable vs. regular picture languages. In Proc. 2nd International Conference on Algebraic Informatics, CAI 2007, volume 4728 of Lecture Notes in Computer Science, pages 75–86, 2007.
[11] K. Morita. Two-dimensional languages. In C. Martín-Vide, V. Mitrana, and G. Păun, editors, Studies in Fuzziness and Soft Computing - Formal Languages and Applications, pages 427–437. Springer, 2004.
[12] A. Rosenfeld. Picture Languages: Formal Models for Picture Recognition. Academic Press, Inc., Orlando, 1979.
[13] A. Rosenfeld and R. Siromoney. Picture languages – a survey. Languages of Design, 1:229–245, 1993.
[14] G. Siromoney, R. Siromoney, and K. Krithivasan. Picture languages with array rewriting rules. Information and Control, 22:447–470, 1973.
[15] P. S. P. Wang. Array Grammars, Patterns and Recognizers. World Scientific Publishing Co., Inc., NJ, USA, 1989.

Two-Dimensional Pattern Languages††thanks: This document is a full version (i. e., it contains all proofs) of the conference paper [3].

Abstract

1 Introduction

2 Preliminaries

Example 1.

Example 2.

3 Two-Dimensional Pattern Languages

Definition 1.

Proposition 1.

Proof.

Definition 2.

Remark 1.

4 General Observations

Lemma 1.

Proof.

Lemma 2.

Proof.

Lemma 3.

Proof.

5 Comparison of Array Pattern Language Classes

Proposition 2.

Lemma 4.

Proof.

Lemma 5.

Proof.

Lemma 6.

Proof.

6 Closure Properties of Array Pattern Languages

6.1 String Language Operations

Lemma 7.

Lemma 8.

Proposition 3.

Proof.

Corollary 1.

Proposition 4.

Proof.

Corollary 2.

Proposition 5.

Proof.

Proposition 6.

Proof.

Corollary 3.

Theorem 1.

Proof.

Proposition 7.

Proof.

Corollary 4.

Proposition 8.

Proof.

Corollary 5.

Corollary 6.

Proposition 9.

Proof.

Corollary 7.

6.2 Operations Similar to String Language Operations

Lemma 9.

Proof.

Theorem 2.

Proof.

Theorem 3.

Proof.

Lemma 10.

Proof.

Proposition 10.

Proof.

6.3 Operations Special to Arrays

Lemma 11.

Corollary 8.

Proposition 11.

Proof.

Proposition 12.

Proof.

Proposition 13.

Proof.

7 Future Research Directions

References

Two-Dimensional Pattern Languages^†^†thanks: This document is a full version (i. e., it contains all proofs) of the conference paper [3].