A Local Approach to Studying the Time and Space Complexity of Deterministic and Nondeterministic Decision Trees

Kerven Durdymyradov and Mikhail Moshkov
Computer, Electrical and Mathematical Sciences & Engineering Division
and Computational Bioscience Research Center
King Abdullah University of Science and Technology (KAUST)
Thuwal 23955-6900, Saudi Arabia
{kerven.durdymyradov,mikhail.moshkov}@kaust.edu.sa

Abstract

Decision trees and decision rules are intensively studied and used in different areas of computer science. The questions important for the theory of decision trees and rules include relations between decision trees and decision rule systems, time-space tradeoff for decision trees, and time-space tradeoff for decision rule systems. In this paper, we study arbitrary infinite binary information systems each of which consists of an infinite set called universe and an infinite set of two-valued functions (attributes) defined on the universe. We consider the notion of a problem over information system, which is described by a finite number of attributes and a mapping associating a decision to each tuple of attribute values. As algorithms for problem solving, we investigate deterministic and nondeterministic decision trees that use only attributes from the problem description. Nondeterministic decision trees are representations of decision rule systems that sometimes have less space complexity than the original rule systems. As time and space complexity, we study the depth and the number of nodes in the decision trees. In the worst case, with the growth of the number of attributes in the problem description, (i) the minimum depth of deterministic decision trees grows either as a logarithm or linearly, (ii) the minimum depth of nondeterministic decision trees either is bounded from above by a constant or grows linearly, (iii) the minimum number of nodes in deterministic decision trees has either polynomial or exponential growth, and (iv) the minimum number of nodes in nondeterministic decision trees has either polynomial or exponential growth. Based on these results, we divide the set of all infinite binary information systems into three complexity classes. This allows us to identify nontrivial relationships between deterministic decision trees and decision rules systems represented by nondeterministic decision trees. For each class, we study issues related to time-space trade-off for deterministic and nondeterministic decision trees.

Keywords: Deterministic decision trees, Nondeterministic decision trees, Time complexity, Space complexity, Complexity classes, Time-space trade-off.

1 Introduction

In this paper, instead of decision rule systems we study nondeterministic decision trees. These trees can be considered as representations of decision rule systems that sometimes have less space complexity than the original rule systems. We study problems over infinite binary information systems and divide the set of all infinite binary information systems into three complexity classes depending on the worst case time and space complexity of deterministic and nondeterministic decision trees solving problems. This allows us to identify nontrivial relationships between deterministic decision trees and decision rule systems represented by nondeterministic decision trees. For each complexity class, we study issues related to time-space trade-off for deterministic and nondeterministic decision trees.

Decision trees [1, 2, 6, 22, 26, 32] and systems of decision rules [4, 5, 7, 11, 25, 29, 30, 31, 35] are widely used as classifiers to predict a decision for a new object, as a means of knowledge representation, and as algorithms for solving problems of fault diagnosis, computational geometry, combinatorial optimization, etc.

Decision trees and rules are among the most interpretable models for classifying and representing knowledge [15]. In order to better understand decision trees, we should not only minimize the number of their nodes, but also the depth of decision trees to avoid the consideration of long conjunctions of conditions corresponding to long paths in these trees. Similarly, for decision rule systems, we should minimize both the total length of rules and the maximum length of a rule in the system. When we consider decision trees and decision rule systems as algorithms (usually sequential for decision trees and parallel for decision rule systems), we should have in mind the same bi-criteria optimization problems to minimize space and time complexity of these algorithms.

In this paper, we represent systems of decision rules as nondeterministic decision trees to compress them and to emphasize the possibility of processing different decision rules in parallel. We consider deterministic and nondeterministic decision trees as algorithms and study their space and time complexity, paying particular attention to time and space complexity relationships. Examples of deterministic and nondeterministic decision trees computing Boolean function $x_{1}\wedge x_{2}$ can be found in Fig. 1.

Refer to caption — Figure 1: Deterministic and nondeterministic decision trees computing function $x_{1}\wedge x_{2}$

Infinite systems of attributes and decision trees over these systems have been intensively studied, especially systems of linear and algebraic attributes and the corresponding linear [8, 9, 16] and algebraic decision trees [3, 12, 13, 14, 36, 37, 38]. Years ago, one of the authors initiated the study of decision trees over arbitrary infinite systems of attributes [17, 18, 19, 20, 21]. In this paper, we study decision trees over arbitrary infinite systems of binary attributes represented in the form of infinite binary information systems.

General information system introduced by Pawlak [28] consists of a universe (a set of objects) and a set of attributes (functions with finite image) defined on the universe. An information system is called infinite, if both its universe and the set of attributes are infinite. An information system is called binary if each of its attributes has values from the set $\{0,1\}$ .

Any problem over an information system is described by a finite number of attributes that divide the universe into domains in which these attributes have fixed values. A decision is attached to each domain. For a given object from the universe, it is required to find the decision attached to the domain containing this object.

As algorithms solving these problems, deterministic and nondeterministic decision trees are studied. As time complexity of a decision tree, we consider its depth, i.e., the maximum number of nodes labeled with attributes in a path from the root to a terminal node. As space complexity of a decision tree, we consider the number of its nodes.

There are two approaches to the study of infinite information systems: local when in decision trees solving a problem we can use only attributes from the problem description, and global when in the decision trees solving a problem we can use arbitrary attributes from the considered information system. In this paper, we study decision trees in the framework of the local approach.

To the best of our knowledge, time-space trade-offs for decision trees over infinite information systems were not studied in the framework of the local approach prior to the present paper except for its conference version [10], which does not contain proofs. The paper [24] was the first one in which the time-space trade-offs for decision trees over infinite information systems were studied in the framework of the global approach.

Results obtained in [24] are different from the results obtained in the present paper. Apart from the difference in approach, in [24], the set of all infinite information systems is divided not into three but into five families and the criteria for the behavior of functions characterizing the minimum depth of the deterministic and nondeterministic decision tree are completely different in comparison with the present paper. However, many of the definitions and results of the two papers appear similar. In the present paper, we use some auxiliary statements proved in [24] and adapt some proofs from [24] to the case of the local approach.

Based on the results obtained in the present paper and in [22, 26], we describe possible types of behavior of four functions $h_{U}^{ld},h_{U}^{la},L_{U}^{ld},L_{U}^{la}$ that characterize worst case time and space complexity of deterministic and nondeterministic decision trees over an infinite binary information system $U$ (index $l$ refers to the local approach). Decision trees solving a problem can use only attributes from the problem description.

The function $h_{U}^{ld}$ characterizes the growth in the worst case of the minimum depth of a deterministic decision tree solving a problem with the growth of the number of attributes in the problem description. The function $h_{U}^{ld}$ is either grows as a logarithm or linearly.

The function $h_{U}^{la}$ characterizes the growth in the worst case of the minimum depth of a nondeterministic decision tree solving a problem with the growth of the number of attributes in the problem description. The function $h_{U}^{la}$ is either bounded from above by a constant or grows linearly.

The function $L_{U}^{ld}$ characterizes the growth in the worst case of the minimum number of nodes in a deterministic decision tree solving a problem with the growth of the number of attributes in the problem description. The function $L_{U}^{ld}$ has either polynomial or exponential growth.

The function $L_{U}^{la}$ characterizes the growth in the worst case of the minimum number of nodes in a nondeterministic decision tree solving a problem with the growth of the number of attributes in the problem description. The function $L_{U}^{la}$ has either polynomial or exponential growth.

Each of the functions $h_{U}^{ld},h_{U}^{la},L_{U}^{ld},L_{U}^{la}$ has two types of behavior. The tuple $(h_{U}^{ld},h_{U}^{la},L_{U}^{ld},L_{U}^{la})$ has three types of behavior. All these types are described in the paper and each type is illustrated by an example.

There are three complexity classes of infinite binary information systems corresponding to the three possible types of the tuple $(h_{U}^{ld},h_{U}^{la},L_{U}^{ld},L_{U}^{la})$ . For each class, we study joint behavior of time and space complexity of decision trees. The obtained results are related to time-space trade-off for deterministic and nondeterministic decision trees.

A pair of functions $(\varphi,\psi)$ is called a boundary $ld$ -pair of the information system $U$ if, for any problem over $U$ , there exists a deterministic decision tree over $z$ , which solves this problem and for which the depth is at most $\varphi(n)$ and the number of nodes is at most $\psi(n)$ , where $n$ is the number of attributes in the problem description. An information system $U$ is called $ld$ -reachable if the pair $(h_{U}^{ld},L_{U}^{ld})$ is a boundary $ld$ -pair of the system $U$ . For nondeterministic decision trees, the notions of a boundary $la$ -pair of an information system and $la$ -reachable information system are defined in a similar way. For deterministic decision trees, the best situation is when the considered information system is $ld$ -reachable: for any boundary $ld$ -pair $(\varphi,\psi)$ for an information system $U$ and any natural $n$ , $\varphi(n)\geq h_{U}^{ld}(n)$ and $\psi(n)\geq L_{U}^{ld}(n)$ . For nondeterministic decision trees, the best situation is when the information system is $la$ -reachable.

For all complexity classes, all information systems from the class are $ld$ -reachable. For two out of the three complexity classes, all information systems from the class are $la$ -reachable. For the remaining class, all information systems from the class are not $la$ -reachable. For all information systems $U$ that are not $la$ -reachable, we find nontrivial boundary $la$ -pairs, which are sufficiently close to $(h_{U}^{la},L_{U}^{la})$ .

The rest of the paper is organized as follows: Section 2 contains main results, Sections 3-5 – proofs of these results, and Section 6 – short conclusions.

2 Main Results

Let $A$ be an infinite set and $F$ be an infinite set of functions that are defined on $A$ and have values from the set $\{0,1\}$ . The pair $U=(A,F)$ is called an infinite binary information system [28], the elements of the set $A$ are called objects, and the functions from $F$ are called attributes. The set $A$ is called sometimes the universe of the information system $U$ .

A problem over $U$ is a tuple of the form $z=(\nu,f_{1},\ldots,f_{n})$ , where $\nu:\{0,1\}^{n}\rightarrow\mathbb{N}$ , $\mathbb{N}$ is the set of natural numbers $\{1,2,\ldots\}$ , and $f_{1},\ldots,f_{n}\in F$ . We do not require attributes $f_{1},\ldots,f_{n}$ to be pairwise distinct. The problem $z$ consists in finding the value of the function $z(x)=\nu(f_{1}(x),\ldots,f_{n}(x))$ for a given object $a\in A$ . The value $\dim z=n$ is called the dimension of the problem $z$ .

Various problems of combinatorial optimization, pattern recognition, fault diagnosis, probabilistic reasoning, computational geometry, etc., can be represented in this form.

As algorithms for problem solving we consider decision trees. A decision tree over the information system $U$ is a directed tree with a root in which the root and edges leaving the root are not labeled, each terminal node is labeled with a number from $\mathbb{N}$ , each working node (which is neither the root nor a terminal node) is labeled with an attribute from $F$ , and each edge leaving a working node is labeled with a number from the set $\{0,1\}$ . A decision tree is called deterministic if only one edge leaves the root and edges leaving an arbitrary working node are labeled with different numbers.

Let $\Gamma$ be a decision tree over $U$ and

\xi=v_{0},d_{0},v_{1},d_{1},\ldots,v_{m},d_{m},v_{m+1}

be a directed path from the root $v_{0}$ to a terminal node $v_{m+1}$ of $\Gamma$ (we call such path complete). Define a subset $A(\xi)$ of the set $A$ as follows. If $m=0$ , then $A(\xi)=A$ . Let $m>0$ and, for $i=1,\ldots,m$ , the node $v_{i}$ be labeled with the attribute $f_{j_{i}}$ and the edge $d_{i}$ be labeled with the number $\delta_{i}$ . Then

A(\xi)=\{a:a\in A,f_{j_{1}}(a)=\delta_{1},\ldots,f_{j_{m}}(a)=\delta_{m}\}.

The depth of the decision tree $\Gamma$ is the maximum number of working nodes in a complete path of $\Gamma$ . Denote by $h(\Gamma)$ the depth of $\Gamma$ and by $L(\Gamma)$ – the number of nodes in $\Gamma$ .

A decision tree over the information system $U$ is called a decision tree over the problem $z=(\nu,f_{1},\ldots,f_{n})$ if each working node of $\Gamma$ is labeled with an attribute from the set $\{f_{1},\ldots,f_{n}\}$ .

The decision tree $\Gamma$ over $z$ solves the problem $z$ nondeterministically if, for any object $a\in A$ , there exists a complete path $\xi$ of $\Gamma$ such that $a\in A(\xi)$ and, for each $a\in A$ and each complete path $\xi$ such that $a\in A(\xi)$ , the terminal node of $\xi$ is labeled with the number $z(a)$ (in this case, we can say that $\Gamma$ is a nondeterministic decision tree solving the problem $z$ ). In particular, if the decision tree $\Gamma$ solves the problem $z$ nondeterministically, then, for each complete path $\xi$ of $\Gamma$ , either the set $A(\xi)$ is empty or the function $z(x)$ is constant on the set $A(\xi)$ . The decision tree $\Gamma$ over $z$ solves the problem $z$ deterministically if $\Gamma$ is a deterministic decision tree, which solves the problem $z$ nondeterministically (in this case, we can say that $\Gamma$ is a deterministic decision tree solving the problem $z$ ).

Let $P(U)$ be the set of all problems over $U$ . For a problem $z$ from $P(U)$ , let $h_{U}^{ld}(z)$ be the minimum depth of a decision tree over $z$ solving the problem $z$ deterministically, $h_{U}^{la}(z)$ be the minimum depth of a decision tree over $z$ solving the problem $z$ nondeterministically, $L_{U}^{ld}(z)$ be the minimum number of nodes in a decision tree over $z$ solving the problem $z$ deterministically, and $L_{U}^{la}(z)$ be the minimum number of nodes in a decision tree over $z$ solving the problem $z$ nondeterministically.

We consider four functions defined on the set $\mathbb{N}$ in the following way: $h_{U}^{ld}(n)=\max$ $h_{U}^{ld}(z)$ , $h_{U}^{la}(n)=\max$ $h_{U}^{la}(z)$ , $L_{U}^{ld}(n)=\max$ $L_{U}^{ld}(z)$ , and $L_{U}^{la}(n)=\max$ $L_{U}^{la}(z)$ , where the maximum is taken among all problems $z$ over $U$ with $\dim z\leq n$ . These functions describe how the minimum depth and the minimum number of nodes of deterministic and nondeterministic decision trees solving problems are growing in the worst case with the growth of problem dimension. To describe possible types of behavior of these four functions, we need to define some properties of infinite binary information systems.

Definition 1.

We will say that the information system $U=(A,F)$ satisfies the condition of reduction if there exists $m\in\mathbb{N}$ such that, for each compatible on $A$ system of equations $\{f_{1}(x)=\delta_{1},\ldots,f_{r}(x)=\delta_{r}\},$ where $r\in\mathbb{N}$ , $f_{1},\ldots,f_{r}\in F$ and $\delta_{1},\ldots,\delta_{r}\in\{0,1\}$ , there exists a subsystem of this system, which has the same set of solutions from $A$ and contains at most $m$ equations. In this case, we will say that $U$ satisfies the condition of reduction with parameter $m$ .

We now consider two examples of infinite binary information systems that satisfy the condition of reduction. These examples are close to ones considered in Section 3.4 of the book [1].

Example 1.

Let $d,t\in\mathbb{N}$ , $f_{1},\ldots,f_{t}$ be functions from $\mathbb{R}^{d}$ to $\mathbb{R}$ , where $\mathbb{R}$ is the set of real numbers, and $s$ be a function from $\mathbb{R}$ to $\{0,1\}$ such that $s(x)=0$ if $x<0$ and $s(x)=1$ if $x\geq 0$ . Then the infinite binary information system $(\mathbb{R}^{d},F)$ , where $F=\{s(f_{i}+c):i=1,\ldots,t,c\in\mathbb{R}\}$ , satisfies the condition of reduction with parameter $2t$ . If $f_{1},\ldots,f_{t}$ are linear functions, then we deal with attributes corresponding to $t$ families of parallel hyperplanes in $\mathbb{R}^{d}$ what is common for decision trees for datasets with $t$ numerical attributes only [6].

Example 2.

Let $P$ be the Euclidean plane and $l$ be a straight line (line in short) in the plane. This line divides the plane into two open half-planes $H_{1}$ and $H_{2}$ , and the line $l$ . Two attributes correspond to the line $l$ . The first attribute takes value $0$ on points from $H_{1}$ , and value $1$ on points from $H_{2}$ and $l$ . The second one takes value $0$ on points from $H_{2}$ , and value $1$ on points from $H_{1}$ and $l$ . We denote by $\mathcal{L}$ the set of all attributes corresponding to lines in the plane. Infinite binary information systems of the form $(P,L)$ , where $L\subseteq\mathcal{L}$ , are called linear information systems.

Let $l$ be a line in the plane. Let us denote by $\mathcal{L}(l)$ the set of all attributes corresponding to lines, which are parallel to $l$ . Let $p$ be a point in the plane. We denote by $\mathcal{L}(p)$ the set of all attributes corresponding to lines, which pass through $p$ . A set $C$ of attributes from $\mathcal{L}$ is called a clone if $C\subseteq\mathcal{L}(l)$ for some line $l$ or $C\subseteq\mathcal{L}(p)$ for some point $p$ . In [23], it was proved that a linear information system $(P,L)$ satisfies the condition of reduction if and only if $L$ is the union of a finite number of clones.

Definition 2.

Let $U=(A,F)$ be an infinite binary information system. A subset $\{f_{1},\ldots,f_{m}\}$ of the set $F$ will be called independent if, for any $\delta_{1},\ldots,\delta_{m}\in\{0,1\}$ , the system of equations $\{f_{1}(x)=\delta_{1},\ldots,f_{m}(x)=\delta_{m}\},$ has a solution from the set $A$ . The empty set of attributes is independent by definition.

Definition 3.

We define the parameter $I(U)$ , which is called the independence dimension or I-dimension of the information system $U$ (this notion is similar to the notion of independence number of family of sets [27]) as follows. If, for each $m\in\mathbb{N}$ , the set $F$ contains an independent subset of cardinality $m$ , then $I(U)=\infty$ . Otherwise, $I(U)$ is the maximum cardinality of an independent subset of the set $F$ .

We now consider examples of infinite binary information systems with finite I-dimension and with infinite I-dimension. More examples can be found in Lemmas 7-9.

Example 3.

Let $m,t\in\mathbb{N}$ . We denote by $Pol(m)$ the set of all polynomials, which have integer coefficients and depend on variables $x_{1},\ldots,x_{m}$ . We denote by $Pol(m,t)$ the set of all polynomials from $Pol(m)$ such that the degree of each polynomial is at most $t$ . We define infinite binary information systems $U(m)$ and $U(m,t)$ as follows: $U(m)=(\mathbb{R}^{m},F(m))$ and $U(m,t)=(\mathbb{R}^{m},F(m,t))$ , where $F(m)=\{s(p):p\in Pol(m)\}$ , $F(m,t)=\{s(p):p\in Pol(m,t)\}$ , and $s(x)=0$ if $x<0$ and $s(x)=1$ if $x\geq 0$ . One can show that the system $U(m)$ has infinite I-dimension and the system $U(m,t)$ has finite I-dimension.

We now consider four statements that describe possible types of behavior of functions $h_{U}^{ld}(n)$ , $h_{U}^{la}(n)$ , $L_{U}^{ld}(n)$ , and $L_{U}^{la}(n)$ . The next statement follows immediately from Theorem 4.3 [22].

Proposition 1.

For any infinite binary information system $U$ , the function $h_{U}^{ld}(n)$ has one of the following two types of behavior:

(LOG) If the system $U$ satisfies the condition of reduction, then $h_{U}^{ld}(n)=\Theta(\log n)$ .

(LIN) If the system $U$ does not satisfy the condition of reduction, then $h_{U}^{ld}(n)=n$ for any $n\in\mathbb{N}$ .

The next statement follows immediately from Theorem 8.2 [26].

Proposition 2.

For any infinite binary information system $U=(A,F)\,$ , the function $h_{U}^{la}(n)$ has one of the following two types of behavior:

(CON) If the system $U$ satisfies the condition of reduction, then $h_{U}^{la}(n)=O(1)$ .

(LIN) If the system $U$ does not satisfy the condition of reduction, then $h_{U}^{la}(n)=n$ for any $n\in\mathbb{N}$ .

Proposition 3.

For any infinite binary information system $U$ , the function $L_{U}^{ld}(n)$ has one of the following two types of behavior:

(POL) If the system $U$ has finite I-dimension, then for any $n\in\mathbb{N}$ ,

2(n+1)\leq L_{U}^{ld}(n)\leq 2(4n)^{I(U)}.

(EXP) If the system $U$ has infinite I-dimension, then for any $n\in\mathbb{N}$ ,

L_{U}^{ld}(n)=2^{n+1}.

Proposition 4.

For any infinite binary information system $U$ and any $n\in\mathbb{N}$ ,

L_{U}^{la}(n)=L_{U}^{ld}(n).

Let $U$ be an infinite binary information system. Proposition 1 allows us to correspond to the function $h_{U}^{ld}(n)$ its type of behavior from the set $\{\mathrm{LOG},\mathrm{LIN}\}$ . Proposition 2 allows us to correspond to the function $h_{U}^{la}(n)$ its type of behavior from the set $\{\mathrm{CON},\mathrm{LIN}\}$ . Propositions 3 and 4 allow us to correspond to each of the functions $L_{U}^{ld}(n)$ and $L_{U}^{la}(n)$ its type of behavior from the set $\{\mathrm{POL},\mathrm{EXP}\}$ . A tuple obtained from the tuple

(h_{U}^{ld}(n),h_{U}^{la}(n),L_{U}^{ld}(n),L_{U}^{la}(n))

by replacing functions with their types of behavior is called the local type of the information system $U$ . We now describe all possible local types of infinite binary information systems.

Theorem 1.

For any infinite binary information system, its local type coincides with one of the rows of Table 1. Each row of Table 1 is the local type of some infinite binary information system.

Table 1: Possible local types of infinite binary information systems

	$h_{U}^{ld}(n)$	$h_{U}^{la}(n)$	$L_{U}^{ld}(n)$	$L_{U}^{la}(n)$
1	$\mathrm{LOG}$	$\mathrm{CON}$	$\mathrm{POL}$	$\mathrm{POL}$
2	$\mathrm{LIN}$	$\mathrm{LIN}$	$\mathrm{POL}$	$\mathrm{POL}$
3	$\mathrm{LIN}$	$\mathrm{LIN}$	$\mathrm{EXP}$	$\mathrm{EXP}$

For $i=1,2,3$ , we denote by $W_{i}^{l}$ the class of all infinite binary information systems, whose local type coincides with the $i$ th row of Table 1. We now study for each of these complexity classes joint behavior of the depth and number of nodes in decision trees solving problems.

For a given infinite binary information system $U$ , we will consider pairs of functions $(\varphi,\psi)$ such that, for any problem $z$ over $U$ , there exists a deterministic decision tree over $z$ solving $z$ with the depth at most $\varphi(\dim z)$ and the number of nodes at most $\psi(\dim z)$ . We will study such pairs and will call them boundary $ld$ -pairs.

Definition 4.

A pair of functions $(\varphi,\psi)$ , where $\varphi:\mathbb{N}\rightarrow\mathbb{N}\cup\{0\}$ and $\psi:\mathbb{N}\rightarrow\mathbb{N}\cup\{0\}$ , will be called a boundary $ld$ -pair of the information system $U$ if, for any problem $z$ over $U$ , there exists a decision tree $\Gamma$ over $z$ , which solves the problem $z$ deterministically and for which $h(\Gamma)\leq\varphi(n)$ and $L(\Gamma)\leq\psi(n)$ , where $n=\dim z$ .

We are interested in finding boundary $ld$ -pairs with functions that grow as slowly as possible. It is clear that, for any boundary $ld$ -pair $(\varphi,\psi)$ of the information system $U$ , the following inequalities hold: $\varphi(n)\geq h_{U}^{ld}(n)$ and $\psi(n)\geq L_{U}^{ld}(n)$ . So the best possible situation is when $(h_{U}^{ld},L_{U}^{ld})$ is a boundary $ld$ -pair of $U$ .

Definition 5.

An information system $U$ will be called $ld$ -reachable if the pair $(h_{U}^{ld},L_{U}^{ld})$ is a boundary $ld$ -pair of the system $U$ .

We now consider similar notions for nondeterministic decision trees: the notion of boundary $la$ -pair and the notion of $la$ -reachable information system.

Definition 6.

A pair of functions $(\varphi,\psi)$ , where $\varphi:\mathbb{N}\rightarrow\mathbb{N}\cup\{0\}$ and $\psi:\mathbb{N}\rightarrow\mathbb{N}\cup\{0\}$ , will be called a boundary $la$ -pair of the information system $U$ if, for any problem $z$ over $U$ , there exists a decision tree $\Gamma$ over $z$ , which solves the problem $z$ nondeterministically and for which $h(\Gamma)\leq\varphi(n)$ and $L(\Gamma)\leq\psi(n)$ , where $n=\dim z$ .

Definition 7.

An information system $U$ will be called $la$ -reachable if the pair $(h_{U}^{la},L_{U}^{la})$ is a boundary $la$ -pair of the system $U$ .

Note that for deterministic decision trees, the best situation is when the considered information system is $ld$ -reachable and for nondeterministic decision trees – when the information system is $la$ -reachable.

Each information system from the classes $W_{1}^{l},W_{2}^{l}$ , and $W_{3}^{l}$ is $ld$ -reachable. Each information system from the classes $W_{2}^{l}$ and $W_{3}^{l}$ is $la$ -reachable. Each information system from the class $W_{1}^{l}$ is not $la$ -reachable. For all information systems $U$ , which are not $la$ -reachable, we find nontrivial boundary $la$ -pairs that are sufficiently close to $(h_{U}^{la},L_{U}^{la})$ .

The obtained results are related to time-space trade-off for deterministic and nondeterministic decision trees. Details can be found in the following three theorems.

Theorem 2.

Let $U$ be an information system from the class $W_{1}^{l}$ . Then

(a) The system $U$ is $ld$ -reachable.

(b) The system $U$ is not $la$ -reachable and there exists $m\in\mathbb{N}$ such that

(m,(m+1)L_{U}^{la}(n)/2+1)

is a boundary $la$ -pair of the system $U$ .

Theorem 3.

Let $U$ be an information system from the class $W_{2}^{l}$ . Then

(a) The system $U$ is $ld$ -reachable.

(b) The system $U$ is $la$ -reachable.

Theorem 4.

Let $U$ be an information system from the class $W_{3}^{l}$ . Then

(a) The system $U$ is $ld$ -reachable.

(b) The system $U$ is $la$ -reachable.

Table 2 summarizes Theorems 1-4. The first column contains the name of the complexity class. The next four columns describe the local type of information systems from this class. The last two columns “ $ld$ -pairs” and “ $la$ -pairs” contain information about boundary $ld$ -pairs and boundary $la$ -pairs for information systems from the considered class: “ $ld$ -reachable” means that all information systems from the class are $ld$ -reachable, “ $la$ -reachable” means that all information systems from the class are $la$ -reachable, Th. 2 (b) is a link to the corresponding statement Theorem 2 (b).

Table 2: Summary of Theorems 1-4

	$h_{U}^{ld}(n)$	$h_{U}^{la}(n)$	$L_{U}^{ld}(n)$	$L_{U}^{la}(n)$	$ld$ -pairs	$la$ -pairs
$W_{1}^{l}$	$\mathrm{LOG}$	$\mathrm{CON}$	$\mathrm{POL}$	$\mathrm{POL}$	$ld$ -reachable	Th. 2 (b)
$W_{2}^{l}$	$\mathrm{LIN}$	$\mathrm{LIN}$	$\mathrm{POL}$	$\mathrm{POL}$	$ld$ -reachable	$la$ -reachable
$W_{3}^{l}$	$\mathrm{LIN}$	$\mathrm{LIN}$	$\mathrm{EXP}$	$\mathrm{EXP}$	$ld$ -reachable	$la$ -reachable

3 Proofs of Propositions 3 and 4

In this section, we consider a number of auxiliary statements and prove the two mentioned propositions.

Let $\Gamma$ be a decision tree over an infinite binary information system $U=(A,F)$ and $d$ be an edge of $\Gamma$ entering a node $w$ . We denote by $\Gamma(d)$ a subtree of $\Gamma$ , whose root is the node $w$ . We say that a complete path $\xi$ of $\Gamma$ is realizable if $A(\xi)\neq\emptyset$ .

Lemma 1.

Let $U=(A,F)$ be an infinite binary information system, $z=(\nu,f_{1},\ldots,f_{n})$ be a problem over $U$ , and $\Gamma$ be a decision tree over $z$ , which solves the problem $z$ deterministically and for which $L(\Gamma)=L_{U}^{ld}(z)$ . Then

(a) For each node of $\Gamma$ , there exists a realizable complete path that passes through this node.

(b) Each working node of $\Gamma$ has two edges leaving this node.

Proof.

(a) It is clear that there exists at least one realizable complete path that passes through the root of $\Gamma$ . Let us assume that $w$ is a node of $\Gamma$ different from the root and such that there is no a realizable complete path, which passes through $w$ . Let $d$ be an edge entering the node $w$ . We remove from $\Gamma$ the edge $d$ and the subtree $\Gamma(d)$ . As a result, we obtain a decision tree $\Gamma^{\prime}$ over $z$ , which solves $z$ deterministically and for which $L(\Gamma^{\prime})<L(\Gamma)$ but this is impossible by definition of $\Gamma$ .

(b) Let us assume that in $\Gamma$ there exists a working node $w$ , which has only one leaving edge $d$ entering a node $w_{1}$ . We remove from $\Gamma$ the node $w$ and the edge $d$ and connect the edge entering the node $w$ to the node $w_{1}$ . As a result, we obtain a decision tree $\Gamma^{\prime}$ over $z$ , which solves the problem $z$ deterministically and for which $L(\Gamma^{\prime})<L(\Gamma)$ but this is impossible by definition of $\Gamma$ . ∎∎

Let $U$ be an infinite binary information system, $\Gamma$ be a decision tree over $U$ , and $d$ be an edge of $\Gamma$ . The subtree $\Gamma(d)$ will be called full if there exist edges $d_{1},\ldots,d_{m}$ in $\Gamma(d)$ such that the removal of these edges and subtrees $\Gamma(d_{1}),\ldots,\Gamma(d_{m})$ transforms the subtree $\Gamma(d)$ into a tree $G$ such that each terminal node of $G$ is a terminal node of $\Gamma$ , and exactly two edges labeled with the numbers $0$ and $1$ respectively leave each working node of $G$ .

Lemma 2.

Let $U=(A,F)$ be an infinite binary information system, $z=(\nu,f_{1},\ldots,f_{n})$ be a problem over $U$ , and $\Gamma$ be a decision tree over $z$ , which solves the problem $z$ nondeterministically and for which $L(\Gamma)=L_{U}^{la}(z)$ . Then

(a) For each node of $\Gamma$ , there exists a realizable complete path that passes through this node.

(b) If a working node $w$ of $\Gamma$ has $m$ leaving edges $d_{1},\ldots,d_{m}$ labeled with the same number and $m\geq 2$ , then the subtrees $\Gamma(d_{1}),\ldots,\Gamma(d_{m})$ are not full.

(c) If the root $r$ of $\Gamma$ has $m$ leaving edges $d_{1},\ldots,d_{m}$ and $m\geq 2$ , then the subtrees $\Gamma(d_{1}),\ldots,\Gamma(d_{m})$ are not full.

Proof.

(a) The proof of item (a) is almost identical to the proof of item (a) of Lemma 1.

(b) Let $w$ be a working node of $\Gamma$ , which has $m$ leaving edges $d_{1},\ldots,d_{m}$ labeled with the same number, $m\geq 2$ , and at least one of the subtrees $\Gamma(d_{1}),\ldots,$ $\Gamma(d_{m})$ is full. For the definiteness, we assume that $\Gamma(d_{1})$ is full. Remove from $\Gamma$ the edges $d_{2},\ldots,d_{m}$ and subtrees $\Gamma(d_{2}),\ldots,\Gamma(d_{m})$ . We now show that the obtained tree $\Gamma^{\prime}$ solves the problem $z$ nondeterministically. Assume the contrary. Then there exists an object $a\in A$ such that, for each complete path $\xi$ with $a\in A(\xi)$ , the path $\xi$ passes through one of the edges $d_{2},\ldots,d_{m}$ but it is not true. Let $\xi$ be a complete path such that $a\in A(\xi)$ . Then, according to the assumption, this path passes through the node $w$ . Let $\xi^{\prime}$ be the part of this path from the root of $\Gamma$ to the node $w$ . Since the edges $d_{1},\ldots,d_{m}$ are labeled with the same number and $\Gamma(d_{1})$ is a full subtree, we can find in $\Gamma(d_{1})$ the continuation of $\xi^{\prime}$ to a terminal node of $\Gamma(d_{1})$ such that the obtained complete path $\xi^{\prime\prime}$ of $\Gamma$ satisfies the condition $a\in A(\xi^{\prime\prime})$ . Hence $\Gamma^{\prime}$ is a decision tree over $z$ , which solves the problem $z$ nondeterministically and for which $L(\Gamma^{\prime})<L(\Gamma)$ , but this is impossible by definition of $\Gamma$ .

We now consider two statements about classes of decision trees proved in [24]. Let $\Gamma$ be a decision tree. We denote by $L_{t}(\Gamma)$ the number of terminal nodes in $\Gamma$ and by $L_{w}(\Gamma)$ the number of working nodes in $\Gamma$ . It is clear that $L(\Gamma)=1+L_{t}(\Gamma)+L_{w}(\Gamma)$ .

Let $U$ be an infinite binary information systems. We denote by $G_{d}(U)$ the set of all deterministic decision trees over $U$ and by $G_{d}^{2}(U)$ the set of all decision trees from $G_{d}(U)$ such that each working node of the tree has two leaving edges.

Lemma 3 (Lemma 14 from [24]).

Let $U$ be an infinite binary information system. Then

(a) If $\Gamma\in G_{d}^{2}(U)$ , then $L_{w}(\Gamma)=L_{t}(\Gamma)-1$ .

(b) If $\Gamma\in G_{d}(U)\setminus G_{d}^{2}(U)$ , then $L_{w}(\Gamma)>L_{t}(\Gamma)-1$ .

We denote by $G_{a}^{f}(U)$ the set of all decision trees $\Gamma$ over $U$ that satisfy the following conditions: (i) if a working node of $\Gamma$ has $m$ leaving edges $d_{1},\ldots,d_{m}$ labeled with the same number and $m\geq 2$ , then the subtrees $\Gamma(d_{1}),\ldots,\Gamma(d_{m})$ are not full, and (ii) if the root of $\Gamma$ has $m$ leaving edges $d_{1},\ldots,d_{m}$ and $m\geq 2$ , then the subtrees $\Gamma(d_{1}),\ldots,\Gamma(d_{m})$ are not full. One can show that $G_{d}^{2}(U)\subseteq G_{d}(U)\subseteq G_{a}^{f}(U)$ .

Lemma 4 (Lemma 15 from [24]).

Let $U$ be an infinite binary information system. If $\Gamma\in$ $G_{a}^{f}(U)\setminus G_{d}^{2}(U)$ , then $L_{w}(\Gamma)>L_{t}(\Gamma)-1$ .

Let $U=(A,F)$ be an infinite binary information system. For $f_{1},\ldots,f_{n}\in F$ , we denote by $N_{U}(f_{1},\ldots,f_{n})$ the number of $n$ -tuples $(\delta_{1},\ldots,\delta_{n})\in\{0,1\}^{n}$ for which the system of equations

\{f_{1}(x)=\delta_{1},\ldots,f_{n}(x)=\delta_{n}\}

has a solution from $A$ . For $n\in\mathbb{N}$ , denote

N_{U}(n)=\max\{N_{U}(f_{1},\ldots,f_{n}):f_{1},\ldots,f_{n}\in F\}.

It is clear that, for any $m,n\in\mathbb{N}$ , if $m\leq n$ then $N_{U}(m)\leq N_{U}(n)$ .

Proposition 5.

Let $U=(A,F)$ be an infinite binary information system. Then, for any $n\in\mathbb{N}$ ,

L_{U}^{la}(n)=L_{U}^{ld}(n)=2N_{U}(n).

Proof.

Let $z=(\nu,f_{1},\ldots,f_{m})$ be a problem over $U$ and $m\leq n$ . Let $\Gamma$ be a decision tree over $z$ , which solves the problem $z$ deterministically and for which $L(\Gamma)=L_{U}^{ld}(z)$ . From Lemma 1 it follows that each working node of $\Gamma$ has two edges leaving this node and, for each node of $\Gamma$ , there exists a realizable complete path that passes through this node. Let $\xi_{1}$ and $\xi_{2}$ be different complete paths in $\Gamma$ , $a_{1}\in A(\xi_{1})$ , and $a_{2}\in A(\xi_{2})$ . It is easy to show that $(f_{1}(a_{1}),\ldots,f_{m}(a_{1}))\neq(f_{1}(a_{2}),\ldots,f_{m}(a_{2}))$ . Therefore $L_{t}(\Gamma)\leq N_{U}(f_{1},\ldots,f_{m})\leq N_{U}(n)$ . It is clear that $\Gamma\in G_{d}^{2}(U)$ . By Lemma 3, $L_{w}(\Gamma)=L_{t}(\Gamma)-1$ . Hence $L(\Gamma)\leq 2N_{U}(n)$ . Taking into account that $z$ is an arbitrary problem over $U$ with $\dim z\leq n$ we obtain

L_{U}^{ld}(n)\leq 2N_{U}(n).

Since any decision tree solving the problem $z$ deterministically solves it nondeterministically we obtain

L_{U}^{la}(n)\leq L_{U}^{ld}(n).

We now show that $2N_{U}(n)\leq L_{U}^{la}(n)$ . Let us consider a problem $z=(\nu,f_{1},\ldots,f_{n})$ over $U$ such that

N_{U}(f_{1},\ldots,f_{n})=N_{U}(n)

and, for any $\bar{\delta}_{1},\bar{\delta}_{2}\in\{0,1\}^{n}$ , if $\bar{\delta}_{1}\neq\bar{\delta}_{2}$ , then $\nu(\bar{\delta}_{1})\neq\nu(\bar{\delta}_{2})$ . Let $\Gamma$ be a decision tree over $z$ , which solves the problem $z$ nondeterministically and for which $L(\Gamma)=L_{U}^{la}(z)$ . By Lemma 2, $\Gamma\in G_{a}^{f}(U)$ . Using Lemmas 3 and 4 we obtain $L_{w}(\Gamma)\geq L_{t}(\Gamma)-1$ . It is clear that $L_{t}(\Gamma)\geq N_{U}(f_{1},\ldots,f_{n})=N_{U}(n)$ . Therefore $L(\Gamma)\geq 2N_{U}(n)$ , $L_{U}^{la}(z)\geq 2N_{U}(n)$ , and $L_{U}^{la}(n)\geq 2N_{U}(n)$ . ∎∎

The next statement follows directly from Lemmas 5.1 and 5.2 [22] and the evident inequality $N_{U}(n)\leq 2^{n}$ , which is true for any infinite binary information system $U$ . The proof of Lemma 5.1 from [22] is based on Theorems 4.6 and 4.7 from the same monograph that are similar to results obtained in [33, 34].

Proposition 6.

For any infinite binary information system $U$ , the function $N_{U}(n)$ has one of the following two types of behavior:

(POL) If the system $U$ has finite I-dimension, then for any $n\in\mathbb{N}$ ,

n+1\leq N_{U}(n)\leq(4n)^{I(U)}.

(EXP) If the system $U$ has infinite I-dimension, then for any $n\in\mathbb{N}$ ,

N_{U}(n)=2^{n}.

We now prove Propositions 3 and 4.

Proof of Proposition 3.

The statement of the proposition follows immediately from Propositions 5 and 6. ∎∎

Proof of Proposition 4.

The statement of the proposition follows immediately from Proposition 5. ∎∎

4 Proof of Theorem 1

First, we prove five auxiliary statements.

Lemma 5.

Let $U=(A,F)$ be an infinite binary information system, which has infinite I-dimension. Then $U$ does not satisfy the condition of reduction.

Proof.

Let us assume the contrary: $U$ satisfies the condition of reduction. Then $U$ satisfies the condition of reduction with parameter $m$ for some $m\in\mathbb{N}$ . Since $I(U)=\infty$ , there exists an independent subset $\{f_{1},\ldots,f_{m+1}\}$ of the set $F$ . It is clear that the system of equations

S=\{f_{1}(x)=0,\ldots,f_{m+1}(x)=0\}

is compatible on $A$ and each proper subsystem of the system $S$ has the set of solutions different from the set of solutions of $S$ . Therefore $U$ does not satisfy the condition of reduction with parameter $m$ . ∎∎

Lemma 6.

For any infinite binary information system, its local type coincides with one of the rows of Table 1.

Proof.

To prove this statement we fill Table 3. In the first column “I-dim.” we have either “Fin” or “Inf”: “Fin” if the considered information system has finite I-dimension and “Inf” if the considered information system has infinite I-dimension. In the second column “Reduct.”, we have either “Yes” or “No”: “Yes” if the considered information system satisfies the condition of reduction and “No” otherwise.

By Lemma 5, if an information system has infinite I-dimension, then this information system does not satisfy the condition of reduction. It means that there are only three possible tuples of values of the considered two parameters of information systems, which correspond to the three rows of Table 3. The values of the considered two parameters define the types of behavior of functions $h_{U}^{ld}(n)$ , $h_{U}^{la}(n)$ , $L_{U}^{ld}(n)$ , and $L_{U}^{la}(n)$ according to Propositions 1-4. We see that the set of possible tuples of values in the last four columns coincides with the set of rows of Table 1. ∎∎

Table 3: Parameters and local types of infinite binary information systems

I-dim.	Reduct.	$h_{U}^{ld}(n)$	$h_{U}^{la}(n)$	$L_{U}^{ld}(n)$	$L_{U}^{la}(n)$
Fin	Yes	$\mathrm{LOG}$	$\mathrm{CON}$	$\mathrm{POL}$	$\mathrm{POL}$
Fin	No	$\mathrm{LIN}$	$\mathrm{LIN}$	$\mathrm{POL}$	$\mathrm{POL}$
Inf	No	$\mathrm{LIN}$	$\mathrm{LIN}$	$\mathrm{EXP}$	$\mathrm{EXP}$

For each row of Table 1, we consider an example of infinite binary information system, whose local type coincides with this row.

For any $i\in\mathbb{N}$ , we define two functions $p_{i}:\mathbb{N}\rightarrow\{0,1\}$ and $l_{i}:\mathbb{N}\rightarrow\{0,1\}$ . Let $j\in\mathbb{N}$ . Then $p_{i}(j)=1$ if and only if $j=i$ , and $l_{i}(j)=1$ if and only if $j>i$ .

Define an information system $U_{1}=(A_{1},F_{1})$ as follows: $A_{1}=\mathbb{N}$ and $F_{1}=\{l_{i}:i\in\mathbb{N}\}$ .

Lemma 7.

The information system $U_{1}$ belongs to the class $W_{1}^{l}$ , $h_{U_{1}}^{ld}(n)=\lceil\log_{2}(n$ $+1)\rceil$ , $h_{U_{1}}^{la}(1)=1$ and $h_{U_{1}}^{la}(n)=2$ if $n>1$ , $L_{U_{1}}^{ld}(n)=2(n+1)$ , and $L_{U_{1}}^{la}(n)=2(n+1)$ for any $n\in\mathbb{N}$ . This information system satisfies the condition of reduction with parameter $2$ and has finite I-dimension equal $1$ .

Proof.

It is easy to show that $N_{U_{1}}(n)=n+1$ for any $n\in\mathbb{N}$ . Using Proposition 5 we obtain $L_{U_{1}}^{ld}(n)=L_{U_{1}}^{la}(n)=2(n+1)$ for any $n\in\mathbb{N}$ . Let $n\in\mathbb{N}$ . Consider a problem $z=(\nu,l_{1},\ldots,l_{n})$ over $U_{1}$ such that, for each $\bar{\delta}_{1},\bar{\delta}_{2}\in\{0,1\}^{n}$ with $\bar{\delta}_{1}\neq\bar{\delta}_{2}$ , $\nu(\bar{\delta}_{1})\neq\nu(\bar{\delta}_{2})$ . It is clear that $N_{U_{1}}(l_{1},\ldots,l_{n})=n+1$ . Therefore each decision tree $\Gamma$ over $z$ that solves the problem $z$ deterministically has at least $n+1$ terminal nodes. One can show that the number of terminal nodes in $\Gamma$ is at most $2^{h(\Gamma)}$ . Hence $n+1\leq 2^{h(\Gamma)}$ and $\log_{2}(n+1)\leq h(\Gamma)$ . Since $h(\Gamma)$ is an integer, $\lceil\log_{2}(n+1)\rceil\leq h(\Gamma)$ . Thus, $h_{U_{1}}^{ld}(n)\geq\lceil\log_{2}(n+1)\rceil$ . Set $m=\lceil\log_{2}(n+1)\rceil$ . Then $n\leq 2^{m}-1$ . One can show that $h_{U_{1}}^{ld}(2^{m}-1)\leq m$ (the construction of an appropriate decision tree is based on an analog of binary search, and we use only attributes from the problem description) and $h_{U_{1}}^{ld}(n)\leq h_{U_{1}}^{ld}(2^{m}-1)$ . Therefore $h_{U_{1}}^{ld}(n)\leq\lceil\log_{2}(n+1)\rceil$ and $h_{U_{1}}^{ld}(n)=\lceil\log_{2}(n+1)\rceil$ . It is clear that $h_{U_{1}}^{la}(1)=1$ . Let $n\geq 2$ , $z=(\nu,f_{1},\ldots,f_{n})$ be an arbitrary problem over $U_{1}$ and $l_{i_{1}},\ldots,l_{i_{m}}$ be all pairwise different attributes from the set $\{f_{1},\ldots,f_{n}\}$ ordered such that $i_{1}<\ldots<i_{m}$ . Then these attributes divide the set $\mathbb{N}$ into $m+1$ nonempty domains that are sets of solutions on $\mathbb{N}$ of the following systems of equations: $\{l_{i_{1}}(x)=0\}$ , $\{l_{i_{1}}(x)=1,l_{i_{2}}(x)=0\}$ , …, $\{l_{i_{m-1}}(x)=1,l_{i_{m}}(x)=0\}$ , $\{l_{i_{m}}(x)=1\}$ . The value $z(x)$ is constant in each of the considered domains. Using these facts it is easy to show that there exists a decision tree $\Gamma$ over $z$ , which solves the problem $z$ nondeterministically and for which $h(\Gamma)=2$ if $m\geq 2$ . Therefore $h_{U_{1}}^{la}(n)\leq 2$ . One can show that there exists a problem $z$ over $U_{1}$ such that $\dim z=n$ and $h_{U_{1}}^{la}(z)\geq 2$ . Therefore $h_{U_{1}}^{la}(n)=2$ .

Since the function $h_{U_{1}}^{ld}$ has the type of behavior LOG, the information system $U_{1}$ belongs to the class $W_{1}^{l}$ – see Lemma 6 and Table 1. One can show that the information system $U_{1}$ satisfies the condition of reduction with parameter $2$ and has finite I-dimension equal $1$ . ∎∎

Define an information system $U_{2}=(A_{2},F_{2})$ as follows: $A_{2}=\mathbb{N}$ and $F_{2}=\{p_{i}:i\in\mathbb{N}\}$ .

Lemma 8.

The information system $U_{2}$ belongs to the class $W_{2}^{l}$ , $h_{U_{2}}^{ld}(n)=n$ , $h_{U_{2}}^{la}(n)=n$ , $L_{U_{2}}^{ld}(n)=2(n+1)$ , and $L_{U_{2}}^{la}(n)=2(n+1)$ for any $n\in\mathbb{N}$ . This information system does not satisfy the condition of reduction and has finite I-dimension equal $1$ .

Proof.

It is easy to show that $N_{U_{2}}(n)=n+1$ for any $n\in\mathbb{N}$ . Using Proposition 5, we obtain $L_{U_{2}}^{ld}(n)=L_{U_{2}}^{la}(n)=2(n+1)$ for any $n\in\mathbb{N}$ .

Let $n\in\mathbb{N}$ . It is clear that the system of equations

S_{n}=\{p_{1}(x)=0,\ldots,p_{n}(x)=0\}

is compatible on $A_{2}$ and each proper subsystem of the system $S_{n}$ has the set of solutions different from the set of solutions of $S_{n}$ . Therefore $U_{2}$ does not satisfy the condition of reduction. Using Propositions 1 and 2, we obtain $h_{U_{2}}^{ld}(n)=h_{U_{2}}^{la}(n)=n$ for any $n\in\mathbb{N}$ .

Since the function $h_{U_{2}}^{ld}$ has the type of behavior LIN and the function $L_{U_{2}}^{ld}$ has the type of behavior POL, the information system $U_{2}$ belongs to the class $W_{2}^{l}$ – see Lemma 6 and Table 1. One can show that this information system has finite I-dimension equal $1$ . ∎∎

Define an information system $U_{3}=(A_{3},F_{3})$ as follows: $A_{3}=\mathbb{N}$ and $F_{3}$ is the set of all functions from $\mathbb{N}$ to $\{0,1\}$ .

Lemma 9.

The information system $U_{3}$ belongs to the class $W_{3}^{l}$ , $h_{U_{3}}^{ld}(n)=n$ , $h_{U_{3}}^{la}(n)=n$ , $L_{U_{3}}^{ld}(n)=2^{n+1}$ , and $L_{U_{3}}^{la}(n)=2^{n+1}$ for any $n\in\mathbb{N}$ . This information system does not satisfy the condition of reduction and has infinite I-dimension.

Proof.

It is easy to show that the information system $U_{3}$ has infinite I-dimension. Using Lemma 5, we obtain that the system $U_{3}$ does not satisfy the condition of reduction. Let $n\in\mathbb{N}$ . By Propositions 1 and 2, $h_{U_{3}}^{ld}(n)=h_{U_{3}}^{la}(n)=n$ . By Propositions 3 and 4, $L_{U_{3}}^{ld}(n)=L_{U_{3}}^{la}(n)=2^{n+1}$ .

Since the function $L_{U_{3}}^{ld}$ has the type of behavior EXP, the information system $U_{3}$ belongs to the class $W_{3}^{l}$ – see Lemma 6 and Table 1. ∎∎

Proof of Theorem 1.

The statements of the theorem follow from Lemmas 6-9. ∎∎

5 Proofs of Theorems 2-4

First, we prove a number of auxiliary statements.

Lemma 10.

Let $U=(A,F)$ be an infinite binary information system. Then the information system $U$ is $ld$ -reachable.

Proof.

Let $z=(\nu,f_{1},\ldots,f_{n})$ be a problem over $U$ . Then there exists a decision tree $\Gamma$ over $z$ , which solves this problem deterministically and whose depth is at most $h_{U}^{ld}(n)$ . By removal of some nodes and edges from $\Gamma$ , we can obtain a decision tree $\Gamma^{\prime}$ over $z$ , which solves the problem $z$ deterministically and in which each working node has exactly two leaving edges and each complete path is realizable. Let $\xi_{1}$ and $\xi_{2}$ be different complete paths in $\Gamma^{\prime}$ , $a_{1}\in A(\xi_{1})$ , and $a_{2}\in A(\xi_{2})$ . It is easy to show that $(f_{1}(a_{1}),\ldots,f_{n}(a_{1}))\neq(f_{1}(a_{2}),\ldots,f_{n}(a_{2}))$ . Therefore $L_{t}(\Gamma^{\prime})\leq N_{U}(f_{1},\ldots,f_{n})\leq N_{U}(n)$ . It is clear that $\Gamma^{\prime}\in G_{d}^{2}(U)$ . By Lemma 3, $L_{w}(\Gamma^{\prime})=L_{t}(\Gamma^{\prime})-1$ . Therefore $L(\Gamma^{\prime})\leq 2N_{U}(n)$ . By Proposition 5, $2N_{U}(n)=L_{U}^{ld}(n)$ . Taking into account that $h(\Gamma^{\prime})\leq h_{U}^{ld}(n)$ and $z$ is an arbitrary problem over $U$ with $\dim z=n$ , we obtain that $U$ is $ld$ -reachable. ∎∎

Lemma 11.

Let $U$ be an infinite binary information system such that $h_{U}^{la}(n)=n$ for any $n\in\mathbb{N}$ . Then the information system $U$ is $la$ -reachable.

Proof.

Let $z=(\nu,f_{1},\ldots,f_{n})$ be a problem over $U$ and $\Gamma$ be a decision tree over $z$ that solves the problem $z$ deterministically and satisfies the following conditions: the number of working nodes in each complete path of $\Gamma$ is equal to $n$ and these nodes in the order from the root to a terminal node are labeled with attributes $f_{1},\ldots,f_{n}$ . Remove from $\Gamma$ all nodes and edges that do not belong to realizable complete paths. Let $w$ be a working node in the obtained tree that has only one leaving edge $d$ entering a node $v$ . We remove the node $w$ and edge $d$ and connect the edge $e$ entering $w$ to the node $v$ . We do the same with all working nodes with only one leaving edge. Denote by $\Gamma^{\prime}$ the obtained decision tree. It is clear that $\Gamma^{\prime}$ solves the problem $z$ deterministically and hence nondeterministically, $\Gamma^{\prime}\in G_{d}^{2}(U)$ , and $L_{t}(\Gamma^{\prime})\leq N_{U}(f_{1},\ldots,f_{n})\leq N_{U}(n)$ . By Lemma 3, $L_{w}(\Gamma^{\prime})=L_{t}(\Gamma^{\prime})-1$ . Therefore $L(\Gamma^{\prime})\leq 2N_{U}(n)$ . Using Proposition 5, we obtain $L(\Gamma^{\prime})\leq L_{U}^{la}(n)$ . It is clear that $h(\Gamma^{\prime})\leq n=h_{U}^{la}(n)$ . Therefore $U$ is $la$ -reachable. ∎∎

Lemma 12.

Let $U$ be an infinite binary information system, which satisfies the condition of reduction. Then the information system $U$ is not $la$ -reachable.

Proof.

By Proposition 2, the function $h_{U}^{la}(n)$ is bounded from above by a positive constant $c$ . By Proposition 6, the function $N_{U}(n)$ is not bounded from above by a constant. Choose $n\in\mathbb{N}$ such that $N_{U}(n)>2^{2c}$ . Let $z=(\nu,f_{1},\ldots,f_{n})$ be a problem over $U$ such that $\nu(\bar{\delta}_{1})\neq\nu(\bar{\delta}_{2})$ for any $\bar{\delta}_{1},\bar{\delta}_{2}\in\{0,1\}^{n}$ , $\bar{\delta}_{1}\neq\bar{\delta}_{2}$ , and $N_{U}(f_{1},\ldots,f_{n})=N_{U}(n)$ . Let $\Gamma$ be a decision tree over $z$ , which solves the problem $z$ nondeterministically, for which $h(\Gamma)\leq h_{U}^{la}(n)\leq c$ , and which has the minimum number of nodes among such trees. In the same way as it was done in the proof of Lemma 2, we can prove that $\Gamma\in G_{a}^{f}(U)$ . It is clear that $L_{t}(\Gamma)\geq N_{U}(f_{1},\ldots,f_{n})=N_{U}(n)$ . Let us assume that $\Gamma\in G_{d}^{2}(U)$ . Then it is easy to show that $h(\Gamma)\geq\log_{2}L_{t}(\Gamma)\geq\log_{2}N_{U}(n)>2c$ , which is impossible by the choice of $\Gamma$ . Therefore $\Gamma\in G_{a}^{f}(U)\setminus G_{d}^{2}(U)$ . By Lemma 4, $L_{w}(\Gamma)>L_{t}(\Gamma)-1\geq N_{U}(n)-1$ . Using Proposition 5, we obtain $L(\Gamma)>2N_{U}(n)=L_{U}^{la}(n)$ . Therefore $U$ is not $la$ -reachable. ∎∎

Lemma 13.

Let $U=(A,F)$ be an infinite binary information system, which satisfies the condition of reduction with parameter $m$ . Then $(m,(m+1)L_{U}^{la}(n)/2+1)$ is a boundary $la$ -pair of the system $U$ .

Proof.

Let $z=(\nu,f_{1},\ldots,f_{n})$ be a problem over $U$ . We now describe a decision tree $\Gamma$ over $z$ , which solves the problem $z$ nondeterministically and for which $h(\Gamma)\leq m$ and $L(\Gamma)\leq(m+1)L_{U}^{la}(n)/2+1$ . For each tuple $\bar{\delta}=(\delta_{1},\ldots,\delta_{n})\in\{0,1\}^{n}$ for which the system of equations

S_{\bar{\delta}}=\{f_{1}(x)=\delta_{1},\ldots,f_{n}(x)=\delta_{n}\}

has a solution from $A$ , we describe a complete path $\xi_{\bar{\delta}}$ . Since the information system $U$ satisfies the condition of reduction with parameter $m$ , there exists a subsystem

S_{\bar{\delta}}^{\prime}=\{f_{i_{1}}(x)=\delta_{i_{1}},\ldots,f_{i_{t}}(x)=\delta_{i_{t}}\}

of the system $S_{\bar{\delta}}$ , which has the same set of solutions and for which $t\leq m$ . Then

\xi_{\bar{\delta}}=v_{0},d_{0},v_{1},d_{1},\ldots,v_{t},d_{t},v_{t+1},

where the node $v_{0}$ and the edge $d_{0}$ are not labeled, for $j=1,\ldots,t$ , the node $v_{j}$ is labeled with the attribute $f_{i_{j}}$ and the edge $d_{j}$ is labeled with the number $\delta_{i_{j}}$ , and the node $v_{t+1}$ is labeled with the number $\nu(\bar{\delta})$ . We merge initial nodes of all such complete paths and denote by $\Gamma$ the obtained tree. One can show that $\Gamma$ is a decision tree over $z$ , which solves the problem $z$ nondeterministically and for which $h(\Gamma)\leq m$ . The number of the considered complete paths is equal to $N_{U}(f_{1},\ldots,f_{n})\leq N_{U}(n)$ . The number of nodes in each complete paths is at most $m+2$ . Therefore $L(\Gamma)\leq(m+1)N_{U}(n)+1$ . By Proposition 5, $N_{U}(n)=L_{U}^{la}(n)/2$ . Hence $L(\Gamma)\leq(m+1)L_{U}^{la}(n)/2+1$ . Thus, $(m,(m+1)L_{U}^{la}(n)/2+1)$ is a boundary $la$ -pair of the system $U$ . ∎∎

Proof of Theorem 2.

Each information system from the class $W_{1}^{l}$ satisfies the condition of reduction (see Table 3).

(a) Let $U$ be an information system from the class $W_{1}^{l}$ . Using Lemma 10, we obtain that the system $U$ is $ld$ -reachable.

(b) Let $U$ be an information system from the class $W_{1}^{l}$ . Then, for some $m\in\mathbb{N}$ , the system $U$ satisfies the condition of reduction with parameter $m$ . Using Lemma 12, we obtain that the system $U$ is not $la$ -reachable. Using Lemma 13, we obtain that $(m,(m+1)L_{U}^{la}(n)/2+1)$ is a boundary $la$ -pair of the system $U$ . ∎∎

Proof of Theorem 3.

Each information system from the class $W_{2}^{l}$ does not satisfy the condition of reduction (see Table 3).

(a) Let $U$ be an information system from the class $W_{2}^{l}$ . Using Lemma 10, we obtain that the system $U$ is $ld$ -reachable.

(b) Let $U$ be an information system from the class $W_{2}^{l}$ . By Proposition 2, $h_{U}^{la}(n)=n$ for any $n\in\mathbb{N}$ . Using Lemma 11, we obtain that the system $U$ is $la$ -reachable. ∎∎

Proof of Theorem 4.

Each information system from the class $W_{3}^{l}$ does not satisfy the condition of reduction (see Table 3).

(a) Let $U$ be an information system from the class $W_{3}^{l}$ . Using Lemma 10, we obtain that the system $U$ is $ld$ -reachable.

(b) Let $U$ be an information system from the class $W_{3}^{l}$ . By Proposition 2, $h_{U}^{la}(n)=n$ for any $n\in\mathbb{N}$ . Using Lemma 11, we obtain that the system $U$ is $la$ -reachable. ∎∎

6 Conclusions

In this paper, we divided the set of all infinite binary information systems into three complexity classes depending on the worst case time and space complexity of deterministic and nondeterministic decision trees. This allowed us to identify nontrivial relationships between deterministic decision trees and decision rule systems represented by nondeterministic decision trees. For each complexity class, we studied issues related to time-space trade-off for deterministic and nondeterministic decision trees. In the future, we are planning to generalize the obtained results to the case of classes of decision tables closed under operations of removal of attributes and changing decisions attached to rows of decision tables.

Acknowledgements

Research reported in this publication was supported by King Abdullah University of Science and Technology (KAUST).

References

[1] AbouEisha, H., Amin, T., Chikalov, I., Hussain, S., Moshkov, M.: Extensions of Dynamic Programming for Combinatorial Optimization and Data Mining, Intelligent Systems Reference Library, vol. 146. Springer, Cham (2019)
[2] Alsolami, F., Azad, M., Chikalov, I., Moshkov, M.: Decision and Inhibitory Trees and Rules for Decision Tables with Many-valued Decisions, Intelligent Systems Reference Library, vol. 156. Springer, Cham (2020)
[3] Ben-Or, M.: Lower bounds for algebraic computation trees (preliminary report). In: 15th Annual ACM Symposium on Theory of Computing, STOC 1983, pp. 80–86 (1983)
[4] Boros, E., Hammer, P.L., Ibaraki, T., Kogan, A.: Logical analysis of numerical data. Math. Program. 79, 163–190 (1997)
[5] Boros, E., Hammer, P.L., Ibaraki, T., Kogan, A., Mayoraz, E., Muchnik, I.B.: An implementation of logical analysis of data. IEEE Trans. Knowl. Data Eng. 12(2), 292–306 (2000)
[6] Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Wadsworth, Belmont, CA (1984)
[7] Chikalov, I., Lozin, V.V., Lozina, I., Moshkov, M., Nguyen, H.S., Skowron, A., Zielosko, B.: Three Approaches to Data Analysis - Test Theory, Rough Sets and Logical Analysis of Data, Intelligent Systems Reference Library, vol. 41. Springer, Berlin Heidelberg (2013)
[8] Dobkin, D.P., Lipton, R.J.: A lower bound of the $(1/2)n^{2}$ on linear search programs for the knapsack problem. J. Comput. Syst. Sci. 16(3), 413–417 (1978)
[9] Dobkin, D.P., Lipton, R.J.: On the complexity of computations under varying sets of primitives. J. Comput. Syst. Sci. 18(1), 86–91 (1979)
[10] Durdymyradov, K., Moshkov, M.: Time and space complexity of deterministic and nondeterministic decision trees. Local approach. In: 2023 IEEE International Conference on Big Data. 9th Special Session on Information Granulation in Data Science and Scalable Computing (2023). (to appear)
[11] Fürnkranz, J., Gamberger, D., Lavrac, N.: Foundations of Rule Learning. Cognitive Technologies. Springer, Berlin Heidelberg (2012)
[12] Gabrielov, A., Vorobjov, N.: On topological lower bounds for algebraic computation trees. Found. Comput. Math. 17(1), 61–72 (2017)
[13] Grigoriev, D., Karpinski, M., Vorobjov, N.: Improved lower bound on testing membership to a polyhedron by algebraic decision trees. In: 36th Annual Symposium on Foundations of Computer Science, FOCS 1995, pp. 258–265 (1995)
[14] Grigoriev, D., Karpinski, M., Yao, A.C.: An exponential lower bound on the size of algebraic decision trees for Max. Computational Complexity 7(3), 193–203 (1998)
[15] Molnar, C.: Interpretable Machine Learning. A Guide for Making Black Box Models Explainable, 2 edn. (2022). URL christophm.github.io/interpretable-ml-book/
[16] Morávek, J.: A localization problem in geometry and complexity of discrete programming. Kybernetika 8(6), 498–516 (1972)
[17] Moshkov, M.: Decision Trees. Theory and Applications (in Russian). Nizhny Novgorod University Publishers, Nizhny Novgorod (1994)
[18] Moshkov, M.: Optimization problems for decision trees. Fundam. Inform. 21(4), 391–401 (1994)
[19] Moshkov, M.: Two approaches to investigation of deterministic and nondeterministic decision trees complexity. In: 2nd World Conference on the Fundamentals of Artificial Intelligence, WOCFAI 1995, pp. 275–280 (1995)
[20] Moshkov, M.: Comparative analysis of deterministic and nondeterministic decision tree complexity. Global approach. Fundam. Inform. 25(2), 201–214 (1996)
[21] Moshkov, M.: Comparative analysis of deterministic and nondeterministic decision tree complexity. Local approach. In: Trans. Rough Sets IV, Lecture Notes in Computer Science, vol. 3700, pp. 125–143. Springer, Berlin Heidelberg (2005)
[22] Moshkov, M.: Time complexity of decision trees. In: Trans. Rough Sets III, Lecture Notes in Computer Science, vol. 3400, pp. 244–459. Springer, Berlin Heidelberg (2005)
[23] Moshkov, M.: On the class of restricted linear information systems. Discret. Math. 307(22), 2837–2844 (2007)
[24] Moshkov, M.: Time and space complexity of deterministic and nondeterministic decision trees. Ann. Math. Artif. Intell. 91(1), 45–74 (2023)
[25] Moshkov, M., Piliszczuk, M., Zielosko, B.: Partial Covers, Reducts and Decision Rules in Rough Sets - Theory and Applications, Studies in Computational Intelligence, vol. 145. Springer, Berlin Heidelberg (2008)
[26] Moshkov, M., Zielosko, B.: Combinatorial Machine Learning - A Rough Set Approach, Studies in Computational Intelligence, vol. 360. Springer, Berlin Heidelberg (2011)
[27] Naiman, D.Q., Wynn, H.P.: Independence number and the complexity of families of sets. Discr. Math. 154, 203–216 (1996)
[28] Pawlak, Z.: Information systems theoretical foundations. Inf. Syst. 6(3), 205–218 (1981)
[29] Pawlak, Z.: Rough Sets - Theoretical Aspects of Reasoning about Data, Theory and Decision Library : series D, vol. 9. Kluwer (1991)
[30] Pawlak, Z., Polkowski, L., Skowron, A.: Rough set theory. In: B.W. Wah (ed.) Wiley Encyclopedia of Computer Science and Engineering. John Wiley & Sons, Inc. (2008). URL https://doi.org/10.1002/9780470050118.ecse466
[31] Pawlak, Z., Skowron, A.: Rudiments of rough sets. Inf. Sci. 177(1), 3–27 (2007)
[32] Rokach, L., Maimon, O.: Data Mining with Decision Trees - Theory and Applications, Series in Machine Perception and Artificial Intelligence, vol. 69. WorldScientific, Singapore (2007)
[33] Sauer, N.: On the density of families of sets. J. of Combinatorial Theory (A) 13, 145–147 (1972)
[34] Shelah, S.: A combinatorial problem; stability and order for models and theories in infinitary languages. Pacific J. of Mathematics 41, 241–261 (1972)
[35] Skowron, A., Rauszer, C.: The discernibility matrices and functions in information systems. In: R. Slowinski (ed.) Intelligent Decision Support - Handbook of Applications and Advances of the Rough Sets Theory, Theory and Decision Library, vol. 11, pp. 331–362. Springer (1992)
[36] Steele, J.M., Yao, A.C.: Lower bounds for algebraic decision trees. J. Algorithms 3(1), 1–8 (1982)
[37] Yao, A.C.: Algebraic decision trees and Euler characteristics. In: 33rd Annual Symposium on Foundations of Computer Science, FOCS 1992, pp. 268–277 (1992)
[38] Yao, A.C.: Decision tree complexity and Betti numbers. In: 26th Annual ACM Symposium on Theory of Computing, STOC 1994, pp. 615–624 (1994)