This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

A Local Approach to Studying the Time and Space Complexity of Deterministic and Nondeterministic Decision Trees

Kerven Durdymyradov and Mikhail Moshkov
Computer, Electrical and Mathematical Sciences & Engineering Division
and Computational Bioscience Research Center
King Abdullah University of Science and Technology (KAUST)
Thuwal 23955-6900, Saudi Arabia
{kerven.durdymyradov,mikhail.moshkov}@kaust.edu.sa
Abstract

Decision trees and decision rules are intensively studied and used in different areas of computer science. The questions important for the theory of decision trees and rules include relations between decision trees and decision rule systems, time-space tradeoff for decision trees, and time-space tradeoff for decision rule systems. In this paper, we study arbitrary infinite binary information systems each of which consists of an infinite set called universe and an infinite set of two-valued functions (attributes) defined on the universe. We consider the notion of a problem over information system, which is described by a finite number of attributes and a mapping associating a decision to each tuple of attribute values. As algorithms for problem solving, we investigate deterministic and nondeterministic decision trees that use only attributes from the problem description. Nondeterministic decision trees are representations of decision rule systems that sometimes have less space complexity than the original rule systems. As time and space complexity, we study the depth and the number of nodes in the decision trees. In the worst case, with the growth of the number of attributes in the problem description, (i) the minimum depth of deterministic decision trees grows either as a logarithm or linearly, (ii) the minimum depth of nondeterministic decision trees either is bounded from above by a constant or grows linearly, (iii) the minimum number of nodes in deterministic decision trees has either polynomial or exponential growth, and (iv) the minimum number of nodes in nondeterministic decision trees has either polynomial or exponential growth. Based on these results, we divide the set of all infinite binary information systems into three complexity classes. This allows us to identify nontrivial relationships between deterministic decision trees and decision rules systems represented by nondeterministic decision trees. For each class, we study issues related to time-space trade-off for deterministic and nondeterministic decision trees.

Keywords: Deterministic decision trees, Nondeterministic decision trees, Time complexity, Space complexity, Complexity classes, Time-space trade-off.

1 Introduction

Decision trees and decision rules are intensively studied and used in different areas of computer science. The questions important for the theory of decision trees and rules include relations between decision trees and decision rule systems, time-space tradeoff for decision trees, and time-space tradeoff for decision rule systems.

In this paper, instead of decision rule systems we study nondeterministic decision trees. These trees can be considered as representations of decision rule systems that sometimes have less space complexity than the original rule systems. We study problems over infinite binary information systems and divide the set of all infinite binary information systems into three complexity classes depending on the worst case time and space complexity of deterministic and nondeterministic decision trees solving problems. This allows us to identify nontrivial relationships between deterministic decision trees and decision rule systems represented by nondeterministic decision trees. For each complexity class, we study issues related to time-space trade-off for deterministic and nondeterministic decision trees.

Decision trees [1, 2, 6, 22, 26, 32] and systems of decision rules [4, 5, 7, 11, 25, 29, 30, 31, 35] are widely used as classifiers to predict a decision for a new object, as a means of knowledge representation, and as algorithms for solving problems of fault diagnosis, computational geometry, combinatorial optimization, etc.

Decision trees and rules are among the most interpretable models for classifying and representing knowledge [15]. In order to better understand decision trees, we should not only minimize the number of their nodes, but also the depth of decision trees to avoid the consideration of long conjunctions of conditions corresponding to long paths in these trees. Similarly, for decision rule systems, we should minimize both the total length of rules and the maximum length of a rule in the system. When we consider decision trees and decision rule systems as algorithms (usually sequential for decision trees and parallel for decision rule systems), we should have in mind the same bi-criteria optimization problems to minimize space and time complexity of these algorithms.

In this paper, we represent systems of decision rules as nondeterministic decision trees to compress them and to emphasize the possibility of processing different decision rules in parallel. We consider deterministic and nondeterministic decision trees as algorithms and study their space and time complexity, paying particular attention to time and space complexity relationships. Examples of deterministic and nondeterministic decision trees computing Boolean function x1x2x_{1}\wedge x_{2} can be found in Fig. 1.

Refer to caption
Figure 1: Deterministic and nondeterministic decision trees computing function x1x2x_{1}\wedge x_{2}

Infinite systems of attributes and decision trees over these systems have been intensively studied, especially systems of linear and algebraic attributes and the corresponding linear [8, 9, 16] and algebraic decision trees [3, 12, 13, 14, 36, 37, 38]. Years ago, one of the authors initiated the study of decision trees over arbitrary infinite systems of attributes [17, 18, 19, 20, 21]. In this paper, we study decision trees over arbitrary infinite systems of binary attributes represented in the form of infinite binary information systems.

General information system introduced by Pawlak [28] consists of a universe (a set of objects) and a set of attributes (functions with finite image) defined on the universe. An information system is called infinite, if both its universe and the set of attributes are infinite. An information system is called binary if each of its attributes has values from the set {0,1}\{0,1\}.

Any problem over an information system is described by a finite number of attributes that divide the universe into domains in which these attributes have fixed values. A decision is attached to each domain. For a given object from the universe, it is required to find the decision attached to the domain containing this object.

As algorithms solving these problems, deterministic and nondeterministic decision trees are studied. As time complexity of a decision tree, we consider its depth, i.e., the maximum number of nodes labeled with attributes in a path from the root to a terminal node. As space complexity of a decision tree, we consider the number of its nodes.

There are two approaches to the study of infinite information systems: local when in decision trees solving a problem we can use only attributes from the problem description, and global when in the decision trees solving a problem we can use arbitrary attributes from the considered information system. In this paper, we study decision trees in the framework of the local approach.

To the best of our knowledge, time-space trade-offs for decision trees over infinite information systems were not studied in the framework of the local approach prior to the present paper except for its conference version [10], which does not contain proofs. The paper [24] was the first one in which the time-space trade-offs for decision trees over infinite information systems were studied in the framework of the global approach.

Results obtained in [24] are different from the results obtained in the present paper. Apart from the difference in approach, in [24], the set of all infinite information systems is divided not into three but into five families and the criteria for the behavior of functions characterizing the minimum depth of the deterministic and nondeterministic decision tree are completely different in comparison with the present paper. However, many of the definitions and results of the two papers appear similar. In the present paper, we use some auxiliary statements proved in [24] and adapt some proofs from [24] to the case of the local approach.

Based on the results obtained in the present paper and in [22, 26], we describe possible types of behavior of four functions hUld,hUla,LUld,LUlah_{U}^{ld},h_{U}^{la},L_{U}^{ld},L_{U}^{la} that characterize worst case time and space complexity of deterministic and nondeterministic decision trees over an infinite binary information system UU (index ll refers to the local approach). Decision trees solving a problem can use only attributes from the problem description.

The function hUldh_{U}^{ld} characterizes the growth in the worst case of the minimum depth of a deterministic decision tree solving a problem with the growth of the number of attributes in the problem description. The function hUldh_{U}^{ld} is either grows as a logarithm or linearly.

The function hUlah_{U}^{la} characterizes the growth in the worst case of the minimum depth of a nondeterministic decision tree solving a problem with the growth of the number of attributes in the problem description. The function hUlah_{U}^{la} is either bounded from above by a constant or grows linearly.

The function LUldL_{U}^{ld} characterizes the growth in the worst case of the minimum number of nodes in a deterministic decision tree solving a problem with the growth of the number of attributes in the problem description. The function LUldL_{U}^{ld} has either polynomial or exponential growth.

The function LUlaL_{U}^{la} characterizes the growth in the worst case of the minimum number of nodes in a nondeterministic decision tree solving a problem with the growth of the number of attributes in the problem description. The function LUlaL_{U}^{la} has either polynomial or exponential growth.

Each of the functions hUld,hUla,LUld,LUlah_{U}^{ld},h_{U}^{la},L_{U}^{ld},L_{U}^{la} has two types of behavior. The tuple (hUld,hUla,LUld,LUla)(h_{U}^{ld},h_{U}^{la},L_{U}^{ld},L_{U}^{la}) has three types of behavior. All these types are described in the paper and each type is illustrated by an example.

There are three complexity classes of infinite binary information systems corresponding to the three possible types of the tuple (hUld,hUla,LUld,LUla)(h_{U}^{ld},h_{U}^{la},L_{U}^{ld},L_{U}^{la}). For each class, we study joint behavior of time and space complexity of decision trees. The obtained results are related to time-space trade-off for deterministic and nondeterministic decision trees.

A pair of functions (φ,ψ)(\varphi,\psi) is called a boundary ldld-pair of the information system UU if, for any problem over UU, there exists a deterministic decision tree over zz, which solves this problem and for which the depth is at most φ(n)\varphi(n) and the number of nodes is at most ψ(n)\psi(n), where nn is the number of attributes in the problem description. An information system UU is called ldld-reachable if the pair (hUld,LUld)(h_{U}^{ld},L_{U}^{ld}) is a boundary ldld-pair of the system UU. For nondeterministic decision trees, the notions of a boundary lala-pair of an information system and lala-reachable information system are defined in a similar way. For deterministic decision trees, the best situation is when the considered information system is ldld-reachable: for any boundary ldld-pair (φ,ψ)(\varphi,\psi) for an information system UU and any natural nn, φ(n)hUld(n)\varphi(n)\geq h_{U}^{ld}(n) and ψ(n)LUld(n)\psi(n)\geq L_{U}^{ld}(n). For nondeterministic decision trees, the best situation is when the information system is lala-reachable.

For all complexity classes, all information systems from the class are ldld-reachable. For two out of the three complexity classes, all information systems from the class are lala-reachable. For the remaining class, all information systems from the class are not lala-reachable. For all information systems UU that are not lala-reachable, we find nontrivial boundary lala-pairs, which are sufficiently close to (hUla,LUla)(h_{U}^{la},L_{U}^{la}).

The rest of the paper is organized as follows: Section 2 contains main results, Sections 3-5 – proofs of these results, and Section 6 – short conclusions.

2 Main Results

Let AA be an infinite set and FF be an infinite set of functions that are defined on AA and have values from the set {0,1}\{0,1\}. The pair U=(A,F)U=(A,F) is called an infinite binary information system [28], the elements of the set AA are called objects, and the functions from FF are called attributes. The set AA is called sometimes the universe of the information system UU.

A problem over UU is a tuple of the form z=(ν,f1,,fn)z=(\nu,f_{1},\ldots,f_{n}), where ν:{0,1}n\nu:\{0,1\}^{n}\rightarrow\mathbb{N}, \mathbb{N} is the set of natural numbers {1,2,}\{1,2,\ldots\}, and f1,,fnFf_{1},\ldots,f_{n}\in F. We do not require attributes f1,,fnf_{1},\ldots,f_{n} to be pairwise distinct. The problem zz consists in finding the value of the function z(x)=ν(f1(x),,fn(x))z(x)=\nu(f_{1}(x),\ldots,f_{n}(x)) for a given object aAa\in A. The value dimz=n\dim z=n is called the dimension of the problem zz.

Various problems of combinatorial optimization, pattern recognition, fault diagnosis, probabilistic reasoning, computational geometry, etc., can be represented in this form.

As algorithms for problem solving we consider decision trees. A decision tree over the information system UU is a directed tree with a root in which the root and edges leaving the root are not labeled, each terminal node is labeled with a number from \mathbb{N}, each working node (which is neither the root nor a terminal node) is labeled with an attribute from FF, and each edge leaving a working node is labeled with a number from the set {0,1}\{0,1\}. A decision tree is called deterministic if only one edge leaves the root and edges leaving an arbitrary working node are labeled with different numbers.

Let Γ\Gamma be a decision tree over UU and

ξ=v0,d0,v1,d1,,vm,dm,vm+1\xi=v_{0},d_{0},v_{1},d_{1},\ldots,v_{m},d_{m},v_{m+1}

be a directed path from the root v0v_{0} to a terminal node vm+1v_{m+1} of Γ\Gamma (we call such path complete). Define a subset A(ξ)A(\xi) of the set AA as follows. If m=0m=0, then A(ξ)=AA(\xi)=A. Let m>0m>0 and, for i=1,,mi=1,\ldots,m, the node viv_{i} be labeled with the attribute fjif_{j_{i}} and the edge did_{i} be labeled with the number δi\delta_{i}. Then

A(ξ)={a:aA,fj1(a)=δ1,,fjm(a)=δm}.A(\xi)=\{a:a\in A,f_{j_{1}}(a)=\delta_{1},\ldots,f_{j_{m}}(a)=\delta_{m}\}.

The depth of the decision tree Γ\Gamma is the maximum number of working nodes in a complete path of Γ\Gamma. Denote by h(Γ)h(\Gamma) the depth of Γ\Gamma and by L(Γ)L(\Gamma) – the number of nodes in Γ\Gamma.

A decision tree over the information system UU is called a decision tree over the problem z=(ν,f1,,fn)z=(\nu,f_{1},\ldots,f_{n}) if each working node of Γ\Gamma is labeled with an attribute from the set {f1,,fn}\{f_{1},\ldots,f_{n}\}.

The decision tree Γ\Gamma over zz solves the problem zz nondeterministically if, for any object aAa\in A, there exists a complete path ξ\xi of Γ\Gamma such that aA(ξ)a\in A(\xi) and, for each aAa\in A and each complete path ξ\xi such that aA(ξ)a\in A(\xi), the terminal node of ξ\xi is labeled with the number z(a)z(a) (in this case, we can say that Γ\Gamma is a nondeterministic decision tree solving the problem zz). In particular, if the decision tree Γ\Gamma solves the problem zz nondeterministically, then, for each complete path ξ\xi of Γ\Gamma, either the set A(ξ)A(\xi) is empty or the function z(x)z(x) is constant on the set A(ξ)A(\xi). The decision tree Γ\Gamma over zz solves the problem zz deterministically if Γ\Gamma is a deterministic decision tree, which solves the problem zz nondeterministically (in this case, we can say that Γ\Gamma is a deterministic decision tree solving the problem zz).

Let P(U)P(U) be the set of all problems over UU. For a problem zz from P(U)P(U), let hUld(z)h_{U}^{ld}(z) be the minimum depth of a decision tree over zz solving the problem zz deterministically, hUla(z)h_{U}^{la}(z) be the minimum depth of a decision tree over zz solving the problem zz nondeterministically, LUld(z)L_{U}^{ld}(z) be the minimum number of nodes in a decision tree over zz solving the problem zz deterministically, and LUla(z)L_{U}^{la}(z) be the minimum number of nodes in a decision tree over zz solving the problem zz nondeterministically.

We consider four functions defined on the set \mathbb{N} in the following way: hUld(n)=maxh_{U}^{ld}(n)=\max hUld(z)h_{U}^{ld}(z), hUla(n)=maxh_{U}^{la}(n)=\max hUla(z)h_{U}^{la}(z), LUld(n)=maxL_{U}^{ld}(n)=\max LUld(z)L_{U}^{ld}(z), and LUla(n)=maxL_{U}^{la}(n)=\max LUla(z)L_{U}^{la}(z), where the maximum is taken among all problems zz over UU with dimzn\dim z\leq n. These functions describe how the minimum depth and the minimum number of nodes of deterministic and nondeterministic decision trees solving problems are growing in the worst case with the growth of problem dimension. To describe possible types of behavior of these four functions, we need to define some properties of infinite binary information systems.

Definition 1.

We will say that the information system U=(A,F)U=(A,F) satisfies the condition of reduction if there exists mm\in\mathbb{N} such that, for each compatible on AA system of equations {f1(x)=δ1,,fr(x)=δr},\{f_{1}(x)=\delta_{1},\ldots,f_{r}(x)=\delta_{r}\}, where rr\in\mathbb{N}, f1,,frFf_{1},\ldots,f_{r}\in F and δ1,,δr{0,1}\delta_{1},\ldots,\delta_{r}\in\{0,1\}, there exists a subsystem of this system, which has the same set of solutions from AA and contains at most mm equations. In this case, we will say that UU satisfies the condition of reduction with parameter mm.

We now consider two examples of infinite binary information systems that satisfy the condition of reduction. These examples are close to ones considered in Section 3.4 of the book [1].

Example 1.

Let d,td,t\in\mathbb{N}, f1,,ftf_{1},\ldots,f_{t} be functions from d\mathbb{R}^{d} to \mathbb{R}, where \mathbb{R} is the set of real numbers, and ss be a function from \mathbb{R} to {0,1}\{0,1\} such that s(x)=0s(x)=0 if x<0x<0 and s(x)=1s(x)=1 if x0x\geq 0. Then the infinite binary information system (d,F)(\mathbb{R}^{d},F), where F={s(fi+c):i=1,,t,c}F=\{s(f_{i}+c):i=1,\ldots,t,c\in\mathbb{R}\}, satisfies the condition of reduction with parameter 2t2t. If f1,,ftf_{1},\ldots,f_{t} are linear functions, then we deal with attributes corresponding to tt families of parallel hyperplanes in d\mathbb{R}^{d} what is common for decision trees for datasets with tt numerical attributes only [6].

Example 2.

Let PP be the Euclidean plane and ll be a straight line (line in short) in the plane. This line divides the plane into two open half-planes H1H_{1} and H2H_{2}, and the line ll. Two attributes correspond to the line ll. The first attribute takes value 0 on points from H1H_{1}, and value 11 on points from H2H_{2} and ll. The second one takes value 0 on points from H2H_{2}, and value 11 on points from H1H_{1} and ll. We denote by \mathcal{L} the set of all attributes corresponding to lines in the plane. Infinite binary information systems of the form (P,L)(P,L), where LL\subseteq\mathcal{L}, are called linear information systems.

Let ll be a line in the plane. Let us denote by (l)\mathcal{L}(l) the set of all attributes corresponding to lines, which are parallel to ll. Let pp be a point in the plane. We denote by (p)\mathcal{L}(p) the set of all attributes corresponding to lines, which pass through pp. A set CC of attributes from \mathcal{L} is called a clone if C(l)C\subseteq\mathcal{L}(l) for some line ll or C(p)C\subseteq\mathcal{L}(p) for some point pp. In [23], it was proved that a linear information system (P,L)(P,L) satisfies the condition of reduction if and only if LL is the union of a finite number of clones.

Definition 2.

Let U=(A,F)U=(A,F) be an infinite binary information system. A subset {f1,,fm}\{f_{1},\ldots,f_{m}\} of the set FF will be called independent if, for any δ1,,δm{0,1}\delta_{1},\ldots,\delta_{m}\in\{0,1\}, the system of equations {f1(x)=δ1,,fm(x)=δm},\{f_{1}(x)=\delta_{1},\ldots,f_{m}(x)=\delta_{m}\}, has a solution from the set AA. The empty set of attributes is independent by definition.

Definition 3.

We define the parameter I(U)I(U), which is called the independence dimension or I-dimension of the information system UU (this notion is similar to the notion of independence number of family of sets [27]) as follows. If, for each mm\in\mathbb{N}, the set FF contains an independent subset of cardinality mm, then I(U)=I(U)=\infty. Otherwise, I(U)I(U) is the maximum cardinality of an independent subset of the set FF.

We now consider examples of infinite binary information systems with finite I-dimension and with infinite I-dimension. More examples can be found in Lemmas 7-9.

Example 3.

Let m,tm,t\in\mathbb{N}. We denote by Pol(m)Pol(m) the set of all polynomials, which have integer coefficients and depend on variables x1,,xmx_{1},\ldots,x_{m}. We denote by Pol(m,t)Pol(m,t) the set of all polynomials from Pol(m)Pol(m) such that the degree of each polynomial is at most tt. We define infinite binary information systems U(m)U(m) and U(m,t)U(m,t) as follows: U(m)=(m,F(m))U(m)=(\mathbb{R}^{m},F(m)) and U(m,t)=(m,F(m,t))U(m,t)=(\mathbb{R}^{m},F(m,t)), where F(m)={s(p):pPol(m)}F(m)=\{s(p):p\in Pol(m)\}, F(m,t)={s(p):pPol(m,t)}F(m,t)=\{s(p):p\in Pol(m,t)\}, and s(x)=0s(x)=0 if x<0x<0 and s(x)=1s(x)=1 if x0x\geq 0. One can show that the system U(m)U(m) has infinite I-dimension and the system U(m,t)U(m,t) has finite I-dimension.

We now consider four statements that describe possible types of behavior of functions hUld(n)h_{U}^{ld}(n), hUla(n)h_{U}^{la}(n), LUld(n)L_{U}^{ld}(n), and LUla(n)L_{U}^{la}(n). The next statement follows immediately from Theorem 4.3 [22].

Proposition 1.

For any infinite binary information system UU, the function hUld(n)h_{U}^{ld}(n) has one of the following two types of behavior:

(LOG) If the system UU satisfies the condition of reduction, then hUld(n)=Θ(logn)h_{U}^{ld}(n)=\Theta(\log n).

(LIN) If the system UU does not satisfy the condition of reduction, then hUld(n)=nh_{U}^{ld}(n)=n for any nn\in\mathbb{N}.

The next statement follows immediately from Theorem 8.2 [26].

Proposition 2.

For any infinite binary information system U=(A,F)U=(A,F)\,, the function hUla(n)h_{U}^{la}(n) has one of the following two types of behavior:

(CON) If the system UU satisfies the condition of reduction, then hUla(n)=O(1)h_{U}^{la}(n)=O(1).

(LIN) If the system UU does not satisfy the condition of reduction, then hUla(n)=nh_{U}^{la}(n)=n for any nn\in\mathbb{N}.

Proposition 3.

For any infinite binary information system UU, the function LUld(n)L_{U}^{ld}(n) has one of the following two types of behavior:

(POL) If the system UU has finite I-dimension, then for any nn\in\mathbb{N},

2(n+1)LUld(n)2(4n)I(U).2(n+1)\leq L_{U}^{ld}(n)\leq 2(4n)^{I(U)}.

(EXP) If the system UU has infinite I-dimension, then for any nn\in\mathbb{N},

LUld(n)=2n+1.L_{U}^{ld}(n)=2^{n+1}.
Proposition 4.

For any infinite binary information system UU and any nn\in\mathbb{N},

LUla(n)=LUld(n).L_{U}^{la}(n)=L_{U}^{ld}(n).

Let UU be an infinite binary information system. Proposition 1 allows us to correspond to the function hUld(n)h_{U}^{ld}(n) its type of behavior from the set {LOG,LIN}\{\mathrm{LOG},\mathrm{LIN}\}. Proposition 2 allows us to correspond to the function hUla(n)h_{U}^{la}(n) its type of behavior from the set {CON,LIN}\{\mathrm{CON},\mathrm{LIN}\}. Propositions 3 and 4 allow us to correspond to each of the functions LUld(n)L_{U}^{ld}(n) and LUla(n)L_{U}^{la}(n) its type of behavior from the set {POL,EXP}\{\mathrm{POL},\mathrm{EXP}\}. A tuple obtained from the tuple

(hUld(n),hUla(n),LUld(n),LUla(n))(h_{U}^{ld}(n),h_{U}^{la}(n),L_{U}^{ld}(n),L_{U}^{la}(n))

by replacing functions with their types of behavior is called the local type of the information system UU. We now describe all possible local types of infinite binary information systems.

Theorem 1.

For any infinite binary information system, its local type coincides with one of the rows of Table 1. Each row of Table 1 is the local type of some infinite binary information system.

Table 1: Possible local types of infinite binary information systems
hUld(n)h_{U}^{ld}(n) hUla(n)h_{U}^{la}(n) LUld(n)L_{U}^{ld}(n) LUla(n)L_{U}^{la}(n)
1 LOG\mathrm{LOG} CON\mathrm{CON} POL\mathrm{POL} POL\mathrm{POL}
2 LIN\mathrm{LIN} LIN\mathrm{LIN} POL\mathrm{POL} POL\mathrm{POL}
3 LIN\mathrm{LIN} LIN\mathrm{LIN} EXP\mathrm{EXP} EXP\mathrm{EXP}

For i=1,2,3i=1,2,3, we denote by WilW_{i}^{l} the class of all infinite binary information systems, whose local type coincides with the iith row of Table 1. We now study for each of these complexity classes joint behavior of the depth and number of nodes in decision trees solving problems.

For a given infinite binary information system UU, we will consider pairs of functions (φ,ψ)(\varphi,\psi) such that, for any problem zz over UU, there exists a deterministic decision tree over zz solving zz with the depth at most φ(dimz)\varphi(\dim z) and the number of nodes at most ψ(dimz)\psi(\dim z). We will study such pairs and will call them boundary ldld-pairs.

Definition 4.

A pair of functions (φ,ψ)(\varphi,\psi), where φ:{0}\varphi:\mathbb{N}\rightarrow\mathbb{N}\cup\{0\} and ψ:{0}\psi:\mathbb{N}\rightarrow\mathbb{N}\cup\{0\}, will be called a boundary ldld-pair of the information system UU if, for any problem zz over UU, there exists a decision tree Γ\Gamma over zz, which solves the problem zz deterministically and for which h(Γ)φ(n)h(\Gamma)\leq\varphi(n) and L(Γ)ψ(n)L(\Gamma)\leq\psi(n), where n=dimzn=\dim z.

We are interested in finding boundary ldld-pairs with functions that grow as slowly as possible. It is clear that, for any boundary ldld-pair (φ,ψ)(\varphi,\psi) of the information system UU, the following inequalities hold: φ(n)hUld(n)\varphi(n)\geq h_{U}^{ld}(n) and ψ(n)LUld(n)\psi(n)\geq L_{U}^{ld}(n). So the best possible situation is when (hUld,LUld)(h_{U}^{ld},L_{U}^{ld}) is a boundary ldld-pair of UU.

Definition 5.

An information system UU will be called ldld-reachable if the pair (hUld,LUld)(h_{U}^{ld},L_{U}^{ld}) is a boundary ldld-pair of the system UU.

We now consider similar notions for nondeterministic decision trees: the notion of boundary lala-pair and the notion of lala-reachable information system.

Definition 6.

A pair of functions (φ,ψ)(\varphi,\psi), where φ:{0}\varphi:\mathbb{N}\rightarrow\mathbb{N}\cup\{0\} and ψ:{0}\psi:\mathbb{N}\rightarrow\mathbb{N}\cup\{0\}, will be called a boundary lala-pair of the information system UU if, for any problem zz over UU, there exists a decision tree Γ\Gamma over zz, which solves the problem zz nondeterministically and for which h(Γ)φ(n)h(\Gamma)\leq\varphi(n) and L(Γ)ψ(n)L(\Gamma)\leq\psi(n), where n=dimzn=\dim z.

Definition 7.

An information system UU will be called lala-reachable if the pair (hUla,LUla)(h_{U}^{la},L_{U}^{la}) is a boundary lala-pair of the system UU.

Note that for deterministic decision trees, the best situation is when the considered information system is ldld-reachable and for nondeterministic decision trees – when the information system is lala-reachable.

Each information system from the classes W1l,W2lW_{1}^{l},W_{2}^{l}, and W3lW_{3}^{l} is ldld-reachable. Each information system from the classes W2lW_{2}^{l} and W3lW_{3}^{l} is lala-reachable. Each information system from the class W1lW_{1}^{l} is not lala-reachable. For all information systems UU, which are not lala-reachable, we find nontrivial boundary lala-pairs that are sufficiently close to (hUla,LUla)(h_{U}^{la},L_{U}^{la}).

The obtained results are related to time-space trade-off for deterministic and nondeterministic decision trees. Details can be found in the following three theorems.

Theorem 2.

Let UU be an information system from the class W1lW_{1}^{l}. Then

(a) The system UU is ldld-reachable.

(b) The system UU is not lala-reachable and there exists mm\in\mathbb{N} such that

(m,(m+1)LUla(n)/2+1)(m,(m+1)L_{U}^{la}(n)/2+1)

is a boundary lala-pair of the system UU.

Theorem 3.

Let UU be an information system from the class W2lW_{2}^{l}. Then

(a) The system UU is ldld-reachable.

(b) The system UU is lala-reachable.

Theorem 4.

Let UU be an information system from the class W3lW_{3}^{l}. Then

(a) The system UU is ldld-reachable.

(b) The system UU is lala-reachable.

Table 2 summarizes Theorems 1-4. The first column contains the name of the complexity class. The next four columns describe the local type of information systems from this class. The last two columns “ldld-pairs” and “lala-pairs” contain information about boundary ldld-pairs and boundary lala-pairs for information systems from the considered class: “ldld-reachable” means that all information systems from the class are ldld-reachable, “lala-reachable” means that all information systems from the class are lala-reachable, Th. 2 (b) is a link to the corresponding statement Theorem 2 (b).

Table 2: Summary of Theorems 1-4
hUld(n)h_{U}^{ld}(n) hUla(n)h_{U}^{la}(n) LUld(n)L_{U}^{ld}(n) LUla(n)L_{U}^{la}(n) ldld-pairs lala-pairs
W1lW_{1}^{l} LOG\mathrm{LOG} CON\mathrm{CON} POL\mathrm{POL} POL\mathrm{POL} ldld-reachable Th. 2 (b)
W2lW_{2}^{l} LIN\mathrm{LIN} LIN\mathrm{LIN} POL\mathrm{POL} POL\mathrm{POL} ldld-reachable lala-reachable
W3lW_{3}^{l} LIN\mathrm{LIN} LIN\mathrm{LIN} EXP\mathrm{EXP} EXP\mathrm{EXP} ldld-reachable lala-reachable

3 Proofs of Propositions 3 and 4

In this section, we consider a number of auxiliary statements and prove the two mentioned propositions.

Let Γ\Gamma be a decision tree over an infinite binary information system U=(A,F)U=(A,F) and dd be an edge of Γ\Gamma entering a node ww. We denote by Γ(d)\Gamma(d) a subtree of Γ\Gamma, whose root is the node ww. We say that a complete path ξ\xi of Γ\Gamma is realizable if A(ξ)A(\xi)\neq\emptyset.

Lemma 1.

Let U=(A,F)U=(A,F) be an infinite binary information system, z=(ν,f1,,fn)z=(\nu,f_{1},\ldots,f_{n}) be a problem over UU, and Γ\Gamma be a decision tree over zz, which solves the problem zz deterministically and for which L(Γ)=LUld(z)L(\Gamma)=L_{U}^{ld}(z). Then

(a) For each node of Γ\Gamma, there exists a realizable complete path that passes through this node.

(b) Each working node of Γ\Gamma has two edges leaving this node.

Proof.

(a) It is clear that there exists at least one realizable complete path that passes through the root of Γ\Gamma. Let us assume that ww is a node of Γ\Gamma different from the root and such that there is no a realizable complete path, which passes through ww. Let dd be an edge entering the node ww. We remove from Γ\Gamma the edge dd and the subtree Γ(d)\Gamma(d). As a result, we obtain a decision tree Γ\Gamma^{\prime} over zz, which solves zz deterministically and for which L(Γ)<L(Γ)L(\Gamma^{\prime})<L(\Gamma) but this is impossible by definition of Γ\Gamma.

(b) Let us assume that in Γ\Gamma there exists a working node ww, which has only one leaving edge dd entering a node w1w_{1}. We remove from Γ\Gamma the node ww and the edge dd and connect the edge entering the node ww to the node w1w_{1}. As a result, we obtain a decision tree Γ\Gamma^{\prime} over zz, which solves the problem zz deterministically and for which L(Γ)<L(Γ)L(\Gamma^{\prime})<L(\Gamma) but this is impossible by definition of Γ\Gamma. ∎∎

Let UU be an infinite binary information system, Γ\Gamma be a decision tree over UU, and dd be an edge of Γ\Gamma. The subtree Γ(d)\Gamma(d) will be called full if there exist edges d1,,dmd_{1},\ldots,d_{m} in Γ(d)\Gamma(d) such that the removal of these edges and subtrees Γ(d1),,Γ(dm)\Gamma(d_{1}),\ldots,\Gamma(d_{m}) transforms the subtree Γ(d)\Gamma(d) into a tree GG such that each terminal node of GG is a terminal node of Γ\Gamma, and exactly two edges labeled with the numbers 0 and 11 respectively leave each working node of GG.

Lemma 2.

Let U=(A,F)U=(A,F) be an infinite binary information system, z=(ν,f1,,fn)z=(\nu,f_{1},\ldots,f_{n}) be a problem over UU, and Γ\Gamma be a decision tree over zz, which solves the problem zz nondeterministically and for which L(Γ)=LUla(z)L(\Gamma)=L_{U}^{la}(z). Then

(a) For each node of Γ\Gamma, there exists a realizable complete path that passes through this node.

(b) If a working node ww of Γ\Gamma has mm leaving edges d1,,dmd_{1},\ldots,d_{m} labeled with the same number and m2m\geq 2, then the subtrees Γ(d1),,Γ(dm)\Gamma(d_{1}),\ldots,\Gamma(d_{m}) are not full.

(c) If the root rr of Γ\Gamma has mm leaving edges d1,,dmd_{1},\ldots,d_{m} and m2m\geq 2, then the subtrees Γ(d1),,Γ(dm)\Gamma(d_{1}),\ldots,\Gamma(d_{m}) are not full.

Proof.

(a) The proof of item (a) is almost identical to the proof of item (a) of Lemma 1.

(b) Let ww be a working node of Γ\Gamma, which has mm leaving edges d1,,dmd_{1},\ldots,d_{m} labeled with the same number, m2m\geq 2, and at least one of the subtrees Γ(d1),,\Gamma(d_{1}),\ldots, Γ(dm)\Gamma(d_{m}) is full. For the definiteness, we assume that Γ(d1)\Gamma(d_{1}) is full. Remove from Γ\Gamma the edges d2,,dmd_{2},\ldots,d_{m} and subtrees Γ(d2),,Γ(dm)\Gamma(d_{2}),\ldots,\Gamma(d_{m}). We now show that the obtained tree Γ\Gamma^{\prime} solves the problem zz nondeterministically. Assume the contrary. Then there exists an object aAa\in A such that, for each complete path ξ\xi with aA(ξ)a\in A(\xi), the path ξ\xi passes through one of the edges d2,,dmd_{2},\ldots,d_{m} but it is not true. Let ξ\xi be a complete path such that aA(ξ)a\in A(\xi). Then, according to the assumption, this path passes through the node ww. Let ξ\xi^{\prime} be the part of this path from the root of Γ\Gamma to the node ww. Since the edges d1,,dmd_{1},\ldots,d_{m} are labeled with the same number and Γ(d1)\Gamma(d_{1}) is a full subtree, we can find in Γ(d1)\Gamma(d_{1}) the continuation of ξ\xi^{\prime} to a terminal node of Γ(d1)\Gamma(d_{1}) such that the obtained complete path ξ′′\xi^{\prime\prime} of Γ\Gamma satisfies the condition aA(ξ′′)a\in A(\xi^{\prime\prime}). Hence Γ\Gamma^{\prime} is a decision tree over zz, which solves the problem zz nondeterministically and for which L(Γ)<L(Γ)L(\Gamma^{\prime})<L(\Gamma), but this is impossible by definition of Γ\Gamma.

(c) Item (c) can be proven in the same way as item (b). ∎∎

We now consider two statements about classes of decision trees proved in [24]. Let Γ\Gamma be a decision tree. We denote by Lt(Γ)L_{t}(\Gamma) the number of terminal nodes in Γ\Gamma and by Lw(Γ)L_{w}(\Gamma) the number of working nodes in Γ\Gamma. It is clear that L(Γ)=1+Lt(Γ)+Lw(Γ)L(\Gamma)=1+L_{t}(\Gamma)+L_{w}(\Gamma).

Let UU be an infinite binary information systems. We denote by Gd(U)G_{d}(U) the set of all deterministic decision trees over UU and by Gd2(U)G_{d}^{2}(U) the set of all decision trees from Gd(U)G_{d}(U) such that each working node of the tree has two leaving edges.

Lemma 3 (Lemma 14 from [24]).

Let UU be an infinite binary information system. Then

(a) If ΓGd2(U)\Gamma\in G_{d}^{2}(U), then Lw(Γ)=Lt(Γ)1L_{w}(\Gamma)=L_{t}(\Gamma)-1.

(b) If ΓGd(U)Gd2(U)\Gamma\in G_{d}(U)\setminus G_{d}^{2}(U), then Lw(Γ)>Lt(Γ)1L_{w}(\Gamma)>L_{t}(\Gamma)-1.

We denote by Gaf(U)G_{a}^{f}(U) the set of all decision trees Γ\Gamma over UU that satisfy the following conditions: (i) if a working node of Γ\Gamma has mm leaving edges d1,,dmd_{1},\ldots,d_{m} labeled with the same number and m2m\geq 2, then the subtrees Γ(d1),,Γ(dm)\Gamma(d_{1}),\ldots,\Gamma(d_{m}) are not full, and (ii) if the root of Γ\Gamma has mm leaving edges d1,,dmd_{1},\ldots,d_{m} and m2m\geq 2, then the subtrees Γ(d1),,Γ(dm)\Gamma(d_{1}),\ldots,\Gamma(d_{m}) are not full. One can show that Gd2(U)Gd(U)Gaf(U)G_{d}^{2}(U)\subseteq G_{d}(U)\subseteq G_{a}^{f}(U).

Lemma 4 (Lemma 15 from [24]).

Let UU be an infinite binary information system. If Γ\Gamma\in Gaf(U)Gd2(U)G_{a}^{f}(U)\setminus G_{d}^{2}(U), then Lw(Γ)>Lt(Γ)1L_{w}(\Gamma)>L_{t}(\Gamma)-1.

Let U=(A,F)U=(A,F) be an infinite binary information system. For f1,,fnFf_{1},\ldots,f_{n}\in F, we denote by NU(f1,,fn)N_{U}(f_{1},\ldots,f_{n}) the number of nn-tuples (δ1,,δn){0,1}n(\delta_{1},\ldots,\delta_{n})\in\{0,1\}^{n} for which the system of equations

{f1(x)=δ1,,fn(x)=δn}\{f_{1}(x)=\delta_{1},\ldots,f_{n}(x)=\delta_{n}\}

has a solution from AA. For nn\in\mathbb{N}, denote

NU(n)=max{NU(f1,,fn):f1,,fnF}.N_{U}(n)=\max\{N_{U}(f_{1},\ldots,f_{n}):f_{1},\ldots,f_{n}\in F\}.

It is clear that, for any m,nm,n\in\mathbb{N}, if mnm\leq n then NU(m)NU(n)N_{U}(m)\leq N_{U}(n).

Proposition 5.

Let U=(A,F)U=(A,F) be an infinite binary information system. Then, for any nn\in\mathbb{N},

LUla(n)=LUld(n)=2NU(n).L_{U}^{la}(n)=L_{U}^{ld}(n)=2N_{U}(n).
Proof.

Let z=(ν,f1,,fm)z=(\nu,f_{1},\ldots,f_{m}) be a problem over UU and mnm\leq n. Let Γ\Gamma be a decision tree over zz, which solves the problem zz deterministically and for which L(Γ)=LUld(z)L(\Gamma)=L_{U}^{ld}(z). From Lemma 1 it follows that each working node of Γ\Gamma has two edges leaving this node and, for each node of Γ\Gamma, there exists a realizable complete path that passes through this node. Let ξ1\xi_{1} and ξ2\xi_{2} be different complete paths in Γ\Gamma, a1A(ξ1)a_{1}\in A(\xi_{1}), and a2A(ξ2)a_{2}\in A(\xi_{2}). It is easy to show that (f1(a1),,fm(a1))(f1(a2),,fm(a2))(f_{1}(a_{1}),\ldots,f_{m}(a_{1}))\neq(f_{1}(a_{2}),\ldots,f_{m}(a_{2})). Therefore Lt(Γ)NU(f1,,fm)NU(n)L_{t}(\Gamma)\leq N_{U}(f_{1},\ldots,f_{m})\leq N_{U}(n). It is clear that ΓGd2(U)\Gamma\in G_{d}^{2}(U). By Lemma 3, Lw(Γ)=Lt(Γ)1L_{w}(\Gamma)=L_{t}(\Gamma)-1. Hence L(Γ)2NU(n)L(\Gamma)\leq 2N_{U}(n). Taking into account that zz is an arbitrary problem over UU with dimzn\dim z\leq n we obtain

LUld(n)2NU(n).L_{U}^{ld}(n)\leq 2N_{U}(n).

Since any decision tree solving the problem zz deterministically solves it nondeterministically we obtain

LUla(n)LUld(n).L_{U}^{la}(n)\leq L_{U}^{ld}(n).

We now show that 2NU(n)LUla(n)2N_{U}(n)\leq L_{U}^{la}(n). Let us consider a problem z=(ν,f1,,fn)z=(\nu,f_{1},\ldots,f_{n}) over UU such that

NU(f1,,fn)=NU(n)N_{U}(f_{1},\ldots,f_{n})=N_{U}(n)

and, for any δ¯1,δ¯2{0,1}n\bar{\delta}_{1},\bar{\delta}_{2}\in\{0,1\}^{n}, if δ¯1δ¯2\bar{\delta}_{1}\neq\bar{\delta}_{2}, then ν(δ¯1)ν(δ¯2)\nu(\bar{\delta}_{1})\neq\nu(\bar{\delta}_{2}). Let Γ\Gamma be a decision tree over zz, which solves the problem zz nondeterministically and for which L(Γ)=LUla(z)L(\Gamma)=L_{U}^{la}(z). By Lemma 2, ΓGaf(U)\Gamma\in G_{a}^{f}(U). Using Lemmas 3 and 4 we obtain Lw(Γ)Lt(Γ)1L_{w}(\Gamma)\geq L_{t}(\Gamma)-1. It is clear that Lt(Γ)NU(f1,,fn)=NU(n)L_{t}(\Gamma)\geq N_{U}(f_{1},\ldots,f_{n})=N_{U}(n). Therefore L(Γ)2NU(n)L(\Gamma)\geq 2N_{U}(n), LUla(z)2NU(n)L_{U}^{la}(z)\geq 2N_{U}(n), and LUla(n)2NU(n)L_{U}^{la}(n)\geq 2N_{U}(n). ∎∎

The next statement follows directly from Lemmas 5.1 and 5.2 [22] and the evident inequality NU(n)2nN_{U}(n)\leq 2^{n}, which is true for any infinite binary information system UU. The proof of Lemma 5.1 from [22] is based on Theorems 4.6 and 4.7 from the same monograph that are similar to results obtained in [33, 34].

Proposition 6.

For any infinite binary information system UU, the function NU(n)N_{U}(n) has one of the following two types of behavior:

(POL) If the system UU has finite I-dimension, then for any nn\in\mathbb{N},

n+1NU(n)(4n)I(U).n+1\leq N_{U}(n)\leq(4n)^{I(U)}.

(EXP) If the system UU has infinite I-dimension, then for any nn\in\mathbb{N},

NU(n)=2n.N_{U}(n)=2^{n}.

We now prove Propositions 3 and 4.

Proof of Proposition 3.

The statement of the proposition follows immediately from Propositions 5 and 6. ∎∎

Proof of Proposition 4.

The statement of the proposition follows immediately from Proposition 5. ∎∎

4 Proof of Theorem 1

First, we prove five auxiliary statements.

Lemma 5.

Let U=(A,F)U=(A,F) be an infinite binary information system, which has infinite I-dimension. Then UU does not satisfy the condition of reduction.

Proof.

Let us assume the contrary: UU satisfies the condition of reduction. Then UU satisfies the condition of reduction with parameter mm for some mm\in\mathbb{N}. Since I(U)=I(U)=\infty, there exists an independent subset {f1,,fm+1}\{f_{1},\ldots,f_{m+1}\} of the set FF. It is clear that the system of equations

S={f1(x)=0,,fm+1(x)=0}S=\{f_{1}(x)=0,\ldots,f_{m+1}(x)=0\}

is compatible on AA and each proper subsystem of the system SS has the set of solutions different from the set of solutions of SS. Therefore UU does not satisfy the condition of reduction with parameter mm. ∎∎

Lemma 6.

For any infinite binary information system, its local type coincides with one of the rows of Table 1.

Proof.

To prove this statement we fill Table 3. In the first column “I-dim.” we have either “Fin” or “Inf”: “Fin” if the considered information system has finite I-dimension and “Inf” if the considered information system has infinite I-dimension. In the second column “Reduct.”, we have either “Yes” or “No”: “Yes” if the considered information system satisfies the condition of reduction and “No” otherwise.

By Lemma 5, if an information system has infinite I-dimension, then this information system does not satisfy the condition of reduction. It means that there are only three possible tuples of values of the considered two parameters of information systems, which correspond to the three rows of Table 3. The values of the considered two parameters define the types of behavior of functions hUld(n)h_{U}^{ld}(n), hUla(n)h_{U}^{la}(n), LUld(n)L_{U}^{ld}(n), and LUla(n)L_{U}^{la}(n) according to Propositions 1-4. We see that the set of possible tuples of values in the last four columns coincides with the set of rows of Table 1. ∎∎

Table 3: Parameters and local types of infinite binary information systems
I-dim. Reduct. hUld(n)h_{U}^{ld}(n) hUla(n)h_{U}^{la}(n) LUld(n)L_{U}^{ld}(n) LUla(n)L_{U}^{la}(n)
Fin Yes LOG\mathrm{LOG} CON\mathrm{CON} POL\mathrm{POL} POL\mathrm{POL}
Fin No LIN\mathrm{LIN} LIN\mathrm{LIN} POL\mathrm{POL} POL\mathrm{POL}
Inf No LIN\mathrm{LIN} LIN\mathrm{LIN} EXP\mathrm{EXP} EXP\mathrm{EXP}

For each row of Table 1, we consider an example of infinite binary information system, whose local type coincides with this row.

For any ii\in\mathbb{N}, we define two functions pi:{0,1}p_{i}:\mathbb{N}\rightarrow\{0,1\} and li:{0,1}l_{i}:\mathbb{N}\rightarrow\{0,1\}. Let jj\in\mathbb{N}. Then pi(j)=1p_{i}(j)=1 if and only if j=ij=i, and li(j)=1l_{i}(j)=1 if and only if j>ij>i.

Define an information system U1=(A1,F1)U_{1}=(A_{1},F_{1}) as follows: A1=A_{1}=\mathbb{N} and F1={li:i}F_{1}=\{l_{i}:i\in\mathbb{N}\}.

Lemma 7.

The information system U1U_{1} belongs to the class W1lW_{1}^{l}, hU1ld(n)=log2(nh_{U_{1}}^{ld}(n)=\lceil\log_{2}(n +1)+1)\rceil, hU1la(1)=1h_{U_{1}}^{la}(1)=1 and hU1la(n)=2h_{U_{1}}^{la}(n)=2 if n>1n>1, LU1ld(n)=2(n+1)L_{U_{1}}^{ld}(n)=2(n+1), and LU1la(n)=2(n+1)L_{U_{1}}^{la}(n)=2(n+1) for any nn\in\mathbb{N}. This information system satisfies the condition of reduction with parameter 22 and has finite I-dimension equal 11.

Proof.

It is easy to show that NU1(n)=n+1N_{U_{1}}(n)=n+1 for any nn\in\mathbb{N}. Using Proposition 5 we obtain LU1ld(n)=LU1la(n)=2(n+1)L_{U_{1}}^{ld}(n)=L_{U_{1}}^{la}(n)=2(n+1) for any nn\in\mathbb{N}. Let nn\in\mathbb{N}. Consider a problem z=(ν,l1,,ln)z=(\nu,l_{1},\ldots,l_{n}) over U1U_{1} such that, for each δ¯1,δ¯2{0,1}n\bar{\delta}_{1},\bar{\delta}_{2}\in\{0,1\}^{n} with δ¯1δ¯2\bar{\delta}_{1}\neq\bar{\delta}_{2}, ν(δ¯1)ν(δ¯2)\nu(\bar{\delta}_{1})\neq\nu(\bar{\delta}_{2}). It is clear that NU1(l1,,ln)=n+1N_{U_{1}}(l_{1},\ldots,l_{n})=n+1. Therefore each decision tree Γ\Gamma over zz that solves the problem zz deterministically has at least n+1n+1 terminal nodes. One can show that the number of terminal nodes in Γ\Gamma is at most 2h(Γ)2^{h(\Gamma)}. Hence n+12h(Γ)n+1\leq 2^{h(\Gamma)} and log2(n+1)h(Γ)\log_{2}(n+1)\leq h(\Gamma). Since h(Γ)h(\Gamma) is an integer, log2(n+1)h(Γ)\lceil\log_{2}(n+1)\rceil\leq h(\Gamma). Thus, hU1ld(n)log2(n+1)h_{U_{1}}^{ld}(n)\geq\lceil\log_{2}(n+1)\rceil. Set m=log2(n+1)m=\lceil\log_{2}(n+1)\rceil. Then n2m1n\leq 2^{m}-1. One can show that hU1ld(2m1)mh_{U_{1}}^{ld}(2^{m}-1)\leq m (the construction of an appropriate decision tree is based on an analog of binary search, and we use only attributes from the problem description) and hU1ld(n)hU1ld(2m1)h_{U_{1}}^{ld}(n)\leq h_{U_{1}}^{ld}(2^{m}-1). Therefore hU1ld(n)log2(n+1)h_{U_{1}}^{ld}(n)\leq\lceil\log_{2}(n+1)\rceil and hU1ld(n)=log2(n+1)h_{U_{1}}^{ld}(n)=\lceil\log_{2}(n+1)\rceil. It is clear that hU1la(1)=1h_{U_{1}}^{la}(1)=1. Let n2n\geq 2, z=(ν,f1,,fn)z=(\nu,f_{1},\ldots,f_{n}) be an arbitrary problem over U1U_{1} and li1,,liml_{i_{1}},\ldots,l_{i_{m}} be all pairwise different attributes from the set {f1,,fn}\{f_{1},\ldots,f_{n}\} ordered such that i1<<imi_{1}<\ldots<i_{m}. Then these attributes divide the set \mathbb{N} into m+1m+1 nonempty domains that are sets of solutions on \mathbb{N} of the following systems of equations: {li1(x)=0}\{l_{i_{1}}(x)=0\}, {li1(x)=1,li2(x)=0}\{l_{i_{1}}(x)=1,l_{i_{2}}(x)=0\}, …, {lim1(x)=1,lim(x)=0}\{l_{i_{m-1}}(x)=1,l_{i_{m}}(x)=0\}, {lim(x)=1}\{l_{i_{m}}(x)=1\}. The value z(x)z(x) is constant in each of the considered domains. Using these facts it is easy to show that there exists a decision tree Γ\Gamma over zz, which solves the problem zz nondeterministically and for which h(Γ)=2h(\Gamma)=2 if m2m\geq 2. Therefore hU1la(n)2h_{U_{1}}^{la}(n)\leq 2. One can show that there exists a problem zz over U1U_{1} such that dimz=n\dim z=n and hU1la(z)2h_{U_{1}}^{la}(z)\geq 2. Therefore hU1la(n)=2h_{U_{1}}^{la}(n)=2.

Since the function hU1ldh_{U_{1}}^{ld} has the type of behavior LOG, the information system U1U_{1} belongs to the class W1lW_{1}^{l} – see Lemma 6 and Table 1. One can show that the information system U1U_{1} satisfies the condition of reduction with parameter 22 and has finite I-dimension equal 11. ∎∎

Define an information system U2=(A2,F2)U_{2}=(A_{2},F_{2}) as follows: A2=A_{2}=\mathbb{N} and F2={pi:i}F_{2}=\{p_{i}:i\in\mathbb{N}\}.

Lemma 8.

The information system U2U_{2} belongs to the class W2lW_{2}^{l}, hU2ld(n)=nh_{U_{2}}^{ld}(n)=n, hU2la(n)=nh_{U_{2}}^{la}(n)=n, LU2ld(n)=2(n+1)L_{U_{2}}^{ld}(n)=2(n+1), and LU2la(n)=2(n+1)L_{U_{2}}^{la}(n)=2(n+1) for any nn\in\mathbb{N}. This information system does not satisfy the condition of reduction and has finite I-dimension equal 11.

Proof.

It is easy to show that NU2(n)=n+1N_{U_{2}}(n)=n+1 for any nn\in\mathbb{N}. Using Proposition 5, we obtain LU2ld(n)=LU2la(n)=2(n+1)L_{U_{2}}^{ld}(n)=L_{U_{2}}^{la}(n)=2(n+1) for any nn\in\mathbb{N}.

Let nn\in\mathbb{N}. It is clear that the system of equations

Sn={p1(x)=0,,pn(x)=0}S_{n}=\{p_{1}(x)=0,\ldots,p_{n}(x)=0\}

is compatible on A2A_{2} and each proper subsystem of the system SnS_{n} has the set of solutions different from the set of solutions of SnS_{n}. Therefore U2U_{2} does not satisfy the condition of reduction. Using Propositions 1 and 2, we obtain hU2ld(n)=hU2la(n)=nh_{U_{2}}^{ld}(n)=h_{U_{2}}^{la}(n)=n for any nn\in\mathbb{N}.

Since the function hU2ldh_{U_{2}}^{ld} has the type of behavior LIN and the function LU2ldL_{U_{2}}^{ld} has the type of behavior POL, the information system U2U_{2} belongs to the class W2lW_{2}^{l} – see Lemma 6 and Table 1. One can show that this information system has finite I-dimension equal 11. ∎∎

Define an information system U3=(A3,F3)U_{3}=(A_{3},F_{3}) as follows: A3=A_{3}=\mathbb{N} and F3F_{3} is the set of all functions from \mathbb{N} to {0,1}\{0,1\}.

Lemma 9.

The information system U3U_{3} belongs to the class W3lW_{3}^{l}, hU3ld(n)=nh_{U_{3}}^{ld}(n)=n, hU3la(n)=nh_{U_{3}}^{la}(n)=n, LU3ld(n)=2n+1L_{U_{3}}^{ld}(n)=2^{n+1}, and LU3la(n)=2n+1L_{U_{3}}^{la}(n)=2^{n+1} for any nn\in\mathbb{N}. This information system does not satisfy the condition of reduction and has infinite I-dimension.

Proof.

It is easy to show that the information system U3U_{3} has infinite I-dimension. Using Lemma 5, we obtain that the system U3U_{3} does not satisfy the condition of reduction. Let nn\in\mathbb{N}. By Propositions 1 and 2, hU3ld(n)=hU3la(n)=nh_{U_{3}}^{ld}(n)=h_{U_{3}}^{la}(n)=n. By Propositions 3 and 4, LU3ld(n)=LU3la(n)=2n+1L_{U_{3}}^{ld}(n)=L_{U_{3}}^{la}(n)=2^{n+1}.

Since the function LU3ldL_{U_{3}}^{ld} has the type of behavior EXP, the information system U3U_{3} belongs to the class W3lW_{3}^{l} – see Lemma 6 and Table 1. ∎∎

Proof of Theorem 1.

The statements of the theorem follow from Lemmas 6-9. ∎∎

5 Proofs of Theorems 2-4

First, we prove a number of auxiliary statements.

Lemma 10.

Let U=(A,F)U=(A,F) be an infinite binary information system. Then the information system UU is ldld-reachable.

Proof.

Let z=(ν,f1,,fn)z=(\nu,f_{1},\ldots,f_{n}) be a problem over UU. Then there exists a decision tree Γ\Gamma over zz, which solves this problem deterministically and whose depth is at most hUld(n)h_{U}^{ld}(n). By removal of some nodes and edges from Γ\Gamma, we can obtain a decision tree Γ\Gamma^{\prime} over zz, which solves the problem zz deterministically and in which each working node has exactly two leaving edges and each complete path is realizable. Let ξ1\xi_{1} and ξ2\xi_{2} be different complete paths in Γ\Gamma^{\prime}, a1A(ξ1)a_{1}\in A(\xi_{1}), and a2A(ξ2)a_{2}\in A(\xi_{2}). It is easy to show that (f1(a1),,fn(a1))(f1(a2),,fn(a2))(f_{1}(a_{1}),\ldots,f_{n}(a_{1}))\neq(f_{1}(a_{2}),\ldots,f_{n}(a_{2})). Therefore Lt(Γ)NU(f1,,fn)NU(n)L_{t}(\Gamma^{\prime})\leq N_{U}(f_{1},\ldots,f_{n})\leq N_{U}(n). It is clear that ΓGd2(U)\Gamma^{\prime}\in G_{d}^{2}(U). By Lemma 3, Lw(Γ)=Lt(Γ)1L_{w}(\Gamma^{\prime})=L_{t}(\Gamma^{\prime})-1. Therefore L(Γ)2NU(n)L(\Gamma^{\prime})\leq 2N_{U}(n). By Proposition 5, 2NU(n)=LUld(n)2N_{U}(n)=L_{U}^{ld}(n). Taking into account that h(Γ)hUld(n)h(\Gamma^{\prime})\leq h_{U}^{ld}(n) and zz is an arbitrary problem over UU with dimz=n\dim z=n, we obtain that UU is ldld-reachable. ∎∎

Lemma 11.

Let UU be an infinite binary information system such that hUla(n)=nh_{U}^{la}(n)=n for any nn\in\mathbb{N}. Then the information system UU is lala-reachable.

Proof.

Let z=(ν,f1,,fn)z=(\nu,f_{1},\ldots,f_{n}) be a problem over UU and Γ\Gamma be a decision tree over zz that solves the problem zz deterministically and satisfies the following conditions: the number of working nodes in each complete path of Γ\Gamma is equal to nn and these nodes in the order from the root to a terminal node are labeled with attributes f1,,fnf_{1},\ldots,f_{n}. Remove from Γ\Gamma all nodes and edges that do not belong to realizable complete paths. Let ww be a working node in the obtained tree that has only one leaving edge dd entering a node vv. We remove the node ww and edge dd and connect the edge ee entering ww to the node vv. We do the same with all working nodes with only one leaving edge. Denote by Γ\Gamma^{\prime} the obtained decision tree. It is clear that Γ\Gamma^{\prime} solves the problem zz deterministically and hence nondeterministically, ΓGd2(U)\Gamma^{\prime}\in G_{d}^{2}(U), and Lt(Γ)NU(f1,,fn)NU(n)L_{t}(\Gamma^{\prime})\leq N_{U}(f_{1},\ldots,f_{n})\leq N_{U}(n). By Lemma 3, Lw(Γ)=Lt(Γ)1L_{w}(\Gamma^{\prime})=L_{t}(\Gamma^{\prime})-1. Therefore L(Γ)2NU(n)L(\Gamma^{\prime})\leq 2N_{U}(n). Using Proposition 5, we obtain L(Γ)LUla(n)L(\Gamma^{\prime})\leq L_{U}^{la}(n). It is clear that h(Γ)n=hUla(n)h(\Gamma^{\prime})\leq n=h_{U}^{la}(n). Therefore UU is lala-reachable. ∎∎

Lemma 12.

Let UU be an infinite binary information system, which satisfies the condition of reduction. Then the information system UU is not lala-reachable.

Proof.

By Proposition 2, the function hUla(n)h_{U}^{la}(n) is bounded from above by a positive constant cc. By Proposition 6, the function NU(n)N_{U}(n) is not bounded from above by a constant. Choose nn\in\mathbb{N} such that NU(n)>22cN_{U}(n)>2^{2c}. Let z=(ν,f1,,fn)z=(\nu,f_{1},\ldots,f_{n}) be a problem over UU such that ν(δ¯1)ν(δ¯2)\nu(\bar{\delta}_{1})\neq\nu(\bar{\delta}_{2}) for any δ¯1,δ¯2{0,1}n\bar{\delta}_{1},\bar{\delta}_{2}\in\{0,1\}^{n}, δ¯1δ¯2\bar{\delta}_{1}\neq\bar{\delta}_{2}, and NU(f1,,fn)=NU(n)N_{U}(f_{1},\ldots,f_{n})=N_{U}(n). Let Γ\Gamma be a decision tree over zz, which solves the problem zz nondeterministically, for which h(Γ)hUla(n)ch(\Gamma)\leq h_{U}^{la}(n)\leq c, and which has the minimum number of nodes among such trees. In the same way as it was done in the proof of Lemma 2, we can prove that ΓGaf(U)\Gamma\in G_{a}^{f}(U). It is clear that Lt(Γ)NU(f1,,fn)=NU(n)L_{t}(\Gamma)\geq N_{U}(f_{1},\ldots,f_{n})=N_{U}(n). Let us assume that ΓGd2(U)\Gamma\in G_{d}^{2}(U). Then it is easy to show that h(Γ)log2Lt(Γ)log2NU(n)>2ch(\Gamma)\geq\log_{2}L_{t}(\Gamma)\geq\log_{2}N_{U}(n)>2c, which is impossible by the choice of Γ\Gamma. Therefore ΓGaf(U)Gd2(U)\Gamma\in G_{a}^{f}(U)\setminus G_{d}^{2}(U). By Lemma 4, Lw(Γ)>Lt(Γ)1NU(n)1L_{w}(\Gamma)>L_{t}(\Gamma)-1\geq N_{U}(n)-1. Using Proposition 5, we obtain L(Γ)>2NU(n)=LUla(n)L(\Gamma)>2N_{U}(n)=L_{U}^{la}(n). Therefore UU is not lala-reachable. ∎∎

Lemma 13.

Let U=(A,F)U=(A,F) be an infinite binary information system, which satisfies the condition of reduction with parameter mm. Then (m,(m+1)LUla(n)/2+1)(m,(m+1)L_{U}^{la}(n)/2+1) is a boundary lala-pair of the system UU.

Proof.

Let z=(ν,f1,,fn)z=(\nu,f_{1},\ldots,f_{n}) be a problem over UU. We now describe a decision tree Γ\Gamma over zz, which solves the problem zz nondeterministically and for which h(Γ)mh(\Gamma)\leq m and L(Γ)(m+1)LUla(n)/2+1L(\Gamma)\leq(m+1)L_{U}^{la}(n)/2+1. For each tuple δ¯=(δ1,,δn){0,1}n\bar{\delta}=(\delta_{1},\ldots,\delta_{n})\in\{0,1\}^{n} for which the system of equations

Sδ¯={f1(x)=δ1,,fn(x)=δn}S_{\bar{\delta}}=\{f_{1}(x)=\delta_{1},\ldots,f_{n}(x)=\delta_{n}\}

has a solution from AA, we describe a complete path ξδ¯\xi_{\bar{\delta}}. Since the information system UU satisfies the condition of reduction with parameter mm, there exists a subsystem

Sδ¯={fi1(x)=δi1,,fit(x)=δit}S_{\bar{\delta}}^{\prime}=\{f_{i_{1}}(x)=\delta_{i_{1}},\ldots,f_{i_{t}}(x)=\delta_{i_{t}}\}

of the system Sδ¯S_{\bar{\delta}}, which has the same set of solutions and for which tmt\leq m. Then

ξδ¯=v0,d0,v1,d1,,vt,dt,vt+1,\xi_{\bar{\delta}}=v_{0},d_{0},v_{1},d_{1},\ldots,v_{t},d_{t},v_{t+1},

where the node v0v_{0} and the edge d0d_{0} are not labeled, for j=1,,tj=1,\ldots,t, the node vjv_{j} is labeled with the attribute fijf_{i_{j}} and the edge djd_{j} is labeled with the number δij\delta_{i_{j}}, and the node vt+1v_{t+1} is labeled with the number ν(δ¯)\nu(\bar{\delta}). We merge initial nodes of all such complete paths and denote by Γ\Gamma the obtained tree. One can show that Γ\Gamma is a decision tree over zz, which solves the problem zz nondeterministically and for which h(Γ)mh(\Gamma)\leq m. The number of the considered complete paths is equal to NU(f1,,fn)NU(n)N_{U}(f_{1},\ldots,f_{n})\leq N_{U}(n). The number of nodes in each complete paths is at most m+2m+2. Therefore L(Γ)(m+1)NU(n)+1L(\Gamma)\leq(m+1)N_{U}(n)+1. By Proposition 5, NU(n)=LUla(n)/2N_{U}(n)=L_{U}^{la}(n)/2. Hence L(Γ)(m+1)LUla(n)/2+1L(\Gamma)\leq(m+1)L_{U}^{la}(n)/2+1. Thus, (m,(m+1)LUla(n)/2+1)(m,(m+1)L_{U}^{la}(n)/2+1) is a boundary lala-pair of the system UU. ∎∎

Proof of Theorem 2.

Each information system from the class W1lW_{1}^{l} satisfies the condition of reduction (see Table 3).

(a) Let UU be an information system from the class W1lW_{1}^{l}. Using Lemma 10, we obtain that the system UU is ldld-reachable.

(b) Let UU be an information system from the class W1lW_{1}^{l}. Then, for some mm\in\mathbb{N}, the system UU satisfies the condition of reduction with parameter mm. Using Lemma 12, we obtain that the system UU is not lala-reachable. Using Lemma 13, we obtain that (m,(m+1)LUla(n)/2+1)(m,(m+1)L_{U}^{la}(n)/2+1) is a boundary lala-pair of the system UU. ∎∎

Proof of Theorem 3.

Each information system from the class W2lW_{2}^{l} does not satisfy the condition of reduction (see Table 3).

(a) Let UU be an information system from the class W2lW_{2}^{l}. Using Lemma 10, we obtain that the system UU is ldld-reachable.

(b) Let UU be an information system from the class W2lW_{2}^{l}. By Proposition 2, hUla(n)=nh_{U}^{la}(n)=n for any nn\in\mathbb{N}. Using Lemma 11, we obtain that the system UU is lala-reachable. ∎∎

Proof of Theorem 4.

Each information system from the class W3lW_{3}^{l} does not satisfy the condition of reduction (see Table 3).

(a) Let UU be an information system from the class W3lW_{3}^{l}. Using Lemma 10, we obtain that the system UU is ldld-reachable.

(b) Let UU be an information system from the class W3lW_{3}^{l}. By Proposition 2, hUla(n)=nh_{U}^{la}(n)=n for any nn\in\mathbb{N}. Using Lemma 11, we obtain that the system UU is lala-reachable. ∎∎

6 Conclusions

In this paper, we divided the set of all infinite binary information systems into three complexity classes depending on the worst case time and space complexity of deterministic and nondeterministic decision trees. This allowed us to identify nontrivial relationships between deterministic decision trees and decision rule systems represented by nondeterministic decision trees. For each complexity class, we studied issues related to time-space trade-off for deterministic and nondeterministic decision trees. In the future, we are planning to generalize the obtained results to the case of classes of decision tables closed under operations of removal of attributes and changing decisions attached to rows of decision tables.

Acknowledgements

Research reported in this publication was supported by King Abdullah University of Science and Technology (KAUST).

References

  • [1] AbouEisha, H., Amin, T., Chikalov, I., Hussain, S., Moshkov, M.: Extensions of Dynamic Programming for Combinatorial Optimization and Data Mining, Intelligent Systems Reference Library, vol. 146. Springer, Cham (2019)
  • [2] Alsolami, F., Azad, M., Chikalov, I., Moshkov, M.: Decision and Inhibitory Trees and Rules for Decision Tables with Many-valued Decisions, Intelligent Systems Reference Library, vol. 156. Springer, Cham (2020)
  • [3] Ben-Or, M.: Lower bounds for algebraic computation trees (preliminary report). In: 15th Annual ACM Symposium on Theory of Computing, STOC 1983, pp. 80–86 (1983)
  • [4] Boros, E., Hammer, P.L., Ibaraki, T., Kogan, A.: Logical analysis of numerical data. Math. Program. 79, 163–190 (1997)
  • [5] Boros, E., Hammer, P.L., Ibaraki, T., Kogan, A., Mayoraz, E., Muchnik, I.B.: An implementation of logical analysis of data. IEEE Trans. Knowl. Data Eng. 12(2), 292–306 (2000)
  • [6] Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Wadsworth, Belmont, CA (1984)
  • [7] Chikalov, I., Lozin, V.V., Lozina, I., Moshkov, M., Nguyen, H.S., Skowron, A., Zielosko, B.: Three Approaches to Data Analysis - Test Theory, Rough Sets and Logical Analysis of Data, Intelligent Systems Reference Library, vol. 41. Springer, Berlin Heidelberg (2013)
  • [8] Dobkin, D.P., Lipton, R.J.: A lower bound of the (1/2)n2(1/2)n^{2} on linear search programs for the knapsack problem. J. Comput. Syst. Sci. 16(3), 413–417 (1978)
  • [9] Dobkin, D.P., Lipton, R.J.: On the complexity of computations under varying sets of primitives. J. Comput. Syst. Sci. 18(1), 86–91 (1979)
  • [10] Durdymyradov, K., Moshkov, M.: Time and space complexity of deterministic and nondeterministic decision trees. Local approach. In: 2023 IEEE International Conference on Big Data. 9th Special Session on Information Granulation in Data Science and Scalable Computing (2023). (to appear)
  • [11] Fürnkranz, J., Gamberger, D., Lavrac, N.: Foundations of Rule Learning. Cognitive Technologies. Springer, Berlin Heidelberg (2012)
  • [12] Gabrielov, A., Vorobjov, N.: On topological lower bounds for algebraic computation trees. Found. Comput. Math. 17(1), 61–72 (2017)
  • [13] Grigoriev, D., Karpinski, M., Vorobjov, N.: Improved lower bound on testing membership to a polyhedron by algebraic decision trees. In: 36th Annual Symposium on Foundations of Computer Science, FOCS 1995, pp. 258–265 (1995)
  • [14] Grigoriev, D., Karpinski, M., Yao, A.C.: An exponential lower bound on the size of algebraic decision trees for Max. Computational Complexity 7(3), 193–203 (1998)
  • [15] Molnar, C.: Interpretable Machine Learning. A Guide for Making Black Box Models Explainable, 2 edn. (2022). URL christophm.github.io/interpretable-ml-book/
  • [16] Morávek, J.: A localization problem in geometry and complexity of discrete programming. Kybernetika 8(6), 498–516 (1972)
  • [17] Moshkov, M.: Decision Trees. Theory and Applications (in Russian). Nizhny Novgorod University Publishers, Nizhny Novgorod (1994)
  • [18] Moshkov, M.: Optimization problems for decision trees. Fundam. Inform. 21(4), 391–401 (1994)
  • [19] Moshkov, M.: Two approaches to investigation of deterministic and nondeterministic decision trees complexity. In: 2nd World Conference on the Fundamentals of Artificial Intelligence, WOCFAI 1995, pp. 275–280 (1995)
  • [20] Moshkov, M.: Comparative analysis of deterministic and nondeterministic decision tree complexity. Global approach. Fundam. Inform. 25(2), 201–214 (1996)
  • [21] Moshkov, M.: Comparative analysis of deterministic and nondeterministic decision tree complexity. Local approach. In: Trans. Rough Sets IV, Lecture Notes in Computer Science, vol. 3700, pp. 125–143. Springer, Berlin Heidelberg (2005)
  • [22] Moshkov, M.: Time complexity of decision trees. In: Trans. Rough Sets III, Lecture Notes in Computer Science, vol. 3400, pp. 244–459. Springer, Berlin Heidelberg (2005)
  • [23] Moshkov, M.: On the class of restricted linear information systems. Discret. Math. 307(22), 2837–2844 (2007)
  • [24] Moshkov, M.: Time and space complexity of deterministic and nondeterministic decision trees. Ann. Math. Artif. Intell. 91(1), 45–74 (2023)
  • [25] Moshkov, M., Piliszczuk, M., Zielosko, B.: Partial Covers, Reducts and Decision Rules in Rough Sets - Theory and Applications, Studies in Computational Intelligence, vol. 145. Springer, Berlin Heidelberg (2008)
  • [26] Moshkov, M., Zielosko, B.: Combinatorial Machine Learning - A Rough Set Approach, Studies in Computational Intelligence, vol. 360. Springer, Berlin Heidelberg (2011)
  • [27] Naiman, D.Q., Wynn, H.P.: Independence number and the complexity of families of sets. Discr. Math. 154, 203–216 (1996)
  • [28] Pawlak, Z.: Information systems theoretical foundations. Inf. Syst. 6(3), 205–218 (1981)
  • [29] Pawlak, Z.: Rough Sets - Theoretical Aspects of Reasoning about Data, Theory and Decision Library : series D, vol. 9. Kluwer (1991)
  • [30] Pawlak, Z., Polkowski, L., Skowron, A.: Rough set theory. In: B.W. Wah (ed.) Wiley Encyclopedia of Computer Science and Engineering. John Wiley & Sons, Inc. (2008). URL https://doi.org/10.1002/9780470050118.ecse466
  • [31] Pawlak, Z., Skowron, A.: Rudiments of rough sets. Inf. Sci. 177(1), 3–27 (2007)
  • [32] Rokach, L., Maimon, O.: Data Mining with Decision Trees - Theory and Applications, Series in Machine Perception and Artificial Intelligence, vol. 69. WorldScientific, Singapore (2007)
  • [33] Sauer, N.: On the density of families of sets. J. of Combinatorial Theory (A) 13, 145–147 (1972)
  • [34] Shelah, S.: A combinatorial problem; stability and order for models and theories in infinitary languages. Pacific J. of Mathematics 41, 241–261 (1972)
  • [35] Skowron, A., Rauszer, C.: The discernibility matrices and functions in information systems. In: R. Slowinski (ed.) Intelligent Decision Support - Handbook of Applications and Advances of the Rough Sets Theory, Theory and Decision Library, vol. 11, pp. 331–362. Springer (1992)
  • [36] Steele, J.M., Yao, A.C.: Lower bounds for algebraic decision trees. J. Algorithms 3(1), 1–8 (1982)
  • [37] Yao, A.C.: Algebraic decision trees and Euler characteristics. In: 33rd Annual Symposium on Foundations of Computer Science, FOCS 1992, pp. 268–277 (1992)
  • [38] Yao, A.C.: Decision tree complexity and Betti numbers. In: 26th Annual ACM Symposium on Theory of Computing, STOC 1994, pp. 615–624 (1994)