A Local Approach to Studying the Time and Space Complexity of Deterministic and Nondeterministic Decision Trees
Abstract
Decision trees and decision rules are intensively studied and used in different areas of computer science. The questions important for the theory of decision trees and rules include relations between decision trees and decision rule systems, time-space tradeoff for decision trees, and time-space tradeoff for decision rule systems. In this paper, we study arbitrary infinite binary information systems each of which consists of an infinite set called universe and an infinite set of two-valued functions (attributes) defined on the universe. We consider the notion of a problem over information system, which is described by a finite number of attributes and a mapping associating a decision to each tuple of attribute values. As algorithms for problem solving, we investigate deterministic and nondeterministic decision trees that use only attributes from the problem description. Nondeterministic decision trees are representations of decision rule systems that sometimes have less space complexity than the original rule systems. As time and space complexity, we study the depth and the number of nodes in the decision trees. In the worst case, with the growth of the number of attributes in the problem description, (i) the minimum depth of deterministic decision trees grows either as a logarithm or linearly, (ii) the minimum depth of nondeterministic decision trees either is bounded from above by a constant or grows linearly, (iii) the minimum number of nodes in deterministic decision trees has either polynomial or exponential growth, and (iv) the minimum number of nodes in nondeterministic decision trees has either polynomial or exponential growth. Based on these results, we divide the set of all infinite binary information systems into three complexity classes. This allows us to identify nontrivial relationships between deterministic decision trees and decision rules systems represented by nondeterministic decision trees. For each class, we study issues related to time-space trade-off for deterministic and nondeterministic decision trees.
Keywords: Deterministic decision trees, Nondeterministic decision trees, Time complexity, Space complexity, Complexity classes, Time-space trade-off.
1 Introduction
Decision trees and decision rules are intensively studied and used in different areas of computer science. The questions important for the theory of decision trees and rules include relations between decision trees and decision rule systems, time-space tradeoff for decision trees, and time-space tradeoff for decision rule systems.
In this paper, instead of decision rule systems we study nondeterministic decision trees. These trees can be considered as representations of decision rule systems that sometimes have less space complexity than the original rule systems. We study problems over infinite binary information systems and divide the set of all infinite binary information systems into three complexity classes depending on the worst case time and space complexity of deterministic and nondeterministic decision trees solving problems. This allows us to identify nontrivial relationships between deterministic decision trees and decision rule systems represented by nondeterministic decision trees. For each complexity class, we study issues related to time-space trade-off for deterministic and nondeterministic decision trees.
Decision trees [1, 2, 6, 22, 26, 32] and systems of decision rules [4, 5, 7, 11, 25, 29, 30, 31, 35] are widely used as classifiers to predict a decision for a new object, as a means of knowledge representation, and as algorithms for solving problems of fault diagnosis, computational geometry, combinatorial optimization, etc.
Decision trees and rules are among the most interpretable models for classifying and representing knowledge [15]. In order to better understand decision trees, we should not only minimize the number of their nodes, but also the depth of decision trees to avoid the consideration of long conjunctions of conditions corresponding to long paths in these trees. Similarly, for decision rule systems, we should minimize both the total length of rules and the maximum length of a rule in the system. When we consider decision trees and decision rule systems as algorithms (usually sequential for decision trees and parallel for decision rule systems), we should have in mind the same bi-criteria optimization problems to minimize space and time complexity of these algorithms.
In this paper, we represent systems of decision rules as nondeterministic decision trees to compress them and to emphasize the possibility of processing different decision rules in parallel. We consider deterministic and nondeterministic decision trees as algorithms and study their space and time complexity, paying particular attention to time and space complexity relationships. Examples of deterministic and nondeterministic decision trees computing Boolean function can be found in Fig. 1.

Infinite systems of attributes and decision trees over these systems have been intensively studied, especially systems of linear and algebraic attributes and the corresponding linear [8, 9, 16] and algebraic decision trees [3, 12, 13, 14, 36, 37, 38]. Years ago, one of the authors initiated the study of decision trees over arbitrary infinite systems of attributes [17, 18, 19, 20, 21]. In this paper, we study decision trees over arbitrary infinite systems of binary attributes represented in the form of infinite binary information systems.
General information system introduced by Pawlak [28] consists of a universe (a set of objects) and a set of attributes (functions with finite image) defined on the universe. An information system is called infinite, if both its universe and the set of attributes are infinite. An information system is called binary if each of its attributes has values from the set .
Any problem over an information system is described by a finite number of attributes that divide the universe into domains in which these attributes have fixed values. A decision is attached to each domain. For a given object from the universe, it is required to find the decision attached to the domain containing this object.
As algorithms solving these problems, deterministic and nondeterministic decision trees are studied. As time complexity of a decision tree, we consider its depth, i.e., the maximum number of nodes labeled with attributes in a path from the root to a terminal node. As space complexity of a decision tree, we consider the number of its nodes.
There are two approaches to the study of infinite information systems: local when in decision trees solving a problem we can use only attributes from the problem description, and global when in the decision trees solving a problem we can use arbitrary attributes from the considered information system. In this paper, we study decision trees in the framework of the local approach.
To the best of our knowledge, time-space trade-offs for decision trees over infinite information systems were not studied in the framework of the local approach prior to the present paper except for its conference version [10], which does not contain proofs. The paper [24] was the first one in which the time-space trade-offs for decision trees over infinite information systems were studied in the framework of the global approach.
Results obtained in [24] are different from the results obtained in the present paper. Apart from the difference in approach, in [24], the set of all infinite information systems is divided not into three but into five families and the criteria for the behavior of functions characterizing the minimum depth of the deterministic and nondeterministic decision tree are completely different in comparison with the present paper. However, many of the definitions and results of the two papers appear similar. In the present paper, we use some auxiliary statements proved in [24] and adapt some proofs from [24] to the case of the local approach.
Based on the results obtained in the present paper and in [22, 26], we describe possible types of behavior of four functions that characterize worst case time and space complexity of deterministic and nondeterministic decision trees over an infinite binary information system (index refers to the local approach). Decision trees solving a problem can use only attributes from the problem description.
The function characterizes the growth in the worst case of the minimum depth of a deterministic decision tree solving a problem with the growth of the number of attributes in the problem description. The function is either grows as a logarithm or linearly.
The function characterizes the growth in the worst case of the minimum depth of a nondeterministic decision tree solving a problem with the growth of the number of attributes in the problem description. The function is either bounded from above by a constant or grows linearly.
The function characterizes the growth in the worst case of the minimum number of nodes in a deterministic decision tree solving a problem with the growth of the number of attributes in the problem description. The function has either polynomial or exponential growth.
The function characterizes the growth in the worst case of the minimum number of nodes in a nondeterministic decision tree solving a problem with the growth of the number of attributes in the problem description. The function has either polynomial or exponential growth.
Each of the functions has two types of behavior. The tuple has three types of behavior. All these types are described in the paper and each type is illustrated by an example.
There are three complexity classes of infinite binary information systems corresponding to the three possible types of the tuple . For each class, we study joint behavior of time and space complexity of decision trees. The obtained results are related to time-space trade-off for deterministic and nondeterministic decision trees.
A pair of functions is called a boundary -pair of the information system if, for any problem over , there exists a deterministic decision tree over , which solves this problem and for which the depth is at most and the number of nodes is at most , where is the number of attributes in the problem description. An information system is called -reachable if the pair is a boundary -pair of the system . For nondeterministic decision trees, the notions of a boundary -pair of an information system and -reachable information system are defined in a similar way. For deterministic decision trees, the best situation is when the considered information system is -reachable: for any boundary -pair for an information system and any natural , and . For nondeterministic decision trees, the best situation is when the information system is -reachable.
For all complexity classes, all information systems from the class are -reachable. For two out of the three complexity classes, all information systems from the class are -reachable. For the remaining class, all information systems from the class are not -reachable. For all information systems that are not -reachable, we find nontrivial boundary -pairs, which are sufficiently close to .
2 Main Results
Let be an infinite set and be an infinite set of functions that are defined on and have values from the set . The pair is called an infinite binary information system [28], the elements of the set are called objects, and the functions from are called attributes. The set is called sometimes the universe of the information system .
A problem over is a tuple of the form , where , is the set of natural numbers , and . We do not require attributes to be pairwise distinct. The problem consists in finding the value of the function for a given object . The value is called the dimension of the problem .
Various problems of combinatorial optimization, pattern recognition, fault diagnosis, probabilistic reasoning, computational geometry, etc., can be represented in this form.
As algorithms for problem solving we consider decision trees. A decision tree over the information system is a directed tree with a root in which the root and edges leaving the root are not labeled, each terminal node is labeled with a number from , each working node (which is neither the root nor a terminal node) is labeled with an attribute from , and each edge leaving a working node is labeled with a number from the set . A decision tree is called deterministic if only one edge leaves the root and edges leaving an arbitrary working node are labeled with different numbers.
Let be a decision tree over and
be a directed path from the root to a terminal node of (we call such path complete). Define a subset of the set as follows. If , then . Let and, for , the node be labeled with the attribute and the edge be labeled with the number . Then
The depth of the decision tree is the maximum number of working nodes in a complete path of . Denote by the depth of and by – the number of nodes in .
A decision tree over the information system is called a decision tree over the problem if each working node of is labeled with an attribute from the set .
The decision tree over solves the problem nondeterministically if, for any object , there exists a complete path of such that and, for each and each complete path such that , the terminal node of is labeled with the number (in this case, we can say that is a nondeterministic decision tree solving the problem ). In particular, if the decision tree solves the problem nondeterministically, then, for each complete path of , either the set is empty or the function is constant on the set . The decision tree over solves the problem deterministically if is a deterministic decision tree, which solves the problem nondeterministically (in this case, we can say that is a deterministic decision tree solving the problem ).
Let be the set of all problems over . For a problem from , let be the minimum depth of a decision tree over solving the problem deterministically, be the minimum depth of a decision tree over solving the problem nondeterministically, be the minimum number of nodes in a decision tree over solving the problem deterministically, and be the minimum number of nodes in a decision tree over solving the problem nondeterministically.
We consider four functions defined on the set in the following way: , , , and , where the maximum is taken among all problems over with . These functions describe how the minimum depth and the minimum number of nodes of deterministic and nondeterministic decision trees solving problems are growing in the worst case with the growth of problem dimension. To describe possible types of behavior of these four functions, we need to define some properties of infinite binary information systems.
Definition 1.
We will say that the information system satisfies the condition of reduction if there exists such that, for each compatible on system of equations where , and , there exists a subsystem of this system, which has the same set of solutions from and contains at most equations. In this case, we will say that satisfies the condition of reduction with parameter .
We now consider two examples of infinite binary information systems that satisfy the condition of reduction. These examples are close to ones considered in Section 3.4 of the book [1].
Example 1.
Let , be functions from to , where is the set of real numbers, and be a function from to such that if and if . Then the infinite binary information system , where , satisfies the condition of reduction with parameter . If are linear functions, then we deal with attributes corresponding to families of parallel hyperplanes in what is common for decision trees for datasets with numerical attributes only [6].
Example 2.
Let be the Euclidean plane and be a straight line (line in short) in the plane. This line divides the plane into two open half-planes and , and the line . Two attributes correspond to the line . The first attribute takes value on points from , and value on points from and . The second one takes value on points from , and value on points from and . We denote by the set of all attributes corresponding to lines in the plane. Infinite binary information systems of the form , where , are called linear information systems.
Let be a line in the plane. Let us denote by the set of all attributes corresponding to lines, which are parallel to . Let be a point in the plane. We denote by the set of all attributes corresponding to lines, which pass through . A set of attributes from is called a clone if for some line or for some point . In [23], it was proved that a linear information system satisfies the condition of reduction if and only if is the union of a finite number of clones.
Definition 2.
Let be an infinite binary information system. A subset of the set will be called independent if, for any , the system of equations has a solution from the set . The empty set of attributes is independent by definition.
Definition 3.
We define the parameter , which is called the independence dimension or I-dimension of the information system (this notion is similar to the notion of independence number of family of sets [27]) as follows. If, for each , the set contains an independent subset of cardinality , then . Otherwise, is the maximum cardinality of an independent subset of the set .
We now consider examples of infinite binary information systems with finite I-dimension and with infinite I-dimension. More examples can be found in Lemmas 7-9.
Example 3.
Let . We denote by the set of all polynomials, which have integer coefficients and depend on variables . We denote by the set of all polynomials from such that the degree of each polynomial is at most . We define infinite binary information systems and as follows: and , where , , and if and if . One can show that the system has infinite I-dimension and the system has finite I-dimension.
We now consider four statements that describe possible types of behavior of functions , , , and . The next statement follows immediately from Theorem 4.3 [22].
Proposition 1.
For any infinite binary information system , the function has one of the following two types of behavior:
(LOG) If the system satisfies the condition of reduction, then .
(LIN) If the system does not satisfy the condition of reduction, then for any .
The next statement follows immediately from Theorem 8.2 [26].
Proposition 2.
For any infinite binary information system , the function has one of the following two types of behavior:
(CON) If the system satisfies the condition of reduction, then .
(LIN) If the system does not satisfy the condition of reduction, then for any .
Proposition 3.
For any infinite binary information system , the function has one of the following two types of behavior:
(POL) If the system has finite I-dimension, then for any ,
(EXP) If the system has infinite I-dimension, then for any ,
Proposition 4.
For any infinite binary information system and any ,
Let be an infinite binary information system. Proposition 1 allows us to correspond to the function its type of behavior from the set . Proposition 2 allows us to correspond to the function its type of behavior from the set . Propositions 3 and 4 allow us to correspond to each of the functions and its type of behavior from the set . A tuple obtained from the tuple
by replacing functions with their types of behavior is called the local type of the information system . We now describe all possible local types of infinite binary information systems.
Theorem 1.
1 | ||||
---|---|---|---|---|
2 | ||||
3 |
For , we denote by the class of all infinite binary information systems, whose local type coincides with the th row of Table 1. We now study for each of these complexity classes joint behavior of the depth and number of nodes in decision trees solving problems.
For a given infinite binary information system , we will consider pairs of functions such that, for any problem over , there exists a deterministic decision tree over solving with the depth at most and the number of nodes at most . We will study such pairs and will call them boundary -pairs.
Definition 4.
A pair of functions , where and , will be called a boundary -pair of the information system if, for any problem over , there exists a decision tree over , which solves the problem deterministically and for which and , where .
We are interested in finding boundary -pairs with functions that grow as slowly as possible. It is clear that, for any boundary -pair of the information system , the following inequalities hold: and . So the best possible situation is when is a boundary -pair of .
Definition 5.
An information system will be called -reachable if the pair is a boundary -pair of the system .
We now consider similar notions for nondeterministic decision trees: the notion of boundary -pair and the notion of -reachable information system.
Definition 6.
A pair of functions , where and , will be called a boundary -pair of the information system if, for any problem over , there exists a decision tree over , which solves the problem nondeterministically and for which and , where .
Definition 7.
An information system will be called -reachable if the pair is a boundary -pair of the system .
Note that for deterministic decision trees, the best situation is when the considered information system is -reachable and for nondeterministic decision trees – when the information system is -reachable.
Each information system from the classes , and is -reachable. Each information system from the classes and is -reachable. Each information system from the class is not -reachable. For all information systems , which are not -reachable, we find nontrivial boundary -pairs that are sufficiently close to .
The obtained results are related to time-space trade-off for deterministic and nondeterministic decision trees. Details can be found in the following three theorems.
Theorem 2.
Let be an information system from the class . Then
(a) The system is -reachable.
(b) The system is not -reachable and there exists such that
is a boundary -pair of the system .
Theorem 3.
Let be an information system from the class . Then
(a) The system is -reachable.
(b) The system is -reachable.
Theorem 4.
Let be an information system from the class . Then
(a) The system is -reachable.
(b) The system is -reachable.
Table 2 summarizes Theorems 1-4. The first column contains the name of the complexity class. The next four columns describe the local type of information systems from this class. The last two columns “-pairs” and “-pairs” contain information about boundary -pairs and boundary -pairs for information systems from the considered class: “-reachable” means that all information systems from the class are -reachable, “-reachable” means that all information systems from the class are -reachable, Th. 2 (b) is a link to the corresponding statement Theorem 2 (b).
3 Proofs of Propositions 3 and 4
In this section, we consider a number of auxiliary statements and prove the two mentioned propositions.
Let be a decision tree over an infinite binary information system and be an edge of entering a node . We denote by a subtree of , whose root is the node . We say that a complete path of is realizable if .
Lemma 1.
Let be an infinite binary information system, be a problem over , and be a decision tree over , which solves the problem deterministically and for which . Then
(a) For each node of , there exists a realizable complete path that passes through this node.
(b) Each working node of has two edges leaving this node.
Proof.
(a) It is clear that there exists at least one realizable complete path that passes through the root of . Let us assume that is a node of different from the root and such that there is no a realizable complete path, which passes through . Let be an edge entering the node . We remove from the edge and the subtree . As a result, we obtain a decision tree over , which solves deterministically and for which but this is impossible by definition of .
(b) Let us assume that in there exists a working node , which has only one leaving edge entering a node . We remove from the node and the edge and connect the edge entering the node to the node . As a result, we obtain a decision tree over , which solves the problem deterministically and for which but this is impossible by definition of . ∎∎
Let be an infinite binary information system, be a decision tree over , and be an edge of . The subtree will be called full if there exist edges in such that the removal of these edges and subtrees transforms the subtree into a tree such that each terminal node of is a terminal node of , and exactly two edges labeled with the numbers and respectively leave each working node of .
Lemma 2.
Let be an infinite binary information system, be a problem over , and be a decision tree over , which solves the problem nondeterministically and for which . Then
(a) For each node of , there exists a realizable complete path that passes through this node.
(b) If a working node of has leaving edges labeled with the same number and , then the subtrees are not full.
(c) If the root of has leaving edges and , then the subtrees are not full.
Proof.
(a) The proof of item (a) is almost identical to the proof of item (a) of Lemma 1.
(b) Let be a working node of , which has leaving edges labeled with the same number, , and at least one of the subtrees is full. For the definiteness, we assume that is full. Remove from the edges and subtrees . We now show that the obtained tree solves the problem nondeterministically. Assume the contrary. Then there exists an object such that, for each complete path with , the path passes through one of the edges but it is not true. Let be a complete path such that . Then, according to the assumption, this path passes through the node . Let be the part of this path from the root of to the node . Since the edges are labeled with the same number and is a full subtree, we can find in the continuation of to a terminal node of such that the obtained complete path of satisfies the condition . Hence is a decision tree over , which solves the problem nondeterministically and for which , but this is impossible by definition of .
(c) Item (c) can be proven in the same way as item (b). ∎∎
We now consider two statements about classes of decision trees proved in [24]. Let be a decision tree. We denote by the number of terminal nodes in and by the number of working nodes in . It is clear that .
Let be an infinite binary information systems. We denote by the set of all deterministic decision trees over and by the set of all decision trees from such that each working node of the tree has two leaving edges.
Lemma 3 (Lemma 14 from [24]).
Let be an infinite binary information system. Then
(a) If , then .
(b) If , then .
We denote by the set of all decision trees over that satisfy the following conditions: (i) if a working node of has leaving edges labeled with the same number and , then the subtrees are not full, and (ii) if the root of has leaving edges and , then the subtrees are not full. One can show that .
Lemma 4 (Lemma 15 from [24]).
Let be an infinite binary information system. If , then .
Let be an infinite binary information system. For , we denote by the number of -tuples for which the system of equations
has a solution from . For , denote
It is clear that, for any , if then .
Proposition 5.
Let be an infinite binary information system. Then, for any ,
Proof.
Let be a problem over and . Let be a decision tree over , which solves the problem deterministically and for which . From Lemma 1 it follows that each working node of has two edges leaving this node and, for each node of , there exists a realizable complete path that passes through this node. Let and be different complete paths in , , and . It is easy to show that . Therefore . It is clear that . By Lemma 3, . Hence . Taking into account that is an arbitrary problem over with we obtain
Since any decision tree solving the problem deterministically solves it nondeterministically we obtain
We now show that . Let us consider a problem over such that
and, for any , if , then . Let be a decision tree over , which solves the problem nondeterministically and for which . By Lemma 2, . Using Lemmas 3 and 4 we obtain . It is clear that . Therefore , , and . ∎∎
The next statement follows directly from Lemmas 5.1 and 5.2 [22] and the evident inequality , which is true for any infinite binary information system . The proof of Lemma 5.1 from [22] is based on Theorems 4.6 and 4.7 from the same monograph that are similar to results obtained in [33, 34].
Proposition 6.
For any infinite binary information system , the function has one of the following two types of behavior:
(POL) If the system has finite I-dimension, then for any ,
(EXP) If the system has infinite I-dimension, then for any ,
Proof of Proposition 3.
4 Proof of Theorem 1
First, we prove five auxiliary statements.
Lemma 5.
Let be an infinite binary information system, which has infinite I-dimension. Then does not satisfy the condition of reduction.
Proof.
Let us assume the contrary: satisfies the condition of reduction. Then satisfies the condition of reduction with parameter for some . Since , there exists an independent subset of the set . It is clear that the system of equations
is compatible on and each proper subsystem of the system has the set of solutions different from the set of solutions of . Therefore does not satisfy the condition of reduction with parameter . ∎∎
Lemma 6.
For any infinite binary information system, its local type coincides with one of the rows of Table 1.
Proof.
To prove this statement we fill Table 3. In the first column “I-dim.” we have either “Fin” or “Inf”: “Fin” if the considered information system has finite I-dimension and “Inf” if the considered information system has infinite I-dimension. In the second column “Reduct.”, we have either “Yes” or “No”: “Yes” if the considered information system satisfies the condition of reduction and “No” otherwise.
By Lemma 5, if an information system has infinite I-dimension, then this information system does not satisfy the condition of reduction. It means that there are only three possible tuples of values of the considered two parameters of information systems, which correspond to the three rows of Table 3. The values of the considered two parameters define the types of behavior of functions , , , and according to Propositions 1-4. We see that the set of possible tuples of values in the last four columns coincides with the set of rows of Table 1. ∎∎
I-dim. | Reduct. | ||||
---|---|---|---|---|---|
Fin | Yes | ||||
Fin | No | ||||
Inf | No |
For each row of Table 1, we consider an example of infinite binary information system, whose local type coincides with this row.
For any , we define two functions and . Let . Then if and only if , and if and only if .
Define an information system as follows: and .
Lemma 7.
The information system belongs to the class , , and if , , and for any . This information system satisfies the condition of reduction with parameter and has finite I-dimension equal .
Proof.
It is easy to show that for any . Using Proposition 5 we obtain for any . Let . Consider a problem over such that, for each with , . It is clear that . Therefore each decision tree over that solves the problem deterministically has at least terminal nodes. One can show that the number of terminal nodes in is at most . Hence and . Since is an integer, . Thus, . Set . Then . One can show that (the construction of an appropriate decision tree is based on an analog of binary search, and we use only attributes from the problem description) and . Therefore and . It is clear that . Let , be an arbitrary problem over and be all pairwise different attributes from the set ordered such that . Then these attributes divide the set into nonempty domains that are sets of solutions on of the following systems of equations: , , …, , . The value is constant in each of the considered domains. Using these facts it is easy to show that there exists a decision tree over , which solves the problem nondeterministically and for which if . Therefore . One can show that there exists a problem over such that and . Therefore .
Define an information system as follows: and .
Lemma 8.
The information system belongs to the class , , , , and for any . This information system does not satisfy the condition of reduction and has finite I-dimension equal .
Proof.
It is easy to show that for any . Using Proposition 5, we obtain for any .
Define an information system as follows: and is the set of all functions from to .
Lemma 9.
The information system belongs to the class , , , , and for any . This information system does not satisfy the condition of reduction and has infinite I-dimension.
5 Proofs of Theorems 2-4
First, we prove a number of auxiliary statements.
Lemma 10.
Let be an infinite binary information system. Then the information system is -reachable.
Proof.
Let be a problem over . Then there exists a decision tree over , which solves this problem deterministically and whose depth is at most . By removal of some nodes and edges from , we can obtain a decision tree over , which solves the problem deterministically and in which each working node has exactly two leaving edges and each complete path is realizable. Let and be different complete paths in , , and . It is easy to show that . Therefore . It is clear that . By Lemma 3, . Therefore . By Proposition 5, . Taking into account that and is an arbitrary problem over with , we obtain that is -reachable. ∎∎
Lemma 11.
Let be an infinite binary information system such that for any . Then the information system is -reachable.
Proof.
Let be a problem over and be a decision tree over that solves the problem deterministically and satisfies the following conditions: the number of working nodes in each complete path of is equal to and these nodes in the order from the root to a terminal node are labeled with attributes . Remove from all nodes and edges that do not belong to realizable complete paths. Let be a working node in the obtained tree that has only one leaving edge entering a node . We remove the node and edge and connect the edge entering to the node . We do the same with all working nodes with only one leaving edge. Denote by the obtained decision tree. It is clear that solves the problem deterministically and hence nondeterministically, , and . By Lemma 3, . Therefore . Using Proposition 5, we obtain . It is clear that . Therefore is -reachable. ∎∎
Lemma 12.
Let be an infinite binary information system, which satisfies the condition of reduction. Then the information system is not -reachable.
Proof.
By Proposition 2, the function is bounded from above by a positive constant . By Proposition 6, the function is not bounded from above by a constant. Choose such that . Let be a problem over such that for any , , and . Let be a decision tree over , which solves the problem nondeterministically, for which , and which has the minimum number of nodes among such trees. In the same way as it was done in the proof of Lemma 2, we can prove that . It is clear that . Let us assume that . Then it is easy to show that , which is impossible by the choice of . Therefore . By Lemma 4, . Using Proposition 5, we obtain . Therefore is not -reachable. ∎∎
Lemma 13.
Let be an infinite binary information system, which satisfies the condition of reduction with parameter . Then is a boundary -pair of the system .
Proof.
Let be a problem over . We now describe a decision tree over , which solves the problem nondeterministically and for which and . For each tuple for which the system of equations
has a solution from , we describe a complete path . Since the information system satisfies the condition of reduction with parameter , there exists a subsystem
of the system , which has the same set of solutions and for which . Then
where the node and the edge are not labeled, for , the node is labeled with the attribute and the edge is labeled with the number , and the node is labeled with the number . We merge initial nodes of all such complete paths and denote by the obtained tree. One can show that is a decision tree over , which solves the problem nondeterministically and for which . The number of the considered complete paths is equal to . The number of nodes in each complete paths is at most . Therefore . By Proposition 5, . Hence . Thus, is a boundary -pair of the system . ∎∎
Proof of Theorem 2.
Each information system from the class satisfies the condition of reduction (see Table 3).
(a) Let be an information system from the class . Using Lemma 10, we obtain that the system is -reachable.
Proof of Theorem 3.
Each information system from the class does not satisfy the condition of reduction (see Table 3).
(a) Let be an information system from the class . Using Lemma 10, we obtain that the system is -reachable.
6 Conclusions
In this paper, we divided the set of all infinite binary information systems into three complexity classes depending on the worst case time and space complexity of deterministic and nondeterministic decision trees. This allowed us to identify nontrivial relationships between deterministic decision trees and decision rule systems represented by nondeterministic decision trees. For each complexity class, we studied issues related to time-space trade-off for deterministic and nondeterministic decision trees. In the future, we are planning to generalize the obtained results to the case of classes of decision tables closed under operations of removal of attributes and changing decisions attached to rows of decision tables.
Acknowledgements
Research reported in this publication was supported by King Abdullah University of Science and Technology (KAUST).
References
- [1] AbouEisha, H., Amin, T., Chikalov, I., Hussain, S., Moshkov, M.: Extensions of Dynamic Programming for Combinatorial Optimization and Data Mining, Intelligent Systems Reference Library, vol. 146. Springer, Cham (2019)
- [2] Alsolami, F., Azad, M., Chikalov, I., Moshkov, M.: Decision and Inhibitory Trees and Rules for Decision Tables with Many-valued Decisions, Intelligent Systems Reference Library, vol. 156. Springer, Cham (2020)
- [3] Ben-Or, M.: Lower bounds for algebraic computation trees (preliminary report). In: 15th Annual ACM Symposium on Theory of Computing, STOC 1983, pp. 80–86 (1983)
- [4] Boros, E., Hammer, P.L., Ibaraki, T., Kogan, A.: Logical analysis of numerical data. Math. Program. 79, 163–190 (1997)
- [5] Boros, E., Hammer, P.L., Ibaraki, T., Kogan, A., Mayoraz, E., Muchnik, I.B.: An implementation of logical analysis of data. IEEE Trans. Knowl. Data Eng. 12(2), 292–306 (2000)
- [6] Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Wadsworth, Belmont, CA (1984)
- [7] Chikalov, I., Lozin, V.V., Lozina, I., Moshkov, M., Nguyen, H.S., Skowron, A., Zielosko, B.: Three Approaches to Data Analysis - Test Theory, Rough Sets and Logical Analysis of Data, Intelligent Systems Reference Library, vol. 41. Springer, Berlin Heidelberg (2013)
- [8] Dobkin, D.P., Lipton, R.J.: A lower bound of the on linear search programs for the knapsack problem. J. Comput. Syst. Sci. 16(3), 413–417 (1978)
- [9] Dobkin, D.P., Lipton, R.J.: On the complexity of computations under varying sets of primitives. J. Comput. Syst. Sci. 18(1), 86–91 (1979)
- [10] Durdymyradov, K., Moshkov, M.: Time and space complexity of deterministic and nondeterministic decision trees. Local approach. In: 2023 IEEE International Conference on Big Data. 9th Special Session on Information Granulation in Data Science and Scalable Computing (2023). (to appear)
- [11] Fürnkranz, J., Gamberger, D., Lavrac, N.: Foundations of Rule Learning. Cognitive Technologies. Springer, Berlin Heidelberg (2012)
- [12] Gabrielov, A., Vorobjov, N.: On topological lower bounds for algebraic computation trees. Found. Comput. Math. 17(1), 61–72 (2017)
- [13] Grigoriev, D., Karpinski, M., Vorobjov, N.: Improved lower bound on testing membership to a polyhedron by algebraic decision trees. In: 36th Annual Symposium on Foundations of Computer Science, FOCS 1995, pp. 258–265 (1995)
- [14] Grigoriev, D., Karpinski, M., Yao, A.C.: An exponential lower bound on the size of algebraic decision trees for Max. Computational Complexity 7(3), 193–203 (1998)
- [15] Molnar, C.: Interpretable Machine Learning. A Guide for Making Black Box Models Explainable, 2 edn. (2022). URL christophm.github.io/interpretable-ml-book/
- [16] Morávek, J.: A localization problem in geometry and complexity of discrete programming. Kybernetika 8(6), 498–516 (1972)
- [17] Moshkov, M.: Decision Trees. Theory and Applications (in Russian). Nizhny Novgorod University Publishers, Nizhny Novgorod (1994)
- [18] Moshkov, M.: Optimization problems for decision trees. Fundam. Inform. 21(4), 391–401 (1994)
- [19] Moshkov, M.: Two approaches to investigation of deterministic and nondeterministic decision trees complexity. In: 2nd World Conference on the Fundamentals of Artificial Intelligence, WOCFAI 1995, pp. 275–280 (1995)
- [20] Moshkov, M.: Comparative analysis of deterministic and nondeterministic decision tree complexity. Global approach. Fundam. Inform. 25(2), 201–214 (1996)
- [21] Moshkov, M.: Comparative analysis of deterministic and nondeterministic decision tree complexity. Local approach. In: Trans. Rough Sets IV, Lecture Notes in Computer Science, vol. 3700, pp. 125–143. Springer, Berlin Heidelberg (2005)
- [22] Moshkov, M.: Time complexity of decision trees. In: Trans. Rough Sets III, Lecture Notes in Computer Science, vol. 3400, pp. 244–459. Springer, Berlin Heidelberg (2005)
- [23] Moshkov, M.: On the class of restricted linear information systems. Discret. Math. 307(22), 2837–2844 (2007)
- [24] Moshkov, M.: Time and space complexity of deterministic and nondeterministic decision trees. Ann. Math. Artif. Intell. 91(1), 45–74 (2023)
- [25] Moshkov, M., Piliszczuk, M., Zielosko, B.: Partial Covers, Reducts and Decision Rules in Rough Sets - Theory and Applications, Studies in Computational Intelligence, vol. 145. Springer, Berlin Heidelberg (2008)
- [26] Moshkov, M., Zielosko, B.: Combinatorial Machine Learning - A Rough Set Approach, Studies in Computational Intelligence, vol. 360. Springer, Berlin Heidelberg (2011)
- [27] Naiman, D.Q., Wynn, H.P.: Independence number and the complexity of families of sets. Discr. Math. 154, 203–216 (1996)
- [28] Pawlak, Z.: Information systems theoretical foundations. Inf. Syst. 6(3), 205–218 (1981)
- [29] Pawlak, Z.: Rough Sets - Theoretical Aspects of Reasoning about Data, Theory and Decision Library : series D, vol. 9. Kluwer (1991)
- [30] Pawlak, Z., Polkowski, L., Skowron, A.: Rough set theory. In: B.W. Wah (ed.) Wiley Encyclopedia of Computer Science and Engineering. John Wiley & Sons, Inc. (2008). URL https://doi.org/10.1002/9780470050118.ecse466
- [31] Pawlak, Z., Skowron, A.: Rudiments of rough sets. Inf. Sci. 177(1), 3–27 (2007)
- [32] Rokach, L., Maimon, O.: Data Mining with Decision Trees - Theory and Applications, Series in Machine Perception and Artificial Intelligence, vol. 69. WorldScientific, Singapore (2007)
- [33] Sauer, N.: On the density of families of sets. J. of Combinatorial Theory (A) 13, 145–147 (1972)
- [34] Shelah, S.: A combinatorial problem; stability and order for models and theories in infinitary languages. Pacific J. of Mathematics 41, 241–261 (1972)
- [35] Skowron, A., Rauszer, C.: The discernibility matrices and functions in information systems. In: R. Slowinski (ed.) Intelligent Decision Support - Handbook of Applications and Advances of the Rough Sets Theory, Theory and Decision Library, vol. 11, pp. 331–362. Springer (1992)
- [36] Steele, J.M., Yao, A.C.: Lower bounds for algebraic decision trees. J. Algorithms 3(1), 1–8 (1982)
- [37] Yao, A.C.: Algebraic decision trees and Euler characteristics. In: 33rd Annual Symposium on Foundations of Computer Science, FOCS 1992, pp. 268–277 (1992)
- [38] Yao, A.C.: Decision tree complexity and Betti numbers. In: 26th Annual ACM Symposium on Theory of Computing, STOC 1994, pp. 615–624 (1994)