Classifying All Transducer Degrees Below
Abstract
We answer an open question in the theory of transducer degrees initially posed in [3], on the structure of polynomial transducer degrees, in particular the question of what degrees, if any, lie below the degree of . Transducer degrees are the equivalence classes formed by word transformations which can be realized by a finite-state transducer. While there are no general techniques to tell if a word can be transformed into via an FST, the work of Endrullis et al. in [2] provides a test for the class of spiralling functions, which includes all polynomials. We classify fully the degrees of all cubic polynomials which are below , and many of the methods can also be used to classify the degrees of polynomials of higher orders.
1 Introduction
Finite-state transducers (FSTs) and infinite sequences are both fairly natural objects in mathematics, but not much is known about the structure of the order induced by the transducibility relation on infinite sequences ( if can be transduced into via an FST). However, there are many parallels between the degrees induced by transducibility and Turing degrees, so it’s natural to begin by asking the same questions that have been asked (and answered) for Turing degrees. Many initial results have already been obtained (mostly in [1]), for instance:
-
•
The bottom degree 0 is the set of all ultimately periodic sequences (sequences of the form for some words ,)
-
•
The partial order of degrees is not dense or well-founded
-
•
There are no maximal degrees
-
•
A set of degrees has an upper bound if and only if it is countable
-
•
There are at least atom degrees (degrees which are exactly one step above 0)
The last result in particular was proven in (Theorem 32, [4]) when it was shown that there is a way to construct an atomic degree for every , where is a polynomial of order . (To avoid confusion between degrees of transducibility and degrees of polynomials, we use the term ”order” for the degree of a polynomial.) However, until now, all that was known about the structure of degrees of polynomials is that polynomials of different orders are always incomparable, and that the degree of is strictly greater than the degree of . We will provide a full classification of all degrees below the degree of , with methods that will be useful for classifying degrees of higher order polynomials.
2 Preliminaries
Let be an alphabet. Recall that the empty word is denoted by . is the set of all finite words over , and is \. Denote by the set of infinite sequences over . is the set of all finite and infinite words over , i.e. . For and k in , the -th shift of f is denoted by for all in .
We use a boldface letter like to denote an ordered tuple , and we use to denote the cyclic shift of : .
3 Finite-State Transducers and Transducer Degrees
We will mostly focus on the definitions and results which are useful for the original material in this paper. For a more complete background of the known results for transducer degrees and their proofs, see [1, 2, 3, 4].
For the following definitions for FSTs, we restrict ourselves to only considering complete pure sequential FSTs, where for every state and every input letter , there is exactly one successor state , and the function realized by such a transducer preserves prefixes (if is a prefix of , then is a prefix of ). We also restrict ourselves to using only 2 = {0,1} as both the input and output alphabet for convenience.
Definition 3.1.
A finite-state transducer is a tuple where is a finite set of states, is the initial state, is the transition function, and is the output function.
Note that and can be extended () as follows:
Intuitively, the above equations simply correspond to inputting a string of letters as opposed to just one letter, and reading the string one letter at a time. So then the function induced by the FST on (finite or infinite) words is defined by , the value of the output after inputting , starting at the initial state . This allows us to give the following definition:
Definition 3.2.
Let be an FST, and let be infinite sequences. We say that to , or that is the - of , if . In general, for any two infinite sequences we say that if there exists some so that .
This relation is reflexive, and can be shown to be transitive by composition of FSTs (See Lemma 8, [1]). If for some we have but not vice versa, we say . If we do have and then we say that , and we use to denote the equivalence class of . We call the degree of .
4 Weight Products and Useful Results
Definition 4.1.
For a function we define the sequence by
Having defined in this way, some relatively simple initial results have been obtained in [2], which we state here.
Lemma 4.2.
Let We have the following equivalences and inequalities:
-
1.
for
-
2.
-
3.
-
4.
for
-
5.
One interesting consequence of the third equality is that any polynomial with a positive leading coefficient can be thought of as a stream and thus associated with a transducer degree, even if some of its values happen to be negative. For instance, the polynomial can’t directly be interpreted as a stream, since it is negative for . However, if we take , then this polynomial is nonnegative, and therefore corresponds to a stream (and thus a degree). So even though it’s technically incorrect, it will be convenient sometimes to refer to a degree such as , when we mean more precisely the degree for any . Similarly, the first equality allows us to refer to the degree of a function with rational coefficients, where we really mean the degree of the corresponding function multiplied by the appropriate scalar to eliminate any fractional coefficients.
Now we move on to weight products. For this paper we don’t provide the full context or proof for the main result we need (Theorem 4.4, which provides us with an incredibly useful result for comparing polynomial degrees), but a more detailed explanation can be found in (Theorem 21, [3]). We start by defining a weight.
Definition 4.3.
A weight is a tuple with each . If for all then we say the weight is constant. To distinguish between weights and tuples of weights, weights will not be bolded but tuples of weights will.
Given a weight as above and a function we can define as:
We are ready to define the weighted product. Let be a tuple of weights, with being the cyclic shift .
Then the weighted product of with , written as , is defined in the following way:
Here and indicates the length of the tuple . We call a weighted product if for all . Note that since is a finite tuple of finite tuples in , we can take the LCM of all of the denominators and multiply through to make the product natural. Since this does not change the degree of the resulting function (by Lemma 4.2.1), from now on we will assume that all weight products are natural.
The following image provides a more intuitive picture of how the weight product works, by showing pictorially how to compute the weight product of the tuple of weights with an arbitrary function , where :
Now that we have defined weight products we can proceed to state the main result that we need for the rest of the paper. The full proof of this result can be found in (Theorem 21, [3]).
Theorem 4.4.
Let be polynomials. Then if and only if there exists a tuple of weights and integers such that .
The usefulness of this theorem for comparing polynomial degrees cannot be overstated. Instead of being forced to consider all possible transductions of polynomial streams by any possible transducer, we can instead narrow our focus strictly to weight products. We also have a similar theorem from [2] which allows us to narrow our focus to only polynomial degrees. The following theorem guarantees that all degrees (not just polynomial degrees) below are equivalent to the degree of a weight product of . Therefore we do not miss anything by only considering weight products of .
Theorem 4.5.
Let be a polynomial, and . Then if and only if for some integer , and a tuple of weights .
We can narrow our focus even further with a new result that lets us focus on a particular subset of weight tuples, the tuples with only one weight. Note that by the definition of weight product, if we take the product of a function with a weight tuple with a single weight , it is equal to the sum .
Lemma 4.6.
Let and suppose is a polynomial such that there exists a tuple of weights with . Then there exists a single weight such that .
Proof.
Let , , be as above, and suppose has more than one weight. By shifting as necessary, we can assume WLOG that has the shortest length of all of the weights in . Let .
We have for all mod ,
Now by assumption, had the shortest length of the weights in , which means that if we take this length (which is ) times the number of weights , this must be less than or equal to . Therefore we can construct the following weight :
where each is in position , and after we have zeroes, so that this weight has length L. Then for all mod , . However, all three expressions in this equation are polynomials. Since they are equal for infinitely many , they must be equal for all , and thus is indeed the desired weight. ∎
Now we are ready to introduce a key definition that forms the basis for all of the results in this paper.
Definition 4.7.
Let be a weight with for exactly ’s. Then we call the weight product of with a function an m-transform of . We will often write instead of , since Lemma 4.6 guarantees that for this paper we only need to consider 1-element tuples of weights.
With this definition, we could restate Lemma 4.6 as ”All degrees below are equivalent to an -transform of a shift of for some ”. For this paper we only need to consider the case , and thus if we are interested in classifying all degrees below , we need to classify the degrees of all -transforms of shifts of .
In fact, we can classify almost all -transforms of in one theorem, a generalization of Theorem 32 in [4]. We first need to introduce a couple of definitions from [3]:
Definition 4.8.
Let be a polynomial. We define to be the column vector . We also define for a weight another column vector , given by . Finally, we define to be the by matrix whose th column is given by .
The key feature of these definitions is that it allows us to rewrite the weight product as a matrix product in the following sense:
The proof can be done by checking the definition of a weight product (see Lemma 30 in [4] for more details). Now we need a few more lemmas to set up our first theorem on -transforms.
Lemma 4.9.
For all , with pairwise distinct and , the vectors form an invertible matrix.
Proof.
The determinant of this matrix is , which is indeed nonzero if are distinct and as assumed. ∎
Lemma 4.10.
Let q(n) be a polynomial of order k. Then for every we have for some rational coefficients .
We refer to the polynomial that can be found by this lemma as , as in [3]. The following lemma is due to Robert Israel in [5].
Lemma 4.11.
Let M be a 3 by n matrix of rank 3 with , let be an n by 1 vector with the first three entries strictly positive and all other entries nonnegative, and let . Let be a matrix the same size as M which has all entries within of M. Then there exists a vector such that , and has the first three entries strictly positive, and all entries nonnegative.
Proof.
Since has rank 3, we can assume WLOG that the first three columns are linearly independent. Then we can write in block form as , where is an invertible 3 by 3 matrix, and is everything else. We can also split into blocks, and writing these blocks in terms of gives us .
Now if is close enough to , is still invertible, and we can define a vector , and by construction . Further, since converges to , the first three entries can be made strictly positive with a sufficiently small , and its other entries (the ”W” block) are equal to the entries of and thus nonnegative.
∎
5 Classifying Cubic Degrees
Now we are ready to begin classifying all cubic degrees below . We begin by classifying almost all -transforms of in one result.
Theorem 5.1.
Let . Then the degrees of every -transform of are all equal, and every cubic polynomial can be transduced to this degree. We refer to this degree as the bottom degree for cubics.
Proof.
The basic idea of this proof is that we want to narrow our focus to only three columns of our matrix (and three nonzero rows of our vector) in order to replicate the proof method from Theorem 32 in [4].
Let , where each is nonnegative, and at least 3 are strictly positive. By definition we have , where is the weight . We will assume for now that it is the first three ’s which are nonnegative, and justify this assumption later.
We want to prove that any cubic polynomial can be transduced into . Let be an arbitrary cubic polynomial, and let be as defined in Lemma 4.9. Consider the matrix . We have the right conditions to apply Lemma 4.11 and obtain a vector as stated in the lemma. In particular, we have that and has all nonnegative entries. But this means that is not just a vector but in fact a weight vector, since we can form a weight out of the entries of . Then the equation corresponds to a weight product, and in fact what it says is that can be transduced into via the weight corresponding to the vector . Combining this with the way was defined, we have for an arbitrary cubic polynomial , . Therefore is in the bottom degree for cubics.
However, we do still have to justify why we can assume WLOG that our weight had the first three entries strictly positive. In general, the nonzero entries of the weight could be in any location. But the fundamental argument behind Lemma 4.11 works the same regardless, as long as an invertible matrix can be formed by the columns which correspond to the nonzero entries of the vector (i.e. if ’s strictly positive entries were in the positions, the columns of need to be linearly independent). If the nonzero entries are in different places we simply have a messier block matrix to draw and more details to write out. And because Lemma 4.9 guarantees that any 3 columns of can form an invertible 3 by 3 matrix, we are justified in using our assumption to simplify the proof.
∎
We also note that this proof would generalize fairly naturally to prove the following statement: ”All -transforms of are in the bottom degree for polynomials of order , for ”.
So now we have completely eliminated all but two cases: 1-transforms and 2-transforms of . Next we prove that the 2-transforms are also in the bottom degree.
Theorem 5.2.
All 2-transforms of are in the bottom degree for cubics.
Proof.
We first observe that all 2-transforms of are of the form , with both positive rational numbers, , and .
We need the following equality:
where
This can be easily verified, and holds for all . We also observe that if and are fixed positive numbers, for a large enough value of each is strictly positive.
Now we proceed to the proof, which essentially consists of applying algebraic operations to this equation to end up with a 2-transform on the left hand side, and a 3-transform (or higher) on the right hand side. Then applying Thm 5.1 lets us conclude both sides of the equation are in the bottom degree for cubics.
Let be a 2-transform of . Assume WLOG .
We have two cases.
Case 1:
If r 0, in the previous equation let , let , and choose an integer k large enough so each is positive.Then on both sides of the equation replace with to obtain:
Now we use that , and also replace with :
Multiplying through by gives us what we want on the left hand side:
Now let’s look at the right hand side. Since and , . Since and is an integer, , so . Finally, so .
Then we can rewrite the right hand side as the following: , where for all i except for . Since and the ’s are all positive, this is 3-transform of . Therefore it is in the bottom degree, and so is the left hand side.
Case 2:
This will proceed similarly to the first case, except at the end we get a transformation of . Again let , , and choose an integer k large enough so each is positive. We have:
Now rewrite the right hand side as follows:
Multiply both sides by and replace with to obtain the desired left hand side:
Again we examine the right hand side. Since and both are integers, and thus , and we also have , . So we can rewrite the right hand side into the following sum: , where for all i except for . Since and the ’s are all positive, this is a 3-transform of . Therefore it is in the bottom degree by Theorem 5.2, and so is the left hand side. ∎
Now we have shown that everything except the 1-transforms is equal to the bottom degree for cubics. Fortunately, the 1-transforms are not in the bottom degree, and we will show that their degrees form an interesting structure. First we prove that they are not in the bottom degree.
Theorem 5.3.
Every 1-transform of is strictly above the bottom degree for cubics.
Proof.
We will proceed by contradiction. Assume that there is some 1-transform of which is equal to the bottom degree. From the definition of the weight product, all 1-transforms of are of the form , with . To say that such a function is in the bottom degree is to say that all other cubic polynomials satisfy . In particular, we can choose f to be a 3-transform of , say for some weight with three nonzero entries.
This means that we have .
A bit of algebra gives us that , where and is the composition of weights (Lemma 12, [3]). The key thing to note here is that because had three nonzero entries, also must have at least three nonzero entries (The composition of a -transform with an -transform is a -transform). Let .
Therefore we have:
and thus
where at least 3 of the are nonzero. We can simplify this equation with the goal of getting just on the left hand side. First we multiply both sides by , as well as multiplying the right hand side by and distributing the top to the inside of the parentheses. This gives:
We can replace with to obtain:
Now replace with :
Replace with , and then divide both sides by (this leaves the left hand side unchanged):
Finally, define for all and 0 otherwise, and let .
Then we have:
Now the right hand side is clearly equal to a weight transformation of , and since at least three of the ’s were nonzero, at least three of the ’s will be nonzero. This would mean that is equal to a function which is in the bottom degree for cubics, by Thm 5.1. However, it has been shown that this is not the case in (Theorem 23, [4]), therefore this contradiction means that is not equal to the bottom degree. ∎
So now we have shown that all of the 1-transforms of are strictly above the bottom degree. We still need to consider how they relate to each other. The following lemma will give us a step in that direction by narrowing down the amount of 1-transforms we need to consider.
Lemma 5.4.
Suppose is relatively prime to . Then .
Proof.
We prove this via double inequality. One direction is trivial:
The other direction requires a bit more work. First, we prove the following: For all ,
Proof:
Now we want to show that for some , . Since is relatively prime to , it is invertible in the multiplicative group , which means for some , mod . So there exists some number with . Then we have:
Since we proved that earlier, this completes the proof. ∎
Note that in the case that are not relatively prime, we can simply divide out by their common factor and reduce. For example, .
This means that the degrees of all 1-transforms can be represented by some . The only question now is whether or not some (or all) of these degrees are equal, and how they compare to each other. The following theorem answers this question definitively.
Theorem 5.5.
if and only if .
Proof.
The reverse implication is relatively straightforward: if , then let and we have .
Now for the forward implication. The key observation is that if , this means that there exists , and a weight such that , and this weight must have only one nonzero entry. Otherwise, the right hand side of the previous equation would be an -transform of for , which (by Thms 5.1 and 5.2) would put it in the bottom degree. ( is a 1-transform of , and an -transform of a 1-transform is also an -transform of the original function, via weight product composition.) To say that must have only one nonzero entry means that has the form . Plugging this into the above equation means that
This means that and therefore . ∎
This theorem also gives us an interesting corollary that is somewhat counterintuitive.
Corollary 5.6.
Suppose , and let be a weight. Then it is possible that .
Proof.
Let , , . Then , but , and by the theorem.
∎
6 Conclusion
What Theorem 5.5 tells us is that the structure of the degrees of 1-transforms of is that of a divisibility lattice, with at the top, in the second row for all primes , etc, and below this lattice is the bottom degree. The results of Theorems 4.5, 5.1, and 5.2 tell us that these are the only degrees that are below the degree of , and thus the structure of all transducer degrees below is given by the following picture (we remove the notation used for degrees to make the image cleaner):
= the bottom degree for cubics (includes all 2-transforms of and all -transforms of for all and all )
Therefore all degrees below have been classified. It remains an open question whether or not there are other cubic polynomials that are either above or incomparable with . The classification of higher order polynomial degrees is also unknown, but many of the methods used in this paper would be useful to answer such questions.
References
- [1] Jörg Endrullis, Dimitri Hendriks and Jan Willem Klop “Degrees of Streams” Proceedings of the Leiden Numeration Conference 2010 In Journal of Integers 11B.A6, 2011, pp. 1–40
- [2] Jörg Endrullis, Clemens Grabmayer, Dimitri Hendriks and Hans Zantema “The Degree of Squares is an Atom” In Proc. Conf. on Combinatorics on Words (WORDS 2015) 9304, LNCS Springer, 2015, pp. 109–121 DOI: 10.1007/978-3-319-23660-5“˙10
- [3] Jörg Endrullis, Juhani Karhumäki, Jan Willem Klop and Aleksi Saarela “Degrees of Infinite Words, Polynomials and Atoms” In Proc. Conf. on Developments in Language Theory (DLT 2016) 9840, LNCS Springer, 2016, pp. 164–176 DOI: 10.1007/978-3-662-53132-7“˙14
- [4] Jörg Endrullis, Juhani Karhumäki, Jan Willem Klop and Aleksi Saarela “Degrees of Infinite Words, Polynomials and Atoms” In International Journal of Foundations of Computer Science 29.5, 2018, pp. 825–843 DOI: 10.1142/S0129054118420066
- [5] Jörg Endrullis, Jan Willem Klop and Rena Bakhshi “Transducer Degrees: Atoms, Infima and Suprema” In Acta Informatica 57.3-5, 2020, pp. 727–758 DOI: 10.1007/s00236-019-00353-7