This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Classifying All Transducer Degrees Below N3N^{3}

Noah Kaufmann
Abstract

We answer an open question in the theory of transducer degrees initially posed in [3], on the structure of polynomial transducer degrees, in particular the question of what degrees, if any, lie below the degree of n3n^{3}. Transducer degrees are the equivalence classes formed by word transformations which can be realized by a finite-state transducer. While there are no general techniques to tell if a word w1w_{1} can be transformed into w2w_{2} via an FST, the work of Endrullis et al. in [2] provides a test for the class of spiralling functions, which includes all polynomials. We classify fully the degrees of all cubic polynomials which are below n3n^{3}, and many of the methods can also be used to classify the degrees of polynomials of higher orders.

1 Introduction

Finite-state transducers (FSTs) and infinite sequences are both fairly natural objects in mathematics, but not much is known about the structure of the order induced by the transducibility relation on infinite sequences (στ\sigma\geq\tau if σ\sigma can be transduced into τ\tau via an FST). However, there are many parallels between the degrees induced by transducibility and Turing degrees, so it’s natural to begin by asking the same questions that have been asked (and answered) for Turing degrees. Many initial results have already been obtained (mostly in [1]), for instance:

  • The bottom degree 0 is the set of all ultimately periodic sequences (sequences of the form uvvvvuvvvv... for some words uu,vv)

  • The partial order of degrees is not dense or well-founded

  • There are no maximal degrees

  • A set of degrees has an upper bound if and only if it is countable

  • There are at least ω\omega atom degrees (degrees which are exactly one step above 0)

The last result in particular was proven in (Theorem 32, [4]) when it was shown that there is a way to construct an atomic degree pk(n)\langle p_{k}(n)\rangle for every kk, where pk(n)p_{k}(n) is a polynomial of order kk. (To avoid confusion between degrees of transducibility and degrees of polynomials, we use the term ”order” for the degree of a polynomial.) However, until now, all that was known about the structure of degrees of polynomials is that polynomials of different orders are always incomparable, and that the degree of nkn^{k} is strictly greater than the degree of pk(n)p_{k}(n). We will provide a full classification of all degrees below the degree of n3n^{3}, with methods that will be useful for classifying degrees of higher order polynomials.

2 Preliminaries

Let Σ\Sigma be an alphabet. Recall that the empty word is denoted by ϵ\epsilon. Σ\Sigma^{*} is the set of all finite words over Σ\Sigma, and Σ+\Sigma^{+} is Σ\Sigma^{*} \ϵ\epsilon. Denote by Σ\Sigma^{\mathbb{N}} the set of infinite sequences over Σ\Sigma. Σ\Sigma^{\infty} is the set of all finite and infinite words over Σ\Sigma, i.e. ΣΣ\Sigma^{*}\cup\Sigma^{\mathbb{N}}. For f:f:\mathbb{N}\rightarrow\mathbb{N} and k in \mathbb{N}, the kk-th shift of f is denoted by Sk(f)(n)=f(n+k)S^{k}(f)(n)=f(n+k) for all nn in \mathbb{N}.

We use a boldface letter like 𝜶\bm{\alpha} to denote an ordered tuple a0,a1,,ak1\langle a_{0},a_{1},\ldots,a_{k-1}\rangle, and we use 𝜶\bm{\alpha^{\prime}} to denote the cyclic shift of 𝜶\bm{\alpha}: a1,a2,,ak1,a0\langle a_{1},a_{2},\ldots,a_{k-1},a_{0}\rangle.

3 Finite-State Transducers and Transducer Degrees

We will mostly focus on the definitions and results which are useful for the original material in this paper. For a more complete background of the known results for transducer degrees and their proofs, see [1, 2, 3, 4].

For the following definitions for FSTs, we restrict ourselves to only considering complete pure sequential FSTs, where for every state qq and every input letter aa, there is exactly one successor state δ(q,a)\delta(q,a), and the function realized by such a transducer preserves prefixes (if uu is a prefix of vv, then f(u)f(u) is a prefix of f(v)f(v)). We also restrict ourselves to using only 2 = {0,1} as both the input and output alphabet for convenience.

Definition 3.1.

A finite-state transducer is a tuple T=Q,q0,δ,λT=\langle Q,q_{0},\delta,\lambda\rangle where QQ is a finite set of states, q0Qq_{0}\in Q is the initial state, δ:Q×2Q\delta:Q\times\textbf{2}\rightarrow Q is the transition function, and λ:Q×22\lambda:Q\times\textbf{2}\rightarrow\textbf{2}^{*} is the output function.

Note that δ\delta and λ\lambda can be extended (δ:Q×2Q,λ:Q×22\delta:Q\times\textbf{2}^{*}\rightarrow Q,\lambda:Q\times\textbf{2}^{\infty}\rightarrow\textbf{2}^{\infty}) as follows:

δ(q,ϵ)=q,δ(q,au)=δ(δ(q,a),u), where qQ,a2,u2\delta(q,\epsilon)=q,\delta(q,au)=\delta(\delta(q,a),u),\textrm{ where }q\in Q,a\in\textbf{2},u\in\textbf{2}^{*}
λ(q,ϵ)=ϵ,λ(q,au)=λ(q,a)λ(δ(q,a),u), where qQ,a2,u2\lambda(q,\epsilon)=\epsilon,\lambda(q,au)=\lambda(q,a)\cdot\lambda(\delta(q,a),u),\textrm{ where }q\in Q,a\in\textbf{2},u\in\textbf{2}^{\infty}

Intuitively, the above equations simply correspond to inputting a string of letters as opposed to just one letter, and reading the string one letter at a time. So then the function T:22T:\textbf{2}^{\infty}\rightarrow\textbf{2}^{\infty} induced by the FST TT on (finite or infinite) words is defined by T(u)=λ(q0,u)T(u)=\lambda(q_{0},u), the value of the output λ\lambda after inputting uu, starting at the initial state q0q_{0}. This allows us to give the following definition:

Definition 3.2.

Let TT be an FST, and let σ,τ2\sigma,\tau\in\textbf{2}^{\mathbb{N}} be infinite sequences. We say that TT transducestransduces σ\sigma to τ\tau, or that τ\tau is the TT-transducttransduct of σ\sigma, if T(σ)=τT(\sigma)=\tau. In general, for any two infinite sequences σ,τ\sigma,\tau we say that στ\sigma\geq\tau if there exists some TT so that T(σ)=τT(\sigma)=\tau.

This relation \geq is reflexive, and can be shown to be transitive by composition of FSTs (See Lemma 8, [1]). If for some σ,τ\sigma,\tau we have στ\sigma\geq\tau but not vice versa, we say σ>τ\sigma>\tau. If we do have στ\sigma\geq\tau and τσ\tau\geq\sigma then we say that στ\sigma\equiv\tau, and we use [σ][\sigma] to denote the equivalence class of σ\sigma. We call [σ][\sigma] the degree of σ\sigma.

4 Weight Products and Useful Results

Definition 4.1.

For a function f:f:\mathbb{N}\rightarrow\mathbb{N} we define the sequence f2\langle f\rangle\in\textbf{2}^{\mathbb{N}} by

f=i=010f(i)=10f(0)10f(1)10f(2)\langle f\rangle=\prod_{i=0}^{\infty}10^{f(i)}=10^{f(0)}10^{f(1)}10^{f(2)}\ldots

Having defined f\langle f\rangle in this way, some relatively simple initial results have been obtained in [2], which we state here.

Lemma 4.2.

Let f:,a,b.f:\mathbb{N}\rightarrow\mathbb{N},a,b\in\mathbb{N}. We have the following equivalences and inequalities:

  1. 1.

    af(n)f(n),\langle af(n)\rangle\equiv\langle f(n)\rangle, for a>0a>0

  2. 2.

    f(n+a)f(n)\langle f(n+a)\rangle\equiv\langle f(n)\rangle

  3. 3.

    f(n)+af(n)\langle f(n)+a\rangle\equiv\langle f(n)\rangle

  4. 4.

    f(n)f(an),\langle f(n)\rangle\geq\langle f(an)\rangle, for a>0a>0

  5. 5.

    f(n)af(2n)+bf(2n+1)\langle f(n)\rangle\geq\langle af(2n)+bf(2n+1)\rangle

One interesting consequence of the third equality is that any polynomial with a positive leading coefficient can be thought of as a stream and thus associated with a transducer degree, even if some of its values happen to be negative. For instance, the polynomial (n2)3(n-2)^{3} can’t directly be interpreted as a stream, since it is negative for n=0,1n=0,1. However, if we take (n2)3+8(n-2)^{3}+8, then this polynomial is nonnegative, and therefore corresponds to a stream (and thus a degree). So even though it’s technically incorrect, it will be convenient sometimes to refer to a degree such as (n2)3\langle(n-2)^{3}\rangle, when we mean more precisely the degree (n2)3+k\langle(n-2)^{3}+k\rangle for any k8k\geq 8. Similarly, the first equality allows us to refer to the degree of a function with rational coefficients, where we really mean the degree of the corresponding function multiplied by the appropriate scalar to eliminate any fractional coefficients.

Now we move on to weight products. For this paper we don’t provide the full context or proof for the main result we need (Theorem 4.4, which provides us with an incredibly useful result for comparing polynomial degrees), but a more detailed explanation can be found in (Theorem 21, [3]). We start by defining a weight.

Definition 4.3.

A weight is a tuple α=a0,a1,,ak1,b\alpha=\langle a_{0},a_{1},\ldots,a_{k-1},b\rangle\in\mathbb{Q} with each ai0a_{i}\geq 0. If ai=0a_{i}=0 for all ii then we say the weight is constant. To distinguish between weights and tuples of weights, weights will not be bolded but tuples of weights will.

Given a weight α\alpha as above and a function f:f:\mathbb{N}\rightarrow\mathbb{N} we can define αf\alpha\cdot f as:

αf=a0f(0)+a1f(1)++ak1f(k1)+b\alpha\cdot f=a_{0}f(0)+a_{1}f(1)+\ldots+a_{k-1}f(k-1)+b

We are ready to define the weighted product. Let 𝜶=α0,α1,,αm1\bm{\alpha}=\langle\alpha_{0},\alpha_{1},\ldots,\alpha_{m-1}\rangle be a tuple of weights, with 𝜶\bm{\alpha^{\prime}} being the cyclic shift α1,α2,,αm1,α0\langle\alpha_{1},\alpha_{2},\ldots,\alpha_{m-1},\alpha_{0}\rangle.

Then the weighted product of 𝜶\bm{\alpha} with ff, written as 𝜶f\bm{\alpha}\otimes f, is defined in the following way:

(𝜶f)(0)=α0f(\bm{\alpha}\otimes f)(0)=\alpha_{0}\cdot f
(𝜶f)(n+1)=(𝜶S|α0|1(f))(n)(\bm{\alpha}\otimes f)(n+1)=(\bm{\alpha^{\prime}}\otimes S^{|\alpha_{0}|-1}(f))(n)

Here Sk(f)(n)=f(n+k)S^{k}(f)(n)=f(n+k) and |α0||\alpha_{0}| indicates the length of the tuple α0\alpha_{0}. We call a weighted product naturalnatural if 𝜶f(n)\bm{\alpha}\otimes f(n)\in\mathbb{N} for all nn. Note that since 𝜶\bm{\alpha} is a finite tuple of finite tuples in \mathbb{Q}, we can take the LCM of all of the denominators and multiply through to make the product natural. Since this does not change the degree of the resulting function (by Lemma 4.2.1), from now on we will assume that all weight products are natural.

The following image provides a more intuitive picture of how the weight product works, by showing pictorially how to compute the weight product of the tuple of weights 𝜶=α0,α1\bm{\alpha}=\langle\alpha_{0},\alpha_{1}\rangle with an arbitrary function f(n)f(n), where α0=1,3,3,7,α1=4,2,3\alpha_{0}=\langle 1,3,3,7\rangle,\alpha_{1}=\langle 4,2,3\rangle:

  1. \Tree(𝜶f)(0)=f(0)+3f(1)+3f(2)+7(\bm{\alpha}\otimes f)(0)=f(0)+3f(1)+3f(2)+7×3\times 3f(2)f(2)×3\times 3f(1)f(1)×1\times 1f(0)f(0)\Tree(𝜶f)(1)=4f(3)+2f(4)+3(\bm{\alpha}\otimes f)(1)=4f(3)+2f(4)+3×2\times 2f(4)f(4)×4\times 4f(3)f(3)
  1. \Tree(𝜶f)(2)=f(5)+3f(6)+3f(7)+7(\bm{\alpha}\otimes f)(2)=f(5)+3f(6)+3f(7)+7×3\times 3f(7)f(7)×3\times 3f(6)f(6)×1\times 1f(5)f(5)\Tree(𝜶f)(3)=4f(8)+2f(9)+3(\bm{\alpha}\otimes f)(3)=4f(8)+2f(9)+3×2\times 2f(9)f(9)×4\times 4f(8)f(8)

Now that we have defined weight products we can proceed to state the main result that we need for the rest of the paper. The full proof of this result can be found in (Theorem 21, [3]).

Theorem 4.4.

Let f,g:f,g:\mathbb{N}\rightarrow\mathbb{N} be polynomials. Then gf\langle g\rangle\geq\langle f\rangle if and only if there exists a tuple of weights 𝛂\bm{\alpha} and integers n0,m0n_{0},m_{0} such that Sn0(f)=𝛂Sm0(g)S^{n_{0}}(f)=\bm{\alpha}\otimes S^{m_{0}}(g).

The usefulness of this theorem for comparing polynomial degrees cannot be overstated. Instead of being forced to consider all possible transductions of polynomial streams by any possible transducer, we can instead narrow our focus strictly to weight products. We also have a similar theorem from [2] which allows us to narrow our focus to only polynomial degrees. The following theorem guarantees that all degrees (not just polynomial degrees) below n3n^{3} are equivalent to the degree of a weight product of n3n^{3}. Therefore we do not miss anything by only considering weight products of n3n^{3}.

Theorem 4.5.

Let f:f:\mathbb{N}\rightarrow\mathbb{N} be a polynomial, and σ2\sigma\in 2^{\mathbb{N}}. Then fσ\langle f\rangle\geq\sigma if and only if σαSn0(f)\sigma\equiv\langle\alpha\otimes S^{n_{0}}(f)\rangle for some integer n00n_{0}\geq 0, and a tuple of weights α\alpha.

We can narrow our focus even further with a new result that lets us focus on a particular subset of weight tuples, the tuples with only one weight. Note that by the definition of weight product, if we take the product of a function ff with a weight tuple with a single weight 𝜶=a0,a1,,ak1,b\bm{\alpha}=\langle a_{0},a_{1},\ldots,a_{k-1},b\rangle, it is equal to the sum b+i=0k1aif(kn+i)b+\sum_{i=0}^{k-1}a_{i}f(kn+i).

Lemma 4.6.

Let f(n)=nkf(n)=n^{k} and suppose g:g:\mathbb{N}\rightarrow\mathbb{N} is a polynomial such that there exists a tuple of weights 𝛂\bm{\alpha} with g=𝛂fg=\bm{\alpha}\otimes f. Then there exists a single weight β\beta such that g=βfg=\langle\beta\rangle\otimes f.

Proof.

Let ff, gg, 𝜶\bm{\alpha} be as above, and suppose 𝜶=α0,α1,,αm1\bm{\alpha}=\langle\alpha_{0},\alpha_{1},\ldots,\alpha_{m-1}\rangle has more than one weight. By shifting gg as necessary, we can assume WLOG that α0=a0,a1,,ap,b\alpha_{0}=\langle a_{0},a_{1},\ldots,a_{p},b\rangle has the shortest length of all of the weights in 𝜶\bm{\alpha}. Let L=i=0m1(|αi|1)L=\sum_{i=0}^{m-1}(|\alpha_{i}|-1).

We have for all n0n\equiv 0 mod mm,

(𝜶f)(n)=a0f(Lnm)+a1f(Lnm+1)++apf(Lnm+p)+b(\bm{\alpha}\otimes f)(n)=a_{0}f(\frac{Ln}{m})+a_{1}f(\frac{Ln}{m}+1)+\cdots+a_{p}f(\frac{Ln}{m}+p)+b
=a0(Lnm)k+a1(Lnm+1)k++ap(Lnm+p)k+b=a_{0}(\frac{Ln}{m})^{k}+a_{1}(\frac{Ln}{m}+1)^{k}+\cdots+a_{p}(\frac{Ln}{m}+p)^{k}+b
=1mk(a0(Ln)k+a1(Ln+m)k++ap(Ln+pm)k+bmk)=\frac{1}{m^{k}}(a_{0}(Ln)^{k}+a_{1}(Ln+m)^{k}+\cdots+a_{p}(Ln+pm)^{k}+bm^{k})

Now by assumption, α0\alpha_{0} had the shortest length of the weights in 𝜶\bm{\alpha}, which means that if we take this length (which is p+1p+1) times the number of weights mm, this must be less than or equal to LL. Therefore we can construct the following weight β\beta:

β=1mk(a0,0,0,,a1,0,0,,0,0,ap,0,0,,0,bmk)\beta=\frac{1}{m^{k}}(a_{0},0,0,\cdots,a_{1},0,0,\cdots,0,0,a_{p},0,0,\cdots,0,bm^{k})

where each aia_{i} is in position im+1im+1, and after apa_{p} we have L(pm+1)L-(pm+1) zeroes, so that this weight has length L. Then for all n0n\equiv 0 mod mm, (βf)(n)=(αf)(n)=g(n)(\langle\beta\rangle\otimes f)(n)=(\langle\alpha\rangle\otimes f)(n)=g(n). However, all three expressions in this equation are polynomials. Since they are equal for infinitely many nn, they must be equal for all nn, and thus β\beta is indeed the desired weight. ∎

Now we are ready to introduce a key definition that forms the basis for all of the results in this paper.

Definition 4.7.

Let α=a0,a1,,ak1,b\alpha=\langle a_{0},a_{1},\ldots,a_{k-1},b\rangle be a weight with ai>0a_{i}>0 for exactly mm aia_{i}’s. Then we call the weight product of 𝜶=α\bm{\alpha}=\langle\alpha\rangle with a function f:f:\mathbb{N}\rightarrow\mathbb{N} an m-transform of ff. We will often write αf\alpha\otimes f instead of 𝜶f\bm{\alpha}\otimes f, since Lemma 4.6 guarantees that for this paper we only need to consider 1-element tuples of weights.

With this definition, we could restate Lemma 4.6 as ”All degrees below nk\langle n^{k}\rangle are equivalent to an mm-transform of a shift of nkn^{k} for some mm \in\mathbb{N}”. For this paper we only need to consider the case k=3k=3, and thus if we are interested in classifying all degrees below n3\langle n^{3}\rangle, we need to classify the degrees of all mm-transforms of shifts of n3n^{3}.

In fact, we can classify almost all mm-transforms of n3n^{3} in one theorem, a generalization of Theorem 32 in [4]. We first need to introduce a couple of definitions from [3]:

Definition 4.8.

Let f(n)=aknk++a1n+a0f(n)=a_{k}n^{k}+\cdots+a_{1}n+a_{0} be a polynomial. We define V(f(n))V(f(n)) to be the column vector [a1a2ak]T\begin{bmatrix}a_{1}\hskip 2.84544pta_{2}\hskip 2.84544pt\cdots\hskip 2.84544pta_{k}\end{bmatrix}^{T}. We also define for a weight α=α0,α1,,αm1,b\alpha=\langle\alpha_{0},\alpha_{1},\cdots,\alpha_{m-1},b\rangle another column vector U(α)U(\alpha), given by U(α)=[α0α1αm1]TU(\alpha)=\begin{bmatrix}\alpha_{0}\hskip 2.84544pt\alpha_{1}\hskip 2.84544pt\cdots\hskip 2.84544pt\alpha_{m-1}\end{bmatrix}^{T}. Finally, we define Mm(f(n))M_{m}(f(n)) to be the kk by mm matrix whose iith column is given by V(f(mn+i1))V(f(mn+i-1)).

The key feature of these definitions is that it allows us to rewrite the weight product as a matrix product in the following sense:

Mm(f(n))U(α)=V((αf)(n))M_{m}(f(n))U(\alpha)=V((\alpha\otimes f)(n))

The proof can be done by checking the definition of a weight product (see Lemma 30 in [4] for more details). Now we need a few more lemmas to set up our first theorem on mm-transforms.

Lemma 4.9.

For all m,a,b,cm,a,b,c\in\mathbb{N}, with a,b,ca,b,c pairwise distinct and m0m\neq 0, the vectors V((mn+a)3),V((mn+b)3),V((mn+c)3)V((mn+a)^{3}),V((mn+b)^{3}),V((mn+c)^{3}) form an invertible matrix.

Proof.

The determinant of this matrix is 9m6(ab)(bc)(ac)9m^{6}(a-b)(b-c)(a-c), which is indeed nonzero if a,b,ca,b,c are distinct and m0m\neq 0 as assumed. ∎

Lemma 4.10.

Let q(n) be a polynomial of order k. Then for every ϵ>0\epsilon>0 we have q(n)nk+bk1nk1++b1n\langle q(n)\rangle\geq\langle n^{k}+b_{k-1}n^{k-1}+\cdots+b_{1}n\rangle for some rational coefficients 0bk1,,b1<ϵ0\leq b_{k-1},\cdots,b_{1}<\epsilon.

We refer to the polynomial nk+bk1nk1++b1nn^{k}+b_{k-1}n^{k-1}+\cdots+b_{1}n that can be found by this lemma as qϵ(n)q_{\epsilon}(n), as in [3]. The following lemma is due to Robert Israel in [5].

Lemma 4.11.

Let M be a 3 by n matrix of rank 3 with n3n\geq 3, let UU be an n by 1 vector with the first three entries strictly positive and all other entries nonnegative, and let MU=PMU=P. Let MϵM_{\epsilon} be a matrix the same size as M which has all entries within ϵ\epsilon of M. Then there exists a vector UϵU_{\epsilon} such that MϵUϵ=PM_{\epsilon}U_{\epsilon}=P, and UϵU_{\epsilon} has the first three entries strictly positive, and all entries nonnegative.

Proof.

Since MM has rank 3, we can assume WLOG that the first three columns are linearly independent. Then we can write MM in block form as [BC][B\mid C], where BB is an invertible 3 by 3 matrix, and CC is everything else. We can also split UU into blocks, and writing these blocks in terms of B,C,PB,C,P gives us U=[B1(PCW)W]TU=[B^{-1}(P-CW)\mid W]^{T}.

Now if Mϵ=[BϵCϵ]M_{\epsilon}=[B_{\epsilon}\mid C_{\epsilon}] is close enough to MM, BϵB_{\epsilon} is still invertible, and we can define a vector Uϵ=[Bϵ1(PCϵW)W]TU_{\epsilon}=[B_{\epsilon}^{-1}(P-C_{\epsilon}W)\mid W]^{T}, and by construction MϵUϵ=PM_{\epsilon}U_{\epsilon}=P. Further, since UϵU_{\epsilon} converges to UU, the first three entries can be made strictly positive with a sufficiently small ϵ\epsilon, and its other entries (the ”W” block) are equal to the entries of UU and thus nonnegative.

5 Classifying Cubic Degrees

Now we are ready to begin classifying all cubic degrees below n3n^{3}. We begin by classifying almost all mm-transforms of n3n^{3} in one result.

Theorem 5.1.

Let m3,tm\geq 3,t\in\mathbb{Z}. Then the degrees of every mm-transform of (n+t)3(n+t)^{3} are all equal, and every cubic polynomial can be transduced to this degree. We refer to this degree as the bottom degree for cubics.

Proof.

The basic idea of this proof is that we want to narrow our focus to only three columns of our matrix (and three nonzero rows of our vector) in order to replicate the proof method from Theorem 32 in [4].

Let f(n)=i=0m1ai(mn+t+i)3f(n)=\sum_{i=0}^{m-1}a_{i}(mn+t+i)^{3}, where each aia_{i} is nonnegative, and at least 3 are strictly positive. By definition we have Mm((n+t)3)U(α)=V(f(n))M_{m}((n+t)^{3})U(\alpha)=V(f(n)), where α\alpha is the weight a0,a1,,am1,0\langle a_{0},a_{1},...,a_{m-1},0\rangle. We will assume for now that it is the first three aia_{i}’s which are nonnegative, and justify this assumption later.

We want to prove that any cubic polynomial can be transduced into f(n)f(n). Let q(n)q(n) be an arbitrary cubic polynomial, and let qϵ(n)q_{\epsilon}(n) be as defined in Lemma 4.9. Consider the matrix Mm(qϵ(n+t))M_{m}(q_{\epsilon}(n+t)). We have the right conditions to apply Lemma 4.11 and obtain a vector UϵU_{\epsilon} as stated in the lemma. In particular, we have that Mm(qϵ(n+t))Uϵ=V(f(n))M_{m}(q_{\epsilon}(n+t))U_{\epsilon}=V(f(n)) and UϵU_{\epsilon} has all nonnegative entries. But this means that UϵU_{\epsilon} is not just a vector but in fact a weight vector, since we can form a weight out of the entries of UϵU_{\epsilon}. Then the equation Mm(qϵ(n+t))Uϵ=V(f(n))M_{m}(q_{\epsilon}(n+t))U_{\epsilon}=V(f(n)) corresponds to a weight product, and in fact what it says is that qϵ(n+t)q_{\epsilon}(n+t) can be transduced into f(n)f(n) via the weight corresponding to the vector UϵU_{\epsilon}. Combining this with the way qϵq_{\epsilon} was defined, we have for an arbitrary cubic polynomial q(n)q(n), q(n)q(n+t)qϵ(n+t)f(n)\langle q(n)\rangle\equiv\langle q(n+t)\rangle\geq\langle q_{\epsilon}(n+t)\rangle\geq\langle f(n)\rangle. Therefore f(n)f(n) is in the bottom degree for cubics.

However, we do still have to justify why we can assume WLOG that our weight had the first three entries strictly positive. In general, the nonzero entries of the weight could be in any location. But the fundamental argument behind Lemma 4.11 works the same regardless, as long as an invertible matrix can be formed by the columns which correspond to the nonzero entries of the vector UU (i.e. if UU’s strictly positive entries were in the ith,jth,kthith,jth,kth positions, the ith,jth,kthith,jth,kth columns of MM need to be linearly independent). If the nonzero entries are in different places we simply have a messier block matrix to draw and more details to write out. And because Lemma 4.9 guarantees that any 3 columns of Mm((n+t)3)M_{m}((n+t)^{3}) can form an invertible 3 by 3 matrix, we are justified in using our assumption to simplify the proof.

We also note that this proof would generalize fairly naturally to prove the following statement: ”All mm-transforms of (n+t)k(n+t)^{k} are in the bottom degree for polynomials of order kk, for mkm\geq k”.

So now we have completely eliminated all but two cases: 1-transforms and 2-transforms of n3n^{3}. Next we prove that the 2-transforms are also in the bottom degree.

Theorem 5.2.

All 2-transforms of n3n^{3} are in the bottom degree for cubics.

Proof.

We first observe that all 2-transforms of n3n^{3} are of the form a(pn+r)3+b(pn+s)3a(pn+r)^{3}+b(pn+s)^{3}, with a,ba,b both positive rational numbers, 0r,s<p,rs0\leq r,s<p,r\neq s, and p,r,sp,r,s\in\mathbb{N}.

We need the following equality:

n3+d(n+j)3=a1(kn+jk1)3+a2(knk+1)3+a3(kn)3n^{3}+d(n+j)^{3}=a_{1}(kn+jk-1)^{3}+a_{2}(kn-k+1)^{3}+a_{3}(kn)^{3}

where a1=dj(1+k+jk)k2(jk1)(jk+k2),a2=dj(k+1)k2(jk+k2),a3=1k3(1d(jk+k1)(k1)(jk1))a_{1}=\frac{dj(-1+k+jk)}{k^{2}(jk-1)(jk+k-2)},a_{2}=\frac{dj}{(k+1)k^{2}(jk+k-2)},a_{3}=\frac{1}{k^{3}}(1-\frac{d(jk+k-1)}{(k-1)(jk-1)})

This can be easily verified, and holds for all d,j,kd,j,k. We also observe that if dd and jj are fixed positive numbers, for a large enough value of kk each aia_{i} is strictly positive.

Now we proceed to the proof, which essentially consists of applying algebraic operations to this equation to end up with a 2-transform on the left hand side, and a 3-transform (or higher) on the right hand side. Then applying Thm 5.1 lets us conclude both sides of the equation are in the bottom degree for cubics.

Let a(pn+r)3+b(pn+s)3a(pn+r)^{3}+b(pn+s)^{3} be a 2-transform of n3n^{3}. Assume WLOG s>rs>r.

We have two cases.

Case 1: r>0r>0

If r >> 0, in the previous equation let d=bad=\frac{b}{a}, let j=srj=s-r, and choose an integer k large enough so each aia_{i} is positive.Then on both sides of the equation replace nn with (n+r)(n+r) to obtain:

(n+r)3+d(n+j+r)3=a1(kn+rk+jk1)3+a2(kn+rkk+1)3+a3(kn+rk)3(n+r)^{3}+d(n+j+r)^{3}=a_{1}(kn+rk+jk-1)^{3}+a_{2}(kn+rk-k+1)^{3}+a_{3}(kn+rk)^{3}

Now we use that r+j=sr+j=s, and also replace nn with pnpn:

(pn+r)3+d(pn+s)3=a1(pkn+sk1)3+a2(pkn+(r1)k+1)3+a3(pkn+rk)3(pn+r)^{3}+d(pn+s)^{3}=a_{1}(pkn+sk-1)^{3}+a_{2}(pkn+(r-1)k+1)^{3}+a_{3}(pkn+rk)^{3}

Multiplying through by aa gives us what we want on the left hand side:

a(pn+r)3+b(pn+s)3=aa1(pkn+sk1)3+aa2(pkn+(r1)k+1)3+aa3(pkn+rk)3a(pn+r)^{3}+b(pn+s)^{3}=aa_{1}(pkn+sk-1)^{3}+aa_{2}(pkn+(r-1)k+1)^{3}+aa_{3}(pkn+rk)^{3}

Now let’s look at the right hand side. Since p>sp>s and s>1s>1, pk>sk1>0pk>sk-1>0. Since r>0r>0 and rr is an integer, r1r\geq 1, so pk>(r1)k+1>0pk>(r-1)k+1>0. Finally, p>rp>r so pk>rk>0pk>rk>0.

Then we can rewrite the right hand side as the following: i=0pk1ci(pkn+i)3\sum_{i=0}^{pk-1}c_{i}(pkn+i)^{3}, where ci=0c_{i}=0 for all i except for csk1=aa1,c(r1)k+1=aa2,crk=aa3c_{sk-1}=aa_{1},c_{(r-1)k+1}=aa_{2},c_{rk}=aa_{3}. Since aa and the aia_{i}’s are all positive, this is 3-transform of n3n^{3}. Therefore it is in the bottom degree, and so is the left hand side.

Case 2: r=0r=0

This will proceed similarly to the first case, except at the end we get a transformation of (nk)3(n-k)^{3}. Again let d=bad=\frac{b}{a}, j=sj=s, and choose an integer k large enough so each aia_{i} is positive. We have:

n3+d(n+s)3=a1(kn+sk1)3+a2(knk+1)3+a3(kn)3n^{3}+d(n+s)^{3}=a_{1}(kn+sk-1)^{3}+a_{2}(kn-k+1)^{3}+a_{3}(kn)^{3}

Now rewrite the right hand side as follows:

n3+d(n+s)3=a1(kn+(sk+k1)k)3+a2(kn+1k)3+a3(kn+kk)3n^{3}+d(n+s)^{3}=a_{1}(kn+(sk+k-1)-k)^{3}+a_{2}(kn+1-k)^{3}+a_{3}(kn+k-k)^{3}

Multiply both sides by aa and replace nn with pnpn to obtain the desired left hand side:

a(pn)3+b(pn+s)3=aa1(pkn+(s+1)k1k)3+aa2(pkn+1k)3+aa3(pkn+kk)3a(pn)^{3}+b(pn+s)^{3}=aa_{1}(pkn+(s+1)k-1-k)^{3}+aa_{2}(pkn+1-k)^{3}+aa_{3}(pkn+k-k)^{3}

Again we examine the right hand side. Since p>sp>s and both are integers, ps+1p\geq s+1 and thus pk>(s+1)k1>0pk>(s+1)k-1>0, and we also have pk>1>0pk>1>0, pk>k>0pk>k>0. So we can rewrite the right hand side into the following sum: i=0pk1ci(pkn+ik)3\sum_{i=0}^{pk-1}c_{i}(pkn+i-k)^{3}, where ci=0c_{i}=0 for all i except for c(s+1)k1=aa1,c1=aa2,ck=aa3c_{(s+1)k-1}=aa_{1},c_{1}=aa_{2},c_{k}=aa_{3}. Since aa and the aia_{i}’s are all positive, this is a 3-transform of (nk)3(n-k)^{3}. Therefore it is in the bottom degree by Theorem 5.2, and so is the left hand side. ∎

Now we have shown that everything except the 1-transforms is equal to the bottom degree for cubics. Fortunately, the 1-transforms are not in the bottom degree, and we will show that their degrees form an interesting structure. First we prove that they are not in the bottom degree.

Theorem 5.3.

Every 1-transform of n3n^{3} is strictly above the bottom degree for cubics.

Proof.

We will proceed by contradiction. Assume that there is some 1-transform of n3n^{3} which is equal to the bottom degree. From the definition of the weight product, all 1-transforms of n3n^{3} are of the form (an+b)3(an+b)^{3}, with a,b,b<a,a0a,b\in\mathbb{N},b<a,a\neq 0. To say that such a function is in the bottom degree is to say that all other cubic polynomials ff satisfy f(an+b)3\langle f\rangle\geq\langle(an+b)^{3}\rangle. In particular, we can choose f to be a 3-transform of n3n^{3}, say f=αn3f=\alpha\otimes n^{3} for some weight α\alpha with three nonzero entries.

This means that we have Sn0((an+b)3)=βSm0(αn3)S^{n_{0}}((an+b)^{3})=\beta\otimes S^{m_{0}}(\alpha\otimes n^{3}).

A bit of algebra gives us that βSm0(αn3)=Sm(βαn3)=Sm(γn3)\beta\otimes S^{m_{0}}(\alpha\otimes n^{3})=S^{m}(\beta\otimes\alpha\otimes n^{3})=S^{m}(\gamma\otimes n^{3}), where m=m0|β|m=m_{0}*|\beta| and γ=βα\gamma=\beta\otimes\alpha is the composition of weights (Lemma 12, [3]). The key thing to note here is that because α\alpha had three nonzero entries, γ\gamma also must have at least three nonzero entries (The composition of a kk-transform with an ll-transform is a klkl-transform). Let |γ|=j|\gamma|=j.

Therefore we have:

Sn0((an+b)3)=Sm(γn3)S^{n_{0}}((an+b)^{3})=S^{m}(\gamma\otimes n^{3})

and thus

(an+b+an0)3=i=0j1ci(jn+i+jm)3(an+b+an_{0})^{3}=\sum_{i=0}^{j-1}c_{i}(jn+i+jm)^{3}

where at least 3 of the cisc_{i}^{\prime}s are nonzero. We can simplify this equation with the goal of getting just n3n^{3} on the left hand side. First we multiply both sides by j3j^{3}, as well as multiplying the right hand side by a3a3\frac{a^{3}}{a^{3}} and distributing the top a3a^{3} to the inside of the parentheses. This gives:

(ajn+jb+ajn0)3=i=0j1j3a3ci(ajn+ai+ajm)3(ajn+jb+ajn_{0})^{3}=\sum_{i=0}^{j-1}\frac{j^{3}}{a^{3}}c_{i}(ajn+ai+ajm)^{3}

We can replace nn with naj\frac{n}{aj} to obtain:

(n+jb+ajn0)3=i=0j1j3a3ci(n+ai+ajm)3(n+jb+ajn_{0})^{3}=\sum_{i=0}^{j-1}\frac{j^{3}}{a^{3}}c_{i}(n+ai+ajm)^{3}

Now replace nn with njbajn0n-jb-ajn_{0}:

n3=i=0j1j3a3ci(n+ai+ajmjbajn0)3n^{3}=\sum_{i=0}^{j-1}\frac{j^{3}}{a^{3}}c_{i}(n+ai+ajm-jb-ajn_{0})^{3}

Replace nn with ajnajn, and then divide both sides by a3j3a^{3}j^{3} (this leaves the left hand side unchanged):

n3=i=0j11a6ci(ajn+ai+ajmjbajn0)3n^{3}=\sum_{i=0}^{j-1}\frac{1}{a^{6}}c_{i}(ajn+ai+ajm-jb-ajn_{0})^{3}

Finally, define cak=cka6c^{\prime}_{ak}=\frac{c_{k}}{a^{6}} for all k<jk<j and 0 otherwise, and let ajmjbajn0=tajm-jb-ajn_{0}=t.

Then we have:

n3=i=0aj1ci(ajn+i+t)3n^{3}=\sum_{i=0}^{aj-1}c^{\prime}_{i}(ajn+i+t)^{3}

Now the right hand side is clearly equal to a weight transformation of (n+t)3(n+t)^{3}, and since at least three of the cic_{i}’s were nonzero, at least three of the cic^{\prime}_{i}’s will be nonzero. This would mean that n3n^{3} is equal to a function which is in the bottom degree for cubics, by Thm 5.1. However, it has been shown that this is not the case in (Theorem 23, [4]), therefore this contradiction means that (an+b)3\langle(an+b)^{3}\rangle is not equal to the bottom degree. ∎

So now we have shown that all of the 1-transforms of n3n^{3} are strictly above the bottom degree. We still need to consider how they relate to each other. The following lemma will give us a step in that direction by narrowing down the amount of 1-transforms we need to consider.

Lemma 5.4.

Suppose aa is relatively prime to bb. Then (an+b)3(an+1)3\langle(an+b)^{3}\rangle\equiv\langle(an+1)^{3}\rangle.

Proof.

We prove this via double inequality. One direction is trivial:

(an+b)3(abn+b)3=b3(an+1)3(an+1)3\langle(an+b)^{3}\rangle\geq\langle(abn+b)^{3}\rangle=\langle b^{3}(an+1)^{3}\rangle\equiv\langle(an+1)^{3}\rangle

The other direction requires a bit more work. First, we prove the following: For all i>1Ni>1\in{N}, (an+bi)3(an+b)3\langle(an+b^{i})^{3}\rangle\geq\langle(an+b)^{3}\rangle

Proof: (an+bi)3(abi1n+bi)3=b3(i1)(an+b)3(an+b)3\langle(an+b^{i})^{3}\rangle\geq\langle(ab^{i-1}n+b^{i})^{3}\rangle=\langle b^{3(i-1)}(an+b)^{3}\rangle\equiv\langle(an+b)^{3}\rangle

Now we want to show that for some ii, (an+1)3(an+bi)3\langle(an+1)^{3}\rangle\equiv\langle(an+b^{i})^{3}\rangle. Since bb is relatively prime to aa, it is invertible in the multiplicative group a\mathbb{Z}^{*}_{a}, which means for some ii, bi1b^{i}\equiv 1 mod aa. So there exists some number mm with am+1=biam+1=b^{i}. Then we have:

(an+1)3(a(n+m)+1)3=(an+am+1)3=(an+bi)3\langle(an+1)^{3}\rangle\equiv\langle(a(n+m)+1)^{3}\rangle=\langle(an+am+1)^{3}\rangle=\langle(an+b^{i})^{3}\rangle

Since we proved that (an+bi)3(an+b)3\langle(an+b^{i})^{3}\rangle\geq\langle(an+b)^{3}\rangle earlier, this completes the proof. ∎

Note that in the case that a,ba,b are not relatively prime, we can simply divide out by their common factor and reduce. For example, (4n+2)3=8(2n+1)3(2n+1)3\langle(4n+2)^{3}\rangle=\langle 8(2n+1)^{3}\rangle\equiv\langle(2n+1)^{3}\rangle.

This means that the degrees of all 1-transforms can be represented by some (an+1)3(an+1)^{3}. The only question now is whether or not some (or all) of these degrees are equal, and how they compare to each other. The following theorem answers this question definitively.

Theorem 5.5.

(an+1)3(bn+1)3\langle(an+1)^{3}\rangle\geq\langle(bn+1)^{3}\rangle if and only if a|ba|b.

Proof.

The reverse implication is relatively straightforward: if a|ba|b, then let b=acb=ac and we have (an+1)3(acn+1)3(bn+1)3\langle(an+1)^{3}\rangle\geq\langle(acn+1)^{3}\rangle\equiv\langle(bn+1)^{3}\rangle.

Now for the forward implication. The key observation is that if (an+1)3(bn+1)3\langle(an+1)^{3}\rangle\geq\langle(bn+1)^{3}\rangle, this means that there exists n0,m0n_{0},m_{0}, and a weight α\alpha such that Sn0(bn+1)3=Sm0(α(an+1)3)S^{n_{0}}(bn+1)^{3}=S^{m_{0}}(\alpha\otimes(an+1)^{3}), and this weight α\alpha must have only one nonzero entry. Otherwise, the right hand side of the previous equation would be an mm-transform of n3n^{3} for m2m\geq 2, which (by Thms 5.1 and 5.2) would put it in the bottom degree. ((an+1)3(an+1)^{3} is a 1-transform of n3n^{3}, and an mm-transform of a 1-transform is also an mm-transform of the original function, via weight product composition.) To say that α\alpha must have only one nonzero entry means that α(an+1)3\alpha\otimes(an+1)^{3} has the form (a(cn+d)+1)3=(acn+ad+1)3(a(cn+d)+1)^{3}=(acn+ad+1)^{3}. Plugging this into the above equation means that

(bn+1+bn0)3=(acn+ad+1+acm0)3(bn+1+bn_{0})^{3}=(acn+ad+1+acm_{0})^{3}

This means that ac=bac=b and therefore a|ba|b. ∎

This theorem also gives us an interesting corollary that is somewhat counterintuitive.

Corollary 5.6.

Suppose fg\langle f\rangle\equiv\langle g\rangle, and let α\alpha be a weight. Then it is possible that αfαg\langle\alpha\otimes f\rangle\not\equiv\langle\alpha\otimes g\rangle.

Proof.

Let f=n3f=n^{3}, g=(n+1)3g=(n+1)^{3}, α=1,0,0\alpha=\langle 1,0,0\rangle. Then αf=(2n)3n3\langle\alpha\otimes f\rangle=\langle(2n)^{3}\rangle\equiv\langle n^{3}\rangle, but αg=(2n+1)3\alpha\otimes g=(2n+1)^{3}, and n3>(2n+1)3\langle n^{3}\rangle>\langle(2n+1)^{3}\rangle by the theorem.

6 Conclusion

What Theorem 5.5 tells us is that the structure of the degrees of 1-transforms of n3n^{3} is that of a divisibility lattice, with n3\langle n^{3}\rangle at the top, (pn+1)3\langle(pn+1)^{3}\rangle in the second row for all primes pp, etc, and below this lattice is the bottom degree. The results of Theorems 4.5, 5.1, and 5.2 tell us that these are the only degrees that are below the degree of n3n^{3}, and thus the structure of all transducer degrees below n3n^{3} is given by the following picture (we remove the notation p(n)\langle p(n)\rangle used for degrees to make the image cleaner):

\Treen3n^{3}(2n+1)3(2n+1)^{3}(4n+1)3(4n+1)^{3}(6n+1)3(6n+1)^{3}(3n+1)3(3n+1)^{3}\cdots(9n+1)3(9n+1)^{3}(pn+1)3\cdots(pn+1)^{3}\cdots(2pn+1)3\cdots(2pn+1)^{3}\cdots(3pn+1)3\cdots(3pn+1)^{3}\cdots(qpn+1)3\cdots(qpn+1)^{3}\cdots
\Treep3(n)p_{3}(n)

p3(n)\langle p_{3}(n)\rangle = the bottom degree for cubics (includes all 2-transforms of n3n^{3} and all kk-transforms of (n+t)3(n+t)^{3} for all k3k\geq 3 and all tt\in\mathbb{N})

Therefore all degrees below n3n^{3} have been classified. It remains an open question whether or not there are other cubic polynomials that are either above n3n^{3} or incomparable with n3n^{3}. The classification of higher order polynomial degrees is also unknown, but many of the methods used in this paper would be useful to answer such questions.

References

  • [1] Jörg Endrullis, Dimitri Hendriks and Jan Willem Klop “Degrees of Streams” Proceedings of the Leiden Numeration Conference 2010 In Journal of Integers 11B.A6, 2011, pp. 1–40
  • [2] Jörg Endrullis, Clemens Grabmayer, Dimitri Hendriks and Hans Zantema “The Degree of Squares is an Atom” In Proc. Conf. on Combinatorics on Words (WORDS 2015) 9304, LNCS Springer, 2015, pp. 109–121 DOI: 10.1007/978-3-319-23660-5“˙10
  • [3] Jörg Endrullis, Juhani Karhumäki, Jan Willem Klop and Aleksi Saarela “Degrees of Infinite Words, Polynomials and Atoms” In Proc. Conf. on Developments in Language Theory (DLT 2016) 9840, LNCS Springer, 2016, pp. 164–176 DOI: 10.1007/978-3-662-53132-7“˙14
  • [4] Jörg Endrullis, Juhani Karhumäki, Jan Willem Klop and Aleksi Saarela “Degrees of Infinite Words, Polynomials and Atoms” In International Journal of Foundations of Computer Science 29.5, 2018, pp. 825–843 DOI: 10.1142/S0129054118420066
  • [5] Jörg Endrullis, Jan Willem Klop and Rena Bakhshi “Transducer Degrees: Atoms, Infima and Suprema” In Acta Informatica 57.3-5, 2020, pp. 727–758 DOI: 10.1007/s00236-019-00353-7