This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Limit theorems for toral translations

Dmitry Dolgopyat  and  Bassam Fayad Dmitry Dolgopyat Department of Mathematics University of Maryland College Park MD 20814 USA dmitry@math.umd.edu Bassam Fayad IMJ-PRG, CNRS, Paris, France bassam.fayad@imj-prg.fr
The authors thank Mariusz Lemańczyk, Jens Marklof, Jean Paul Thouvenot, and Ilya Vinogradov for useful discussions. The research of DD was partially supported by the NSF. The research of BF was partially supported by ANR-11-BS01-0004.

1. Introduction

One of the surprising discoveries of dynamical systems theory is that many deterministic systems with non-zero Lyapunov exponents satisfy the same limit theorems as the sums of independent random variables. Much less is known for the zero exponent case where only a few examples have been analyzed. In this survey we consider the extreme case of toral translations where each map not only has zero exponents but is actually an isometry. These systems were studied extensively due to their relations to number theory, to the theory of integrable systems and to geometry. Surprisingly many natural questions are still open. We review known results as well as the methods to obtain them and present a list of open problems. Given a vast amount of work on this subject, it is impossible to provide a comprehensive treatment in this short survey. Therefore we treat the topics closest to our research interests in more detail while some other subjects are mentioned only briefly. Still we hope to provide the reader with the flavor of the subject and introduce some tools useful in the study of toral translations, most notably, various renormalization techniques.

Let X=𝕋d,X={\mathbb{T}}^{d}, μ\mu be the Haar measure on XX and Tα(x)=x+α.T_{\alpha}(x)=x+\alpha.

The most basic question in smooth ergodic theory is the behavior of ergodic sums. Given a map TT and a zero mean observable A()A(\cdot) let

(1) AN(x)=n=0N1A(Tnx)A_{N}(x)=\sum_{n=0}^{N-1}A(T^{n}x)

If there is no ambiguity, we may write ANA_{N} for AN(x)A_{N}(x). Conversely we may use the notation AN(α,x)A_{N}(\alpha,x) to indicate that the underlying map is the translation of vector α\alpha. The uniform distribution of the orbit of xx by TT is characterized by the convergence to 0 of AN(x)/NA_{N}(x)/N. In the case of toral translations TαT_{\alpha} with irrational frequency vector α\alpha the uniform distribution holds for all points x.x. The study of the ergodic sums is then useful to quantify the rate of uniform distribution of the Kronecker sequence nαn\alpha mod 1 as we will see in Section 3 where discrepancy functions are discussed. The question about the distribution of ergodic sums is analogous to the the Central Limit Theorem in probability theory. One can also consider analogues of other classical probabilistic results. In this survey we treat two such questions. In Section 4 we consider so called Poisson regime where (1) is replaced by n=0N1χ𝒞N(Tαnx)\sum_{n=0}^{N-1}\chi_{{\mathcal{C}}_{N}}(T_{\alpha}^{n}x) and the sets 𝒞N{\mathcal{C}}_{N} are scaled in such a way that only finite number of terms are non-zero for typical x.x. Such sums appear in several questions in mathematical physics, including quantum chaos [91] and Boltzmann-Grad limit of several mechanical systems [93]. They also describe the resonances in the study of ergodic sums for toral translations as we will see in Section 5. In Section 6 we consider Borel-Cantelli type questions where one takes a sequence of shrinking sets and studies a number of times a typical orbit hits whose sets. These questions are intimately related to some classical problems in the theory of Diophantine approximations.

The ergodic sums above toral translations also appear in natural dynamical systems such as skew products, cylindrical cascades and special flows. Discrete time systems related to ergodic sums over translations are treated in Section 7 while flows are treated in Section 8. These systems give additional motivation to study the ergodic sums (1) for smooth functions having singularities of various types: power, fractional power, logarithmic… Ergodic sums for functions with singularities are discussed in Section 2. Finally in Section 9 we present the results related to action of several translations at the same time.

Notations.

We say that a vector α=(α1,,αd)d\alpha=(\alpha_{1},\ldots,\alpha_{d})\in\mathbb{R}^{d} is irrational if {1,α1,,αd}\{1,\alpha_{1},\ldots,\alpha_{d}\} are linearly independent over \mathbb{Q}.

For xdx\in\mathbb{R}^{d}, we use the notation {x}:=(x1,,xd) mod (1)\{x\}:=(x_{1},\ldots,x_{d})\text{ mod }(1). We denote by x\|x\| the closest signed distance of some xx\in\mathbb{R} to the integers.

Assuming that dd\in\mathbb{N} is fixed, for σ>0\sigma>0 we denote by 𝒟(σ)d{\mathcal{D}}(\sigma)\subset\mathbb{R}^{d} the set of Diophantine vectors with exponent σ\sigma, that is

(2) 𝒟(σ)={α:Ckd0,m|(k,α)m|C|k|dσ}{\mathcal{D}}(\sigma)=\{\alpha:\exists C\;\;\forall k\in{\mathbb{Z}}^{d}-0,m\in{\mathbb{Z}}\quad|(k,\alpha)-m|\geq C|k|^{-d-\sigma}\}

Let us recall that 𝒟(σ){\mathcal{D}}(\sigma) has a full measure if σ>0\sigma>0, while 𝒟(0){\mathcal{D}}(0) is an uncountable set of zero measure and 𝒟(σ){\mathcal{D}}(\sigma) is empty for σ<0\sigma<0. The set 𝒟(0){\mathcal{D}}(0) is called the set of constant type vector or badly approximable vectors. An irrational vector α\alpha that is not Diophantine for any σ>0\sigma>0 is called Liouville.

We denote by {\mathfrak{C}} the standard Cauchy random variable with density 1π(1+x2).\frac{1}{\pi(1+x^{2})}. The normal random variable with zero mean and variance D2D^{2} will be denoted by 𝔑(D2).{\mathfrak{N}}(D^{2}). Thus 𝔑(D2){\mathfrak{N}}(D^{2}) has density 12πDex2/2D2.\frac{1}{2\pi D}e^{-x^{2}/2D^{2}}. We will write simply 𝔑{\mathfrak{N}} for 𝔑(1).{\mathfrak{N}}(1). Next, 𝔓(X,μ){\mathfrak{P}}(X,\mu) will denote the Poisson process on XX with measure μ\mu (we refer the reader to Section 5.1 for the definition and basic properties of Poisson processes).

2. Ergodic sums of smooth functions with singularities

2.1. Smooth observables

For toral translations, the ergodic sums of smooth observables are well understood. Namely if AA is sufficiently smooth with zero mean then for almost all α,\alpha, AA is a coboundary, that is, there exists B(α,x)B(\alpha,x) such that

(3) A(x)=B(x+α,α)B(x,α).A(x)=B(x+\alpha,\alpha)-B(x,\alpha).

Namely if A(x)=k0ake2πi(k,x)A(x)=\sum_{k\neq 0}a_{k}e^{2\pi i(k,x)} then

B(α,x)=k0bke2πi(k,x) where bk=akei2π(k,α)1.B(\alpha,x)=\sum_{k\neq 0}b_{k}e^{2\pi i(k,x)}\text{ where }b_{k}=\frac{a_{k}}{e^{i2\pi(k,\alpha)}-1}.

The above series converges in L2L^{2} provided α𝒟(σ)\alpha\in{\mathcal{D}}(\sigma) and Aσ={A:k|ak|k|(σ+d)|2<}.A\in{\mathcal{H}}^{\sigma}=\{A:\sum_{k}|a_{k}|k|^{(\sigma+d)}|^{2}<\infty\}. Note that (3) implies that

AN(x,α)=B(x+Nα,α)B(x,α)A_{N}(x,\alpha)=B(x+N\alpha,\alpha)-B(x,\alpha)

giving a complete description of the behavior of ergodic sums for almost all α.\alpha. In particular we have

Corollary 1.

If α\alpha is uniformly distributed on 𝕋d{\mathbb{T}}^{d} then AN(x)A_{N}(x) has a limiting distribution as N,N\to\infty, namely

ANB(y,α)B(x,α)A_{N}\Rightarrow B(y,\alpha)-B(x,\alpha)

where (y,α)(y,\alpha) is uniformly distributed on 𝕋d×𝕋d.{\mathbb{T}}^{d}\times{\mathbb{T}}^{d}.

Proof.

We need to show that as NN\to\infty the random vector (α,Nα)(\alpha,N\alpha) converge to a vector with coordinates independent random variables uniformly distributed on 𝕋d×𝕋d.{\mathbb{T}}^{d}\times{\mathbb{T}}^{d}. To this end it suffices to check that if ϕ(x,y)\phi(x,y) is a smooth function on 𝕋d×𝕋d{\mathbb{T}}^{d}\times{\mathbb{T}}^{d} then

limN𝕋dϕ(α,Nα)𝑑α=𝕋d×𝕋dϕ(α,β)𝑑α𝑑β\lim_{N\to\infty}\int_{{\mathbb{T}}^{d}}\phi(\alpha,N\alpha)d\alpha=\int_{{\mathbb{T}}^{d}\times{\mathbb{T}}^{d}}\phi(\alpha,\beta)d\alpha d\beta

but this is easily established by considering the Fourier series of ϕ.\phi.

We will see in Section 8 how our understanding of ergodic sums for smooth functions can be used to derive ergodic properties of area preserving flows on 𝕋2\mathbb{T}^{2} without fixed points.

On the other hand there are many open questions related to the case when the observable AA is not smooth enough for (3) to hold. Below we mention several classes of interesting observables.

2.2. Observables with singularities

Special flows above circle rotations and under ceiling functions that are smooth except for some singularities naturally appear in the study of conservative flows on surfaces with fixed points.

Another motivation for studying ergodic sums for functions with singularities is the case of meromorphic functions, whose sums appear in questions related to both number theory [48] and ergodic theory [106].

2.2.1. Observables with logarithmic singularities.

In the study of conservative flows on surfaces, non degenerate saddle singularities are responsible for logarithmic singularities of the ceiling function.

Ceiling functions with logarithmic singularities also appear in the study of multi-valued Hamiltonians on the two torus. In [3], Arnold investigated such flows and showed that the torus decomposes into cells that are filled up by periodic orbits and one open ergodic component. On this component, the flow can be represented as a special flow over an interval exchange map of the circle and under a ceiling function that is smooth except for some logarithmic singularities. The singularities can be asymmetric since the coefficient in front of the logarithm is twice as big on one side of the singularity as the one on the other side, due to the existence of homoclinic loops (see Figure 1).

Refer to caption
Figure 1. Multivalued Hamiltonian flow. Note that the orbits passing to the left of the saddle spend approximately twice longer time comparing to the orbits passing to the right of the saddle and starting at the same distance from the separatrix since they pass near the saddle twice.

More motivations for studying function with logarithmic singularities as well as some numerical results for rotation numbers of bounded type are presented in [69].

A natural question is to understand the fluctuations of the ergodic sums for these functions as the frequency α\alpha of the underlying rotation is random as well as the base point xx. Since Fourier coefficients of the symmetric logarithm function have the asymptotics similar to that of the indicator function of an interval one may expect that the results about the latter that we will discuss in Section 3 can be extended to the former.

Question 1.

Suppose that AA is smooth away from a finite set of points x1,x2xkx_{1},x_{2}\dots x_{k} and near xj,x_{j}, A(x)=aj±ln|xxj|+rj(x)A(x)=a_{j}^{\pm}\ln|x-x_{j}|+r_{j}(x) where - sign is taken if x<xj,x<x_{j}, ++ sign is taken if x>xjx>x_{j} and rjr_{j} are smooth functions. What can be said about the distribution of AN(α,x)/lnNA_{N}(\alpha,x)/\ln N as xx and α\alpha are random?

2.2.2. Observables with power like singularities.

When considering conservative flows on surfaces with degenerate saddles one is led to study the ergodic sums of observables with integrable power like singularities (more discussion of these flows will be given in Section 8). Special flows above irrational rotations of the circle under such ceiling functions are called Kocergin flows.

The study of ergodic sums for smooth ergodic flows with nondegenerate hyperbolic singular points on surfaces of genus p2p\geq 2 shows that these flows are in general not mixing (see Section 8). A contrario Kocergin showed that special flows above irrational rotations and under ceiling functions with integrable power like singularities are always mixing. This is due to the important deceleration next to the singularity that is responsible for a shear along orbits that separates the points sufficiently to produce mixing. In other words, the mixing is due to large oscillations of the ergodic sums. In this note we will be frequently interested in the distribution properties of these sums.

One may also consider the case of non-integrable power singularities since they naturally appear in problems of ergodic theory and number theory. The following result answers a question of [48].

Theorem 2.

([120]) If AA has one simple pole on 𝕋1{\mathbb{T}}^{1} and (α,x)(\alpha,x) is uniformly distributed on 𝕋2{\mathbb{T}}^{2} then ANN\frac{A_{N}}{N} has a limiting distribution as N.N\to\infty.

The function AA in Theorem 2 has a symmetric singularity of the form 1/x1/x that is the source of cancellations in the ergodic sums.

Question 2.

What happens for an asymmetric singularity of the type 1/|x|1/|x|?

Question 3.

What happens in the quenched setting where α\alpha is fixed?

We now present several generalizations of Theorem 2.

Theorem 3.

Let A=A~(x)+cχx<x0+c+χx>x0|xx0|aA=\tilde{A}(x)+\frac{c_{-}\chi_{x<x_{0}}+c^{+}\chi_{x>x_{0}}}{|x-x_{0}|^{a}} where A~\tilde{A} is smooth and a>1.a>1.

(a) If (α,x)(\alpha,x) is uniformly distributed on 𝕋2{\mathbb{T}}^{2} then ANNa\dfrac{A_{N}}{N^{a}} converges in distribution.

(b) For almost every xx fixed, if α\alpha is uniformly distributed on 𝕋{\mathbb{T}} then AN(α,x)Na\dfrac{A_{N}(\alpha,x)}{N^{a}} converge to the same limit as in part (a).

Theorem 4.

[87] If AA has zero mean and is smooth except for a singularity at 0 of type |x|a|x|^{-a}, a(0,1)a\in(0,1) then AN/NaA_{N}/N^{a} converges in distribution.

The proof of Theorem 4 is inspired by the proof of Theorem 10 of Section 3 which will be presented in Section 5.5.

Marklof proved in [92] that if α𝒟(σ)\alpha\in{\mathcal{D}}(\sigma) with σ<(1a)/a\sigma<(1-a)/a, then for AA as in Theorem 4 AN(α,α)/N0A_{N}(\alpha,\alpha)/N\to 0.

Question 4.

What happens for other angles α\alpha and other type of singularities, including the non integrable ones for which the ergodic theorem does not necessarily hold?

Another natural generalization of Theorem 2 is to consider meromorphic functions. Let AA be such a function with highest pole of order m.m. Thus AA can be written as

A(x)=j=1rcj(xxj)m+A~(x)A(x)=\sum_{j=1}^{r}\frac{c_{j}}{(x-x_{j})^{m}}+{\tilde{A}}(x)

where the highest pole of A~{\tilde{A}} has order at most m1.m-1.

Theorem 5.

(a) Let AA be fixed and let α\alpha be distributed according to a smooth density on 𝕋.\mathbb{T}. Then for any x𝕋x\in\mathbb{T}, AN(α,x)Nm\dfrac{A_{N}(\alpha,x)}{N^{m}} has a limiting distribution as N.N\to\infty.

(b) Let A~,c1,cr{\tilde{A}},c_{1},\dots c_{r} be fixed while (α,x,x1xr)(\alpha,x,x_{1}\dots x_{r}) are distributed according to a smooth density on on 𝕋r+2{\mathbb{T}}^{r+2} then AN(α,x)Nm\dfrac{A_{N}(\alpha,x)}{N^{m}} has a limiting distribution as N.N\to\infty.

(c) If (x1,x2xr)(x_{1},x_{2}\dots x_{r}) is a fixed irrational vector then for almost every x𝕋x\in\mathbb{T} the limit distribution in part (a) is the same as the limit distribution in part (b).

Proofs of Theorems 3 and 5 are sketched in Section 5.

It will be apparent from the proof of Theorem 5 that the limit distribution in part (a) is not the same for all x1,x2xj.x_{1},x_{2}\dots x_{j}. For example if xj=jx1x_{j}=jx_{1} we get an exceptional distribution since a close approach to x1x_{1} and x2x_{2} by the orbit of xx should be followed by a close approach to xjx_{j} for j3.j\geq 3. We will see that this phenomenon appears in many limit theorems (see e.g Theorem 9, Theorem 25 and Question 52, Theorem 38 and Question 38, as well as [93]).

Question 5.

What can be said about more general meromorphic functions such as sin2πx/(sin2πx+3cos2πy)\sin 2\pi x/(\sin 2\pi x+3\cos 2\pi y) on 𝕋d\mathbb{T}^{d} with d>1?d>1?

3. Ergodic sums of characteristic functions. Discrepancies

The case where A=χΩA=\chi_{\Omega} is a classical subject in number theory. Define the discrepancy function

DN(Ω,α,x)=n=0N1χΩ(x+nα)NVol(Ω)Vol(𝕋d).D_{N}(\Omega,\alpha,x)=\sum_{n=0}^{N-1}\chi_{\Omega}(x+n\alpha)-N\frac{\text{Vol}(\Omega)}{\text{Vol}({\mathbb{T}}^{d})}.

Uniform distribution of the sequence x+kαx+k\alpha on 𝕋d\mathbb{T}^{d} is equivalent to the fact that, for regular sets Ω,\Omega, DN(Ω,α,x)/N0D_{N}(\Omega,\alpha,x)/N\to 0 as NN\to\infty. A step further in the description of the uniform distribution is the study of the rate of convergence to 0 of DN(Ω,α,x)/ND_{N}(\Omega,\alpha,x)/N.

In d=1d=1 it is known that if α𝕋\alpha\in\mathbb{T}-\mathbb{Q} is fixed, the discrepancy DN(Ω,α,x)/ND_{N}(\Omega,\alpha,x)/N displays an oscillatory behavior according to the position of NN with respect to the denominators of the best rational approximations of α\alpha. A great deal of work in Diophantine approximation has been done on estimating the discrepancy function in relation with the arithmetic properties of α𝕋\alpha\in\mathbb{T}, and more generally for α𝕋d\alpha\in\mathbb{T}^{d}.

3.1. The maximal discrepancy

Let

(4) D¯N(α)=supΩ𝔹DN(Ω,α,0)\overline{D}_{N}(\alpha)=\sup_{\Omega\in\mathbb{B}}D_{N}(\Omega,\alpha,0)

where the supremum is taken over all sets Ω\Omega in some natural class of sets 𝔹\mathbb{B}, for example balls or boxes (product of intervals).

The case of (straight) boxes was extensively studied, and growth properties of the sequence D¯N(α)\overline{D}_{N}(\alpha) were obtained with a special emphasis on their relations with the Diophantine approximation properties of α.\alpha. In particular, following earlier advances of [75, 48, 99, 64, 117] and others, [8] proves

Theorem 6.

Let

D¯N(α)=supΩboxDN(Ω,α,0)\overline{D}_{N}(\alpha)=\sup_{\Omega-\text{box}}D_{N}(\Omega,\alpha,0)

Then for any positive increasing function ϕ\phi we have

(5) nϕ(n)1<D¯N(α)(lnN)dϕ(lnlnN) is bounded for almost every α𝕋d.\hskip 11.38092pt\sum_{n}\phi(n)^{-1}<\infty\iff\frac{\overline{D}_{N}(\alpha)}{(\ln N)^{d}\phi(\ln\ln N)}\begin{array}[]{l}\text{ is bounded for}\\ \text{ almost every }\alpha\in\mathbb{T}^{d}.\end{array}

In dimension d=1d=1, this result is the content of Khinchine theorems obtained in the early 1920’s [64], and it follows easily from well-known results from the metrical theory of continued fractions (see for example the introduction of [8]). The higher dimensional case is significantly more difficult and the cited bound was only obtained in the 1990s.

The bound in (5) focuses on how bad can the discrepancy become along a subsequence of NN, for a fixed α\alpha in a full measure set. In a sense, it deals with the worst case scenario and does not capture the oscillations of the discrepancy. On the other hand, the restriction on α\alpha is necessary, since given any εn0{\varepsilon}_{n}\to 0 it is easy to see that for α𝕋\alpha\in\mathbb{T} sufficiently Liouville, the discrepancy (relative to intervals) can be as bad as NnεnN_{n}{\varepsilon}_{n} along a suitable sequence NnN_{n} (large multiples of Liouville denominators).

For d=1d=1, it is not hard to see, using continued fractions, that for any α\alpha : lim supD¯N(α)lnN>0\limsup\frac{\overline{D}_{N}(\alpha)}{\ln N}>0, lim infD¯N(α)C\liminf\overline{D}_{N}(\alpha)\leq C; and for α𝒟(0)\alpha\in\mathcal{D}(0) lim supD¯N(α)lnN<+\limsup\frac{\overline{D}_{N}(\alpha)}{\ln N}<+\infty. The study of higher dimensional counterparts to these results raises several interesting questions.

Question 6.

Is it true that lim supD¯N(α)lndN>0\limsup\frac{\overline{D}_{N}(\alpha)}{\ln^{d}N}>0 for all α𝕋d\alpha\in{\mathbb{T}}^{d}?

Question 7.

Is it true that there exists α\alpha such that lim supD¯N(α)lndN<+\limsup\frac{\overline{D}_{N}(\alpha)}{\ln^{d}N}<+\infty?

Question 8.

What can one say about lim infD¯N(α)aN\liminf\frac{\overline{D}_{N}(\alpha)}{a_{N}} for a.e. α\alpha, where aNa_{N} is an adequately chosen normalization? for every α\alpha?

Question 9.

Same questions as Questions 68 when boxes are replaced by balls.

Question 10.

Same questions as Questions 68 for the isotropic discrepancy, when boxes are replaced by the class of all convex sets [79].

3.2. Limit laws for the discrepancy as α\alpha is random

In this survey, we will mostly concentrate on the distribution of the discrepancy function as α\alpha is random. The above discussion naturally raises the following question.

Question 11.

Let α\alpha be uniformly distributed on 𝕋d.\mathbb{T}^{d}. Is it true that D¯N(α)lndN\frac{\overline{D}_{N}(\alpha)}{\ln^{d}N} converges in distribution as NN\to\infty?

Why do we need to take α\alpha random? The answer is that for fixed α\alpha the discrepancy does not have a limit distribution, no water which normalization is chosen.

For example for d=1d=1 the Denjoy-Koksma inequality says that

|AqnqnA(x)𝑑x|2V|A_{q_{n}}-q_{n}\int A(x)dx|\leq 2V

where qnq_{n} is the nn-th partial convergent to α\alpha and VV denotes the total variation of A.A. In particular Dqn(I,α,x)D_{q_{n}}(I,\alpha,x) can take at most 3 values.

In higher dimensions one can show that if Ω\Omega is either a box or any other strictly convex set then for almost all α\alpha and almost all tori, when xx is random the variable

DN(Ω,d/L,α,)aN\frac{D_{N}(\Omega,{\mathbb{R}}^{d}/L,\alpha,\cdot)}{a_{N}}

does not converge to a non-trivial limiting distribution for any choice of aN=aN(α,L)a_{N}=a_{N}(\alpha,L) (see discussion in the introduction of [29]).

Question 12.

Is this true for all α,L\alpha,L?

Question 13.

Study the distributions which can appear as weak limits of DN(Ω,α,)aN\dfrac{D_{N}(\Omega,\alpha,\cdot)}{a_{N}}, in particular their relation with number theoretic properties of α.\alpha.

Let us consider the case d=1d=1 (so the sets of interest are intervals and we will write II instead of Ω\Omega.) It is easy to see that all limit distributions are atomic for all II iff α.\alpha\in{\mathbb{Q}}.

Question 14.

Is it true that all limit distributions are either atomic or Gaussian for almost all II iff α\alpha is of bounded type?

Evidence for the affirmative answer is contained in the following results.

Theorem 7.

([55]) If α\alpha\not\in{\mathbb{Q}} and I=[0,1/2]I=[0,1/2] then there is a sequence NjN_{j} such that DNj(I,α,)j\dfrac{D_{N_{j}}(I,\alpha,\cdot)}{j} converges to 𝔑.{\mathfrak{N}}.

Instead of considering subsequences, it is possible to randomize N.N.

Theorem 8.

Let α\alpha be a quadratic surd.

(a) ([10]) If (x,a,l)(x,a,l) is uniformly distributed on 𝕋3{\mathbb{T}}^{3} then D[aN]([0,l],α,x)lnN\dfrac{D_{[aN]}([0,l],\alpha,x)}{\sqrt{\ln N}} converges to 𝔑(σ2){\mathfrak{N}}(\sigma^{2}) for some σ20.\sigma^{2}\neq 0.

(b) ([11]) If MM is uniformly distributed on [1,N][1,N] and ll is rational then there are constants C(α,l),σ(α,l)C(\alpha,l),\sigma(\alpha,l) such that DM([0,l],α,0)C(α,l)lnNlnN\dfrac{D_{M}([0,l],\alpha,0)-C(\alpha,l)\ln N}{\sqrt{\ln N}} converges to 𝔑(σ2(α,l)).{\mathfrak{N}}(\sigma^{2}(\alpha,l)).

Note that even though we have normalized the discrepancy by subtracting the expected value an additional normalization is required in Theorem 8(b). The reason for this is explained at the end of Section 8.5.

So if one wants to have a unique limit distribution for all NN one needs to allow random α.\alpha.

The case when d=1d=1 was studied by Kesten. Define

V(u,v,w)=k=1sin(2πu)sin(2πv)sin(2πw)k2.V(u,v,w)=\sum_{k=1}^{\infty}\frac{\sin(2\pi u)\sin(2\pi v)\sin(2\pi w)}{k^{2}}.

If (r,q)(r,q) are positive integers let

θ(r,q)=Card(j:0jq1:gcd(j,r,q)=1)Card(j,k:0j,kq1:gcd(j,k,q)=1).\theta(r,q)=\frac{{\mathrm{Card}}(j:0\leq j\leq q-1:gcd(j,r,q)=1)}{{\mathrm{Card}}(j,k:0\leq j,k\leq q-1:gcd(j,k,q)=1)}.

Finally let

c(r)={π312[r=0q1θ(p,q)0101V(u,rpq,v)𝑑u𝑑v]1if r=pq and gcd(p,q)=1π312[010101V(u,r,v)𝑑u𝑑r𝑑v]1if r is irrational.c(r)=\begin{cases}\frac{\pi^{3}}{12}\left[\sum_{r=0}^{q-1}\theta(p,q)\int_{0}^{1}\int_{0}^{1}V(u,\frac{rp}{q},v)dudv\right]^{-1}&\text{if }r=\frac{p}{q}\text{ and }gcd(p,q)=1\\ \frac{\pi^{3}}{12}\left[\int_{0}^{1}\int_{0}^{1}\int_{0}^{1}V(u,r,v)dudrdv\right]^{-1}&\text{if }r\text{ is irrational.}\end{cases}
Theorem 9.

([61, 62]) If (α,x)(\alpha,x) is uniformly distributed on 𝕋2{\mathbb{T}}^{2} then DN([0,l],α,x)c(l)lnN\frac{D_{N}([0,l],\alpha,x)}{c(l)\ln N} converges to the Cauchy distribution .{\mathfrak{C}}.

Note that the normalizing factor is discontinuous as a function of the length of the interval at rational values.

A natural question is to extend Theorem 9 to higher dimensions. The first issue is to decide which sets Ω\Omega to consider instead of intervals. It appears that a quite flexible assumption is that Ω\Omega is semialgebraic, that is, it is defined by a finite number of algebraic inequalities.

Question 15.

Suppose that Ω\Omega is semialgebraic then there is a sequence aN=aN(Ω)a_{N}=a_{N}(\Omega) such that for a random translation of a random torus DN(Ω,d/L,α,x)aN\dfrac{D_{N}(\Omega,{\mathbb{R}}^{d}/L,\alpha,x)}{a_{N}} converges in distribution as N.N\to\infty.

By random translation of a random torus, we mean a translation of random angle α\alpha on a torus d/L{\mathbb{R}}^{d}/L where L=AdL=A{\mathbb{Z}}^{d} and the triple (α,x,A)(\alpha,x,A) has a smooth density on 𝕋d×𝕋d×GL(,d){\mathbb{T}}^{d}\times{\mathbb{T}}^{d}\times\text{GL}(\mathbb{R},d). Notice that comparing to Kesten’s result of Theorem 9, Question 15 allows for additional randomness, namely, the torus is random. In particular, for d=1d=1, the study of the discrepancy of visits to [0,l][0,l] on the torus /{\mathbb{R}}/{\mathbb{Z}} is equivalent to the study of the discrepancy of visits to [0,1][0,1] on the torus /(l1).{\mathbb{R}}/(l^{-1}{\mathbb{Z}}). Thus the purpose of the extra randomness is to avoid the irregular dependence on parameters observed in Theorem 9 (cf. also [109, 110]).

So far Question 15 has been answered for two classes of sets which are natural counterparts to intervals in higher dimensions : strictly convex sets and (tilted) boxes.

Given a convex body Ω\Omega, we consider the family Ωr\Omega_{r} of bodies obtained from Ω\Omega by rescaling it with a ratio r>0r>0 (we apply to Ω\Omega the homothety centered at the origin with scale rr). We suppose r<r0r<r_{0} so that the rescaled bodies can fit inside the unit cube of d\mathbb{R}^{d}. We define

(6) DN(Ω,r,α,x)=n=0N1χΩr(x+nα)NVol(Ωr)D_{N}(\Omega,r,\alpha,x)=\sum_{n=0}^{N-1}\chi_{\Omega_{r}}(x+n\alpha)-N{\rm Vol}({\Omega_{r}})
Theorem 10.

([28]) If (r,α,x)(r,\alpha,x) is uniformly distributed on X=[a,b]×𝕋d×𝕋dX=[a,b]\times\mathbb{T}^{d}\times\mathbb{T}^{d} then DN(Ω,r,α,x)r(d1)/2N(d1)/2d\frac{D_{N}(\Omega,r,\alpha,x)}{r^{(d-1)/2}N^{(d-1)/2d}} has a limit distribution as N.N\to\infty.

The form of the limiting distribution is given in Theorem 18 in Section 5.

In the case of boxes we recover the same limit distribution as in Kesten but with a higher power of the logarithm in the normalization.

Theorem 11.

([29]) In the context of Question 15, if Ω\Omega is a box, then DNclndN\dfrac{D_{N}}{c\ln^{d}N} converges to {\mathfrak{C}} as N.N\to\infty.

Alternatively, one can consider gilded boxes, namely: for u=(u1,,ud)u=(u_{1},\ldots,u_{d}) with 0<ui<1/20<u_{i}<1/2 for every ii, we define a cube on the dd-torus by Cu=[u1,u1]×[ud,ud]C_{u}=[-u_{1},u_{1}]\times\ldots[-u_{d},u_{d}]. Let η>0\eta>0 and MCuMC_{u} be the image of CuC_{u} by a matrix MSL(d,)M\in{\rm SL}(d,\mathbb{R}) such that

M=(aij)Gη={|ai,i1|<η, for every i and |ai,j|<η for every ji}.M=(a_{ij})\in G_{\eta}=\{|a_{i,i}-1|<\eta,\text{ for every }i\text{ and }|a_{i,j}|<\eta\text{ for every }j\neq i\}.

For a point x𝕋dx\in\mathbb{T}^{d} and a translation frequency vector α𝕋d\alpha\in\mathbb{T}^{d} we denote ξ=(u,M,α,x)\xi=(u,M,\alpha,x) and define the following discrepancy function

DN(ξ)=#{1mN:(x+mα)mod 1MCu}2d(Πiui)N.D_{N}(\xi)=\#\{1\leq m\leq N:(x+m\alpha){\rm\ mod\ }1\in MC_{u}\}-2^{d}\left(\Pi_{i}u_{i}\right)N.

Fix dd segments [vi,wi][v_{i},w_{i}] such that 0<vi<wi<1/2i=1,,d0<v_{i}<w_{i}<1/2\forall i=1,\ldots,d. Let

(7) X=(u,α,x,(ai,j))[v1,w1]×[vd,wd]×𝕋2d×GηX=(u,\alpha,x,(a_{i,j}))\in[v_{1},w_{1}]\times\dots[v_{d},w_{d}]\times\mathbb{T}^{2d}\times G_{\eta}

We denote by {\mathbb{P}} the normalized restriction of the Lebesgue ×\times Haar measure on XX. Then, the precise statement of Theorem 11 is

Theorem 12.

([29]) Let ρ=1d!(2π)2d+2.\rho=\frac{1}{d!}\left(\frac{2}{\pi}\right)^{2d+2}. If ξ\xi is distributed according to λ\lambda then DN(ξ)ρ(lnN)d\frac{D_{N}(\xi)}{\rho(\ln N)^{d}} converges to {\mathfrak{C}} as N.N\to\infty.

Question 16.

Are Theorems 1012 valid if

(a) The lattice LL is fixed; (b) xx is fixed?

Question 17.

Describe large deviations for DN.D_{N}. That is, given bNaNb_{N}\gg a_{N} where aNa_{N} is the same as in Question 15, study the asymptotics of (DNbN).{\mathbb{P}}(D_{N}\geq b_{N}). One can study this question in the annealed setting when all variables are random or in the quenched setting where some of them are fixed.

Question 18.

Does a local limit theorem hold? That is, is it true that given a finite interval JJ we have

limNaN(DNJ)=c|J|?\lim_{N\to\infty}a_{N}{\mathbb{P}}(D_{N}\in J)=c|J|?

4. Poisson regime

The results presented in the last section deal with the so called CLT regime. This is the regime when, since the target set Ω\Omega is macroscopic (having volume of order 11), if TT was sufficiently mixing, one would get the Central Limit Theorem for the ergodic sums of χΩ\chi_{\Omega}. In this section we discuss Poisson (microscopic) regime, that is, we let Ω=ΩN\Omega=\Omega_{N} shrink so that 𝔼(DN(ΩN,α,x)){\mathbb{E}}(D_{N}(\Omega_{N},\alpha,x)) is constant. In this case, the sum in the discrepancy consists of a large number of terms each one of which vanishes with probability close to 1 so that typically only finitely many terms are non-zero.

Theorem 13.

([88]) Suppose that Ω\Omega is a bounded set whose boundary has zero measure.

If (α,x)(\alpha,x) is uniformly distributed on 𝕋d×𝕋d{\mathbb{T}}^{d}\times{\mathbb{T}}^{d} then both DN(N1/dΩ,α,x)D_{N}(N^{-1/d}\Omega,\alpha,x) and DN(N1/dΩ,α,0)D_{N}(N^{-1/d}\Omega,\alpha,0) converge in distribution.

Note that in this case the result is less sensitive to the shape of Ω\Omega than in the case of sets of unit size.

We will see later (Theorem 16 in Section 5) that one can also handle several sets at the same time.

Corollary 14.

If (α,x)(\alpha,x) is uniformly distributed on 𝕋d×𝕋d{\mathbb{T}}^{d}\times{\mathbb{T}}^{d} then the following random variables have limit distributions

(a) N1/dmin0n<Nd(x+nα,x¯)\displaystyle N^{1/d}\min_{0\leq n<N}d(x+n\alpha,\bar{x}) where x¯\bar{x} is a given point in 𝕋d;\mathbb{T}^{d};

(b) N2/dmin0n<N[A(x+nα)A(x¯)]\displaystyle N^{2/d}\min_{0\leq n<N}[A(x+n\alpha)-A(\bar{x})] where AA is a Morse function with minimum at x¯.\bar{x}.

Proof.

To prove (a) note that N1/dmin0n<Nd(x+nα,x¯)s\displaystyle N^{1/d}\min_{0\leq n<N}d(x+n\alpha,\bar{x})\leq s iff the number of points of the orbit of xx of length NN inside B(x¯,sN1/d)B(\bar{x},sN^{-1/d}) is zero.

To prove (b) note that if AA is a Morse function and x is close to x¯\bar{x} then A(x)A(x¯)+(D2A)(x¯)(xx¯,xx¯).A(x)\approx A(\bar{x})+(D^{2}A)(\bar{x})(x-\bar{x},x-\bar{x}).

There are two natural ways to extend this result.

Question 19.

If S𝕋dS\subset\mathbb{T}^{d} is an analytic submanifold of codimension qq find the limit distribution of N1/qmin0n<Nd(x+nα,S).\displaystyle N^{1/q}\min_{0\leq n<N}d(x+n\alpha,S).

Question 20.

Given a typical analytic function AA find a limit distribution of N1dmin0n<N|A(x+nα)|.\displaystyle N^{\frac{1}{d}}\min_{0\leq n<N}|A(x+n\alpha)|.

As we shall see in Section 5.4 this question is closely related to Question 5.

5. Outlines of proofs

5.1. Poisson processes

In this section we recall some facts about the Poisson processes referring the reader to [93, Section 11] or [65] for more details. The next section contains preliminaries from homogenous dynamics.

Recall that a random variable NN has Poisson distribution with parameter λ\lambda if (N=k)=eλλkk!.{\mathbb{P}}(N=k)=e^{-\lambda}\frac{\lambda_{k}}{k!}. Now an easy combinatorics shows the following facts

(I) If N1,N2NmN_{1},N_{2}\dots N_{m} are independent random variables and NjN_{j} have Poisson distribution with parameters λj\lambda_{j}, then N=j=1mNjN=\sum_{j=1}^{m}N_{j} has Poisson distribution with parameter j=1mλj.\sum_{j=1}^{m}\lambda_{j}.

(II) Conversely, take NN points distributed according to a Poisson distribution with parameter λ\lambda and color each point independently with one of mm colors where color jj is chosen with probability pj.p_{j}. Let NjN_{j} be the number of points of color j.j. Then NjN_{j} are independent and NjN_{j} has Poisson distribution with parameter λj=pjλ.\lambda_{j}=p_{j}\lambda.

Now let (Ω,μ)(\Omega,\mu) be a measure space. By a Poisson process on this space we mean a random point process on XX such that if Ω1,Ω2Ωm\Omega_{1},\Omega_{2}\dots\Omega_{m} are disjoint sets and NjN_{j} is the number of points in Ωj\Omega_{j} then NjN_{j} are independent Poisson random variables with parameters μ(Ωj)\mu(\Omega_{j}) (note that this definition is consistent due to (I)). We will write {xj}𝔓((X,μ))\{x_{j}\}\sim{\mathfrak{P}}((X,\mu)) to indicate that {xj}\{x_{j}\} is a Poisson process with parameters (X,μ).(X,\mu). If (X,μ)=(,cLeb)(X,\mu)=(\mathbb{R},c{\mathrm{Leb}}) we shall say that XX is a Poisson process with intensity c.c. The following properties of the Poisson process are straightforward consequences of (I) and (II) above.

Proposition 1.

(a) If {xj}𝔓(X,μ)\{x_{j}^{\prime}\}\sim{\mathfrak{P}}(X,\mu^{\prime}) and {xj′′}𝔓(X,μ′′)\{x_{j}^{\prime\prime}\}\sim{\mathfrak{P}}(X,\mu^{\prime\prime}) are independent then {xj}{xj′′}𝔓(X,μ+μ′′).\{x_{j}^{\prime}\}\cup\{x_{j}^{\prime\prime}\}\sim{\mathfrak{P}}(X,\mu^{\prime}+\mu^{\prime\prime}).

(b) If {xj}𝔓(X,μ)\{x_{j}\}\sim{\mathfrak{P}}(X,\mu) and f:XYf:X\to Y is a measurable map then {f(xj)}𝔓(Y,f1μ).\{f(x_{j})\}\sim{\mathfrak{P}}(Y,f^{-1}\mu).

(c) Let X=Y×Z,X=Y\times Z, μ=ν×λ\mu=\nu\times\lambda where λ\lambda is a probability measure on Z.Z. Then {(yj,zj)}𝔓(X,μ)\{(y_{j},z_{j})\}\sim{\mathfrak{P}}(X,\mu) iff {yj}𝔓(Y,ν)\{y_{j}\}\sim{\mathfrak{P}}(Y,\nu) and zjz_{j} are random variables independent from {yj}\{y_{j}\} and each other and distributed according to λ.\lambda.

Next recall [40, Chapter XVII] that the Cauchy distribution is unique (up to scaling) symmetric distribution such that if Z,ZZ,Z^{\prime} and Z′′Z^{\prime\prime} are independent random variables with that distribution then Z+Z′′Z^{\prime}+Z^{\prime\prime} has the same distribution as 2Z.2Z. We have the following representation of the Cauchy distribution.

Proposition 2.

(a) If {xj}\{x_{j}\} is a Poisson process with constant intensity then j1xj\sum_{j}\frac{1}{x_{j}} has Cauchy distribution. (the sum is understood in the sense of principle value).

(b) If {xj}\{x_{j}\} is a Poisson distribution with constant intensity and ξj\xi_{j} are random variables with finite expectation independent from {xj}\{x_{j}\} and from each other then

(8) jξjxj\sum_{j}\frac{\xi_{j}}{x_{j}}

has Cauchy distribution.

To see part (a) let {xj},{xj′′}\{x_{j}^{\prime}\},\{x_{j}^{\prime\prime}\} and {xj}\{x_{j}\} are independent Poisson processes with intensity c.c. Then

1xj+1xj′′=y{xj}{xj′′}1y\sum\frac{1}{x_{j}^{\prime}}+\sum\frac{1}{x_{j}^{\prime\prime}}=\sum_{y\in\{x_{j}^{\prime}\}\cup\{x_{j}^{\prime\prime}\}}\frac{1}{y}

and by Proposition 1 (a) and (b) both {xj}{xj′′}\{x_{j}^{\prime}\}\cup\{x_{j}^{\prime\prime}\} and {xj2}\{\frac{x_{j}}{2}\} are Poisson processes with intensity 2c.2c.

To see part (b) note that by Proposition 1(b) and (c), {xjξj}\{\frac{x_{j}}{\xi_{j}}\} is a Poisson process with constant intensity.

5.2. Uniform distribution on the space of lattices

In order to describe ideas of the proofs from Sections 2, 3, and 4 we will first go over some preliminaries. By a random dd-dimensional lattice (centered at 0) we mean a lattice L=QdL=Q\mathbb{Z}^{d} where QQ is distributed according to Haar measure on G=SLd()/SLd().G=SL_{d}(\mathbb{R})/SL_{d}(\mathbb{Z}).

By a random dd-dimensional affine lattice we mean an affine lattice L=Qd+bL=Q\mathbb{Z}^{d}+b where (Q,b)(Q,b) is distributed according to a Haar measure on G¯=(SLd()d)/(SLd()d).\bar{G}=(SL_{d}(\mathbb{R})\ltimes\mathbb{R}^{d})/(SL_{d}(\mathbb{Z})\ltimes\mathbb{Z}^{d}). Here G¯\bar{G} is equipped with the multiplication rule (A,a)(B,b)=(AB,a+Ab)(A,a)(B,b)=(AB,a+Ab).

We denote by gtg_{t} the diagonal action on GG given by

gt=(et/d000et/d000et)g^{t}=\left(\begin{array}[]{llll}e^{t/d}&\dots&0&0\\ &&&\\ 0&\dots&e^{t/d}&0\\ 0&\dots&0&e^{-t}\end{array}\right)

and for αd1\alpha\in\mathbb{R}^{d-1} we denote by Λα\Lambda_{\alpha} the horocyclic action

(9) Λα=(10α101αd1001).\Lambda_{\alpha}=\left(\begin{array}[]{llll}1&\dots&0&\alpha_{1}\\ &&&\\ 0&\dots&1&\alpha_{d-1}\\ 0&\dots&0&1\end{array}\right).

The action of gtg^{t} on the space of affine lattices GG is partially hyperbolic and unstable manifolds are orbits of Λα\Lambda_{\alpha} where αd1\alpha\in\mathbb{R}^{d-1}.

Similarly the action by gt:=(gt,0){g}_{t}:=(g_{t},0), defined on the space of affine lattices is partially hyperbolic and unstable manifolds are orbits of (Λα,x¯)(\Lambda_{\alpha},\bar{x}) where (α,x)(d1)2(\alpha,x)\in(\mathbb{R}^{d-1})^{2} and

x¯=(x1xd10).\bar{x}=\left(\begin{array}[]{l}x_{1}\\ {}\\ x_{d-1}\\ 0\end{array}\right).

For convenience, here and below we will use the notation x¯=(x1,,xd1,0)\bar{x}=(x_{1},\ldots,x_{d-1},0) for the column vector xd1x\in{\mathbb{R}}^{d-1}.

We can also equip SLd()(d)rSL_{d}(\mathbb{R})\ltimes(\mathbb{R}^{d})^{r} with the multiplication rule

(10) (A,a1,,ar)(B,b1,,br)=(AB,a1+Ab1,,ar+Abr),(A,a_{1},\dots,a_{r})(B,b_{1},\dots,b_{r})=(AB,a_{1}+Ab_{1},\dots,a_{r}+Ab_{r}),

and consider the space of periodic configurations in dd-dimensional space G^=SLd()(d)r/SLd()(d)r\hat{G}=SL_{d}(\mathbb{R})\ltimes(\mathbb{R}^{d})^{r}/SL_{d}(\mathbb{Z})\ltimes(\mathbb{Z}^{d})^{r}.

The action of gt:=(gt,0,,0)g_{t}:=(g^{t},0,\ldots,0) on G^\hat{G} is partially hyperbolic and unstable manifolds are orbits of (Λα,x¯1,,x¯r)(\Lambda_{\alpha},\bar{x}_{1},\ldots,\bar{x}_{r}) where α,x¯jd1\alpha,\bar{x}_{j}\in\mathbb{R}^{d-1}.

We will denote these unstable manifolds by n+(α)n_{+}(\alpha) or n+(α,x¯)n_{+}(\alpha,\bar{x}) or n+(α,x¯1,,x¯r)n_{+}(\alpha,\bar{x}_{1},\ldots,\bar{x}_{r}). Note also that n+(α,x¯)n_{+}(\alpha,\bar{x}) or n+(α,x¯1,,x¯r)n_{+}(\alpha,\bar{x}_{1},\ldots,\bar{x}_{r}) for fixed x¯,x¯1,,x¯r\bar{x},\bar{x}_{1},\ldots,\bar{x}_{r} form positive codimension manifolds inside the full unstable leaves of the action of gtg_{t}.

We will often use the uniform distribution of the images of unstable manifolds for partially hyperbolic flows (see e.g. [33]) to assert that gt(Λα)g_{t}(\Lambda_{\alpha}) or gt(Λα,x¯)g_{t}(\Lambda_{\alpha},\bar{x}) or gt(Λα,x¯1,,x¯r)g_{t}(\Lambda_{\alpha},\bar{x}_{1},\ldots,\bar{x}_{r}) becomes uniformly distributed in the corresponding lattice spaces according to their Haar measures as α,x¯,x¯1,,x¯r\alpha,\bar{x},\bar{x}_{1},\ldots,\bar{x}_{r} are independent and distributed according to any absolutely continuous measure on d1\mathbb{R}^{d-1}. In fact, if the original measure has smooth density, then one has exponential estimate for the rate of equidistribution (cf. [66]). The explicit decay estimates play an important role in proving limit theorems by martingale methods. For example, such estimates are helpful in proving Theorem 9 in Section 3 and Theorems 26, 27 in Section 6.

Below we shall also encounter a more delicate situation when all or some of x¯,x¯j\bar{x},\bar{x}_{j} are fixed so we have to deal with positive codimension manifolds inside the full unstable horocycles. In this case one has to use Ratner classification theory for unipotent actions. Examples of unipotent actions are n+()n_{+}(\cdot) or n+(,x¯)n_{+}(\cdot,\bar{x}) or n+(α,)n_{+}(\alpha,\cdot). The computations of the limiting distribution of the translates of unipotent orbits proceeds in two steps (cf. [93]). For several results described in the previous sections we need the limit distribution of gtΛαwg_{t}\Lambda_{\alpha}w inside X=𝐆/𝚪X={\mathbf{G}}/{\boldsymbol{\Gamma}} where XX can be any of the sets G,G¯,G^G,\bar{G},\hat{G} described above and w𝐆.w\in{\mathbf{G}}. In fact, the identity gtΛαw=w(w1gtΛαw)g_{t}\Lambda_{\alpha}w=w(w^{-1}g_{t}\Lambda_{\alpha}w) allows us to assume that w=idw=id at the cost of replacing the action of SLd()SL_{d}(\mathbb{R}) by right multiplication by a twisted action ϕw(M)u=w1Mwu.\phi_{w}(M)u=w^{-1}Mwu. So we are interested in ϕw(gtΛα)Id\phi_{w}(g_{t}\Lambda_{\alpha})\text{Id} for some fixed wXw\in X. The first step in the analysis is to use Ratner Orbit Closure Theorem [112] to find a closed connected subgroup, that depends on ww, 𝐇𝐆{\mathbf{H}}\subset{\mathbf{G}} such that ϕw(SLd())Γ¯=𝐇Γ\overline{\phi_{w}(SL_{d}(\mathbb{R}))\Gamma}={\mathbf{H}}\Gamma and 𝐇Γ{\mathbf{H}}\cap\Gamma is a lattice in 𝐇.{\mathbf{H}}. The second step is to use Ratner Measure Classification Theorem [111] to conclude that the sets in question are uniformly distributed in 𝐇Γ/Γ.{\mathbf{H}}\Gamma/\Gamma. Namely, we have the following result (see [118, Theorem 1.4] or [93, Theorem 18.1]).

Theorem 15.

(a) For any bounded piecewise continuous functions f:Xf:X\to\mathbb{R} and h:d1h:\mathbb{R}^{d-1}\to\mathbb{R} the following holds

(11) limtd1f(φw(gtΛα))h(α)𝑑α=Xf𝑑μ𝐇d1h(α)𝑑α\lim_{t\to\infty}\int_{\mathbb{R}^{d-1}}f\left(\varphi_{w}(g_{t}\Lambda_{\alpha})\right)h(\alpha)d\alpha=\int_{X}fd\mu_{{\mathbf{H}}}\mathbb{\int}_{\mathbb{R}^{d-1}}h(\alpha)d\alpha

where μ𝐇\mu_{\mathbf{H}} denotes the Haar measure on 𝐇Γ/Γ.{\mathbf{H}}\Gamma/\Gamma.

(b) In particular, if ϕ(SLd())Γ\phi(SL_{d}(\mathbb{R}))\Gamma is dense in XX then

limtd1f(φw(gtΛα))h(α)𝑑α=Xf𝑑μ𝐆d1h(α)𝑑α\lim_{t\to\infty}\int_{\mathbb{R}^{d-1}}f\left(\varphi_{w}(g_{t}\Lambda_{\alpha})\right)h(\alpha)d\alpha=\int_{X}fd\mu_{{\mathbf{G}}}\mathbb{\int}_{\mathbb{R}^{d-1}}h(\alpha)d\alpha

where μ𝐆\mu_{\mathbf{G}} denotes the Haar measure on X.X.

To apply this Theorem one needs to compute 𝐇=𝐇(w).{\mathbf{H}}={\mathbf{H}}(w). Here we provide an example of such computation based on [93, Sections 17-19], [89, Theorem 5.7], [95, Sections 2 and 4] and [32, Section 3].

Proposition 3.

Suppose that d=2d=2 and w=Λ(I,x¯1x¯r).w=\Lambda(I,\bar{x}_{1}\dots\bar{x}_{r}).

(a) If the vector (x1,,xr)(x_{1},\dots,x_{r}) is irrational then 𝐇(w)=SL2()(2)r.{\mathbf{H}}(w)=SL_{2}(\mathbb{R})\ltimes(\mathbb{R}^{2})^{r}.

(b) If the vector (x1,,xk)(x_{1},\dots,x_{k}) is irrational and for j>kj>k

xj=qj+i=1kqijxjx_{j}=q_{j}+\sum_{i=1}^{k}q_{ij}x_{j}

where qjq_{j} and qijq_{ij} are rational numbers then 𝐇(w){\mathbf{H}}(w) is isomorphic to SL2()(2)k.SL_{2}(\mathbb{R})\ltimes(\mathbb{R}^{2})^{k}.

Proof.

(a) Denote

(M,0)=(M,(00),,(00)),Ux=(I,x¯)=((1001),x¯1,,x¯r).(M,0)=\left(M,\left(\begin{array}[]{l}0\\ 0\end{array}\right),\ldots,\left(\begin{array}[]{l}0\\ 0\end{array}\right)\right),\quad U_{x}=(I,\bar{x})=\left(\left(\begin{array}[]{ll}1&0\\ 0&1\end{array}\right),\bar{x}_{1},\ldots,\bar{x}_{r}\right).

We need to show that Ux1(M,0)Ux(γ,n)U_{x}^{-1}(M,0)U_{x}(\gamma,n) is dense in SLd()(2)rSL_{d}(\mathbb{R})\ltimes(\mathbb{R}^{2})^{r} as MM, γ,\gamma, n=(n1,,nr)n=(n_{1},\ldots,n_{r}) vary in SL(2,){\rm SL}(2,\mathbb{R}), SL(2,){\rm SL}(2,\mathbb{Z}), and (2)r(\mathbb{Z}^{2})^{r} respectively. It is of course sufficient to prove the density of

(M,0)Ux(γ,n)=(Mγ,Mx¯1+Mn1,,Mx¯r+Mnr)(M,0)U_{x}(\gamma,n)=(M\gamma,M\bar{x}_{1}+Mn_{1},\ldots,M\bar{x}_{r}+Mn_{r})

which in turn follows from the density in (2)r(\mathbb{R}^{2})^{r} of

(γ1(x¯1+n1),,γ1(x¯r+nr)).\left(\gamma^{-1}(\bar{x}_{1}+n_{1}),\ldots,\gamma^{-1}(\bar{x}_{r}+n_{r})\right).

To prove this last claim fix ϵ>0\epsilon>0 and any v(2)r.v\in(\mathbb{R}^{2})^{r}. Let z=(x1,,xr)z=(x_{1},\ldots,x_{r}). Since {1,x1xr}\{1,x_{1}\dots x_{r}\} are linearly independent over \mathbb{Q}, the TzT_{z} orbit of 0 is dense in 𝕋r\mathbb{T}^{r} and hence there exists aa\in\mathbb{N} and a vector m1rm_{1}\in\mathbb{Z}^{r} such that

|axivi,1mi,1|<ϵ.|ax_{i}-v_{i,1}-m_{i,1}|<\epsilon.

Since the TazT_{az} orbit of zz is dense in 𝕋r\mathbb{T}^{r} there exists cc\in\mathbb{N} such that

c1 mod a and |cxivi,2mi,2|<ϵ.c\equiv 1\text{ mod }a\text{ and }|cx_{i}-v_{i,2}-m_{i,2}|<\epsilon.

Since ac=1a\wedge c=1 we can find b,db,d\in\mathbb{Z} such that adbc=1.ad-bc=1. Let γ1=(abcd)\gamma^{-1}=\left(\begin{array}[]{ll}a&b\\ c&d\end{array}\right) and ni=γmin_{i}=-\gamma m_{i} so that |(γ1(x¯i+ni))jvi,j|<ε\left|\left(\gamma^{-1}(\bar{x}_{i}+n_{i})\right)_{j}-v_{i,j}\right|<{\varepsilon} for every i=1,,ri=1,\ldots,r and j=1,2j=1,2. This finishes the proof of density completing the proof of part (a).

(b) Suppose first that qjq_{j} and qijq_{ij} are integers. In this case a direct inspection shows that ϕ(SL2())\phi(SL_{2}(\mathbb{R})) is contained in the orbit of H=SL2()×VH=SL_{2}(\mathbb{R})\times V where

V={(z1,z2,,zr):zj=qj+iqijzi}V=\{(z_{1},z_{2},\dots,z_{r}):z_{j}=q_{j}+\sum_{i}q_{ij}z_{i}\}

and that the orbit of HH is closed. Hence 𝐇(w)H.{\mathbf{H}}(w)\subset H. To prove the opposite inclusion it suffices to show that dim(H)=dim(𝐇).\dim(H)=\dim({\mathbf{H}}). To this end we note that since the action of SL2()SL_{2}(\mathbb{R}) is a skew product, 𝐇{\mathbf{H}} projects to SL2().SL_{2}(\mathbb{R}). On the other hand the argument of part (a) shows that the closure of ϕ(SL2())\phi(SL_{2}(\mathbb{R})) contains the elements of the form (Id,v)(Id,v) with vV.v\in V. This proves the result in case qiq_{i} and qijq_{ij} are integers.

In the general case where qj=pjQq_{j}=\frac{p_{j}}{Q} and qij=pijQq_{ij}=\frac{p_{ij}}{Q} where Q,pjQ,p_{j} and pijp_{ij} are integers, the foregoing argument shows that

ϕw(SL2())(SL2()(/Q)2r)¯=H(SL2()(/Q)2r).\overline{\phi_{w}(SL_{2}(\mathbb{R}))(SL_{2}(\mathbb{Z})\ltimes(\mathbb{Z}/Q)^{2r})}=H(SL_{2}(\mathbb{Z})\ltimes(\mathbb{Z}/Q)^{2r}).

Accordingly, the orbit of IdId is contained in a finite union of HH-orbits and intersects one of these orbits by the set of measure at least (1/Q)r.(1/Q)^{r}. Again the dimensional considerations imply that 𝐇(w)=H.{\mathbf{H}}(w)=H.

We are now ready to explain some of the ideas behind the proofs of the Theorems of Sections 2, 3 and 4 following [28, 29]. Applications to similar techniques to the related problems could be found for example in [12, 32, 33, 66, 67, 91, 92]. We shall see later that the same approach can be used to prove several other limit theorems.

5.3. The Poisson regime

Theorem 13 is a consequence of the following more general result.

Theorem 16.

([88, 94]) (a) If (α,x)(\alpha,x) is uniformly distributed on 𝕋d×𝕋d{\mathbb{T}}^{d}\times{\mathbb{T}}^{d} then N1/d{x+nα}N^{1/d}\{x+n\alpha\} converges in distribution to

{Xd such that for some Y[0,1] the point (X,Y)L}\{X\in\mathbb{R}^{d}\text{ such that for some }Y\in[0,1]\text{ the point }(X,Y)\in L\}

where Ld+1L\in{\mathbb{R}}^{d+1} is a random affine lattice.

(b) The same result holds if xx is a fixed irrational vector and α\alpha is uniformly distributed on 𝕋d.{\mathbb{T}}^{d}.

(c) If α\alpha is uniformly distributed on 𝕋d{\mathbb{T}}^{d} then N1/d{nα}N^{1/d}\{n\alpha\} converges in distribution to

{Xd such that for some Y[0,1] the point (X,Y)L}\{X\in\mathbb{R}^{d}\text{ such that for some }Y\in[0,1]\text{ the point }(X,Y)\in L\}

where Ld+1L\in{\mathbb{R}}^{d+1} is a random lattice centered at 0.

Here the convergence in, say part (a), means the following. Take a collection of sets Ω1,Ω2Ωrd\Omega_{1},\Omega_{2}\dots\Omega_{r}\subset\mathbb{R}^{d} whose boundary has zero measure and let 𝒩j(α,x,N)=Card(0nN1:N1/d{x+nα}Ωj.{\mathcal{N}}_{j}(\alpha,x,N)={\mathrm{Card}}(0\leq n\leq N-1:N^{1/d}\{x+n\alpha\}\in\Omega_{j}. Then for each l1lml_{1}\dots l_{m}

(12) limN(𝒩1(α,x,N)=l1,,𝒩r(α,x,N)=lr)=\lim_{N\to\infty}{\mathbb{P}}({\mathcal{N}}_{1}(\alpha,x,N)=l_{1},\dots,{\mathcal{N}}_{r}(\alpha,x,N)=l_{r})=
μG(L:Card(L(Ω1×[0,1]))=l1,,Card(L(Ωr×[0,1]))=lr)\mu_{G}(L:{\mathrm{Card}}(L\cap(\Omega_{1}\times[0,1]))=l_{1},\dots,{\mathrm{Card}}(L\cap(\Omega_{r}\times[0,1]))=l_{r})

where μG\mu_{G} is the Haar measure on G=SLd()G=SL_{d}(\mathbb{R}), or G=SLd()dG=SL_{d}(\mathbb{R})\ltimes\mathbb{R}^{d} respectively.

The sets appearing in Theorem 16 are called cut-and-project sets. We refer the reader to Section 9.2 of the present paper as well as to [93, Section 16] for more discussion of these objects.

Remark.

Random (quasi)-lattices provide important examples of random point processes in the Euclidean space having a large symmetry group. This high symmetry explains why they appear as limit processes in several limit theorems (see discussion in [93, Section 20]). Another point process with large symmetry group is a Poisson process discussed in Section 5.1. Poisson processes will appear in Theorem 19 below.

A variation on Theorem 16 is the following.

Let G=SL2()(2)rG=SL_{2}(\mathbb{R})\ltimes(\mathbb{R}^{2})^{r} equipped with the multiplication rule defined in Section 5.2 and consider X=G/ΓX=G/\Gamma, Γ=SL2()(2)r\Gamma=SL_{2}(\mathbb{Z})\ltimes(\mathbb{Z}^{2})^{r}.

Theorem 17.

Assume (α,z1zr)[0,1]r+1(\alpha,z_{1}\dots z_{r})\in[0,1]^{r+1}. For any collection of sets Ωi,j\Omega_{i,j}\subset\mathbb{R}, i=1,,ri=1,\ldots,r and j=1,,Mj=1,\ldots,M whose boundary has zero measure let

𝒩i,j(α,N)=Card(0nN1:N{zi+nα}Ωi,j).{\mathcal{N}}_{i,j}(\alpha,N)={\mathrm{Card}}\left(0\leq n\leq N-1:N\{z_{i}+n\alpha\}\in\Omega_{i,j}\right).

(a) If (α,z1zr)(\alpha,z_{1}\dots z_{r}) are uniformly distributed on [0,1]r+1[0,1]^{r+1} or if α\alpha is uniformly distributed on [0,1][0,1] and (z1,,zr)(z_{1},\ldots,z_{r}) is a fixed irrational vector then for each li,jl_{i,j}

(13) limN(𝒩i,j(α,N)=li,j,i,j)=μG((L,a1,,ar)G:Card(L+ai(Ωi,j×[0,1]))=li,j,i,j)\lim_{N\to\infty}{\mathbb{P}}\left({\mathcal{N}}_{i,j}(\alpha,N)=l_{i,j},\forall i,j\right)\\ =\mu_{G}\left((L,a_{1},\ldots,a_{r})\in G:{\mathrm{Card}}(L+a_{i}\cap(\Omega_{i,j}\times[0,1]))=l_{i,j},\forall i,j\right)

where we use the notation {\mathbb{P}} for Haar measure on 𝕋\mathbb{T} (in the case of fixed vector (z1,,zr)(z_{1},\ldots,z_{r})) as well as on 𝕋r+1\mathbb{T}^{r+1} (in the case of random vector (z1,,zr)(z_{1},\ldots,z_{r})), and μG\mu_{G} is the Haar measure on GG.

(b) For arbitrary (z1,,zr)(z_{1},\dots,z_{r}) there is a subgroup 𝐇G{\mathbf{H}}\subset G such that for each li,jl_{i,j}

(14) limN(𝒩i,j(α,N)=li,j,i,j)=μ𝐇((L,a1,,ar)G:Card(L+ai(Ωi,j×[0,1]))=li,j,i,j)\lim_{N\to\infty}{\mathbb{P}}\left({\mathcal{N}}_{i,j}(\alpha,N)=l_{i,j},\forall i,j\right)\\ =\mu_{\mathbf{H}}\left((L,a_{1},\ldots,a_{r})\in G:{\mathrm{Card}}(L+a_{i}\cap(\Omega_{i,j}\times[0,1]))=l_{i,j},\forall i,j\right)

where μ𝐇\mu_{\mathbf{H}} denotes the Haar measure on the orbit of 𝐇.{\mathbf{H}}.

Proof of Theorem 16.

(a) We provide a sketch of proof referring the reader to [93, Section 13] for more details.

Fix a collection of sets Ω1,Ω2Ωrd\Omega_{1},\Omega_{2}\dots\Omega_{r}\subset\mathbb{R}^{d} and l1,,lrl_{1},\ldots,l_{r}\in\mathbb{N}. We want to prove (12).

Consider the following functions on the space of affine lattices G¯=(SLd+1()d+1)/(SLd+1()d+1)\bar{G}=(SL_{d+1}(\mathbb{R})\ltimes\mathbb{R}^{d+1})/(SL_{d+1}(\mathbb{Z})\ltimes\mathbb{Z}^{d+1})

fj(L)=eLχΩj×[0,1](e)f_{j}(L)=\sum_{e\in L}\chi_{\Omega_{j}\times[0,1]}(e)

and let 𝒜={L:fj(L)=lj}.{\mathcal{A}}=\{L:f_{j}(L)=l_{j}\}. By definition, the right hand side of (12) is μG¯(L:L𝒜)\mu_{\bar{G}}(L:L\in{\mathcal{A}}).

On the other hand, the Dani correspondence principle states that

(15) 𝒩1(α,x,N)=l1,,𝒩r(α,x,N)=lr iff glnN(Λαd+1+x¯)𝒜{\mathcal{N}}_{1}(\alpha,x,N)=l_{1},\dots,{\mathcal{N}}_{r}(\alpha,x,N)=l_{r}\text{ iff }g^{\ln N}(\Lambda_{\alpha}\mathbb{Z}^{d+1}+\bar{x})\in{\mathcal{A}}

where gtg_{t} is the diagonal action (et/d,,et/d,et)(e^{t/d},\ldots,e^{t/d},e^{-t}) and x¯=(x1,,xd,0)\bar{x}=(x_{1},\ldots,x_{d},0) are defined as in Section 5.2. To see this, fix jj and suppose that {x+nα}N1/dΩj\{x+n\alpha\}\in N^{-1/d}\Omega_{j} for some n[0,N]n\in[0,N]. Then

{x+nα}=(x1+nα1+m1(n),,xd+nαd+md(n))\{x+n\alpha\}=(x_{1}+n\alpha_{1}+m_{1}(n),\dots,x_{d}+n\alpha_{d}+m_{d}(n))

with mi(n)m_{i}(n) uniquely defined so that xi+nαi+mi(1/2,1/2]x_{i}+n\alpha_{i}+m_{i}\in(-1/2,1/2], and the vector v=(m1(n),,md(n),n)v=(m_{1}(n),\ldots,m_{d}(n),n) is such that

χΩj×[0,1](glnN(Λαd+1+x¯)v)=1.\chi_{\Omega_{j}\times[0,1]}\left(g^{\ln N}(\Lambda_{\alpha}\mathbb{Z}^{d+1}+\bar{x})v\right)=1.

The converse is similarly true, namely that any vector that counts in the right hand side of (15) corresponds uniquely to an nn that counts in the left hand side visits.

Now (15), and thus (12), follow if we prove that

(16) limN((α,x)𝕋d×𝕋d:glnN(Λαd+1+x¯)𝒜)=μG(LG¯:L𝒜)\lim_{N\to\infty}{\mathbb{P}}\left((\alpha,x)\in\mathbb{T}^{d}\times\mathbb{T}^{d}:g^{\ln N}(\Lambda_{\alpha}\mathbb{Z}^{d+1}+\bar{x})\in{\mathcal{A}}\right)=\\ \mu_{G}(L\in\bar{G}:L\in{\mathcal{A}})

Finally, (16) holds due to the uniform distribution of the images of unstable manifolds n+(α,x)n_{+}(\alpha,x) for partially hyperbolic flows.

(b) Following the same arguments as above, we see that in order to prove Theorem 16(b) we need to show that (glnN,0).(Λα,x¯)(g^{\ln N},0).(\Lambda_{\alpha},\bar{x}) becomes equidisitributed with respect to Haar measure on

G=(SLd+1()d+1)/(SLd+1()d+1)G=(SL_{d+1}(\mathbb{R})\ltimes\mathbb{R}^{d+1})/(SL_{d+1}(\mathbb{Z})\ltimes\mathbb{Z}^{d+1})

if α\alpha is random and xx is a fixed irrational vector. This can be derived from Theorem 15(b) using a generalization of Proposition 3(a).

The argument for part (c) is the same as in part (a) but we use the space of lattices rather than the space of affine lattices. ∎

Proof of Theorem 17.

As in the proof of Theorem 16 we use Dani’s correspondence principle to identify the left hand side in (13) with

(Card((glnNΛα+glnNz¯i)(Ωi,j×[0,1]))=li,j,i,j){\mathbb{P}}\left({\mathrm{Card}}((g_{\ln N}\Lambda_{\alpha}+g_{\ln N}\bar{z}_{i})\cap(\Omega_{i,j}\times[0,1]))=l_{i,j},\forall i,j\right)

where z¯i=(zi0)\bar{z}_{i}=\left(\begin{array}[]{l}z_{i}\\ 0\end{array}\right).

Now (13) follows if we have that (glnN,0).(Λα,z¯1,,z¯r))(g_{\ln N},0).(\Lambda_{\alpha},\bar{z}_{1},\ldots,\bar{z}_{r})) is distributed according to the Haar measure in X=G/ΓX=G/\Gamma with G=SL2()(2)rG=SL_{2}(\mathbb{R})\ltimes(\mathbb{R}^{2})^{r} and Γ=SL2()(2)r\Gamma=SL_{2}(\mathbb{Z})\ltimes(\mathbb{Z}^{2})^{r}. But this last statement follows from Theorem 15(b) and Proposition 3(a).

Likewise (14) follows from Theorem 15(a). ∎

5.4. Application of the Poisson regime theorems to the ergodic sums of smooth functions with singularities

The microscopic or Poisson regime theorems are useful to treat the ergodic sums of smooth functions with singularities since the main contribution to these sums come from the visits to small neighborhoods of the singularity.

Proof of Theorem 3.

Due to Corollary 1 we may assume that A~=0.\tilde{A}=0.

Let RR be a large number and denote by SNS_{N}^{\prime} the sum of terms with |x+nαx0|>R/N|x+n\alpha-x_{0}|>R/N and by SN′′S_{N}^{\prime\prime} the sum of terms with |x+nαx0|<R/N.|x+n\alpha-x_{0}|<R/N. Then

𝔼(|SN/Na|)CNa𝔼(|ξ|aχ|ξ|>R/N)=O(R1a).{\mathbb{E}}(|S_{N}^{\prime}/N^{a}|)\leq\frac{C}{N^{a}}{\mathbb{E}}(|\xi|^{-a}\chi_{|\xi|>R/N})=O(R^{1-a}).

On the other hand, by Theorem 16(a) if (α,x)(\alpha,x) is random SN′′Na\dfrac{S_{N}^{\prime\prime}}{N^{a}} converges as NN\to\infty to

(X,Y)L:Y[0,1],|X|<RcχX<0+c+χX>0|X|a,\sum_{(X,Y)\in L:Y\in[0,1],|X|<R}\frac{c_{-}\chi_{X<0}+c_{+}\chi_{X>0}}{|X|^{a}},

where LL is a random affine lattice in 2\mathbb{R}^{2}. Letting RR\to\infty we get

SNNa(X,Y)L:Y[0,1]cχX<0+c+χX>0|X|a.\frac{S_{N}}{N^{a}}\Rightarrow\sum_{(X,Y)\in L:Y\in[0,1]}\frac{c_{-}\chi_{X<0}+c_{+}\chi_{X>0}}{|X|^{a}}.

The case of fixed irrational xx is dealt with similarly using Theorem 16(b). ∎

Sketch of proof of Theorem 5.

Consider first the case when the highest pole has order m>1.m>1. Then the argument given in the proof of Theorem 3 shows that for large R,R, AN/NmA_{N}/N^{m} can be well approximated by

(17) ANNmj=1r|x+nαxj|<R/Ncj(N(x+nαxj))m\frac{A_{N}}{N^{m}}\sim\sum_{j=1}^{r}\sum_{|x+n\alpha-x_{j}|<R/N}\frac{c_{j}}{(N(x+n\alpha-x_{j}))^{m}}

where x1xrx_{1}\dots x_{r} are all poles of order mm and c1crc_{1}\dots c_{r} are the corresponding Laurent coefficients.

We use Theorem 17 to analyze this sum. Namely, let G=SL2()(2)rG=SL_{2}(\mathbb{R})\ltimes(\mathbb{R}^{2})^{r} with the multiplication rule defined in Section 5.2 and consider X=G/ΓX=G/\Gamma, Γ=SL2()(2)r\Gamma=SL_{2}(\mathbb{Z})\ltimes(\mathbb{Z}^{2})^{r}. Consider the functions Φ:G/Γ\Phi:G/\Gamma\to\mathbb{R} given by

(18) ΦR(A,a1,ar)=j=1reA2+ajcjx(e)mχ[R,R]×[0,1](e)\Phi_{R}(A,a_{1},\dots a_{r})=\sum_{j=1}^{r}\sum_{e\in A\mathbb{Z}^{2}+a_{j}}\frac{c_{j}}{x(e)^{m}}\chi_{[-R,R]\times[0,1]}(e)

which as RR\to\infty will be distributed as

(19) Φ(A,a1ar)=j=1reA2+ajcjx(e)mχ[0,1](y(e)).\Phi(A,a_{1}\dots a_{r})=\sum_{j=1}^{r}\sum_{e\in A\mathbb{Z}^{2}+a_{j}}\frac{c_{j}}{x(e)^{m}}\chi_{[0,1]}(y(e)).

Now Theorem 5 follows from Theorem 17. Namely part (a) of Theorem 5 follows from Theorem 17(a). To get part (c) we let zj=xxjz_{j}=x-x_{j}, we observe that for almost every xx the vector (z1,,zr)(z_{1},\ldots,z_{r}) is irrational. Hence Theorem 17 applies and gives us that (17) when α𝕋\alpha\in\mathbb{T} is random has the same distribution as (18) as (A,a1ar)(A,a_{1}\dots a_{r}) is random in GG. Finally Theorem 5(b) follows from Theorem 17 (b).

The proof in case when all poles are simple is the same except that the proof that AN/NA_{N}/N is well approximated by (17) is more involved since one cannot use L1L^{1} bounds. We refer to [120] for more details. ∎

5.5. Limit laws for discrepancies.

The proofs of Theorems 1012 use a similar strategy as Theorems 3 and 5 of first localizing the important terms and then reducing their contribution to lattice counting problems. However the analysis in that case is more complicated, in particular, because the argument is carried over in the set of frequencies of the Fourier series of the discrepancy rather than in the phase space.

Let us describe the main steps in the proof of Theorem 10. Consider first the case where Ω\Omega is centrally symmetric. We start with Fourier series of the discrepancy

DN(Ω,α,x)=r(d1)/2kd0ck(r)cos(2π(k,x)+π(N1)(k,α))sin(πN(k,α))sin(π(k,α))D_{N}(\Omega,\alpha,x)=\\ r^{(d-1)/2}\sum_{k\in\mathbb{Z}^{d}-0}c_{k}(r)\frac{\cos(2\pi(k,x)+\pi(N-1)(k,\alpha))\sin(\pi N(k,\alpha))}{\sin(\pi(k,\alpha))}

where r(d1)/2ck(r)r^{(d-1)/2}c_{k}(r) are Fourier coefficients of χrΩ\chi_{r\Omega} which have the following asymptotics for large kk (see [50])

ck(r)1π|k|(d+1)/2K1/2(k/|k|)sin(2π(rP(k)d18)).c_{k}(r)\approx\frac{1}{\pi|k|^{(d+1)/2}}K^{-1/2}(k/|k|)\sin\left(2\pi\left(rP(k)-\frac{d-1}{8}\right)\right).

Here K(ξ)K(\xi) is the Gaussian curvature of Ω\partial\Omega at the point where the normal to Ω\partial\Omega is equal to ξ\xi and P(t)=supxΩ(x,t).P(t)=\sup_{x\in\Omega}(x,t).

The proof consists of three steps. First, using elementary manipulations with Fourier series one shows that the main contribution to the discrepancy comes from kk satisfying

(20) εN1/d<|k|<ε1N1/d{\varepsilon}N^{1/d}<|k|<{\varepsilon}^{-1}N^{1/d}
(21) k(d+1)/2|{(k,α)}|<1εN(d1)/2d.k^{(d+1)/2}|\{(k,\alpha)\}|<\frac{1}{{\varepsilon}N^{(d-1)/2d}}.

To understand the above conditions note that (20) and (21) imply that {(k,α)}\{(k,\alpha)\} is of order 1/N1/N so the sum n=0N1ei2π(k,(x+nα))\sum_{n=0}^{N-1}e^{i2\pi(k,(x+n\alpha))} is of order N,N, that is, it is as large as possible. Next the number of terms with |k|N1/d|k|\ll N^{1/d} is too small (much smaller than NN) so for typical α\alpha, we have that |{k,α)}|1/N|\{k,\alpha)\}|\gg 1/N for such kk ensuring the cancelations in ergodic sums. For the higher modes kN1/dk\gg N^{1/d}, using L2L^{2} norms and the decay of ckc_{k} one sees that their contribution is negligible

The second step consists in showing using the same argument as in the proof of Theorem 16 that if (α,x)(\alpha,x) is uniformly distributed on 𝕋d×𝕋d\mathbb{T}^{d}\times\mathbb{T}^{d} then the distribution of

(kN1/d,(k,α)|k|(d+1)/2N(d1)/2d)\left(\frac{k}{N^{1/d}},(k,\alpha)|k|^{(d+1)/2}N^{(d-1)/2d}\right)

converges as NN\to\infty to the distribution of

(X(e),Z(e)|X(e)|(d+1)/2)eL(X(e),Z(e)|X(e)|^{(d+1)/2})_{e\in L}

where LL is a random lattice in d+1\mathbb{R}^{d+1} centered at 0.0.

Finally the last step in the proof of Theorem 10 is to show that if we take prime kk satisfying (20) and (21) then the phases (k,x),(k,x), N(k,α)N(k,\alpha) and rP(k)rP(k) are asymptotically independent of each other and of the numerators. For (k,x)(k,x) and rP(k)rP(k) the independence comes from the fact that (20) and (21) do not involve xx or rr, while N(k,α)N(k,\alpha) has wide oscillation due to the large prefactor N.N.

The argument for non-symmetric bodies is similar except that the asymptotics of their Fourier coefficients is slightly more complicated.

The foregoing discussion explains the form of the limit distribution which we now present. Let 2,d{\mathcal{M}}_{2,d} be the space of quadruples (L,θ,b,b)(L,\theta,b,b^{\prime}) where LXL\in X, the space of lattices in d+1\mathbb{R}^{d+1}, and (θ,b,b)(\theta,b,b^{\prime}) are elements of 𝕋X\mathbb{T}^{X} satisfying the conditions

θe1+e2=θe1+θe2,bme=mbe and bme=mbe.\theta_{e_{1}+e_{2}}=\theta_{e_{1}}+\theta_{e_{2}},\quad b_{me}=mb_{e}\text{ and }b^{\prime}_{me}=mb^{\prime}_{e}.

Let d{\mathcal{M}}_{d} be the subset of M2,dM_{2,d} defined by the condition b=b.b=b^{\prime}. Consider the following function on 2,d{\mathcal{M}}_{2,d}

(22) Ω(L,θ,b,b)=1π2eLκ(e,θ,b,b)sin(πZ(e))|X(e)|d+12Z(e){\mathcal{L}}_{\Omega}(L,\theta,b,b^{\prime})=\frac{1}{\pi^{2}}\sum_{e\in L}\kappa(e,\theta,b,b^{\prime})\frac{\sin(\pi Z(e))}{|X(e)|^{\frac{d+1}{2}}Z(e)}

with

(23) κ(e,θ,b,b)=K12(X(e)/|X(e)|)sin(2π(be+θe(d1)/8))+K12(X(e)/|X(e)|)sin(2π(beθe(d1)/8)).\kappa(e,\theta,b,b^{\prime})=K^{-\frac{1}{2}}(X(e)/|X(e)|)\sin(2\pi(b_{e}+\theta_{e}-(d-1)/8))\\ +K^{-\frac{1}{2}}(-X(e)/|X(e)|)\sin(2\pi(b^{\prime}_{e}-\theta_{e}-(d-1)/8)).

It is shown in [28] that this sum converges almost everywhere on 2,d{\mathcal{M}}_{2,d} and d.{\mathcal{M}}_{d}. Now the limit distribution in Theorem 10 can be described as follows

Theorem 18.

(a) If Ω\Omega is symmetric then DN(Ω,r,α,x)N(d1)/2dr(d1)/2\dfrac{D_{N}(\Omega,r,\alpha,x)}{N^{(d-1)/2d}r^{(d-1)/2}} converges to Ω(L,θ,b,b){\mathcal{L}}_{\Omega}(L,\theta,b,b^{\prime}) where (L,θ,b,b)(L,\theta,b,b^{\prime}) is uniformly distributed on d.{\mathcal{M}}_{d}.

(b) If Ω\Omega is non-symmetric then DN(Ω,r,α,x)N(d1)/2dr(d1)/2\dfrac{D_{N}(\Omega,r,\alpha,x)}{N^{(d-1)/2d}r^{(d-1)/2}} converges to Ω(L,θ,b,b){\mathcal{L}}_{\Omega}(L,\theta,b,b^{\prime}) where (L,θ,b,b)(L,\theta,b,b^{\prime}) is uniformly distributed on 2,d.{\mathcal{M}}_{2,d}.

Question 21.

Study the properties of the limiting distribution in Theorem 18, in particular its tail behavior.

The next question is inspired by Theorem 49 from Section 9.

Question 22.

Consider the case where Ω\Omega is the standard ball. Thus in (23) K1.K\equiv 1. Study the limit distribution of {\mathcal{L}} when the dimension of the torus d.d\to\infty.

Next we describe the idea of the proof of Theorem 12. Note that (8) looks similar to (22). The main ingredient in the proof of Theorem 12 involves a result on the distribution of small divisors of multiplicative form Π|ki|(k,α)\Pi|k_{i}|\|(k,\alpha)\|. Namely, a harmonic analysis of the discrepancy’s Fourier series related to boxes allows to bound the frequencies that have essential contributions to the discrepancy and show that they must be resonant with α.\alpha. The main step is then to establish a Poisson limit theorem for the distribution of small denominators and the corresponding numerators. With the notation introduced before the statement of Theorem 12 let k¯i=ai,1k1++ai,dkd.\bar{k}_{i}=a_{i,1}k_{1}+\dots+a_{i,d}k_{d}. Then we have

Theorem 19.

([29]) Let ξX\xi\in X be distributed according to the normalized Lebesgue measure λ.\lambda. Then as NN\to\infty the point process

{((lnN)dΠik¯i(k,α),N(k,α)mod(2),{k¯1u1},{k¯dud},)}kZ(ξ,N)\left\{\left((\ln N)^{d}\Pi_{i}\bar{k}_{i}\|(k,\alpha)\|,N(k,\alpha){\rm mod}(2),\{\bar{k}_{1}u_{1}\},\dots\{\bar{k}_{d}u_{d}\},\right)\right\}_{k\in Z(\xi,N)}

where

Z(ξ,N)={kd:|k¯i|1,|Πik¯i|<N,k¯1>0,|Πik¯i|(k,α)1ε(lnN)d,msuchthatk1kdm=1and(k,α)=(k,α)+m}Z(\xi,N)=\left\{k\in\mathbb{Z}^{d}:|\bar{k}_{i}|\geq 1,|\Pi_{i}\bar{k}_{i}|<N,\bar{k}_{1}>0,\right.\\ \left.|\Pi_{i}\bar{k}_{i}|\|(k,\alpha)\|\leq\frac{1}{{\varepsilon}(\ln N)^{d}},\right.\\ \left.\exists m\in\mathbb{Z}{\rm\ such\ that\ }k_{1}\wedge\ldots\wedge k_{d}\wedge m=1{\ and\ }\|(k,\alpha)\|=(k,\alpha)+m\right\}

converges to a Poisson process on ×/(2)×(/)d\mathbb{R}\times\mathbb{R}/(2\mathbb{Z})\times(\mathbb{R}/\mathbb{Z})^{d} with intensity 2d1𝐜1/d!.2^{d-1}{\mathbf{c}}_{1}/d!.

Comparing this result with the proof of Theorem 10 discussed above we see that Theorem 19 comprises analogies of both step 2 and 3 in the former proof. Namely, it shows both that the small denominators contributing most to the discrepancy have asymptotically Poisson distribution and that the numerators are asymptotically independent of the denominators (cf. Proposition 1(c)).

We note that Theorem 19 is interesting in its own right since it describes the number of solutions to Diophantine inequalities

Πi|k¯i|(k,α)<clndN,|k¯i|>1,i|k¯i|<N.\Pi_{i}|\bar{k}_{i}|\|(k,\alpha)\|<\frac{c}{\ln^{d}N},\quad|\bar{k}_{i}|>1,\quad\prod_{i}|\bar{k}_{i}|<N.
Question 23.

What happens if in Theorem 19 lndN\ln^{d}N is replaced by lnaN\ln^{a}N with a(0,d)a\in(0,d)?

Question 24.

Is Theorem 19 still valid if the distribution of ξ\xi is concentrated on a submanifold of X?X? For example, one can take α=(s,s2).\alpha=(s,s^{2}).

A special case of Question 24 is when the matrix (ai,j)(a_{i,j}) is fixed equal to Identity. This case is directly related to Question 16(a).

The proof of Theorem 19 proceeds by martingale approach (see [26, 27]) which requires good mixing properties in the future conditioned to the past. In the present setting, to apply this method it suffices to prove that most orbits of certain unipotent subgroups are equidisitributed at a polynomial rate. Under the conditions of Theorem 19 one can assume (after an easy reduction) that the initial point has smooth density with respect to Haar measure. Then the required equidistribution follows easily form polynomial mixing of the unipotent flows. In the setting of Question 24 (as well as Question 53 in Section 9) the initial point is chosen from a positive codimension submanifold so one cannot use the mixing argument. The problem of estimating the rate of equidistribution for unipotent orbits starting from submanifolds interpolates between the problem of taking a random initial condition with smooth density which is solved and the problem of taking fixed initial condition which seems very hard.

6. Shrinking targets

Another classical result in probability theory is the Borel-Cantelli Lemma which says that if AjA_{j} are independent sets and j(Aj)=\sum_{j}{\mathbb{P}}(A_{j})=\infty then {\mathbb{P}}-almost every point belongs to infinitely many sets. A yet stronger conclusion is given by the strong Borel-Cantell Lemma claiming that the number of AjA_{j} which happen up to time NN is asymptotic to j=1N(Aj).\sum_{j=1}^{N}{\mathbb{P}}(A_{j}). In the context of ergodic dynamical systems (T,X,μ)(T,X,\mu), the law of large numbers is reflected in the Birkhoff theorem of almost sure converge in average of the ergodic means associated to a measurable observable, for example the characteristic function of a measurable set AXA\subset X. In a similar fashion one can study the so called dynamical Borel-Cantelli properties of the system (X,T,μ)(X,T,\mu) by considering instead of a fixed stet AA a sequence of ”target” sets AjXA_{j}\in X such that μ(Aj)=\sum\mu(A_{j})=\infty. We then say that the dynamical Borel-Cantelli property is satisfied by {Aj}\{A_{j}\} if for almost every xx, Tj(x)T^{j}(x) belongs to AjA_{j} for infinitely many jj.

In the context of a dynamical system (T,X,μ)(T,X,\mu) on a metric space XX it is natural to assume that the sets in question have nice geometric structure, since it is always possible for any dynamical system (with a non atomic invariant measure) to construct sets with divergent sum of measures that are missed after a certain iterate by the orbits of almost every point [21, Proposition 1.6]. The simplest assumption is that the sets be balls. The dynamical Borel-Cantelli property for balls is a common feature for deterministic systems displaying hyperbolicity features (see [51, 108, 27] and references therein).

Due to strong correlations among iterates of a toral translation the dynamical Borel-Cantelli properties are more delicate in the quasi-periodic context.

6.1. Dynamical Borel-Cantelli lemmas for translations.

For toral translations one needs also to assume that the sets are nested since otherwise one can take Aj(A0+jα)A_{j}\subset(A_{0}+j\alpha) for some fixed set A0A_{0} ensuring that the points from the compliment of A0A_{0} do not visit any AjA_{j} at time jj. This motivates the following definition (see [51, 21, 38]).

Given T:(X,μ)(X,μ)T:(X,\mu)\to(X,\mu) let VN(x,y)=n=1NχB(y,rn)(Tnx).V_{N}(x,y)=\sum_{n=1}^{N}\chi_{B(y,r_{n})}(T^{n}x). We say that TT has the shrinking target property (STP) if for any y,y, {rn}\{r_{n}\} such that nμ(B(y,rn))=\sum_{n}\mu(B(y,r_{n}))=\infty, it holds that VN(x,y)V_{N}(x,y)\to\infty for almost all x,x, i.e. the targets sequence (B(y,rn))(B(y,r_{n})) satisfies the Borel-Cantelli property for TT. We say that TT has the monotone shrinking target property (MSTP) if for any y,{rn}y,\{r_{n}\} such that nμ(B(rn))=\sum_{n}\mu(B(r_{n}))=\infty and rnr_{n} is non-increasing VN(x,y)V_{N}(x,y)\to\infty for almost all x.x.

In the case of translations, we can always assume without loss of generality that y=0y=0 (replace xx by xyx-y). We then use the notation VN(x)V_{N}(x) for VN(x,y)V_{N}(x,y). We also use the notation B(r)B(r) for the ball B(0,r)B(0,r). Another interesting choice is to take y=xy=x in which case we study the rate of return rather than the rate of approach to 0.0. Note that if VN(x,x)V_{N}(x,x) does not depend on xx and so the number of close returns depends only on α.\alpha. We shall write UN(α)=n=0N1χB(rn)(Tn0).U_{N}(\alpha)=\sum_{n=0}^{N-1}\chi_{B(r_{n})}(T^{n}0).

The following is a straightforward consequence of the fact that toral translations are isometries.

Theorem 20.

([38]) Toral translations do not have STP.

It turns out that the following Diophantine condition is relevant to this problem. Let

(24) 𝒟(σ)={α:k0,maxi[1,d]kαiC|k|(1+σ)/d}.{\mathcal{D}}^{*}(\sigma)=\{\alpha:\forall k\in{\mathbb{Z}}-0,\max_{i\in[1,d]}\|k\alpha_{i}\|\geq C|k|^{-(1+\sigma)/d}\}.
Theorem 21.

([80]) A toral translation TαT_{\alpha} has the MSTP iff α𝒟(0).\alpha\in{\mathcal{D}}^{*}(0).

A simple proof of Theorem 21 can be found in [38]. Recall that 𝒟(0){\mathcal{D}}^{*}(0) has zero Lebesgue measure. Hence, the latter result shows that one has to further restrict the targets if one wants that typical translations display the dynamical Borel-Cantelli property relative to these targets.

One possible restriction on the targets is to impose a certain growth rate on the sum of their volumes. This actually allows to further distinguish among distinct Diophantine classes as it is shown in the following result. We say that TT has s-(M)STP if for any {rn}\{r_{n}\} such that nrnds=\sum_{n}r_{n}^{ds}=\infty (and rnr_{n} is non-increasing) VN(x)V_{N}(x)\to\infty for almost all x.x. We then have the following.

Theorem 22.

([124])

a) If α𝒟(sdd)\alpha\not\in{\mathcal{D}}^{*}(sd-d), then the toral translation TαT_{\alpha} does not have the s-MSTP.

b) A circle rotation TαT_{\alpha} has the s-MSTP iff α𝒟(s1).\alpha\in{\mathcal{D}}^{*}(s-1).

Question 25.

Is this true that the toral translation TαT_{\alpha} has the s-MSTP iff α𝒟(sdd)?\alpha\in{\mathcal{D}}^{*}(sd-d)?

Another possible direction is to study specific sequences, asking for example that rn=cnγ,r_{n}=cn^{-\gamma}, or that nrndnr_{n}^{d} be decreasing, in which case the sequence rnr_{n} is coined a Khinchin sequence. The case rn=cn1/dr_{n}=cn^{-1/d} in dimension dd is very particular, but important. Indeed a vector α𝕋d\alpha\in\mathbb{T}^{d} is said to be badly approximable if for some c>0c>0, the sequence limNUN(α,{cn1/d})<.\lim_{N\to\infty}U_{N}(\alpha,\{cn^{-1/d}\})<\infty. It is known that the set of badly approximable vectors has zero measure. By contrast, vectors α\alpha such that limNUN(α,{cn(1/d+ε)})=\lim_{N\to\infty}U_{N}(\alpha,\{cn^{-(1/d+{\varepsilon})}\})=\infty for some ε>0{\varepsilon}>0 are called very well approximated, or VWA. The obvious direction of the Borel-Cantelli lemma implies that almost every α𝕋d\alpha\in\mathbb{T}^{d} is not very well approximated (cf. [19, Chap. VII]). The latter facts are particular cases of a more general result, the Khintchine–Groshev theorem on Diophantine approximation which gives a very detailed description of the sequences such that UN(x,{rn})U_{N}(x,\{r_{n}\}) diverges for almost all α.\alpha. We refer the reader to [13] for a nice discussion of that theorem and its extensions, and to Section 9.1 below.

Khinchin sequences also display BC property much more likely than general sequences. For example, compare Theorem 23(b) below with Theorem 21 which shows that the set of vectors having mSTP has zero measure.

If a shrinking target property holds it is natural to investigate the asymptotics of the number of target hits. This makes the following definition natural. We say that a given sequence of targets {An}\{A_{n}\} is sBC or strong Borel-Cantelli for (T,X,μ)(T,X,\mu) if for almost every xx

limNn=1NχAn(Tnx)n=1Nμ(An)=1.\lim_{N\to\infty}\frac{\sum_{n=1}^{N}\chi_{A_{n}}(T^{n}x)}{\sum_{n=1}^{N}\mu(A_{n})}=1.
Theorem 23.

[20] (a) For every α𝕋\alpha\in\mathbb{T} such that its convergents satisfy anCn76a_{n}\leq Cn^{\frac{7}{6}} the sequence {B(cn)}\{B({\frac{c}{n}})\} is sBC for Tα.T_{\alpha}.

(b) For almost every α𝕋,\alpha\in\mathbb{T}, any Khinchin sequence is sBC for Tα.T_{\alpha}.

(c) For any α𝒟(1)\alpha\in{\mathcal{D}}(1), and any decreasing sequence {rn}\{r_{n}\} such that rn=,\sum r_{n}=\infty, {B(rn)}\{B(r_{n})\} is sBC for TαT_{\alpha}.

Observe that the condition in (a) has full measure. On the other hand, it is not hard to see that if an(α)n2+εa_{n}(\alpha)\sim n^{2+{\varepsilon}} for every nn then the sequence (B(1n))(B({\frac{1}{n}})) does not have the sBC for TαT_{\alpha}. Indeed, if

x[kqn12nqn,kqn+12nqn]x\in\left[\frac{k}{q_{n}}-\frac{1}{2nq_{n}},\frac{k}{q_{n}}+\frac{1}{2nq_{n}}\right]

then since qnα1qn+12n2+εqn\|q_{n}\alpha\|\leq\frac{1}{q_{n+1}}\leq\frac{2}{n^{2+{\varepsilon}}q_{n}} and lnqnCnlnn\ln q_{n}\leq Cn\ln n

l=qnn1+ε/2qnχB(1n)(x+lα)n1+ε/2l=1n1+ε/2qn1n.\sum_{l=q_{n}}^{n^{1+{\varepsilon}/2}q_{n}}\chi_{B(\frac{1}{n})}(x+l\alpha)\geq n^{1+{\varepsilon}/2}\gg\sum_{l=1}^{n^{1+{\varepsilon}/2}q_{n}}\frac{1}{n}.

But it is easy to see that a.e. xx belongs to infinitely many intervals of the form [kqn12nqn,kqn+12nqn][\frac{k}{q_{n}}-\frac{1}{2nq_{n}},\frac{k}{q_{n}}+\frac{1}{2nq_{n}}].

In higher dimensions, it was proved in [117] that

Theorem 24.

If nrnd=\sum_{n}r_{n}^{d}=\infty then for almost every vector α𝕋d\alpha\in\mathbb{T}^{d}, the sequence (B(rn))(B({r_{n}})) is sBC for the translation Tα.T_{\alpha}.

6.2. On the distribution of hits.

Theorems 23 and 24 motivate the study of the error terms

ΔN(c,α,x)=VN(α,x)n=1NVol(Brn) and Δ¯N(c,α)=UN(α)n=1NVol(Brn).\Delta_{N}(c,\alpha,x)=V_{N}(\alpha,x)-\sum_{n=1}^{N}{\mathrm{Vol}}(B_{r_{n}})\text{ and }{\bar{\Delta}}_{N}(c,\alpha)=U_{N}(\alpha)-\sum_{n=1}^{N}{\mathrm{Vol}}(B_{r_{n}}).

One can for example try to give lower and upper asymptotic bounds on the growth of ΔN\Delta_{N} as a function of the arithmetic properties of α\alpha in the spirit of Kintchine-Beck Theorem 6 and Questions 68. Here we will be interested in the distribution of ΔN(c,α,x)\Delta_{N}(c,\alpha,x) after adequate normalization when α\alpha or xx or (α,x)(\alpha,x) are random.

Theorem 25.

([9, 90]) Let rn=cn1/d.r_{n}=cn^{-1/d}. Suppose that xx is uniformly distributed on 𝕋d.\mathbb{T}^{d}. For any c>0c>0, if α𝒟(0)\alpha\in{\mathcal{D}}^{*}(0), there is a constant KK such that all limit points of ΔN(c,α,x)lnN\dfrac{\Delta_{N}(c,\alpha,x)}{\sqrt{\ln N}} are 𝔑(σ2){\mathfrak{N}}(\sigma^{2}) with σ2K.\sigma^{2}\leq K.

In the case of random (α,x)(\alpha,x) we have

Theorem 26.

Let rn=cn1/d.r_{n}=cn^{-1/d}. ([30]) There is Σ(c,d)>0\Sigma(c,d)>0 such that if (α,x)(\alpha,x) is uniformly distributed on 𝕋d×𝕋d{\mathbb{T}}^{d}\times{\mathbb{T}}^{d} then ΔN(c,α,x)lnN\dfrac{\Delta_{N}(c,\alpha,x)}{\sqrt{\ln N}} converges to 𝔑(Σ(c,d)).{\mathfrak{N}}(\Sigma(c,d)).

There is an analogous statement for the return times.

Theorem 27.

([107, 114, 30]) Let rn=cn1/d.r_{n}=cn^{-1/d}. There is Σ¯(c,d)>0\bar{\Sigma}(c,d)>0 such that if α\alpha is uniformly distributed on 𝕋d{\mathbb{T}}^{d} then Δ¯N(c,α)bN\dfrac{{\bar{\Delta}}_{N}(c,\alpha)}{\sqrt{b_{N}}} converges in distribution to 𝔑(Σ¯(c,d)){\mathfrak{N}}(\bar{\Sigma}(c,d)) where

bN={lnNlnlnN if d=1lnN if d2.b_{N}=\begin{cases}\ln N\ln\ln N&\text{ if }d=1\\ \ln N&\text{ if }d\geq 2.\end{cases}

The case d=1d=1 was obtained in [107, Theorem 3.1.1 on page 44] (see also [114]), based on the metric theory of the continued fractions. In fact, one can handle more general sequences. Namely, let ϕ(k)\phi(k) satisfy the following conditions

  • (i)

    ϕ(k)0,\phi(k)\searrow 0, but kϕ(k)=+,\sum_{k}\phi(k)=+\infty,

  • (ii)

    There exists 0<δ<1/20<\delta<1/2 such that k=1nϕ(k)kδCk=1nϕ(k)\sum_{k=1}^{n}\frac{\phi(k)}{k^{\delta}}\leq C\sqrt{\sum_{k=1}^{n}\phi(k)}

  • (iii)

    k=1nϕ2(k)Ck=1nϕ(k).\sum_{k=1}^{n}\phi^{2}(k)\leq C\sqrt{\sum_{k=1}^{n}\phi(k)}.

Theorem 28.

([44]) If rn=ϕ(lnn)nr_{n}=\frac{\phi(\ln n)}{n} and α\alpha is uniformly distributed on 𝕋\mathbb{T} then Δ¯N(c,α)F(n)lnF(n)\dfrac{{\bar{\Delta}}_{N}(c,\alpha)}{\sqrt{F(n)\ln F(n)}} converges in distribution to 𝔑(𝚺(c)){\mathfrak{N}}({\boldsymbol{\Sigma}}(c)) where F(n)=k=1nϕ(lnk)k.F(n)=\sum_{k=1}^{n}\frac{\phi(\ln k)}{k}.

The higher dimensional case is obtained via ergodic theory of homogeneous flows and martingale methods in [30].

Question 26.

Study the limiting distribution of UNU_{N} and VNV_{N} in case rn=cnγr_{n}=\frac{c}{n^{\gamma}} with γ<1d.\gamma<\frac{1}{d}.

Question 27.

Do Theorems 24, 26 and 27 hold when the random vector α\alpha is taken from a proper submanifold of 𝕋d,\mathbb{T}^{d}, for example α=(s,s2,,sd).\alpha=(s,s^{2},\dots,s^{d}).

One motivation for this question comes from Diophantine approximation on manifolds (see [13] and references wherein), another is multidimensional extension of Kesten Theorem (cf. Question 24).

6.3. Proofs outlines.

First we sketch a proof of Theorem 24 in case rn=cn1/d.r_{n}=cn^{-1/d}. Consider the number Nm(α,x)N_{m}(\alpha,x) of solutions to

x+nαB(0,cn1/d),em<n<em+1.x+n\alpha\in B(0,cn^{-1/d}),\quad e^{m}<n<e^{m+1}.

The argument used to prove Theorem 16 shows that

Nm(α,x)=f(gm(Λαd+1+x))N_{m}(\alpha,x)=f(g_{m}(\Lambda_{\alpha}\mathbb{Z}^{d+1}+x))

where ff is the function on the space of affine lattices given by

f(L)=vLχB(0,c)×[1,e](v).f(L)=\sum_{v\in L}\chi_{B(0,c)\times[1,e]}(v).

Thus

(25) m=1lnNNm(α,x)m=1lnNf(gm(Λαd+1+x))\sum_{m=1}^{\ln N}N_{m}(\alpha,x)\sim\sum_{m=1}^{\ln N}f(g_{m}(\Lambda_{\alpha}\mathbb{Z}^{d+1}+x))

and Theorem 24 for rn=cn1/dr_{n}=cn^{-1/d} reduces to the study of ergodic sums (25) under the assumption that the initial condition has a density on n+(α,x)n_{+}(\alpha,x). In fact, a standard argument allows to reduce the problem to the case when the initial condition has density on the space of lattices. Namely, it is not difficult to check that the ergodic sums of ff do not change much if we move in the stable or neutral direction in the space of lattices. After this reduction, the sBC property follows from the Ergodic Theorem.

The relation (25) also allows to reduce Theorem 26 to a Central Limit Theorem for ergodic sums of gmg_{m} which can be proven, for example, by a martingale argument (see [81]. We refer the reader to [26] for a nice introduction to the martingale approach to limit theorems for dynamical systems.)

The proof of Theorem 27 is similar but one needs to work with lattices centered at 0 rather than affine lattices.

In particular, the non-standard normalization in case d=1d=1 is explained by the fact that ff in this case is not in L2L^{2} and the main contribution comes from the region where ff is large (in fact, the analysis is similar to [46, Section 4]).

7. Skew products. Random walks.

7.1. Basic properties.

The properties of ergodic sums along toral translations are crucial to the study of some classes of dynamical systems, such as skew products or special flows. In this section we consider the skew products. Special flows are the subject of Section 8.

Skew products above TαT_{\alpha} will be denoted Sα,A:𝕋d×𝕋r𝕋d×𝕋rS_{\alpha,A}:\mathbb{T}^{d}\times\mathbb{T}^{r}\to\mathbb{T}^{d}\times\mathbb{T}^{r} They are given by Sα,A(x,y)=(x+α,y+A(x) mod 1)S_{\alpha,A}(x,y)=(x+\alpha,y+A(x)\text{ mod }1). Cylindrical cascades above TαT_{\alpha} will be denoted Wα,A:𝕋d×r𝕋d×r.W_{\alpha,A}:\mathbb{T}^{d}\times\mathbb{R}^{r}\to\mathbb{T}^{d}\times\mathbb{R}^{r}. They are given by Wα,A(x,y)=(x+α,y+A(x))W_{\alpha,A}(x,y)=(x+\alpha,y+A(x)). Note that

Wα,AN(x,y)=(x+Nα,y+AN(x))W_{\alpha,A}^{N}(x,y)=(x+N\alpha,y+A_{N}(x))

(the same formula holds for Sα,AS_{\alpha,A} but the second coordinate has to be taken mod 1). If AA takes integer values then Wα,AW_{\alpha,A} preserves 𝕋d×r\mathbb{T}^{d}\times\mathbb{Z}^{r} and it is natural to restrict the dynamics to this subset. Thus cylindrical cascades define random walks on r\mathbb{R}^{r} or r\mathbb{Z}^{r} driven by the translation TαT_{\alpha}.

If α\alpha is Diophantine and AA is smooth then the so called linear cohomological equation similar to (3)

(26) A(x)𝕋dA(u)𝑑u=B(x+α)+B(x)A(x)-\int_{\mathbb{T}^{d}}A(u)du=-B(x+\alpha)+B(x)

has a smooth solution BB, thus Sα,AS_{\alpha,A} and Wα,AW_{\alpha,A} are respectively smoothly conjugated to the translations Sα,𝕋dAS_{\alpha,\int_{\mathbb{T}^{d}}A} and Wα,𝕋dAW_{\alpha,\int_{\mathbb{T}^{d}}A} via the conjugacy (x,y)(x,yB(x))(x,y)\mapsto(x,y-B(x)).

Hence the ergodic properties of the skew products and the cascades with smooth AA are interesting to study only in the Liouville case. The following is a convenient ergodicity criterion for skew products.

Proposition 4.

[78] Sα,AS_{\alpha,A} is ergodic iff for any λr{0}\lambda\in\mathbb{Z}^{r}-\{0\}, λ,A\langle\lambda,A\rangle is not a measurable multiplicative coboundary above TαT_{\alpha}, that is, iff there does not exist λr{0}\lambda\in\mathbb{Z}^{r}-\{0\} and a measurable solution ψ:𝕋d\psi:\mathbb{T}^{d}\to\mathbb{C} to

(27) ei2πλ,A(x)=ψ(x+α)/ψ(x).e^{i2\pi\langle\lambda,A(x)\rangle}=\psi(x+\alpha)/\psi(x).

This ergodicity criterion can be simply derived from the observation that the spaces VλV_{\lambda} of functions of the form

(28) ϕ(x)ei2πλ,y\phi(x)e^{i2\pi\langle\lambda,y\rangle}

are invariant under Sα,AS_{\alpha,A}. It then follows that the existence or nonexistence of an invariant function φ\varphi by Sα,AS_{\alpha,A} is determined by the existence or nonexistence of a solution to (27). We refer the reader to Section 8 for further discussion concerning (27).

When AA is not a linear coboundary, i.e. (26) does not have a solution, it is very likely and often easy to prove that (27) does not have a solution either. For example, it suffices to show that the sums ANnA_{N_{n}} do not concentrate on a subgroup of lower dimension for a sequence NnN_{n} such that TαNnId.T_{\alpha}^{N_{n}}\to{\rm Id}. Indeed, if a solution to (27) exists then |ψ||\psi| is constant by ergodicity of the base translation. Therefore by Lebesgue Dominated Convergence Theorem

limn𝕋dei2πλ,ANn(x)𝑑x=limn𝕋dψ(x+Nnα)/ψ(x)𝑑x=1\lim_{n\to\infty}\int_{\mathbb{T}^{d}}e^{i2\pi\langle\lambda,A_{N_{n}}(x)\rangle}dx=\lim_{n\to\infty}\int_{\mathbb{T}^{d}}\psi(x+N_{n}\alpha)/\psi(x)dx=1

which means that ANn(x)A_{N_{n}}(x) is concentrated near the set

{ur:λ,u}.\{u\in\mathbb{R}^{r}:\langle\lambda,u\rangle\in\mathbb{Z}\}.

In particular it was shown, in [35], that for every Liouville translation vector αd,\alpha\in\mathbb{R}^{d}, the generic smooth function AA does not admit a solution to (27) for any λd{0}\lambda\in\mathbb{R}^{d}-\{0\}. Hence the generic smooth skew product above a Liouville translation is ergodic (cf. Section 7.3 and Theorem 42 in Section 8).

It is known that ergodic skew products Sα,AS_{\alpha,A} are actually uniquely ergodic (see [100]). On the other hand, skew products above translations are never weak mixing since they have the translation as a factor. However, the same ideas as the ones used to prove ergodicity of the skew products often prove that all eigenfunctions come from that factor (see [45, 42, 58, 59, 128]).

If one considers skew products on 𝕋×𝕋\mathbb{T}\times\mathbb{T} with smooth increasing functions on (0,1)(0,1) having a jump discontinuity at 0 then the corresponding skew product will even be mixing in the fibers, that is, the correlations between functions that depend only on the fiber coordinate tends to 0. A classical example is given by the skew shift (x,y)(x+α,y+x)(x,y)\mapsto(x+\alpha,y+x). The mixing in the fibers can be easily derived from the invariance of VλV_{\lambda} defined by (28) and the fact that, by the Ergodic Theorem, Anx=(Ax)n+.\frac{\partial A_{n}}{\partial x}=\left(\frac{\partial A}{\partial x}\right)_{n}\to+\infty. A similar phenomenon can occurs for analytic skew products that are homotopic to identity but over higher dimensional tori 𝕋d×𝕋(x,y)(x+α,y+ϕ(x))\mathbb{T}^{d}\times\mathbb{T}\ni(x,y)\mapsto(x+\alpha,y+\phi(x)), with α\alpha and ϕ\phi as in Theorem 47 below (see [37]). This mechanism can also be used to establish ergodicity of cylindrical cascades (see [102]). A fast decay of correlations in the fibers can be responsible for the existence of non trivial invariant distributions for these skew products similarly to what occurs for the skew shift (x,y)(x+α,x+y)(x,y)\mapsto(x+\alpha,x+y) (see [60]).

The deviations of ergodic sums for skew products, that is the behavior of the sums

n=0N1B(Sα,An(x,z))N𝕋d𝕋rB(x,z)𝑑x𝑑z\sum_{n=0}^{N-1}B(S_{\alpha,A}^{n}(x,z))-N\int_{\mathbb{T}^{d}}\int_{\mathbb{T}^{r}}B(x,z)dxdz

is poorly understood. The only cases where some results are available have significant extra symmetry [60, 91, 41].

7.2. Recurrence.

Our next topic are cylindrical cascades. As it was mentioned above they are sometimes called deterministic random walks. So the first question one can ask is if the walk is recurrent (that is, ANA_{N} returns to some bounded region infinitely many times) or transient. We will assume in this section that AA has zero mean since otherwise Wα,AW_{\alpha,A} is transient by the ergodic theorem. If r=1r=1 this condition is also sufficient. In fact, the next result is valid for skew products over arbitrary ergodic transformations (in fact, there is a multidimensional version of this result, see Theorem 32).

Theorem 29.

([5]) If r=1,r=1, AA is integrable and has zero mean then WW is recurrent.

7.2.1. Recurrence and the Denjoy Koksma Property.

Next we note that if the base dimension d=1d=1 and AA has bounded variation then WW is recurrent for all rr and for all α\alpha\in{\mathbb{R}}-{\mathbb{Q}} due to the Denjoy-Koksma inequality stating that

(29) maxx𝕋|Aqnqn𝕋A(y)𝑑y|2V\max_{x\in\mathbb{T}}|A_{q_{n}}-q_{n}\int_{\mathbb{T}}A(y)dy|\leq 2V

for every denominator of the convergence of α\alpha, where VV is the total variation of AA.

More generally we say that AA (not necessarily of zero mean) has the Denjoy-Koksma property (DKP) if there exist constants C,δ>0C,\delta>0 and a sequence nkn_{k}\to\infty such that

(30) (|Anknk𝕋dA(y)𝑑y|C)δ.{\mathbb{P}}(|A_{n_{k}}-n_{k}\int_{\mathbb{T}^{d}}A(y)dy|\leq C)\geq\delta.

We say that AA has the strong Denjoy-Koksma property (sDKP) if (30) holds with δ=1.\delta=1.

Note that if DKP holds and AA has zero mean then the set of points where liminfn|An|C\lim\inf_{n\to\infty}|A_{n}|\leq C has positive measure and so by ergodicity of the base map Wα,AW_{\alpha,A} is recurrent.

Later, we will also see how the DKP can be very helpful in proving ergodicity of the cylindrical cascades as well as weak mixing of special flows.

The situation with DKP for translations on higher dimensional tori is delicate. Of course it holds for almost all α\alpha and for every smooth function by the existence of smooth solutions to the linear cohomological equation (3). But the DKP also holds above most translations even from a topological point of view.

Theorem 30.

([36]) There is a residual set of vectors in αd\alpha\in\mathbb{R}^{d} such that DKP holds above TαT_{\alpha} for every function that is of class C4.C^{4}.

In fact, it is non-trivial to construct rotation vectors and smooth functions that do not have the DKP. The first construction is due to Yoccoz and it actually provides examples of non recurrent analytic cascades.

Theorem 31.

([129, Appendix]) For d=2d=2 there exists an uncountable dense set YY of translation vectors and a real analytic function 𝒜:𝕋2{\mathcal{A}}:\mathbb{T}^{2}\to{\mathbb{C}} with zero mean such that WW is not recurrent.

Denote the translation vector by (α,α′′).(\alpha^{\prime},\alpha^{\prime\prime}). The main ingredient in the construction of [129] is that the denominators, qnq_{n}^{\prime} and qn′′q^{\prime\prime}_{n} of the convergents of α\alpha^{\prime} and α′′\alpha^{\prime\prime} are alternated, and more precisely, they are such that the sequence qn,qn′′,qn+1,qn+1′′...q_{n}^{\prime},q^{\prime\prime}_{n},q_{n+1}^{\prime},q^{\prime\prime}_{n+1}... increases exponentially. We will see later that the same construction can be used to create examples of mixing special flows with an analytic ceiling function.

Let YY be the set of couples (α,α′′)22(\alpha^{\prime},\alpha^{\prime\prime})\in{{\mathbb{R}}^{2}-\mathbb{Q}}^{2}, whose sequences of best approximations qnq_{n}^{\prime} and qn′′q^{\prime\prime}_{n} satisfy, for any nn0(α,α′′)n\geq n_{0}(\alpha^{\prime},\alpha^{\prime\prime})

qn′′e3qn,qn+1e3qn′′.q^{\prime\prime}_{n}\geq e^{3q^{\prime}_{n}},\quad q^{\prime}_{n+1}\geq e^{3q^{\prime\prime}_{n}}.

Then [129] constructs a real analytic function 𝒜:𝕋2{\mathcal{A}}:\mathbb{T}^{2}\to{\mathbb{C}} with zero integral such that for almost every (x,y)𝕋2(x,y)\in\mathbb{T}^{2} |An(x,y)||A_{n}(x,y)|\to\infty, hence Wα,𝒜W_{\alpha,{\mathcal{A}}} is not recurrent. Note that the set YY as defined above is uncountable and dense in 2\mathbb{R}^{2}.

7.2.2. Indicator functions.

Now we specify the study of Wα,AW_{\alpha,A} to the case where

A=(χΩjVol(Ωj))j=1,,r where ΩjX=𝕋d are regular sets.A=(\chi_{\Omega_{j}}-{\rm Vol}(\Omega_{j}))_{j=1,\ldots,r}\text{ where }\Omega_{j}\subset X=\mathbb{T}^{d}\text{ are regular sets.}

If d>1d>1 then the DKP does not seem to be well adapted for proving recurrence in this case (see Questions 610).

Question 28.

Show that DKP does not hold when d>1d>1 and A=(χΩjVol(Ωj))j=1,,rA=(\chi_{\Omega_{j}}-{\rm Vol}(\Omega_{j}))_{j=1,\ldots,r} and the ΩjX\Omega_{j}\subset X are balls or boxes.

There is however another criterion for recurrence which is valid for arbitrary skew products.

Theorem 32.

Given a sequence δn=o(n1/r)\delta_{n}=o(n^{1/r}) the following holds.

(a) ([22]) Consider the map T:XXT:X\to X preserving a measure μ.\mu. Let W(x,y)=(Tx,y+A(x)).W(x,y)=(Tx,y+A(x)). If there exists a sequence knk_{n} such that limnμ(x:Akn(x)δn)=1\displaystyle\lim_{n\to\infty}\mu(x:A_{k_{n}}(x)\leq\delta_{n})=1 then WW is recurrent.

(b) Consider a parametric family of maps Tα:XX,T_{\alpha}:X\to X, α𝔄.\alpha\in\mathfrak{A}. Assume that TαT_{\alpha} preserves a measure μα.\mu_{\alpha}. Let (α,x)(\alpha,x) be distributed according to a measure 𝛌{\boldsymbol{\lambda}} on 𝔄×X\mathfrak{A}\times X such that d𝛌=dν(α)dμα(x)d{\boldsymbol{\lambda}}=d\nu(\alpha)d\mu_{\alpha}(x) for some measure ν\nu on 𝔄.\mathfrak{A}. If n=0N1A(Tαnx)δN\dfrac{\sum_{n=0}^{N-1}A(T_{\alpha}^{n}x)}{\delta_{N}} has a limiting distribution as NN\to\infty then Wα,AW_{\alpha,A} is recurrent for ν\nu-almost all α.\alpha.

Note that TT is not required to be ergodic. On the other hand if TT is ergodic, r=1r=1 and AA has zero mean, then by the Ergodic Theorem μ(|An/n|>ε)0\mu(|A_{n}/n|>{\varepsilon})\to 0 for any ε{\varepsilon} so one can take kn=nk_{n}=n and δn=εnn\delta_{n}={\varepsilon}_{n}n where εn0{\varepsilon}_{n}\to 0 sufficiently slowly. Therefore Theorem 32 implies Theorem 29.

Proof.

(a) Suppose BB is a wondering set (that is, WkBW^{k}B are disjoint) of positive measure which is contained in {|z|<C}.\{|z|<C\}. Let

Bn={(x,z)B:Akn(x)δn}.B_{n}=\{(x,z)\in B:A_{k_{n}}(x)\leq\delta_{n}\}.

Then μ(Bn)μ(B)\mu(B_{n})\to\mu(B) as nn\to\infty so for large nn

μ(1inWki(Bki))nμ(B)2.\mu(\cup_{1\leq i\leq n}W^{k_{i}}(B_{k_{i}}))\geq n\frac{\mu(B)}{2}.

On the other hand, by assumption Wki(B)Ei:={y2C+δi}EnW^{k_{i}}(B)\subset E_{i}:=\{y\leq 2C+\delta_{i}\}\subset E_{n} if i[1,n].i\in[1,n]. Hence μ(1inWki(Bki))δnr=o(n)\mu(\cup_{1\leq i\leq n}W^{k_{i}}(B_{k_{i}}))\leq\delta_{n}^{r}=o(n), a contradiction.

(b) follows from (a) applied to the map 𝒯:(𝔄×X)×r{\mathcal{T}}:(\mathfrak{A}\times X)\times\mathbb{R}^{r} given by 𝒯(α,x,y)=(α,Wα,A(x,y)).{\mathcal{T}}(\alpha,x,y)=(\alpha,W_{\alpha,A}(x,y)).

Combining Theorems 10 and 32(b) we obtain

Corollary 33.

If {Ωj}j=1,,r\{\Omega_{j}\}_{j=1,\ldots,r} are real analytic and strictly convex and (d1)2d<1r\frac{(d-1)}{2d}<\frac{1}{r} then WW is recurrent for almost all α.\alpha.

Note that the proof of Theorem 32 is not constructive.

Question 29.

(a) Construct α\alpha and {Ωj}j=1,,r\{\Omega_{j}\}_{j=1,\ldots,r} for which the corresponding WW is non recurrent.

(b) Find explicit arithmetic conditions which imply recurrence.

Theorem 34.

([22]) (a) If {Ωj}j=1,,r\{\Omega_{j}\}_{j=1,\ldots,r} are polyhedra then WW is recurrent for almost all α.\alpha.

(b) There are polyhedra {Ωj}j=1,,r\{\Omega_{j}\}_{j=1,\ldots,r} and α\alpha in 𝕋2{\mathbb{T}}^{2} such that WW is transient.

Here part (a) follows from Theorem 32 and a control on the growth of the ergodic sums. Namely it is proven in [22] that given any polyhedron Ω𝕋d\Omega\subset\mathbb{T}^{d} then for any γ>0\gamma>0, for almost every αd\alpha\in\mathbb{R}^{d}, we have that An2=O(nγ)\|A_{n}\|_{2}=O(n^{\gamma}) where A=χΩVol(Ω),A=\chi_{\Omega}-{\rm Vol}(\Omega), the sums are considered above the translation TαT_{\alpha} and the L2L_{2} norm is considered with respect to the Haar measure on 𝕋d\mathbb{T}^{d}. In the case of boxes, the latter naturally follows from the power log control given by Beck’s Theorem (see Section 3.1).

The proof of part (b) proceeds by extending the method of [129] discussed in Section 7.2.1 to the case of indicator functions.

Question 30.

Is it true that for a generic choice of Ωj\Omega_{j} as in Question 33, WW is transient for almost all α\alpha when (d1)2d>1r\frac{(d-1)}{2d}>\frac{1}{r}?

An affirmative answer to Question 18 (Local Limit Theorem) would give evidence that Question 18 may be true due to Borel-Cantelli Lemma. (More precisely, to answer Question 30 we need a joint Local Limit Theorem for ergodic sums of indicators of several sets.)

Question 31.

Let α\alpha be as in Theorem 34 (a) or Question 33. Does there exist xx such that limNAN(x)=\lim_{N\to\infty}||A_{N}(x)||=\infty?

Note that this is only possible if d>1d>1 due to the Denjoy-Koksma inequality. On the other hand in any dimension one can have orbits which stay in a half space. Such orbits have been studied extensively (see [103] and the references wherein).

Another case where recurrence is not easy to establish is that of skew products over circle rotations with functions having a singularity such as the examples discussed in Section 2. We will come back to this question in the next section.

7.3. Ergodicity.

Next we discuss the ergodicity of cylindrical cascades. Here one has to overcome both problems of recurrence discussed in Section 7.2 and issues of non-arithmeticity appearing in the study of ergodicity of Sα,A.S_{\alpha,A}.

The ergodicity of Wα,AW_{\alpha,A} is usually established using the fact that the sums ANnA_{N_{n}} are increasingly well distributed on r\mathbb{R}^{r} when considered above any small scale balls in the base and for some rigidity sequence NnN_{n}, i.e. such that Nnα0\|N_{n}\alpha\|\to 0. More precisely, usual methods of proving their ergodicity take into consideration a sequence of distributions

(31) (Ank)(μ),k1\left(A_{n_{k}}\right)_{\ast}(\mu),\;k\geq 1

along some rigidity sequence {nk}\{n_{k}\} as probability measures on ¯r{\bar{\mathbb{R}}}^{r} where ¯\bar{\mathbb{R}} is the one-point compactification of \mathbb{R}. As shown in [85] each point in the topological support of a limit measure of (31) is a so called essential value for Wα,A.W_{\alpha,A}. Following [115] ara\in\mathbb{R}^{r} is called an essential value of AA if for each B𝕋dB\in\mathbb{T}^{d} of positive measure, for each ϵ>0\epsilon>0 there exists NN\in\mathbb{Z} such that

μ(BTNB[|AN()a|<ϵ])>0.\mu(B\cap T^{-N}B\cap[|A_{N}(\cdot)-a|<\epsilon])>0.

Denote by E(A)E(A) the set of essential values of AA. Then the essential value criterion states as follows

Theorem 35.

([115],[1])

(a) E(A)E(A) is a closed subgroup of r\mathbb{R}^{r}.

(b) E(A)=rE(A)=\mathbb{R}^{r} iff Wα,AW_{\alpha,A} is ergodic.

(c) If AA is integer valued and E(A)=rE(A)=\mathbb{Z}^{r} then Wα,AW_{\alpha,A} is ergodic on 𝕋d×r.{\mathbb{T}}^{d}\times\mathbb{Z}^{r}.

Hence if the supports of the probability measures in (31) are increasingly dense on r\mathbb{R}^{r} then Wα,AW_{\alpha,A} is ergodic.

The case where d=r=1d=r=1 is the most studied although there are still some open questions in this context. For d=r=1d=r=1 ergodicity is often proved using the Denjoy Koksma Property. Indeed, if AA is not cohomologous to a constant then ANNAA_{N}-N\int A are not bounded. Let qnq_{n} be a best denominator for the base rotation. Pick KnK_{n} which is large but not too large. Then KqnKq_{n} is still a rigidity time for the translation but AKqnA_{Kq_{n}} have sufficiently large albeit controlled oscillations to yield that a given value aa in the fibers is indeed an essential value.

This method is well adapted to AA whose Fourier transform satisfies A^(n)=O(1/|n|)\hat{A}(n)=\mbox{O}(1/|n|), since they display a DKP (see [84]). Example of such functions are functions of bounded variation and functions smooth everywhere except for a log symmetric singularity.

Ergodicity also holds in general for characteristic functions of intervals.

Theorem 36.

(a) [35] If α\alpha is Liouville, there is a residual set of smooth functions AA with zero integral such that the skew product Wα,AW_{\alpha,A} is ergodic.

(b) ([43]) If AA has a symmetric logarithmic singularity then Wα,AW_{\alpha,A} is ergodic for all irrational α.\alpha.

(c) ([24]) If A=χ[0,1/2]χ[1/2,1]A=\chi_{[0,1/2]}-\chi_{[1/2,1]} and α\alpha is irrational then Wα,AW_{\alpha,A} is ergodic on 𝕋1×.\mathbb{T}^{1}\times\mathbb{Z}.

(d) ([97]) If A=χ[0,β]βA=\chi_{[0,\beta]}-\beta then Wα,AW_{\alpha,A} is ergodic iff 1,α1,\alpha and β\beta are rationally independent.

(e) ([102]) If AA is piecewise absolutely continuous, 𝕋1A(x)𝑑x=0,\int_{\mathbb{T}^{1}}A(x)dx=0, AA^{\prime} is Riemann integrable and 𝕋1A(x)𝑑x0\int_{\mathbb{T}^{1}}A^{\prime}(x)dx\neq 0 then Wα,AW_{\alpha,A} is ergodic for all α.\alpha\in\mathbb{R}-\mathbb{Q}.

(f) ([23]) If A:𝕋r=(A1,,Ar)A:\mathbb{T}\to\mathbb{R}^{r}=(A_{1},\ldots,A_{r}) with Aj=cj,iχIj,iβjA_{j}=\sum c_{j,i}\chi_{I_{j,i}}-\beta_{j} with Ij,iI_{j,i} a finite family of intervals, cj,ic_{j,i}\in\mathbb{Z} and βj\beta_{j} is such that 𝕋Aj(x)𝑑x=0\int_{\mathbb{T}}A_{j}(x)dx=0 and if the sequence ({qnβ1},,{qnβr})(\{q_{n}\beta_{1}\},\ldots,\{q_{n}\beta_{r}\}) is equidisitributed on 𝕋r\mathbb{T}^{r} as nn\to\infty, where qnq_{n} is the sequence of denominators of α\alpha, then Wα,AW_{\alpha,A} is ergodic. In the case r=1r=1, it is sufficient to ask that ({qnβ})(\{q_{n}\beta\}) has infinitely many accumulation points, then Wα,AW_{\alpha,A} is ergodic.

For further results on the ergodicity of cascades defined over circle rotations with step functions as in (f), we refer to the recent work [25].

The proofs of (a) and (b) are based on DKP and progressive divergence of the sums as explained above. (c)–(e) are treated differently since the ergodic sums take discrete values. For example, the proof of (e) in the case r=1r=1 is based on the fact that AqnA_{q_{n}} is bounded by DKP and then the hypothesis on {qnβ}\{q_{n}\beta\} implies that the set of essential values is not discrete, hence it is all of \mathbb{R}, and the ergodicity follows.

The cases of slower decay of the Fourier coefficients of A{A} are more difficult to handle. We have nevertheless a positive result in the particular situation of log singularities.

Theorem 37.

[39] If AA has (asymmetric) logarithmic singularity then Wα,AW_{\alpha,A} is ergodic for almost every α.\alpha.

The delicate point in Theorem 37 is that the DKP does not hold. Indeed, it was shown in [119] that the special flow above TαT_{\alpha} and under a function that has asymmetric log singularity is mixing for a.e. α\alpha. But, as we will see in the next section, mixing of the special flow is not compatible with the DKP. A contrario special flows under functions with symmetric logarithmic singularities are not mixing [72, 84] because of the DKP.

In the proof of Theorem 37, one first shows that the DKP (30) holds if the constant δ\delta is replaced by a sequence δn\delta_{n} which decays sufficiently slowly and then uses this to push through the standard techniques under appropriate arithmetic conditions.

The case of general angles for the base rotation or the case of stronger singularities are harder and all questions are still open.

Question 32.

Are there examples of ergodic cylindrical cascades with smooth functions having power like singularities?

Conversely, we may ask the following

Question 33.

Are there examples of non ergodic cylindrical cascades with smooth functions having non symmetric logarithmic or (integrable) power singularities?

The study of ergodicity when d>1d>1 and r>1r>1 is more tricky essentially because of the absence of DKP.

For smooth observable, only the Liouville frequencies are interesting. The ergodic sums above such frequencies tend to stretch at least along a subsequence of integers. And this stretch usually occurs gradually and independently in all the coordinates of AA hence a positive answer to the following question is expected.

Question 34.

Show that for any Liouville vector α\alpha, there is a residual set of smooth functions AA with zero integral such that the skew product Wα,AW_{\alpha,A} is ergodic.

As we discussed in the proof of Theorem 37, the cylindrical cascade on 𝕋×\mathbb{T}\times\mathbb{R} with a function AA having an asymmetric logarithmic singularity is ergodic for almost every α\alpha although the ergodic sums ANA_{N} above TαT_{\alpha} concentrate at infinity as NN\to\infty. The slow divergence of these sums that compare to lnN\ln N (see Question 1) plays a role in the proof of ergodicity. The logarithmic control of the discrepancy relative to a polyhedron (see Theorems 6, 11 and 34) motivates the following question.

Question 35.

Is it true that for (almost) every polyhedra Ωj𝕋d\Omega_{j}\subset\mathbb{T}^{d}, j=1,,rj=1,\ldots,r, and A=(χΩjVol(Ωj))j=1,,rA=(\chi_{\Omega_{j}}-{\rm Vol}(\Omega_{j}))_{j=1,\ldots,r}, the cascades Wα,AW_{\alpha,A} are ergodic?

We note that the answer is unknown even for boxes with d=2d=2 and r=1r=1.

7.4. Rate of recurrence.

Section 7.3 described several situations where Wα,AW_{\alpha,A} is ergodic. However for infinite measure preserving transformations the (ratio) ergodic theorem does not specify the growth of ergodic sums. Rather it shows that for any L1L^{1} functions B1(x,y),B2(x,y)B_{1}(x,y),B_{2}(x,y) with B2>0B_{2}>0 we have

(32) n=0N1B1(Wα,An(x,y))n=0N1B2(Wα,An(x,y))B1(x,y)𝑑x𝑑yB2(x,y)𝑑x𝑑y.\frac{\sum_{n=0}^{N-1}B_{1}(W_{\alpha,A}^{n}(x,y))}{\sum_{n=0}^{N-1}B_{2}(W_{\alpha,A}^{n}(x,y))}\to\frac{\iint B_{1}(x,y)dxdy}{\iint B_{2}(x,y)dxdy}.

In fact ([1]) there is no sequence aNa_{N} such that

(33) n=0N1B1(Wα,An(x,y))aN\frac{\sum_{n=0}^{N-1}B_{1}(W_{\alpha,A}^{n}(x,y))}{a_{N}}

converges to 1 almost surely. On the other hand, one can try to find aNa_{N} such that (33) converges in distribution. By (32) it suffices to do it for one fixed function B.B. For example one can take B=χB(0,1).B=\chi_{B(0,1)}. This motivates the following question.

Question 36.

Let α\alpha be as in Theorem 34 (a) or Question 33. How often is WNR||W^{N}||\leq R?

So far this question has been answered only in a special case. Namely, let d=r=1,d=r=1, ZN(x)=n=0N1[χ[0,1/2](x+nα)1/2].Z_{N}(x)=\sum_{n=0}^{N-1}\left[\chi_{[0,1/2]}(x+n\alpha)-1/2\right]. Denote 𝕃N=Card(nN:Zn=0).{\mathbb{L}}_{N}=\text{Card}(n\leq N:Z_{n}=0).

Theorem 38.

[6] If α\alpha is a quadratic surd then there exists a constant c=c(α)c=c(\alpha) such that lnNcN𝕃N\frac{\sqrt{\ln N}}{cN}{\mathbb{L}}_{N} converges to e𝔑2/2.e^{-{\mathfrak{N}}^{2}/2}.

Similar results have been previously obtained by Ledrappier-Sarig for abelian covers of compact hyperbolic surfaces ([82]). The fact that the correct normalization is N/lnNN/\sqrt{\ln N} was established in [2].

Question 37.

Extend Theorem 38 to the case when 1/21/2 is replaced by

(a) any rational number;

(b) any irrational number, (in which case one needs to replace {AN=0}\{A_{N}=0\} by {|AN|1}\{|A_{N}|\leq 1\}).

Question 38.

What happens for typical α?\alpha?

Note that in contrast to Theorem 38, 𝕃NNlnN{\mathbb{L}}_{N}\sim\dfrac{N}{\ln N} (rather than 𝕃NNlnN{\mathbb{L}}_{N}\sim\dfrac{N}{\sqrt{\ln N}}) is expected in view of Kesten’s Theorem 9. Ideas of the proof of Theorem 38 will be described in Section 8.5.

8. Special flows.

8.1. Ergodic integrals.

In this section we consider special flows above TαT_{\alpha} which will be denoted Tα,At.T^{t}_{\alpha,A}. Here A()>0A(\cdot)>0 is called the ceiling function and the flow is given by

𝕋d×/\displaystyle\mathbb{T}^{d}\times\mathbb{R}/\sim \displaystyle\rightarrow 𝕋d×/\displaystyle\mathbb{T}^{d}\times\mathbb{R}/\sim
(x,s)\displaystyle(x,s) \displaystyle\rightarrow (x,s+t),\displaystyle(x,s+t),

where \sim is the identification

(34) (x,s+A(x))(Tα(x),s).(x,s+A(x))\sim(T_{\alpha}(x),s).

Equivalently the flow is defined for t+s0t+s\geq 0 by

Tt(x,s)=(x+nα,t+sAn(x))T^{t}(x,s)=(x+n\alpha,t+s-A_{n}(x))

where nn is the unique integer such that

(35) An(x)t+s<An+1(x).A_{n}(x)\leq t+s<A_{n+1}(x).

Since TαT_{\alpha} preserves a unique probability measure μ\mu then the special flow will preserve a unique probability measure that is the normalized product measure of μ\mu on the base and the Lebesgue measure on the fibers.

Special flows above ergodic maps are always ergodic for the product measure constructed as above. The interesting feature of special flows is that they can be more ”chaotic” then the base map, displaying properties such as weak mixing or mixing even if the base map does not have them. Actually any map of a very wide class of zero entropy measure theoretic transformations, so called Loosely Bernoulli maps, are isomorphic to sections of special flows above any irrational rotation of the circle with a continuous ceiling function (see [98]).

If A=βA=\beta is constant then Tα,AT_{\alpha,A} is the linear flow on 𝕋d+1\mathbb{T}^{d+1} with frequency vector (α,β).(\alpha,\beta). Thus special flows Tα,AtT^{t}_{\alpha,A} can be viewed as time changes of translation flows on 𝕋d+1.\mathbb{T}^{d+1}. In particular, if we consider the linear flow on 𝕋d+1\mathbb{T}^{d+1} and multiply the velocity vector by a smooth non-zero function ϕ\phi we get a special flow with a smooth ceiling function A.A.

8.2. Smooth time change.

We recall that a translation flow frequency vdv\in\mathbb{R}^{d} is said to be Diophantine if there exists σ,τ>0\sigma,\tau>0 such that (k,v)C|k|σ||(k,v)||\geq C|k|^{-\sigma} for every kdk\in\mathbb{Z}^{d}. Hence a translation vector (1,v)d+1(1,v)\in\mathbb{R}^{d+1} is Diophantine (homogeneous Diophantine or Diophantine in the sense of flows) if and only if vv is a Diophantine vector in the sense of (2).

Theorem 39.

[76] Smooth non vanishing time changes of translation flows with a Diophantine frequency vector are smoothly conjugated to translation flows.

Proof.

Let vv be a constant vector field on 𝕋d+1\mathbb{T}^{d+1}. We suppose WLOG that v=(1,α)v=(1,\alpha). Let u(x)u(x) be a smooth function on the torus and x˙=u(x)v\dot{x}=u(x)v. Then, making a change of variables y=Tvϕ(x)(x)y=T^{\phi(x)}_{v}(x) we obtain the equation y˙=(ϕ+vϕ)(y)u(y)v.\dot{y}=(\phi+\partial_{v}\phi)(y)u(y)v. The equation for yy is linear if ϕ+vϕ=cu.\phi+\partial_{v}\phi=\dfrac{c}{u}. Passing to Fourier series, this equation can be solved if c=ϕ(x)𝑑x(dxu(x))1c=\int\phi(x)dx\left(\int\dfrac{dx}{u(x)}\right)^{-1} and vv is such that |1+(k,v)||C|k|σ|1+(k,v)||\geq C|k|^{-\sigma} for every kd+1k\in\mathbb{Z}^{d+1} which is equivalent to α\alpha Diophantine as in (2).

One can also see this fact at the level of the special flow Tα,AtT^{t}_{\alpha,A} associated to x˙=u(x)v\dot{x}=u(x)v. Namely, making a change of variables (y,s)=Tα,AB(x,t)(x,t)(y,s)=T_{\alpha,A}^{B(x,t)}(x,t) transforms Tα,AT_{\alpha,A} to Tα,DT_{\alpha,D} with

D(x)=A(x)+B(x+α,0)B(x,0)D(x)=A(x)+B(x+\alpha,0)-B(x,0)

so one can make the LHS constant provided α\alpha is Diophantine. Finally, the similarity between linear and nonlinear flows in the Diophantine case is also reflected in (35) since for Diophantine vectors α\alpha An=nA(x)𝑑x+O(1).A_{n}=n\int A(x)dx+O(1).

An interesting question is that of deviations of ergodic sums above time changed linear flows. In fact, the case of linear flows is already non trivial and can be studied by the methods described in Section 5.5. More precisely, as for translations the interesting case occurs when the function under consideration has singularities, for example, for indicator functions.

Namely, given a set Ω\Omega let

(36) 𝐃Ω(r,v,x,T)=0TχΩr(Tvtx)𝑑tTVol(Ωr){\mathbf{D}}_{\Omega}(r,v,x,T)=\int_{0}^{T}\chi_{\Omega_{r}}(T_{v}^{t}x)dt-T\text{Vol}({\Omega_{r}})

where TvtT_{v}^{t} denotes the linear flow with velocity v.v.

We assume that (x,v,r)(x,v,r) are distributed according to a smooth density.

Theorem 40.

([28, 29]). Suppose that Ω\Omega is analytic and strictly convex.

(a) If d=2d=2 then 𝐃Ω(r,v,x,T){\mathbf{D}}_{\Omega}(r,v,x,T) converges in distribution.

(b) If d=3d=3 then 𝐃Ω(r,v,x,T)lnT\frac{{\mathbf{D}}_{\Omega}(r,v,x,T)}{\ln T} converges to a Cauchy distribution.

(c) If d4d\geq 4 then 𝐃Ω(r,v,x,T)rd12Td32(d1)\frac{{\mathbf{D}}_{\Omega}(r,v,x,T)}{r^{\frac{d-1}{2}}T^{\frac{d-3}{2(d-1)}}} has limiting distribution as T.T\to\infty.

(d) For any dd\in\mathbb{N}, if Ω\Omega is a box then 𝐃Ω(r,v,x,T){\mathbf{D}}_{\Omega}(r,v,x,T) converges in distribution.

The proof of Theorem 40 is similar to the proofs of Theorems 10 and 11 and Corollary 1.

Corollary 41.

Theorem 40 remains valid for time changes Tu(x)vT_{u(x)v} where u(x)u(x) is a fixed smooth positive function and vv is random as in Theorem 40.

Proof.

To fix our ideas let us consider the case where Ω\Omega is analytic and strictly convex. Note that Tuvtx=Tvτ(x,t)T_{uv}^{t}x=T_{v}^{\tau(x,t)} where the by the above discussion the function τ\tau satisfies

τ(t,x)=at+ε(t,x,v) where a=(dxu(x))1\tau(t,x)=at+{\varepsilon}(t,x,v)\text{ where }a=\left(\int\frac{dx}{u(x)}\right)^{-1}

and ε(t,x,v){\varepsilon}(t,x,v) is bounded for almost all vv uniformly in xx and t.t. Accordingly it suffices to see how much time is spend inside Ωr\Omega_{r} for the linear segment of length at.at.

Next if the linear flow stays inside Ωr\Omega_{r} during the time [t1,t2][t_{1},t_{2}] then the time spend in Ωr\Omega_{r} by the orbit of TuvT_{uv} equals to t1t2dtu(x(t)).\int_{t_{1}}^{t_{2}}\frac{dt}{u(x(t))}. Thus we need to control the following integral for linear flow

𝐃~Ω(r,v,x,T)=0TχΩr(Tvtx)u(Tvtx)𝑑tTΩrdxu(x).\tilde{{\mathbf{D}}}_{\Omega}(r,v,x,T)=\int_{0}^{T}\frac{\chi_{\Omega_{r}}(T_{v}^{t}x)}{u(T_{v}^{t}x)}dt-T\int_{\Omega_{r}}\dfrac{dx}{u(x)}.

However the Fourier transform of χΩr(x)u(x)\frac{\chi_{\Omega_{r}}(x)}{u(x)} has a similar asymptotics at infinity as the Fourier transform of χΩr(x)\chi_{\Omega_{r}}(x) (see [123]) so the proof of the Corollary proceeds in the same way as the proof of Theorem 40 in [28]. ∎

Up to now, we were interested in smooth time change of linear flows with typical frequencies. We will further discuss smooth time changes for special frequencies in Section 8.4 devoted to mixing properties.

8.3. Time change with singularities.

If the time changing function of an irrational flow has zeroes then the ceiling function of the corresponding special flow has poles. In this case the smooth invariant measure is infinite. In the case of a unique singularity, we have that the time changed flow is uniquely ergodic with the Dirac mass at the singularity the unique invariant probability measure:

Proposition 5.

Consider a flow TtT^{t} given by a smooth time change of an irrational linear flow obtained by multiplying the constant vector field by a function which is smooth and non zero everywhere except for one point x0x_{0}, then for any continuous function bb and any xx

limt1t0tb(Tux)𝑑u=b(x0).\lim_{t\to\infty}\frac{1}{t}\int_{0}^{t}b(T^{u}x)du=b(x_{0}).
Proof.

To simplify the notation we assume that the time change preserves the orientation of the flow. We use the representation as a special flow Tα,AtT^{t}_{\alpha,A} with AA having a pole.

It suffices to prove this statement in case bb equals to 0 in a small neighborhood of x0.x_{0}. In that case we have

(37) 0tb(Tα,Au(x,s))𝑑u=Bn(t)(x)+O(1)\int_{0}^{t}b(T^{u}_{\alpha,A}(x,s))du=B_{n(t)}(x)+O(1)

where B(x)=0A(x)b(x,s)𝑑sB(x)=\int_{0}^{A(x)}b(x,s)ds and n(t)n(t) is defined by (35). If bb vanishes in a small neighborhood of x0x_{0} then BB is bounded and so |Bn(t)|Cn(t).|B_{n(t)}|\leq Cn(t). Therefore it suffices to show that n(t)t0\frac{n(t)}{t}\to 0 which is equivalent to Ann.\frac{A_{n}}{n}\to\infty. Let A~\tilde{A} be a continuous function which is less or equal to AA everywhere. Then

liminfAnnlimnA~nn=A~(x)𝑑x.\lim\inf\frac{A_{n}}{n}\geq\lim_{n\to\infty}\frac{\tilde{A}_{n}}{n}=\int\tilde{A}(x)dx.

Since A(x)𝑑x=\int A(x)dx=\infty we can make A~(x)𝑑x\int\tilde{A}(x)dx as large as possible proving our claim. ∎

Question 39.

In the setting of Proposition 5 describe the deviations of ergodic integrals from b(x0).b(x_{0}).

Question 40.

Consider the case where the time change has finite number of zeroes x1,x2,,xm.x_{1},x_{2},\dots,x_{m}. In that case all limit measures are of the form j=1mpjδxj.\sum_{j=1}^{m}p_{j}\delta_{x_{j}}. Which pjp_{j} describe the behavior of Lebesgue-typical points?

In view of the relation (37) these questions are intimately related to Theorems 3 and 5 and Questions 2, 4 and 5 from Section 2.

Refer to caption
Figure 2. Kocergin Flow is topologically equivalent to the area preserving flow shown on Figure 1 with separatrix loop removed. The rest point is responsible for the shear along the orbits.

If one is interested in flows with singularities preserving a finite non-atomic measure then the simplest example can be obtained by plugging (by smooth surgery) in the phase space of the minimal two dimensional linear flow an isolated singularity coming from a Hamiltonian flow in 2\mathbb{R}^{2} (see Figure 2). The so called Kochergin flows thus obtained preserve besides the Dirac measure at the singularity a measure that is equivalent to Lebesgue measure [71]. As it was explained in Section 2 Kochergin flows model smooth area preserving flows on 𝕋2.\mathbb{T}^{2}. These flows still have 𝕋\mathbb{T} as a global section with a minimal rotation for the return map, but the slowing down near the fixed point produces a singularity for the return time function above the last point where the section intersects the incoming separatrix of the fixed point. The strength of the singularity depends on how abruptly the linear flow is slowed down in the neighborhood of the fixed point. A mild slowing down, or mild shear, is typically represented by the logarithm while stronger singularities such as xa,a(0,1)x^{-a},a\in(0,1) are also possible. Powerlike singularities appear naturally in the study of area preserving flows with degenerate fixed points. We shall see below that dynamical properties of the special flows are quite different for logarithmic and power like singularities.

Question 41.

What can be said about the deviations of the ergodic sums above Kocergin flows?

8.4. Mixing properties.

We give first a classical criterion for weak mixing of special flows. Its proof is similar to the proof of the ergodicity criterion for skew products given by Proposition 4.

Proposition 6.

([126]) Tα,AT_{\alpha,A} is weak mixing iff for any λ\lambda\in\mathbb{R}^{*}, there are no measurable solutions to the multiplicative cohomological equation

(38) ei2πλA(x)=ψ(x+α)/ψ(x).e^{i2\pi\lambda A(x)}=\psi(x+\alpha)/\psi(x).

Indeed if h(x,t)h(x,t) is the eigenfunction when for almost all xx the function h(x,t)eλth(x,t)e^{-\lambda t} takes the same value ψ(x)\psi(x) for almost almost all t.t. Then (38) follows from the identification (34).

Theorem 42.

([35]) If the vector αd\alpha\in{\mathbb{R}}^{d} is not β\beta-Diophantine then there exists a dense GδG_{\delta} for the Cβ+dC^{\beta+d} topology, of functions φCβ+d(𝕋d,+),\varphi\in C^{\beta+d}({\mathbb{T}}^{d},{\mathbb{R}}^{*}_{+}), such that the special flow constructed over TαT_{\alpha} with the ceiling function φ\varphi is weak mixing.

This result is optimal since smooth time changes of linear flows with Diophantine vectors α,\alpha, are smoothly conjugated to the linear flow and, hence, are not weak mixing.

Mixing of special flows is more delicate to establish since one needs to have uniform distribution on increasingly large scales in +\mathbb{R}^{+} of the sums ANA_{N} for all integers NN\to\infty, and this above arbitrarily small sets of the base space. Indeed mixing of special flows above non mixing base dynamics is in general proved as follows: if the ergodic sums ANA_{N} become as NN\to\infty uniformly stretched (well distributed inside large intervals of +\mathbb{R}_{+}) above small sets, the image by the special flow at a large time TT of these small sets decomposes into long strips that are well distributed in the fibers due to uniform stretch and well distributed in projection on the base because of ergodicity of the base dynamics (see Figure 3).

Refer to caption
Figure 3. Mixing mechanism for special flows: the image of a rectangle is a union of long narrow strips which fill densely the phase space.

The delicate point however is to have uniform stretch for all integers NN\to\infty. In particular the following result has been essentially proven in [70].

Theorem 43.

If AA has DKP then Tα,AtT_{\alpha,A}^{t} is not mixing.

Proof.

If AA has the DKP then there is a set Ω\Omega of positive measure on which (30) holds for positive density of nk.n_{k}. By passing to a subsequence we can find a set II of positive measure, a sequence {tk}\{t_{k}\} and a vector β\beta such that on Ω\Omega |Anktk|<C|A_{n_{k}}-t_{k}|<C and αnkβ.\alpha n_{k}\to\beta. Pick a small η.\eta.

Ωi=0tηTα,At[I×{0}],Ωf=0|t|C+ηTα,At[(I+β)×{0}].\Omega_{i}=\cup_{0\leq t\leq\eta}T_{\alpha,A}^{t}[I\times\{0\}],\quad\Omega_{f}=\cup_{0\leq|t|\leq C+\eta}T_{\alpha,A}^{t}[(I+\beta)\times\{0\}].

By decreasing II if necessary we obtain that those sets have measures strictly between 0 and 1.1. On the other hand it is not difficult to see from the definition of the special flow that μ(Tα,AtkΩiΩf)μ(Ωi)\mu(T_{\alpha,A}^{t_{k}}\Omega_{i}\cap\Omega_{f})\to\mu(\Omega_{i}) contradicting the mixing property. ∎

In particular the flows with ceiling functions AA of bounded variation or functions with symmetric log singularities are not mixing.

In fact, since the sDKP holds for any minimal circle diffeomorphism, it follows from (35) and (37) that any smooth flow on 𝕋2\mathbb{T}^{2} without cycles or fixed points is not topologically mixing. We leave this as an exercise for the reader.

The first positive result about mixing of special flows is obtained in [71].

Theorem 44.

If α\alpha\in\mathbb{R}-\mathbb{Q} and AA has (integrable) power singularities then Tα,AT_{\alpha,A} is mixing.

The reason why the case of power singularities is easier than the logarithmic case (corresponding to non-degenerate flows on 𝕋2\mathbb{T}^{2}) is the following. The standard approach for obtaining the stretching of ergodic sums is to control Anx=(Ax)n\frac{\partial A_{n}}{\partial x}=\left(\frac{\partial A}{\partial x}\right)_{n} For AA as in theorem 44, Ax\frac{\partial A}{\partial x} has singularities of the type xax^{-a} with a>1.a>1. In this case the main contribution to ergodic sums comes from the closest encounter with the singularity (cf. Theorem 3) making the control of the stretch easier. Moreover, the strength of the singularity allows to obtain speed of mixing estimates.

Theorem 45.

([34]) If α\alpha is Diophantine and AA has a (integrable) power singularity then Tα,AtT_{\alpha,A}^{t} is power mixing.

More precisely, there exists a constant β=β(α)\beta=\beta(\alpha) such that if R1,R2R_{1},R_{2} are rectangles in 𝕋×\mathbb{T}\times\mathbb{R} then

(39) |μ(R1TtR2)μ(R1)μ(R2)|Ctβ.\left|\mu(R_{1}\cap T^{t}R_{2})-\mu(R_{1})\mu(R_{2})\right|\leq Ct^{-\beta}.

The exponent β\beta in [34] seems to be non optimal.

Question 42.

For α\alpha Diophantine find the asymptotics of the LHS of (39).

It is interesting to surpass the threshold β=1/2\beta=1/2. In particular, one would like to answer the following question.

Question 43.

[83] Can a smooth area preserving flow on 𝕋2\mathbb{T}^{2} have Lebesgue spectrum?

On the other hand for logarithmic singularities there might be cancelations in ergodic sums of Ax,\frac{\partial A}{\partial x}, making the question of mixing more tricky.

Theorem 46.

Let AA be as in Question 1.

(a) ([72]) If jaj+=jaj\sum_{j}a_{j}^{+}=\sum_{j}a_{j}^{-} then Tα,AtT_{\alpha,A}^{t} is not mixing for any α.\alpha\in\mathbb{R}-\mathbb{Q}.

(b) ([119, 73]) If jaj+jaj\sum_{j}a_{j}^{+}\neq\sum_{j}a_{j}^{-} then Tα,AtT_{\alpha,A}^{t} is mixing for almost every α.\alpha\in\mathbb{R}-\mathbb{Q}.

(c) ([73]) If aj+aja_{j}^{+}-a_{j}^{-} has the same sign for all jj then Tα,AtT_{\alpha,A}^{t} is mixing for each α.\alpha\in\mathbb{R}-\mathbb{Q}.

Question 44.

([74]) Does the condition that jaj+jaj\sum_{j}a_{j}^{+}\neq\sum_{j}a_{j}^{-} imply Tα,AtT_{\alpha,A}^{t} is mixing for every α\alpha\in\mathbb{R}-\mathbb{Q}?

Question 45.

([74]) Under the conditions of Theorems 44 and 46 is Tα,AtT_{\alpha,A}^{t} mixing of all orders?

In higher dimensions much less is known. Note that for smooth ceiling functions Theorems 30, 39 and 43 precludes mixing for a set of rotation vectors of full measure that also contains a residual set.

The following was shown in [36]. Recall the definition of the set YY used in Theorem 31. Define the following real analytic complex valued function on 𝕋2{\mathbb{T}}^{2}:

𝒜(x,y)=(k=2ei2πkxek+k=2ei2πkyek).{\mathcal{A}}(x,y)=\left(\sum_{k=2}^{\infty}{e^{i2\pi kx}\over e^{k}}+\sum_{k=2}^{\infty}{e^{i2\pi ky}\over e^{k}}\right).
Theorem 47.

For any (α,α′′)Y(\alpha^{\prime},\alpha^{\prime\prime})\in Y, the special flow constructed over the translation Tα,α′′T_{\alpha^{\prime},\alpha^{\prime\prime}} on 𝕋2{\mathbb{T}}^{2}, with the ceiling function 1+Re𝒜1+\mathrm{Re}{\mathcal{A}} is mixing.

Because of the disposition of the best approximations of α\alpha^{\prime} and α′′\alpha^{\prime\prime} the ergodic sums φm\varphi_{m} of the function φ\varphi, for any mm sufficiently large, will be always stretching (i.e. have big derivatives), in one or in the other of the two directions, xx or yy, depending on whether mm is far from {qn}\{q^{\prime}_{n}\} or far from {qn′′}\{q^{\prime\prime}_{n}\}. And this stretch will increase when mm goes to infinity. So when time goes from 0 to tt, tt large, the image of a small typical interval JJ from the basis 𝕋2{\mathbb{T}}^{2} (depending on tt the intervals should be taken along the xx or the yy axis) will be more and more distorted and stretched in the fibers’ direction, until the image of JJ at time tt will consist of a lot of almost vertical curves whose projection on the basis lies along a piece of a trajectory under the translation Tα,α′′T_{\alpha^{\prime},\alpha^{\prime\prime}}. By unique ergodicity these projections become more and more uniformly distributed, and so will Tt(J)T^{t}(J). For each tt, and except for increasingly small subsets of it (as function of tt), we will be able to cover the basis with such “typical” intervals. Besides, what is true for JJ on the basis is true for Ts(J)T^{s}(J) at any height ss on the fibers. So applying Fubini Theorem in two directions, first along the other direction on the basis (for a time tt all typical intervals are in the same direction), and second along the fibers, we will obtain the asymptotic uniform distribution of any measurable subset, which is, by definition, the mixing property.

Question 46.

Are the flows obtained in Theorem 47 mixing of all orders?

Question 47.

For which vectors αd\alpha\in\mathbb{R}^{d}, there exist special flows above TαT_{\alpha} with smooth functions AA such that Tα,AtT^{t}_{\alpha,A} is mixing?

The foregoing discussion demonstrates that both ergodicity of cylindrical cascades and mixing of special flows require a detailed analysis of ergodic sums (1). However, the estimates needed in those two cases are quite different and somewhat conflicting. Namely, for ergodicity we need to bound from below the probability that ergodic sums hit certain intervals, while for mixing one needs to rule out too much concentration. For this reason it is difficult to construct functions AA such that Wα,AW_{\alpha,A} is ergodic while Tα,c+AtT_{\alpha,c+A}^{t} is mixing. In fact, so far this has only been achieved for smooth functions with asymmetric logarithmic singularities. However, it seems that in higher dimensions there is more flexibility so such examples should be more common.

Question 48.

Is it true that for (almost) every polyhedron Ω𝕋d\Omega\in\mathbb{T}^{d}, d2d\geq 2, and almost every a>0a>0, and almost every α𝕋d\alpha\in\mathbb{T}^{d}, the special flow above α\alpha and under the function a+χΩa+\chi_{\Omega} is mixing?

Note that a positive answer to both this question and Question 35 will give a large class of interesting examples where ergodicity of Wα,AW_{\alpha,A} and mixing for Tα,c+AT_{\alpha,c+A} (for any cc such that c+A>0c+A>0) hold simultaneously.

Question 49.

Answer Questions 35 and 48 in the case Ω\Omega is a strictly convex analytic set.

8.5. An application.

Here we show how the geometry of special flows above cylindrical cascades can be used to study the ergodic sums.

Proof of Theorem 38..
Refer to caption
Figure 4. Staircase surfaces. The sides marked by the same symbol are identified.

The proof uses the properties of the staircase surface StSt shown on Figure 4. The staircase is an infinite pile of 2×12\times 1 rectangles so that the left bottom corner of the next rectangle is attached to the center of the top of the previous one. The sides which are differ by two units in either horizontal or vertical direction are identified. We number all the rectangles from -\infty to ++\infty as shown on Figure 4. There is a translational symmetry given by G(x,y)=(x+1,y+1)G(x,y)=(x+1,y+1) and St/GSt/G is a torus. We shall use coordinates p¯=(p,z)\bar{p}=(p,z) on the staircase where pp are coordinates on the torus which is the identified with rectangle zero and zz\in\mathbb{Z} is the index of rectangle. Thus we have (p,z)=Gz(p,0).(p,z)=G^{z}(p,0).

The key step in the proof is an observation of [53] that StSt is a Veech surface. Namely, given ASL2()A\in SL_{2}({\mathbb{Z}}) such that AImod2A\equiv I\mod 2 there exists unique automorphism ϕA\phi_{A} of StSt which commutes with G,G, fixes the singularities of St,St, has derivative AA at the non-singular points and has drift 0.0. That is, in our coordinates

(40) ϕ(p,z)=(Ap,z+τ(p))\phi(p,z)=(Ap,z+\tau(p))

and the drift condition means that 𝕋2τ(p)𝑑p=0.\int_{\mathbb{T}^{2}}\tau(p)dp=0.

Refer to caption
Figure 5. Poincare map for a linear flow on the staircase. Orbits starting from [1/2,1][1/2,1] go up while orbits starting from [0,1/2)[0,1/2) have to go down due to the gluing conditions.

Consider the linear flow on StSt with slope θ\theta which is locally given by Tt(x,y)=(x+tcosθ,y+tsinθ).T^{t}(x,y)=(x+t\cos\theta,y+t\sin\theta). Let Π\Pi be the union of the top sides of the rectangles in St.St. We identify Π\Pi with 𝕋×\mathbb{T}\times\mathbb{Z} using the map η:𝕋×Π\eta:\mathbb{T}\times\mathbb{Z}\to\Pi such that η(x,z)\eta(x,z) is the point on the top side of rectangle zz at the distance 2x2x from the left corner. It is easy to check (see Figure 5) that under this identification the Poincare map for TtT^{t} takes form

(x,z)=(x+α,z+χ[1/2,1](x)χ[0,1/2)(z)) where α=tanθ+12.(x,z)=(x+\alpha,z+\chi_{[1/2,1]}(x)-\chi_{[0,1/2)}(z))\text{ where }\alpha=\frac{\tan\theta+1}{2}.

Now suppose that α\alpha and hence tanθ\tan\theta is a quadratic surd. By Lagrange theorem there is ASL2()A\in SL_{2}({\mathbb{Z}}) such that A(cosθsinθ)=λ(cosθsinθ).A\left(\begin{array}[]{c}\cos\theta\\ \sin\theta\end{array}\right)=\lambda\left(\begin{array}[]{c}\cos\theta\\ \sin\theta\end{array}\right). By replacing AA by AkA^{k} for a suitable (positive or negative) kk we may assume that AImod2A\equiv I\mod 2 and that λ<1.\lambda<1. Let ΓN(x)\Gamma_{N}(x) be the ray starting from η(x,0)\eta(x,0) having slope θ\theta and length Nsinθ.\frac{N}{\sin\theta}.

𝕃N=mes(p¯ΓN(x):z(p¯)=0)length(ΓN(x)=mes(q¯Γ~(x):z(ϕAm)=0)length(Γ~){\mathbb{L}}_{N}=\frac{{\rm mes}(\bar{p}\in\Gamma_{N}(x):z(\bar{p})=0)}{{\text{length}}(\Gamma_{N}(x)}=\frac{{\rm mes}(\bar{q}\in{\tilde{\Gamma}}(x):z(\phi_{A}^{-m})=0)}{{\text{length}}({\tilde{\Gamma}})}
=x(z(ϕAmq¯)=0)={\mathbb{P}}_{x}(z(\phi_{A}^{-m}\bar{q})=0)

where Γ~=ϕAmΓN(x)\tilde{\Gamma}=\phi_{A}^{m}\Gamma_{N}(x) and x{\mathbb{P}}_{x} is computed under the assumption that q¯\bar{q} is uniformly distributed on Γ~.{\tilde{\Gamma}}. Choose mm to be the smallest number such that length(ϕAmΓN(x))=λmNsinθ1.{\text{length}}(\phi^{m}_{A}\Gamma_{N}(x))=\lambda^{m}\frac{N}{\sin\theta}\leq 1. Note that mlnNlnλ.m\approx\frac{\ln N}{\ln\lambda}. By our choice of m,m, Γ~{\tilde{\Gamma}} is either contained in a single rectangle or intersects two of them. Let us consider the first case, the second one is similar. So we assume that Γ~{\tilde{\Gamma}} is in the rectangle with index aa so that q¯=(q,a).\bar{q}=(q,a). Due to (40) z(ϕmq¯)=aj=1mτ(ϕAjq).z(\phi^{-m}\bar{q})=a-\sum_{j=1}^{m}\tau(\phi_{A}^{-j}q). Thus

x(z(ϕAmq¯)=0)=x(j=1mτ(ϕAjq=a)).{\mathbb{P}}_{x}\left(z(\phi_{A}^{-m}\bar{q})=0\right)={\mathbb{P}}_{x}\left(\sum_{j=1}^{m}\tau(\phi_{A}^{-j}q=a)\right).

Now we apply the Local Limit Theorem for linear toral automorphisms (see [101, Section 4] or [46]) which says that there is a constant σ2\sigma^{2}

x(j=1nτ(ϕAjq))12πmσea2/2σ2m.{\mathbb{P}}_{x}\left(\sum_{j=1}^{n}\tau(\phi_{A}^{-j}q)\right)\approx\frac{1}{\sqrt{2\pi m}\sigma}e^{-a^{2}/2\sigma^{2}m}.

It remains to note that

a(x)=j=0m1τ(ϕAjη(x,0))a(x)=\sum_{j=0}^{m-1}\tau(\phi_{A}^{j}\eta(x,0))

so applying the Central Limit Theorem for linear toral automorphisms we see that if xx is uniformly distributed on 𝕋1\mathbb{T}^{1} then a(x)m\frac{a(x)}{\sqrt{m}} is approximately normal with zero mean and variance σ2.\sigma^{2}.

Next we discuss the proof of Theorem 8(b) in case l=12.l=\frac{1}{2}. The proof proceeds the same way as the proof of Theorem 38 with the following changes.

(I) Instead of estimating the probability that z(ϕAmq¯)=0z(\phi^{-m}_{A}\bar{q})=0 we need to estimate the probability that z(ϕAmq¯)z(\phi^{-m}_{A}\bar{q}) belongs to an interval of length m\sqrt{m} so we use the Central Limit Theorem instead of the Local Limit Theorem.

(II) Instead of taking xx random we take xx fixed at the origin. Note that the origin is fixed by AA so τ(ϕAm(0,0))=Cm.\tau(\phi^{m}_{A}(0,0))=Cm. (More precisely τ\tau is multivalued at the origin since it belong to several rectangles so by τ(ϕAm(0,0))\tau(\phi^{m}_{A}(0,0)) we mean the limit of τ(ϕAm(p¯))\tau(\phi^{m}_{A}(\bar{p})) as p¯\bar{p} approaches the origin inside ΓN(0).\Gamma_{N}(0).)

9. Higher dimensional actions

Question 50.

Generalize the results presented in Sections 2-8 to higher dimensional actions.

The orbits of commuting shifts Tnx=x+j=1qnjαjT^{n}x=x+\sum_{j=1}^{q}n_{j}\alpha_{j} are much less studied than their one-dimensional counterparts. We expect that some of the results of Sections 2-8 admit straightforward extensions while in other cases significant new ideas will be necessary. Below we discuss two areas of research where multidimensional actions appear naturally.

9.1. Linear forms.

Statements about orbits of a single translation can be interpreted as results about joint distribution of fractional part of inhomogenuous linear forms of one variable evaluated over \mathbb{Z}. From the point of view of Number Theory it is natural to study linear forms of several variables evaluated over d\mathbb{Z}^{d}. Let

li(n)=xi+j=1qαijnj,i=1d.l_{i}(n)=x_{i}+\sum_{j=1}^{q}\alpha_{ij}n_{j},\quad i=1\dots d.

Thus it is of interest to study the discrepancy

𝔻N(Ω,α,x)=Card(0nj<N,j=1,,q:({l1(n)},{ld(n)})Ω)NqVol(Ω).\mathbb{D}_{N}(\Omega,\alpha,x)=\\ {\mathrm{Card}}(0\leq n_{j}<N,j=1,\ldots,q:(\{l_{1}(n)\},\dots\{l_{d}(n)\})\in\Omega)-N^{q}\text{Vol}(\Omega).

The latter problem is a classical subject in Number Theory, and there are several important results related to it. In particular, the Poisson regime is well understood ([88]). The following result generalizes Theorem 16 and can be proven by a similar argument.

Theorem 48.

Let (α,x)(\alpha,x) be uniformly distributed on 𝕋d(q+1).\mathbb{T}^{d(q+1)}. Then, for any cube Σq\Sigma\subset\mathbb{R}^{q}, the distribution of

Card(n:nNΣ and ({l1(n)},{ld(n)})Nq/dΩ){\mathrm{Card}}(n:\frac{n}{N}\in\Sigma\text{ and }(\{l_{1}(n)\},\dots\{l_{d}(n)\})\in N^{-q/d}\Omega)

converges as NN\to\infty to

𝒩(Ω,Σ):=Card(eL,e=(x,y):x(e)Ω,y(e)Σ){\mathcal{N}}(\Omega,\Sigma):={\mathrm{Card}}(e\in L,e=(x,y):x(e)\in\Omega,y(e)\in\Sigma)

where LL is a random affine lattice in d+q.\mathbb{R}^{d+q}.

Thus the Poisson regime for the rotations exhibits more regular behavior comparing to standard Poisson processes. However then the number of rotations becomes large the limiting distribution approaches the Poisson. Namely, the following is the special case of the result proven in [125].

Theorem 49.

If Σq\Sigma_{q} are unit cubes in q\mathbb{R}^{q} then Ω𝒩(Ω,Σq)\Omega\to{\mathcal{N}}(\Omega,\Sigma_{q}) converges as qq\to\infty to the Poisson measure 𝛍(Ω)=Card(𝔓Ω){\boldsymbol{\mu}}(\Omega)={\mathrm{Card}}(\mathfrak{P}\cap\Omega) where 𝔓\mathfrak{P} is a Poisson process on d\mathbb{R}^{d} with constant intensity.

Next we present extensions of Theorems 25, 24, 26 and 27 to the context of homogeneous and inhomogeneous linear forms. Let again li(n)=xi+j=1qαijnj,l_{i}(n)=x_{i}+\sum_{j=1}^{q}\alpha_{ij}n_{j}, i=1d.i=1\dots d. Consider

VN(α,x,c)=Card(0ni<N:({l1(n)},{ld(n)})B(c|n|q/d)).V_{N}(\alpha,x,c)={\mathrm{Card}}(0\leq n_{i}<N:(\{l_{1}(n)\},\dots\{l_{d}(n)\})\in B(c|n|^{-q/d})).

More generally given a function ψ:++\psi:{\mathbb{R}}^{+}\to{\mathbb{R}}^{+} define

VNψ(α,x)=Card(0ni<N:({l1(n)},{ld(n)})B(ψ(|q|)).V^{\psi}_{N}(\alpha,x)={\mathrm{Card}}(0\leq n_{i}<N:(\{l_{1}(n)\},\dots\{l_{d}(n)\})\in B(\psi(|q|)).

We also let UN(α,c)=VN(0,α,c)U_{N}(\alpha,c)=V_{N}(0,\alpha,c) and UNψ(α)=VNψ(0,α)U_{N}^{\psi}(\alpha)=V_{N}^{\psi}(0,\alpha) be the quantities measuring the rate of recurrence.

In particular we call the matrix α\alpha badly approximable if there exists c>0c>0 such that for, VN(0,α,c)V_{N}(0,\alpha,c) is bounded. On the other hand, if UNψ(α)U_{N}^{\psi}(\alpha)\to\infty where ψ(r)=r(d/q+ε)\psi(r)=r^{-(d/q+{\varepsilon})} then α\alpha is called very well approximable (VWA).

The following result is known as Khinchine–Groshev Theorem. Almost sure is considered relative to Lebesgue measure on the space of matrices α𝕋dq\alpha\in\mathbb{T}^{dq}.

Theorem 50.

[64, 47, 31, 116, 14, 7] (a) If q|ψd(|n|)<\sum_{\mathbb{Z}^{q}}|\psi^{d}(|n|)<\infty then UNψU_{N}^{\psi} is bounded almost surely.

(b) If qψd(|n|)=+\sum_{\mathbb{Z}^{q}}\psi^{d}(|n|)=+\infty and either ψ\psi is decreasing or dq>1dq>1 then limNUNψ(α)=+\lim_{N\to\infty}U_{N}^{\psi}(\alpha)=+\infty almost surely.

(c) For d=q=1d=q=1 there exists ψ\psi such that nZψ(|n|)=+\sum_{n\in Z}\psi(|n|)=+\infty but UNψU_{N}^{\psi} is bounded almost surely.

(d) If ψ\psi is decreasing and nZdψd(|n|)=+\sum_{n\in Z^{d}}\psi^{d}(|n|)=+\infty then VNψ(α,x)V_{N}^{\psi}(\alpha,x)\to\infty almost surely.

In particular, both badly approximable and very well approximable α\alphas have zero measure.

When the number of hits is infinite, it is natural to consider the question of the sBC property.

Theorem 51.

[116] (a) For almost all α\alpha

UNψ(α)=𝔼(UNψ)+O(Γ(N)ln3Γ(N))U_{N}^{\psi}(\alpha)={\mathbb{E}}(U_{N}^{\psi})+O\left(\sqrt{\Gamma(N)\ln^{3}\Gamma(N)}\right)

where

Γ(N)=|n|Nψ(|n|)dD(gcd(n1nq))\Gamma(N)=\sum_{|n|\leq N}\psi(|n|)^{d}D(gcd(n_{1}\dots n_{q}))

and DD denotes the number of divisors.

(b) Γ(N)C𝔼(UNψ)\Gamma(N)\leq C{\mathbb{E}}(U_{N}^{\psi}) if either q>3q>3 or q=2q=2 and nψ2(n)n\psi^{2}(n) is decreasing.

(c) If q=1q=1 and ψ(n)\psi(n) is decreasing then for each δ\delta

UNψ(α)=𝔼(UNψ)+O(Γ~(N)𝔼(UNψ)ln2+δ(𝔼(UNψ)))U_{N}^{\psi}(\alpha)={\mathbb{E}}(U_{N}^{\psi})+O\left(\sqrt{{\tilde{\Gamma}}(N){\mathbb{E}}(U_{N}^{\psi})}\ln^{2+\delta}({\mathbb{E}}(U_{N}^{\psi}))\right)

where

Γ~(N)=n=1Nψ(n)n.{\tilde{\Gamma}}(N)=\sum_{n=1}^{N}\frac{\psi(n)}{n}.
Question 51.

Does a similar formula as that of Theorem 51 hold for VψV^{\psi}?

Some partial results are obtained in [117].

It follows from the same arguments as the proof of Theorem 24 sketched in Section 6.3 that the sBC property holds for ψ(r)=r(d/q)\psi(r)=r^{-(d/q)} for almost every (α,x)(\alpha,x), that is

limNVN(α,x)𝔼(VN(α,x))=1.\lim_{N\to\infty}\frac{V_{N}(\alpha,x)}{{\mathbb{E}}(V_{N}(\alpha,x))}=1.

For badly approximable α\alpha we have the following.

Theorem 52.

[90] Let xx be uniformly distributed on 𝕋d.\mathbb{T}^{d}. If α\alpha is badly approximable, there exists a constant KK such that all limit points of VN𝔼(VN)lnN\frac{V_{N}-{\mathbb{E}}(V_{N})}{\sqrt{\ln N}} are normal random variables with zero mean and variance σ2\sigma^{2} where 0σ2K.0\leq\sigma^{2}\leq K.

Question 52.

(a) Show that there exist a constant σ¯2>0\bar{\sigma}^{2}>0 such that for almost all α\alpha VN𝔼(VN)lnN\dfrac{V_{N}-{\mathbb{E}}(V_{N})}{\sqrt{\ln N}} converges to 𝔑(σ¯2).{\mathfrak{N}}(\bar{\sigma}^{2}).

(b) Does there exist α\alpha such that liminfNVN𝔼(VN)lnN=0\lim\inf_{N\to\infty}\dfrac{V_{N}-{\mathbb{E}}(V_{N})}{\sqrt{\ln N}}=0 (that is, lnN\sqrt{\ln N} is not a correct normalization for such α\alpha)?

For random α\alpha we have the following.

Theorem 53.

([30]) There exists σ\sigma such that If α1,,αr\alpha_{1},\ldots,\alpha_{r} and x1,,xdx_{1},\ldots,x_{d} are randomly distributed on 𝕋dr+d\mathbb{T}^{dr+d} then VN𝔼(VN)lnN\frac{V_{N}-{\mathbb{E}}(V_{N})}{\sqrt{\ln N}} converges in distribution to a normal random variables with zero mean and variance σ2\sigma^{2}. A similar convergence holds if d+r>2d+r>2, (x1,,xr)=(0,,0)(x_{1},\ldots,x_{r})=(0,\ldots,0) and only the αi\alpha_{i}’s are random.

Still there are many open questions. We provide several examples.

Question 53.

Extend Theorems 10 and 11 to the case q>1.q>1.

We note that in the case of Theorem 11, even the case d=1d=1 seems quite difficult. One can attack this question using the method of [29] but it runs into the problem of lack of parameters described after Question 24.

Question 54.

Let l,l^:d,l,{\hat{l}}:\mathbb{R}^{d}\to\mathbb{R}, be linear forms with random coefficients, Q:dQ:\mathbb{R}^{d}\to\mathbb{R} be a positive definite quadratic form. Investigate limit theorems, after adequate renormalization, for the number of solutions to

(a) {l(n)}Q(n)c,|n|N;\{l(n)\}Q(n)\leq c,|n|\leq N;

(b) {l(n)}|l^(n)|c,|n|N;\{l(n)\}|{\hat{l}}(n)|\leq c,|n|\leq N;

(c) |l(n)Q(n)|c,|n|N;|l(n)Q(n)|\leq c,|n|\leq N;

(d) |l(n)l^(n)|<c,|n|N.|l(n){\hat{l}}(n)|<c,|n|\leq N.

While (a) and (b) have obvious interpretation as shrinking target problems for toral translations, such interpretation for (c) and (d) is less straightforward. Consider for example (c). Let l(n)=j=1qαjnj.l(n)=\sum_{j=1}^{q}\alpha_{j}n_{j}. Dividing the distribution of α\alpha into thin slices we may assume that αd\alpha_{d} is almost constant. If αda\alpha_{d}\approx a then we can compare our problem with |(j=1q1α~jnj)+nq|Q(n)<c~|\left(\sum_{j=1}^{q-1}{\tilde{\alpha}}_{j}n_{j}\right)+n_{q}|Q(n)<{\tilde{c}} where α~j=αj/a,c~=c/a.{\tilde{\alpha}}_{j}=\alpha_{j}/a,{\tilde{c}}=c/a. Since |l(n)||l(n)| should be small we must have |(j=1q1α~jnj)+nq|={j=1qα~j}|\left(\sum_{j=1}^{q-1}{\tilde{\alpha}}_{j}n_{j}\right)+n_{q}|=\{\sum_{j=1}^{q}{\tilde{\alpha}}_{j}\} in which case Q(n1,,nq1,nq)Q(n_{1},\dots,n_{q-1},n_{q}) is well approximated by

Q(n1,,nq1,j=1q1α~jnj)Q(n_{1},\dots,n_{q-1},-\sum_{j=1}^{q-1}{\tilde{\alpha}}_{j}n_{j})

so we have a shrinking target problem in lower dimensions. In fact as we saw in Section 5 typically the proof proceeds in the opposite direction by getting rid of fractional part at the expense of increasing dimension since problems (c) and (d) have more symmetry and so should be easier to analyze.

We note that part (d) deals with degenerate quadratic form. The case of non-degenerate forms is discussed in [32, Sections 5 and 6].

9.2. Cut-and-project sets.

Cut-and-project sets are used in physics literature to model quasicrystals. To define them we need the following data: a lattice in d,\mathbb{R}^{d}, a decomposition d=E1E2\mathbb{R}^{d}=E_{1}\oplus E_{2} and a compact set (a window) 𝒲E2.{\mathcal{W}}\subset E_{2}. Let P1P_{1} and P2P_{2} be the projections to E1E_{1} and E2E_{2} respectively. The cut-and-project set is defined by

𝒫={P1(e),eL and P2(e)𝒲}.{\mathcal{P}}=\{P_{1}(e),e\in L\text{ and }P_{2}(e)\in{\mathcal{W}}\}.

We suppose in the following discussion that

E1+L¯=d and LE2=.\overline{E_{1}+L}=\mathbb{R}^{d}\text{ and }L\cap E_{2}=\emptyset.

Then 𝒫{\mathcal{P}} is a discrete subset of E1E_{1} sharing many properties of lattices but having a more complicated structure. Note that the limiting distributions in Theorems 16 and 48 are described in terms of cut-and-project sets. We refer the reader to [93, Sections 16 and 17] for more discussion of cut-and-project set. Here we only mention the fact that such sets have asymptotic density. Let 𝒫R={t𝒫:|t|R}.{\mathcal{P}}_{R}=\{t\in{\mathcal{P}}:|t|\leq R\}.

Theorem 54.

Suppose that 𝒲{\mathcal{W}} is an open subset of E2E_{2} with a piecewise smooth boundary. Then

limRCard(𝒫R)Vol(B(0,R))=Vol(𝒲)covol(L)VoldVolE1VolE2.\lim_{R\to\infty}\frac{{\mathrm{Card}}({\mathcal{P}}_{R})}{{\mathrm{Vol}}(B(0,R))}=\frac{{\mathrm{Vol}}({\mathcal{W}})}{\mathrm{covol}(L)}\frac{{\mathrm{Vol}}_{\mathbb{R}^{d}}}{{\mathrm{Vol}}_{E_{1}}{\mathrm{Vol}}_{E_{2}}}.
Proof.

(Following [52]). Note that t𝒫t\in{\mathcal{P}} iff there exists eLe\in L such that t+e𝒲,-t+e\in{\mathcal{W}}, that is t𝒲 mod L.-t\in{\mathcal{W}}\text{ mod }L. Consider the action of E1E_{1} on d/L\mathbb{R}^{d}/L given by Tt(x)=x+t.T^{t}(x)=x+t. Then 𝒫R{\mathcal{P}}_{R} counts the number of intersections of the orbit of the origin of size RR with 𝒲.{\mathcal{W}}. Pick a small δ\delta and let 𝒲δ={𝒲+t,|t|δ}.{\mathcal{W}}_{\delta}=\{{\mathcal{W}}+t,|t|\leq\delta\}. Then 𝒲δ{\mathcal{W}}_{\delta} is a subset of d/L\mathbb{R}^{d}/L and for small δ\delta

(41) Vol(𝒲δ)=Vol(𝒲)Vol(B(0,δ))VoldVolE1VolE2.{\mathrm{Vol}}({\mathcal{W}}_{\delta})={\mathrm{Vol}}({\mathcal{W}}){\mathrm{Vol}}(B(0,\delta))\dfrac{{\mathrm{Vol}}_{\mathbb{R}^{d}}}{{\mathrm{Vol}}_{E_{1}}{\mathrm{Vol}}_{E_{2}}}.

Next,

(42) |t|<Rχ𝒲δ(Tt0)𝑑t=Vol(B(0,δ))Card(𝒫R)+O(Rq1)\int_{|t|<R}\chi_{{\mathcal{W}}_{\delta}}(T^{t}0)dt={\mathrm{Vol}}(B(0,\delta)){\mathrm{Card}}({\mathcal{P}}_{R})+O(R^{q-1})

where q=dim(E1)q=\dim(E_{1}) and the second term represents boundary contribution. On the other hand by unique ergodicity of TtT^{t}

(43) |t|<Rχ𝒲δ(Tt0)𝑑t=Vol(B(0,R))Vol(𝒲δ)covol(L)+o(Rq).\int_{|t|<R}\chi_{{\mathcal{W}}_{\delta}}(T^{t}0)dt={\mathrm{Vol}}(B(0,R))\frac{{\mathrm{Vol}}({\mathcal{W}}_{\delta})}{\mathrm{covol}(L)}+o(R^{q}).

Combining (41), (42) and (43) we get the result. ∎

Question 55.

Describe the error term in the asymptotics of Theorem 54.

If q=1q=1 then the error term in (42) is negligible and so (42) can be used to describe the deviations (see [28] for the case where 𝒲{\mathcal{W}} is convex). If q>1q>1 more work is needed to control both the LHS and the RHS of (42).

While the methods of [28, 29] deal with the case where dim(E1)\dim(E_{1}) is as small as possible, the most classical case is the opposite one when dim(E2)\dim(E_{2}) is as large as possible, that is, studying lattice points in large regions. Here we can not attempt to survey this enormous topic, so we refer the reader to the specialized literature on the subject ([56, 57, 77]. We just mention that limit theorems similar in spirit to the results discussed in this paper are obtained in [49, 16, 17, 104]. More generally, instead of considering large balls one can count the number of lattice points in RDRD where DD is a fixed regular set. As in Section 2 the order of the error term is sensitive to the geometry of DD (see e.g. [15, 78, 86, 96, 105, 109, 110, 121, 122] and references wherein). In fact, one can also consider the varying shapes RDRRD_{R} which includes both the Poisson regime where Vol(RDR){\mathrm{Vol}}(RD_{R}) does not grow (see [18] and references wherein) and the intermediate regime where Vol(RDR){\mathrm{Vol}}(RD_{R}) grows but at the rate slower than RdR^{d} (see [54, 127]).

This motivates the following question

Question 56.

Extend the results of the above mentioned papers to cut-and-project sets.

References

  • [1] Aaronson J. An Introduction to Infinite Ergodic Theory, Math. Surveys and Monographs 50, Amer. Math. Soc. 1997.
  • [2] Aaronson J., Keane M. The visits to zero of some deterministic random walks, Proc. London Math. Soc. 44 (1982) 535–553.
  • [3] Arnold V. I. Topological and ergodic properties of closed 11-forms with incommensurable periods, Funkts. Anal. Prilozhen. 25 (1991) 1–12.
  • [4] Athreya J. Logarithm laws and shrinking target properties, Proceedings–Math. Sciences 119 (2009) 541–557.
  • [5] Atkinson G. Recurrence of co-cycles and random walks, J. London Math. Soc. 13 (1976) 486–488.
  • [6] Avila A., Dolgopyat D., Duriev E., Sarig O. The visits to zero of a random walk driven by an irrational rotation, to appear in Israel Math. J.
  • [7] Badziahin D., Beresnevich V., Velani S. Inhomogeneous theory of dual Diophantine approximation on manifolds, Adv. Math. 232 (2013) 1–35.
  • [8] Beck J. Probabilistic Diophantine approximation. I. Kronecker sequences, Ann. of Math. 140 (1994) 449–502.
  • [9] Beck J., Diophantine approximation and quadratic fields in Number theory (Eger, 1996), (edited by K. Gyory, A. Petho, V. T. Sos) de Gruyter, Berlin, 1998, pp. 55–93.
  • [10] Beck J., From probabilistic Diophantine approximation to quadratic fields, Random and quasi-random point sets, 1 48, Springer Lecture Notes in Statist., 138 (1997) 1–48.
  • [11] Beck J., Randomness of the square root of 2 and the giant leap, Period. Math. Hungar. Part I: 60 (2010) 137–242; Part II: 62 (2011) 127–246.
  • [12] Bekka M. B., Mayer M. Ergodic theory and topological dynamics of group actions on homogeneous spaces,London Math. Soc. Lect. Note Ser. 269 Cambridge University Press, Cambridge, 2000. x+200 pp.
  • [13] V. Beresnevich, V. Bernik, D. Kleinbock, G. A. Margulis, Metric Diophantine approximation: The Khintchine–Groshev theorem for non-degenerate manifolds, Moscow Math. J. 2 (2002), 203–225.
  • [14] Beresnevich V., Velani S. Classical metric Diophantine approximation revisited: the Khintchine-Groshev theorem, IMRN (2010) no. 1, 69–86.
  • [15] Bleher P. On the distribution of the number of lattice points inside a family of convex ovals, Duke Math. J. 67 (1992) 461–481.
  • [16] Bleher P., Bourgain, J. Distribution of the error term for the number of lattice points inside a shifted ball, Progr. Math. 138 (1996) 141–153 Birkhauser Boston.
  • [17] Bleher P. M., Cheng Z., Dyson F. J., Lebowitz J. L. Distribution of the error term for the number of lattice points inside a shifted circle, Comm. Math. Phys. 154 (1993) 433–469.
  • [18] Bleher P. M., Lebowitz J. L. Variance of number of lattice points in random narrow elliptic strip, Ann. Inst. H. Poincare Probab. Statist. 31 (1995) 27–58.
  • [19] J. W. S. Cassels, An introduction to Diophantine approximation, Cambridge Tracts in Math., vol. 45, Cambridge Univ. Press, Cambridge, 1957.
  • [20] Chaika J, Constantine D. Quantitative Shrinking Target Properties for rotations, interval exchanges and billiards in rational polygons, arXiv preprint 1201.0941.
  • [21] Chernov N., Kleinbock D. Dynamical Borel-Cantelli lemmas for Gibbs measures, Israel J. Math. 122 (2001) 1–27.
  • [22] Chevallier N., Conze J.–P. Examples of recurrent or transient stationary walks in d\mathbb{R}^{d} over a rotation of 𝕋2,\mathbb{T}^{2}, Contemp. Math 485 (2009) 71–84.
  • [23] Conze J.-P., Recurrence, ergodicity and invariant measures. for cocycles over a rotation, Contemp. Math. 485 (2009) 45–70.
  • [24] Conze J.-P., Keane, M. Ergodicité d’un flot cylindrique, Seminaire de Prob.-I, Univ. Rennes, Rennes, 1976, Exp. No. 5, 7 pp.
  • [25] Conze J.-P., Piekniewska A., On multiple ergodicity of affine cocycles over irrational rotations, arXiv:1209.3798
  • [26] de Simoi J., Liverani C. The martingale approach after Varadhan and Dolgopyat, this volume.
  • [27] Dolgopyat D. Limit theorems for partially hyperbolic systems, Trans. AMS 356 (2004) 1637–1689.
  • [28] Dolgopyat D., Fayad B. Deviations of ergodic sums for toral translation-I: Convex bodies, GAFA 24 (2014) 85–115.
  • [29] Dolgopyat D., Fayad B. Deviations of ergodic sums for toral translations-II: Boxes, arXiv preprint 1211.4323.
  • [30] Dolgopyat D., Fayad B., Vinogradov I., Central limit theorems for simultaneous Diophantine approximations, in preparation.
  • [31] Duffin R. J., Schaeffer A.C. Khintchine’s problem in metric Diophantine approximation, Duke Math. J. 8 (1941) 243–255.
  • [32] Eskin A. Unipotent flows and applications. Homogeneous flows, moduli spaces and arithmetic, Clay Math. Proc. 10 (2010) 71–129, AMS, Providence, RI.
  • [33] Eskin A., McMullen C. Mixing, counting, and equidistribution in Lie groups, Duke Math. J. 71 (1993) 181–209.
  • [34] Fayad B. Polynomial decay of correlations for a class of smooth flows on the two torus, Bull. SMF 129 (2001) 487–503.
  • [35] Fayad B. Weak mixing for reparametrized linear flows on the torus, Erg. Th., Dynam. Sys. 22 (2002) 187–201.
  • [36] Fayad B. Analytic mixing reparametrizations of irrational flows, Erg. Th., Dynam. Sys. 22 (2002) 437–468.
  • [37] Fayad B. Skew products over translations on 𝕋d{\mathbb{T}}^{d}, d2d\geq 2, Proc. Amer. Math. Soc. 130 (2002) 103–109.
  • [38] Fayad B. Mixing in the absence of the shrinking target property, Bull. London Math. Soc. 38 (2006) 829–838.
  • [39] Fayad B., Lemańczyk M. On the ergodicity of cylindrical transformations given by the logarithm, Mosc. Math. J. 6 (2006) 657–672, 771–772.
  • [40] Feller W. An introduction to probability theory and its applications, Vol. II 2d ed. John Wiley & Sons, New York-London-Sydney, 1971, 669 pp.
  • [41] Flaminio L., Forni G. Equidistribution of nilflows and applications to theta sums, Erg. Th. Dyn. Sys. 26 (2006) 409–433.
  • [42] Fra̧czek K. Some examples of cocycles with simple continuous singular spectrum, Studia Math. 146 (2001) 1–13.
  • [43] Fra̧czek K., Lemańczyk M. On symmetric logarithm and some old examples in smooth ergodic theory, Fundamenta Math. 180 (2003), 241–255.
  • [44] Fuchs M. On a problem of W. J. LeVeque concerning metric Diophantine approximation, Trans. AMS 355 (2003) 1787–1801.
  • [45] Gabriel P., Lemańczyk M., Liardet P. Ensemble d’invariants pour les produits crois s de Anzai, Mem. SMF 47 (1991) 102 pp.
  • [46] Gouezel S. Limit theorems in dynamical systems using the spectral method, this volume.
  • [47] Groshev A. A theorem on a system of linear forms, Doklady Akademii Nauk SSSR 19 (1938) 151–152.
  • [48] Hardy G. H., Littlewood J. E. Some problems of diophantine approximation: a series of cosecants, Bull. Calcutta Math. Soc. 20 (1930) 251–266.
  • [49] Heath-Brown D. R. The distribution and moments of the error term in the Dirichlet divisor problem, Acta Arith. 60 (1992) 389–415.
  • [50] Herz C. S. Fourier transforms related to convex sets, Ann. Math. 75 (1962) 81–92.
  • [51] Hill B., Velani S. The ergodic theory of shrinking targets, Invent. Math. 119 (1995) 175–198.
  • [52] Hof A. Uniform distribution and the projection method, Fields Inst. Monogr. 10 (1998) 201–206, AMS, Providence, RI.
  • [53] Hooper P., Hubert P. Weiss B. Dynamics on the infinite staircase, Discr., Cont. Dyn. Syst. A 33 (2013) 4341–4347.
  • [54] Hughes C. P., Rudnick Z. On the distribution of lattice points in thin annuli, Int. Math. Res. Not. 13 (2004) 637–658.
  • [55] Huveneers F. Subdiffusive behavior generated by irrational rotations, Erg Th. Dyn. Sys. 29 (2009) 1217–1233.
  • [56] Huxley M. N. Area, lattice points, and exponential sums, LMS Monographs 13 (1996) 494 pp., Oxford Univ. Press, New York, 1996.
  • [57] Ivic A., Kratzel E., Kuhleitner M., Nowak W. G. Lattice points in large regions and related arithmetic functions: recent developments in a very classic topic, Schriften der Wissenschaftlichen Gesellschaft an der Johann Wolfgang Goethe-Universit t Frankfurt am Main 20 (2006) 89–128.
  • [58] Iwanik A. Anzai skew products with Lebesgue component of infinite multiplicity, Bull. London Math. Soc. 29 (1997) 195–199.
  • [59] Iwanik A., Lemańczyk M., Rudolph D. Absolutely continuous cocycles over irrational rotations, Israel J. Math. 83 (1993) 73–95.
  • [60] A.B. Katok, Combinatorial constructions in Ergodic Theory and Dynamics, University Lecture Series, 30, 2003.
  • [61] Kesten H. Uniform distribution mod 1{\rm mod}\,1, Ann. of Math. 71 (1960) 445–471.
  • [62] Kesten H. Uniform distribution mod1{\rm mod}1, II, Acta Arith. 7 (1961/1962) 355–380.
  • [63] Keynes H. B., Newton D. The structure of ergodic measures for compact group extensions, Israel J. Math. 18 (1974) 363–389.
  • [64] A. Khintchine Ein Satz ber Kettenbr che, mit arithmetischen Anwendungen, Mathematische Zeitschrift 18 (1923) 289–306.
  • [65] Kingman J. F. C. Poisson processes, Oxford Studies in Probability 3 (1993) viii+104 pp, Oxford University Press, New York.
  • [66] Kleinbock D. Y., Margulis G. A. Bounded orbits of nonquasiunipotent flows on homogeneous spaces, AMS Transl. 171 (1996) 141–172.
  • [67] Kleinbock D. Y., Margulis G. A. Flows on homogeneous spaces and Diophantine approximation on manifolds, Ann. of Math. 148 (1998) 339–360.
  • [68] Knill O. Selfsimilarity in the Birkhoff sum of the cotangent function, arXiv preprint 1206.5458.
  • [69] Knill O., Tangerman F. Self-similarity and growth in Birkhoff sums for the golden rotation, Nonlinearity 24 (2011) 3115–3127.
  • [70] Kocergin A. V. The absence of mixing in special flows over a rotation of the circle and in flows on a two-dimensional torus, Dokl. Akad. Nauk SSSR, 205 (1972) 515–518.
  • [71] Kocergin A. V. Mixing in special flows over a rearrangement of segments and in smooth flows on surfaces, Mat. Sbornik 96 (1975) 471–502, 504.
  • [72] Kocergin A. V. Nondegenerate saddles, and the absence of mixing, Mat. Zametki 19 (1976) 453–468.
  • [73] Kochergin A. V. Nondegenerate fixed points and mixing in flows on a two-dimensional torus I: Sb. Math. 194 (2003) 1195–1224; II: Sb. Math. 195 (2004) 317–346.
  • [74] Kocergin A. V. Causes of stretching of Birkhoff sums and mixing in flows on surfaces, MSRI Publ. 54 (2007) 129–144, Cambridge Univ. Press, Cambridge.
  • [75] J. F. Koksma, Diophantische Approximationen, Ergebnisse der Mathematik, Band IV, Heft 4, Berlin, 1936.
  • [76] A. N. Kolmogorov, On dynamical systems with an integral invariant on the torus, Doklady Akad. Nauk SSSR, 93 (1953) 763–766.
  • [77] Kratzel E. Lattice points Math., Appl. 33 (1988) 320 pp., Kluwer Dordrecht.
  • [78] Kratzel E., Nowak W. G. The lattice discrepancy of certain three-dimensional bodies, Monatsh. Math. 163 (2011) 149–174.
  • [79] L. Kuipers, H. Niederreiter Uniform Distribution of Sequences, Dover Books on Math., 2006, 416pp.
  • [80] Kurzweil, J. On the metric theory of inhomogeneous diophantine approximations, Studia Math. 15 (1955) 84–112.
  • [81] Le Borgne S. Principes d’invariance pour les flots diagonaux sur SL(d,R)/SL(d,Z), Ann. Inst. H. Poincar Prob. Stat. 38 (2002) 581–612.
  • [82] Ledrappier F., Sarig O. Fluctuations of ergodic sums for horocycle flows on d{\mathbb{Z}}^{d}-covers of finite volume surfaces, Disc. Cont. Dyn. Syst. 22 (2008) 247–325.
  • [83] Lemańczyk M., Spectral Theory of Dynamical Systems Encyclopedia of Complexity and Systems Science, Springer, (2009).
  • [84] Lemańczyk M. Sur l’absence de m lange pour des flots sp ciaux au-dessus d’une rotation irrationnelle, Colloq. Math. 84/85 (2000) 29–41.
  • [85] Lemańczyk M., Parreau F., Volný D. Ergodic properties of real cocycles and pseudo-homogenous Banach spaces, Trans. AMS 348 (1996) 4919–4938.
  • [86] Levin M. B., On the Gaussian limiting distribution of lattice points in a parallelepiped, arXiv preprint 1307.2076.
  • [87] Liu L., Limit laws of quasi-periodic ergodic sums for observables with singularities, in preparation.
  • [88] Marklof J. The nn-point correlations between values of a linear form, Erg. Th., Dynam. Sys. 20 (2000) 1127–1172.
  • [89] Marklof J. Pair correlation densities of inhomogeneous quadratic forms, Ann. Math. 158 (2003) 419–471.
  • [90] Marklof J. An application of Bakhtin Central Limit Theorem for Hyperbolic sequences to Beck’s probabilistic Diophantine approximation, preprint (2003).
  • [91] Marklof J. Energy level statistics, lattice point problems and almost modular functions, in Frontiers in Number Theory, Physics and Geometry, v. 1, pp. 163–181 (Les Houches, 9-21 March 2003), Springer.
  • [92] Marklof J. Distribution modulo one and Ratner’s theorem, in Equidistribution in number theory, an introduction, NATO Sci. Ser. II Math. Phys. Chem., 237 (2007) 217–244, Springer, Dordrecht.
  • [93] Marklof J. Kinetic limits of dynamical systems, this volume.
  • [94] Marklof J., Strömbergsson A. The distribution of free path lengths in the periodic Lorentz gas and related lattice point problems, Annals of Mathematics 172 (2010) 1949–2033.
  • [95] Marklof J., Strömbergsson A. Free path lengths in quasicrystals, Comm. Math. Phys. 330 (2014) 723–755.
  • [96] Muller W., Nowak W.G. On lattice points in planar domains, Math. J. Okayama Univ. 27 (1985) 173–184.
  • [97] Oren I. Ergodicity of cylinder flows arising from irregularities of distribution, Israel J. Math. 44 (1983) 127–138.
  • [98] Ornstein D. S., Rudolph D. J., Weiss B. Equivalence of measure preserving transformations, Mem. Amer. Math. Soc. 37 (1982) no. 262, xii+116 pp.
  • [99] Ostrowski A. Bemerkungen zur Theorie der Diophantischen Approximationen, Abhandlungen aus dem Math. Seminar Univ. Hamburg 1 (1922) 77–98.
  • [100] W. Parry, Introduction to ergodic theory, Topics in ergodic theory, Cambridge Tracts in Math. 75 Cambridge Univ. Press, Cambridge-New York, 1981. x+110 pp.
  • [101] Parry W., Pollicott M. Zeta functions and the periodic orbit structure of hyperbolic dynamics, Asterisque 187–188 (1990) 268 pp.
  • [102] Pask D. A. Skew products over the irrational rotation, Israel J. Math. 69 (1990) 65–74.
  • [103] Peres Yu., Ralston D. Heaviness in toral rotations, Israel J. Math. 189 (2012), 337–346.
  • [104] Peter M. Almost periodicity and the remainder in the ellipsoid problem, Michigan Math. J. 49 (2001) 331–351.
  • [105] Peter M. Lattice points in convex bodies with planar points on the boundary, Monatsh. Math. 135 (2002) 37–57.
  • [106] K. Peterson, On a series of cosecants related to a problem in ergodic theory, Compositio Mathematica, 26 (1973), 313–317.
  • [107] W. Philipp, Mixing Sequences of Random Variables and Probabilistic Number Theory, AMS Memoirs 114, 1971.
  • [108] W. Philipp, Some metrical theorems in number theory, Paci c J. Math. 20 (1967), p. 109–127.
  • [109] Randol B. On the Fourier transform of the indicator function of a planar set, Trans. Amer. Math. Soc. 139 (1969) 271–278.
  • [110] Randol B. On the asymptotic behavior of the Fourier transform of the indicator function of a convex set, Trans. AMS 139 (1969) 279–285.
  • [111] Ratner M. On Raghunathan’s measure conjecture, Ann. of Math. 134 (1991) 545–607.
  • [112] Ratner M. Raghunathan’s topological conjecture and distributions of unipotent flows, Duke Math. J. 63 (1991) 235–280.
  • [113] Roth K. F. On irregularities of distribution, Mathematika 1 (1954) 73–79.
  • [114] Samur J., A functional central limit theorem in Diophantine approximation, Proc. Amer. Math. Soc. 111 (1991) 901–911.
  • [115] Schmidt K. Cocycles of Ergodic Transformation Groups, Lect. Notes in Math. Vol. 1, Mac Millan Co. India, 1977.
  • [116] Schmidt W. M. A metrical theorem in diophantine approximation, Canadian J. Math. 12 (1960) 619–631.
  • [117] Schmidt W. Metrical theorems on fractional parts of sequences, Trans. Amer. Math. Soc. 110 (1964) 493–518.
  • [118] Shah N. A. Limit distributions of expanding translates of certain orbits on homogeneous spaces, Proc. Indian Acad. Sci. Math. Sci. 106 (1996) 105–125.
  • [119] Sinai, Ya. G., Khanin K. M. Mixing of some classes of special flows over rotations of the circle, Funct. Anal. Appl. 26 (1992) 155–169.
  • [120] Sinai Ya. G., Ulcigrai C. A limit theorem for Birkhoff sums of non-integrable functions over rotations, Contemp. Math. 469 (2008) 317–340.
  • [121] Skriganov M. M. Ergodic theory on SL(n), Diophantine approximations and anomalies in the lattice point problem, Invent. Math. 132 (1998) 1–72.
  • [122] Skriganov M. M., Starkov A. N. On logarithmically small errors in the lattice point problem, Erg. Th. Dynam. Sys. 20 (2000) 1469–1476.
  • [123] Stein E. M., Weiss G. Introduction to Fourier analysis on Euclidean spaces, Princeton Math. Ser. 32 (1971) 297pp, Princeton Univ. Press, Princeton, N.J.
  • [124] Tseng J. On circle rotations and the shrinking target properties, Discrete Contin. Dyn. Syst. 20 (2008) 1111–1122.
  • [125] Vinogradov I. Limiting distribution of visits of sereval rotations to shrinking intervals, arXiv preprint 1005.1622
  • [126] Von Neumann J., Zur Operatorenmethode in der klassischen Mechanik, Ann. of Math. (2) 33 (1932), 587–642.
  • [127] Wigman I. The distribution of lattice points in elliptic annuli, Q. J. Math. 57 (2006) 395–423, 58 (2007) 279–280.
  • [128] Wysokinska M. A class of real cocycles over an irrational rotation for which Rokhlin cocycle extensions have Lebesgue component in the spectrum, Topol. Methods Nonlinear Anal. 24 (2004) 387–407.
  • [129] Yoccoz J.-C. Petits diviseurs en dimension 1, Asterisque 231 (1995).