Moderate deviation principles for kernel estimator of invariant density in bifurcating Markov chains models.

S. Valère Bitseki Penda S. Valère Bitseki Penda, IMB, CNRS-UMR 5584, Université Bourgogne Franche-Comté, 9 avenue Alain Savary, 21078 Dijon Cedex, France. simeon-valere.bitseki-penda@u-bourgogne.fr

Abstract.

Bitseki and Delmas (2021) have studied recently the central limit theorem for kernel estimator of invariant density in bifurcating Markov chains models. We complete their work by proving a moderate deviation principle for this estimator. Unlike the work of Bitseki and Gorgui (2021), it is interesting to see that the distinction of the two regimes disappears and that we are able to get moderate deviation principle for large values of the ergodic rate. It is also interesting and surprising to see that for moderate deviation principle, the ergodic rate begins to have an impact on the choice of the bandwidth for values smaller than in the context of central limit theorem studied by Bitseki and Delmas (2021).

Keywords: Bifurcating Markov chains, bifurcating auto-regressive process, binary trees, density estimation.

Mathematics Subject Classification (2020): 62G05, 62F12, 60F10, 60J80,

1. Introduction

The study of bifurcating Markov chains (BMCs, for short) models has taken a special place in the literature these last years due to their links with the study of the cell dynamics (see for e.g. [6, 10, 13, 16, 17]). The first model of BMC, named “symmetric” bifurcating autoregressive process (BAR, for short) were introduced by Cowan and Staudte [9] in order to understand the cell division mechanisms of Escherichia Coli (E. Coli, for short). E. Coli is a rod shaped bacterium which reproduces by dividing in two, thus producing two new cells. One of type 1 which has the old end of the mother and the other of type 0 which has the new end of the mother. The age of a cell is thus given by the age of its old pole in the sense of the number of divisions from which this pole exists. This cell division mechanism raises several questions, among other that of the symmetry of the division. In order to give a rigorous answer to this question, Guyon [16] has developed and studied the theory of BMCs. We note that to the best of our knowledge, the term BMC appears for the first time in the work of [1]. In particular, Guyon has studied an extension of the model introduced by Cowan and Staudte, named “asymmetric” BAR. In the conclusion of his study, Guyon concludes that aging has an impact on cell reproduction. We note that an extension of the model proposed by Guyon, named nonlinear BAR (NBAR, for short) were studied by Bitseki and Olivier in [6]. Another question of interest related to cell division is estimating the division rate at which cells divide. This question has been tackled recently in the work of Doumic & al. [13] and Hoffman & Marguet [17]. In all the previous work, the behaviour and the definition of parameters of interest are associated with the density of the invariant probability of an auxiliary Markov chain (see below for a precise definition). The estimation of this invariant density has recently been the subject of several studies. One can cite [5, 8] where adaptive methods have been proposed for the estimation of this invariant density. More recently, Bitseki and Delmas [2] have studied central limit theorem for kernel estimators of this invariant density. Our main objective in this paper is to complete the previous study by establishing a moderate deviation principle for these kernel estimators. Before going any further, let us recall the definition of the main concepts that we will use and study.

2. The model of bifurcating Markov chain and definition of the estimators

2.1. The regular binary tree associated to BMC models

We denote by $\mathbb{N}$ (resp. $\mathbb{N}^{*}$ ) the space of (resp. positive) natural integers. We set $\mathbb{T}_{0}=\mathbb{G}_{0}=\{\emptyset\}$ , $\mathbb{G}_{k}=\{0,1\}^{k}$ and $\mathbb{T}_{k}=\bigcup_{0\leq r\leq k}\mathbb{G}_{r}$ for $k\in{\mathbb{N}}^{*}$ , and $\mathbb{T}=\bigcup_{r\in{\mathbb{N}}}\mathbb{G}_{r}$ . The set $\mathbb{G}_{k}$ corresponds to the $k$ -th generation, $\mathbb{T}_{k}$ to the tree up to the $k$ -th generation, and $\mathbb{T}$ the complete binary tree. One can see that the genealogy of the cells is entirely described by $\mathbb{T}$ (each vertex of the tree designates an individual). For $i\in\mathbb{T}$ , we denote by $|i|$ the generation of $i$ ( $|i|=k$ if and only if $i\in\mathbb{G}_{k}$ ) and $iA=\{ij;j\in A\}$ for $A\subset\mathbb{T}$ , where $ij$ is the concatenation of the two sequences $i,j\in\mathbb{T}$ , with the convention that $\emptyset i=i\emptyset=i$ . For $A\subset\mathbb{T}$ , we denote by $|A|$ the number of elements of $A$ . Note that for all $n\in\mathbb{N},$ $|\mathbb{G}_{n}|=2^{n}$ and $|\mathbb{T}_{n}|=2^{n+1}-1.$

2.2. The probability kernels associated to BMC models

For our convenience, we set $S=\mathbb{R}^{d}$ , $d\geq 1$ and $S$ is equipped with the Borel sigma-algebra ${\mathscr{S}}$ . For any $q\in\mathbb{N}^{*}$ , we denote by ${\mathcal{B}}(S^{q})$ (resp. ${\mathcal{B}}_{b}(S^{q})$ , resp. ${\mathcal{C}}_{b}(S^{q})$ ) the space of (resp. bounded, resp. bounded continuous ) $\mathbb{R}\text{-}$ valued measurable functions defined on $S^{q}$ . For all $q\in\mathbb{N}^{*}$ , we set ${\mathscr{S}}^{\otimes q}={\mathscr{S}}\otimes\ldots\otimes{\mathscr{S}}$ . Let ${\mathcal{P}}$ be a probability kernel on $(S,{\mathscr{S}}^{\otimes 2})$ , that is: ${\mathcal{P}}(\cdot,A)$ is measurable for all $A\in{\mathscr{S}}^{\otimes 2}$ , and ${\mathcal{P}}(x,\cdot)$ is a probability measure on $(S^{2},{\mathscr{S}}^{\otimes 2})$ for all $x\in S$ . For any $g\in{\mathcal{B}}_{b}(S^{3})$ and $h\in{\mathcal{B}}_{b}(S^{2})$ , we set for $x\in S$ :

(1)

({\mathcal{P}}g)(x)=\int_{S^{2}}g(x,y,z)\;{\mathcal{P}}(x,{\rm d}y,{\rm d}z)\quad\text{and}\quad({\mathcal{P}}h)(x)=\int_{S^{2}}h(y,z)\;{\mathcal{P}}(x,{\rm d}y,{\rm d}z).

We define $({\mathcal{P}}g)$ (resp. $({\mathcal{P}}h)$ ), or simply $Pg$ for $g\in{\mathcal{B}}(S^{3})$ (resp. ${\mathcal{P}}h$ for $h\in{\mathcal{B}}(S^{2})$ ), as soon as the corresponding integral (1) is well defined, and we have that ${\mathcal{P}}g$ and ${\mathcal{P}}h$ belong to ${\mathcal{B}}(S)$ . We denote by ${\mathcal{P}}_{0}$ , ${\mathcal{P}}_{1}$ and ${\mathcal{Q}}$ respectively the first and the second marginal of ${\mathcal{P}}$ , and the mean of ${\mathcal{P}}_{0}$ and ${\mathcal{P}}_{1}$ , that is, for all $x\in S$ and $B\in\mathcal{S}$

{\mathcal{P}}_{0}(x,B)={\mathcal{P}}(x,B\times S),\quad{\mathcal{P}}_{1}(x,B)={\mathcal{P}}(x,S\times B)\quad\text{ and}\quad{\mathcal{Q}}=\frac{({\mathcal{P}}_{0}+{\mathcal{P}}_{1})}{2}.

Now let us give a precise definition of bifurcating Markov chain.

Definition 2.1 (Bifurcating Markov Chains, see [16, 2]).

We say a stochastic process indexed by $\mathbb{T}$ , $X=(X_{i},i\in\mathbb{T})$ , is a bifurcating Markov chain (BMC) on a measurable space $(S,{\mathscr{S}})$ with initial probability distribution $\nu$ on $(S,{\mathscr{S}})$ and probability kernel ${\mathcal{P}}$ on $S\times{\mathscr{S}}^{\otimes 2}$ if:

-

(Initial distribution.) The random variable $X_{\emptyset}$ is distributed as $\nu$ .

(Branching Markov property.) For any sequence $(g_{i},i\in\mathbb{T})$ of functions belonging to ${\mathcal{B}}_{b}(S^{3})$ and for all $k\geq 0$ , we have

{\mathbb{E}}\Big{[}\prod_{i\in\mathbb{G}_{k}}g_{i}(X_{i},X_{i0},X_{i1})|\sigma(X_{j};j\in\mathbb{T}_{k})\Big{]}=\prod_{i\in\mathbb{G}_{k}}{\mathcal{P}}g_{i}(X_{i}).

Following [16], we introduce an auxiliary Markov chain $Y=(Y_{n},n\in{\mathbb{N}})$ on $(S,{\mathscr{S}})$ with $Y_{0}=X_{1}$ and transition probability ${\mathcal{Q}}$ . The chain $(Y_{n},n\in\mathbb{N})$ corresponds to a random lineage taken in the population. We shall write ${\mathbb{E}}_{x}$ when $X_{\emptyset}=x$ (i.e. the initial distribution $\nu$ is the Dirac mass at $x\in S$ ). We will assume that the Markov chain $Y$ is ergodic and we denote by $\mu$ its invariant probability measure. Asymptotic and non-asymptotic behaviour of BMCs are strongly related to the knowledge of $\mu$ . In particular, Guyon has proved that if $Y$ is ergodic, then for all $f\in{\mathcal{C}}_{b}(S)$ ,

|{\mathbb{A}}_{n}|^{-1}\sum_{u\in{\mathbb{A}}_{n}}f(X_{u})\underset{n\rightarrow+\infty}{\xrightarrow{\hskip 21.33955pt}}\langle\mu,f\rangle\quad\text{in probability,}\quad\text{where ${\mathbb{A}}_{n}\in\{\mathbb{G}_{n},\mathbb{T}_{n}\}.$}

But in most cases, the invariant probability $\mu$ is unknown, so its estimation from the data is of great interest. For that purpose, we do the following assumption.

Assumption 2.2.

The transition kernel ${\mathcal{P}}$ has a density, still denoted by ${\mathcal{P}}$ , with respect to the Lebesgue measure.

Remark 2.3.

Assumption 2.2 implies that the transition kernel ${\mathcal{Q}}$ has a density, still denoted by ${\mathcal{Q}}$ , with respect to the Lebesgue measure. More precisely, we have ${\mathcal{Q}}(x,y)=2^{-1}\int_{S}({\mathcal{P}}(x,y,z)+{\mathcal{P}}(x,z,y))dz.$ This implies in particular that the invariant probability $\mu$ has a density, still denoted by $\mu$ , with respect to the Lebesgue measure (for more details, we refer for e.g. to [14], chap 6).

2.3. Kernel estimator of the invariant density $\mu$

Recall that ${\mathbb{A}}_{n}\in\{\mathbb{G}_{n},\mathbb{T}_{n}\}$ and $S=\mathbb{R}^{d}$ , $d\geq 1$ . Assume we observe $\mathbb{X}^{n}=(X_{u},u\in{\mathbb{A}}_{n})$ . Let $(h_{n},n\in\mathbb{N})$ be a sequence of positive numbers which converges to $0$ as $n$ goes to infinity. We will simply write $h$ for $h_{n}$ if there is no ambiguity. Let the kernel function $K:S\rightarrow\mathbb{R}$ such that $\int_{S}K(x)dx=1.$ Then, for all $x\in S,$ we propose to estimate $\mu(x)$ by

(2)

\widehat{\mu}_{{\mathbb{A}}_{n}}(x)=|{\mathbb{A}}_{n}|^{-1}h_{n}^{-d/2}\sum_{u\in{\mathbb{A}}_{n}}K_{h_{n}}(x-X_{u}),

where $K_{h_{n}}(\cdot)=h_{n}^{-d/2}K(h_{n}\cdot).$ These estimators are strongly inspired from [18, 21, 22]. They have been studied in [13, 8] (non asymptotic studies) and in [2] (central limit theorem).

2.4. Moderate deviation principle and related topics

Our aim is to study moderate deviation principles for the estimators defined in (2). Before we proceed, let us introduce the notion of moderate deviation principle. We give the definition in a general setting. Let $(Z_{n})_{n\geq 0}$ be a sequence of random variables with values in $S$ endowed with its Borel $\sigma$ -field ${\mathscr{S}}$ and let $(s_{n})_{n\geq 0}$ be a positive sequence that converges to $+\infty$ . We assume that $Z_{n}/s_{n}$ converges in probability to 0 and that $Z_{n}/\sqrt{s_{n}}$ converges in distribution to a centered Gaussian law. Let $I:S\rightarrow\mathbb{R}^{+}$ be a lower semicontinuous function, that is for all $c>0$ the sub-level set $\{x\in S,I(x)\leq c\}$ is a closed set. Such a function $I$ is called rate function and it is called good rate function if all its sub-level sets are compact sets. Let $(b_{n})_{n\geq 0}$ be a positive sequence such that $b_{n}\rightarrow+\infty$ and $b_{n}/\sqrt{s_{n}}\rightarrow 0$ as $n$ goes to $+\infty$ .

Definition 2.4 (Moderate deviation principle, MDP).

We say that $Z_{n}/(b_{n}\sqrt{s_{n}})$ satisfies a moderate deviation principle on $S$ with speed $b_{n}^{2}$ and rate function $I$ if, for any $A\in{\mathscr{S}}$ ,

\displaystyle-\inf_{x\in\mathring{A}}I(x)\leq\liminf_{n\rightarrow+\infty}\frac{1}{b_{n}^{2}}\log\mathbb{P}\big{(}\frac{Z_{n}}{b_{n}\sqrt{s_{n}}}\in A\big{)}\leq\limsup_{n\rightarrow+\infty}\frac{1}{b_{n}^{2}}\log\mathbb{P}\big{(}\frac{Z_{n}}{b_{n}\sqrt{s_{n}}}\in A\big{)}\leq-\inf_{x\in\bar{A}}I(x),

where $\mathring{A}$ and $\bar{A}$ denote respectively the interior and the closure of $A$ .

The following two concepts are closely related to the theory of MDP: super-exponential convergence and exponential equivalence. Let $(Z_{n},n\in\mathbb{N})$ , $(W_{n},n\in\mathbb{N})$ be sequences of random variables and $Z$ a random variable with value in a metric space $(S,d)$ .

Definition 2.5 (Super-exponential convergence).

We say that $(Z_{n})_{n\geq 0}$ converges $(b_{n}^{2})\text{-}$ super-exponentially fast in probability to $Z$ and we note $Z_{n}\xRightarrow[b_{n}^{2}]{\rm superexp}Z$ if, for all $\delta>0$ ,

\limsup_{n\rightarrow+\infty}\frac{1}{b_{n}^{2}}\log\mathbb{P}\big{(}d(Z_{n},Z)>\delta\big{)}=-\infty.

Definition 2.6 (Exponential equivalence, see [11], Chap 4).

We say that $(Z_{n})_{n\geq 0}$ and $(W_{n})_{n\geq 0}$ are $(b_{n}^{2})_{n\geq 0}$ -exponentially equivalent and we note $Z_{n}\mathrel{\underset{b_{n}^{2}}{\overset{{\rm superexp}}{\scalebox{2.0}[1.0]{$\sim$}}}}W_{n}$ if for any $\delta>0$ ,

\limsup_{n\rightarrow+\infty}\frac{1}{b_{n}^{2}}\log\mathbb{P}\big{(}d(Z_{n},W_{n})>\delta\big{)}=-\infty.

Remark 2.7.

Note that for a determininistic sequence that converges to some limit $\ell$ , it also converges $(b_{n}^{2})\text{-}$ superexponentially fast to $\ell$ for any rate $b_{n}$ . We also note that if $(Z_{n})_{n\geq 0}$ and $(W_{n})_{n\geq 0}$ are $(b_{n}^{2})_{n\geq 0}$ -exponentially equivalent and if $(Z_{n})_{n\geq 0}$ satisfies a MDP, then $(W_{n})_{n\geq 0}$ satisfies the same MDP (for more details, see for e.g [11], Chap 4).

The following result give a sufficient condition for super-exponential convergence of a sequence of random variables.

Remark 2.8.

We assume that $(S,d)$ is a metric space. Let $\left(Z_{n}\right)_{n\in\mathbb{N}}$ be a sequence of random variables with values in $S$ , $Z$ a random variable with values in $S$ . So if $d(Z_{n},Z)$ is upper-bounded by a deterministic sequence which converges to $0$ , then, for all sequence $(b_{n},n\in\mathbb{N})$ converging to $+\infty$ , $Z_{n}\xRightarrow[b_{n}^{2}]{\rm superexp}Z$ .

The moderate deviation principle has been proved in the i.i.d. setting for kernel density estimator, see for e.g. Gao [15], Mokkadem & al. [20]. We refer also to [19] where Mokkaddem and Pelletier have constructed confidence bands for probability densities based on moderate deviation principles. In this paper, we will establish moderate deviation principle for $\widehat{\mu}_{{\mathbb{A}}_{n}}(x)$ following the martingale approach developed in [2]. We will need the following assumption.

Assumption 2.9.

There exists a positive real number $M$ and $\alpha\in(0,1)$ such that for all $f\in{\mathcal{B}}_{b}(S)$ :

(3)

|{\mathcal{Q}}^{n}f-\langle\mu,f\rangle|\leq M\,\alpha^{n}\|f\|_{\infty}\quad\text{for all $n\in{\mathbb{N}}$.}

Remark 2.10.

Assumption 2.9 is for example satisfy for nonlinear bifurcating autoregressive process under mild hypotheses on the autoregression functions (see [7] Lemma 9 for more details).

The others assumptions we will need are based on the following bias-variance type decomposition of the estimator $\widehat{\mu}_{{\mathbb{A}}_{n}}(x)$ :

(4)

\widehat{\mu}_{{\mathbb{A}}_{n}}(x)-\mu(x)=B_{h_{n}}(x)+V_{{\mathbb{A}}_{n},h_{n}}(x),

where for $h>0$ and ${\mathbb{A}}\subset\mathbb{T}$ finite:

B_{h}(x)=h^{-d/2}K_{h}\star\mu(x)-\mu(x)\quad\text{and}\quad V_{{\mathbb{A}},h}(x)=|{\mathbb{A}}|^{-1}h^{-d/2}\sum_{u\in{\mathbb{A}}}\Big{(}K_{h}(x-X_{u})-K_{h}\star\mu(x)\Big{)},

and for $h>0$ and $u\in\mathbb{T}$ , we set:

K_{h}\star\mu(x)=\mathbb{E}_{\mu}[K_{h}(x-X_{u})]=\int_{S}K_{h}(x-y)\mu(y)\,dy.

To study the variance term $V_{{\mathbb{A}}_{n},h_{n}}(x)$ , we will introduce a more general sequence of functions (see Section 3.2).

The following assumptions on the kernel, the bandwidth and the regularity of the unknown density function are usual. Recall $S={\mathbb{R}}^{d}$ with $d\geq 1$ .

Assumption 2.11 (Regularity of the kernel function and the bandwidth).

(i)

The kernel function $K\in{\mathcal{B}}(S)$ satisfies:

\mathop{\parallel\!K\!\parallel}\nolimits_{\infty}<+\infty,\,\,\mathop{\parallel\!K\!\parallel}\nolimits_{1}<+\infty,\,\,\mathop{\parallel\!K\!\parallel}\nolimits_{2}<+\infty,\,\,\int_{S}\!K(x)\,dx=1\quad\text{and}\quad\lim_{|x|\rightarrow+\infty}|x|K(x)=0.

(ii)

There exists $\gamma\in(0,1/d)$ such that the bandwidth $(h_{n},n\in\mathbb{N})$ are defined by $h_{n}=2^{-n\gamma}$ .

Assumption 2.12 (Further regularity on the density $\mu$ , the kernel function and the bandwidths).

Suppose that there exists an invariant probability measure $\mu$ of ${\mathcal{Q}}$ and that Assumptions 2.2 and 2.11 hold. We assume there exists $s>0$ such that the following hold:

(i)

The density $\mu$ belongs to the (isotropic) Hölder class of order $(s,\ldots,s)\in\mathbb{R}^{d}$ : The density $\mu$ admits partial derivatives with respect to $x_{j}$ , for all $j\in\{1,\ldots d\}$ , up to the order $\lfloor s\rfloor$ and there exists a finite constant $L>0$ such that for all $x=(x_{1},\ldots,x_{d}),\in\mathbb{R}^{d}$ , $t\in{\mathbb{R}}$ and $j\in\{1,\ldots,d\}$ :

\left|\frac{\partial^{\lfloor s\rfloor}\mu}{\partial x_{j}^{\lfloor s\rfloor}}(x_{-j},t)-\frac{\partial^{\lfloor s\rfloor}\mu}{\partial x_{j}^{\lfloor s\rfloor}}(x)\right|\leq L|x_{j}-t|^{\{s\}},

where $(x_{-j},t)$ denotes the vector $x$ where we have replaced the $j^{th}$ coordinate $x_{j}$ by $t$ , with the convention ${\partial^{0}\mu}/{\partial x_{j}^{0}}=\mu$ .

(ii)

The kernel $K$ is of order $(\lfloor s\rfloor,\ldots,\lfloor s\rfloor)\in\mathbb{N}^{d}$ : We have $\int_{\mathbb{R}^{d}}|x|^{s}K(x)\,dx<\infty$ and $\int_{\mathbb{R}}x^{k}_{j}\,K(x)\,dx_{j}=0$ for all $k\in\{1,\ldots,\lfloor s\rfloor\}$ and $j\in\{1,\ldots,d\}$ .

For $\alpha>1/2$ , we shall also assume the following.

Assumption 2.13.

Keeping the same notations as in (ii) of Assumption 2.11, we further assume that Assumption 2.9 holds with

(5)

\lim_{n\rightarrow+\infty}(2^{1-d\gamma}\alpha)^{n}=0.

Remark 2.14.

As consequence of Assumption 2.13 and (ii) of Assumption 2.11, for moderate deviation principle, the ergodicity rate $\alpha$ begins to have an impact on the choice of the bandwidth for $\alpha>1/2.$ This is out of step with the central limit theorem where the ergodicity rate $\alpha$ begins to have an impact on the choice of the bandwidth for $\alpha>1/\sqrt{2}$ (see [2] for more details).

In the sequel, we will consider the positive sequence $(b_{n},n\in\mathbb{N})$ such that:

(6)

\lim_{n\rightarrow+\infty}b_{n}=+\infty;\quad\lim_{n\rightarrow+\infty}\frac{n^{3/2}\,b_{n}}{\sqrt{|\mathbb{G}_{n}|h_{n}^{d}}}=0;\quad\lim_{n\rightarrow+\infty}\frac{b_{n}}{\sqrt{|\mathbb{G}_{n}|h_{n}^{2s+d}}}=+\infty,

where $s$ is the regularity parameter given in Assumption 2.12.

The paper is organised as follows. In Section 3.1 we state the main result for the moderate deviation principles of the estimators $\widehat{\mu}_{{\mathbb{A}}_{n}}(x)$ for $x$ in the set continuity of $\mu$ and ${\mathbb{A}}_{n}\in\{\mathbb{T}_{n},\mathbb{G}_{n}\}$ . In Section 3.2, directly linked to the study of variance term $V_{{\mathbb{A}},h}(x)$ defined in (4), we study the moderate deviation principle for general additive functionals of BMCs. Sections 4 and 5 are devoted to the proofs of results. In Section 6, we recall some useful results.

3. Main result

3.1. Moderate deviation principle for $\widehat{\mu}_{{\mathbb{A}}_{n}}$

First, we state a strong consistency result for the estimators $\widehat{\mu}_{{\mathbb{A}}_{n}}(x)$ for $x$ in the set of continuity of $\mu$ . Its proof is given in Section 4.1.

Lemma 3.1.

Let $X$ be a BMC with kernel ${\mathcal{P}}$ and initial distribution $\nu$ such that Assumptions 2.9, 2.11 and 2.12 hold. Furthermore, if $\alpha>1/2$ then assume that Assumption 2.13 holds. Let $(b_{n},n\in\mathbb{N})$ be a positive sequence with satisfies (6). Then, for all $x$ in the set of continuity of $\mu$ and ${\mathbb{A}}_{n}\in\{\mathbb{T}_{n},\mathbb{G}_{n}\}$ we have $\widehat{\mu}_{{\mathbb{A}}_{n}}(x)\xRightarrow[b_{n}^{2}]{\rm superexp}\mu(x).$

The main result of this Section is the following theorem which state the moderate deviation principle for $\widehat{\mu}_{{\mathbb{A}}_{n}}(x)-\mu(x)$ for $x$ in the set of continuity of the function $\mu$ .

Theorem 3.2.

Under the hypothesis of Lemma 3.1, for all $x$ in the set of continuity of $\mu$ and ${\mathbb{A}}_{n}\in\{\mathbb{T}_{n},\mathbb{G}_{n}\}$ , $b_{n}^{-1}\sqrt{|{\mathbb{A}}_{n}|h_{n}^{d}}(\widehat{\mu}_{{\mathbb{A}}_{n}}(x)-\mu(x))$ satisfies a moderate deviation principle on $\mathbb{R}$ with speed $b_{n}^{2}$ and rate function $I$ defined by: $I(y)=y^{2}/(2\|K\|_{2}^{2}\mu(x))$ for all $y\in\mathbb{R}$ , that is, for any $A\subset\mathbb{R}$ ,

	$\displaystyle-\inf_{y\in\mathring{A}}I(y)\leq\liminf_{n\rightarrow+\infty}\frac{1}{b_{n}^{2}}$	$\displaystyle\log\mathbb{P}\big{(}b_{n}^{-1}\sqrt{\|{\mathbb{A}}_{n}\|h_{n}^{d}}(\widehat{\mu}_{{\mathbb{A}}_{n}}(x)-\mu(x))\in A\big{)}$
		$\displaystyle\leq\limsup_{n\rightarrow+\infty}\frac{1}{b_{n}^{2}}\log\mathbb{P}\big{(}b_{n}^{-1}\sqrt{\|{\mathbb{A}}_{n}\|h_{n}^{d}}(\widehat{\mu}_{{\mathbb{A}}_{n}}(x)-\mu(x))\in A\big{)}\leq-\inf_{y\in\bar{A}}I(y),$

where $\mathring{A}$ and $\bar{A}$ denote respectively the interior and the closure of $A$ .

In order to obtain confidence intervals for $\mu(x)$ , it would be interesting to replace $\mu(x)$ in the expression of the rate function $I(\cdot)$ by an estimator. In that direction, we have the following. Let ${\mathbb{A}}_{n}^{*}\in\{\mathbb{G}_{n},\mathbb{T}_{n}\}.$ Obviously, ${\mathbb{A}}_{n}^{*}$ and ${\mathbb{A}}_{n}$ can be the same. We consider the estimator $\widehat{\mu}_{{\mathbb{A}}_{n}^{*}}(x)$ of $\mu(x)$ defined with ${\mathbb{A}}_{n}^{*}$ instead of ${\mathbb{A}}_{n}$ . Let $(\varpi_{n},n\in\mathbb{N})$ be a sequence of real numbers such that $\varpi_{n}\rightarrow 0$ as $n\rightarrow+\infty.$ Then, we have the following result which the proof is given in Section 4.3.

Theorem 3.3.

Under the hypothesis of Lemma 3.1, for all $x$ in the set of continuity of $\mu$ and ${\mathbb{A}}_{n},{\mathbb{A}}_{n}^{*}\in\{\mathbb{T}_{n},\mathbb{G}_{n}\}$ , $b_{n}^{-1}(\|K\|_{2}\sqrt{\widehat{\mu}_{{\mathbb{A}}_{n}^{*}}(x)}\vee\varpi_{n})^{-1}\sqrt{|{\mathbb{A}}_{n}|h_{n}^{d}}(\widehat{\mu}_{{\mathbb{A}}_{n}}(x)-\mu(x))$ satisfies a moderate deviation principle on $\mathbb{R}$ with speed $b_{n}^{2}$ and rate function $I^{\prime}$ defined by: $I^{\prime}(y)=y^{2}/2$ for all $y\in\mathbb{R}$ .

In particular, using the contraction principle (see for e.g Dembo and Zeitouni [11], Chap 4), we have the following corollary of Theorem 3.3.

Corollary 3.4.

Under the hypothesis of Theorem 3.3, we have the following convergence for $x$ in the set of continuity of $\mu$ and ${\mathbb{A}}_{n},{\mathbb{A}}^{*}\in\{\mathbb{T}_{n},\mathbb{G}_{n}\}:$

\lim_{n\rightarrow+\infty}\frac{1}{b_{n}^{2}}\log\mathbb{P}\Big{(}b_{n}^{-1}\,\Big{(}\|K\|_{2}\sqrt{\widehat{\mu}_{{\mathbb{A}}_{n}^{*}}(x)}\vee\varpi_{n}\Big{)}^{-1}\,\sqrt{|{\mathbb{A}}_{n}|h_{n}^{d}}\Big{|}\big{(}\widehat{\mu}_{{\mathbb{A}}_{n}}(x)-\mu(x)\big{)}\Big{|}>\delta\Big{)}=-\frac{\delta^{2}}{2}\quad\forall\delta>0.

Remark 3.5.

Corollary 3.4 yields a simple confidence interval for $\mu(x)$ , of decreasing size $b_{n}/\sqrt{|{\mathbb{A}}_{n}|\,h_{n}^{d}}$ and with level asymptotically close to $1-\exp(-(b_{n}^{2}\,\delta^{2})/2).$

Using the structure of the asymptotic variance $\sigma^{2}$ in (7), we can prove the following multidimensional result which the proof is given in Section 4.4

Corollary 3.6.

Under the hypothesis of Theorem 3.2, we have, for $x$ in the set of continuity of $\mu$ and for all $k\geq 0$ , $b_{n}^{-1}\Big{(}|\mathbb{G}_{n}|^{1/2}h_{n}^{1/2}\big{(}\widehat{\mu}_{\mathbb{G}_{n}}(x)-\mu(x)\big{)},\ldots,|\mathbb{G}_{n-k}|^{1/2}h_{n-k}^{1/2}\big{(}\widehat{\mu}_{\mathbb{G}_{n-k}}(x)-\mu(x)\big{)}\Big{)}^{t}$ satisfies a moderate deviation principle on $\mathbb{R}^{k+1}$ with speed $b_{n}^{2}$ and good rate function $J_{x}:\mathbb{R}^{k+1}\rightarrow\mathbb{R}$ defined by

J_{x}(\boldsymbol{z})=\big{(}2\,\|K\|_{2}^{2}\,\mu(x)\big{)}^{-1}\boldsymbol{z}^{t}\Gamma^{-1}\boldsymbol{z}\,,\quad\boldsymbol{z}\in\mathbb{R}^{k+1},

with $\Gamma=diag(2^{0},\ldots,2^{k})$ , where $diag(\cdot)$ denotes the diagonal matrix and $\boldsymbol{z}^{t}$ stands for the transpose of vector $\boldsymbol{z}$ .

Remark 3.7.

We deduce from Corollary 3.6 that the estimators $|\mathbb{G}_{n-\ell}|^{1/2}h_{n-\ell}^{d/2}(\widehat{\mu}_{\mathbb{G}_{n-\ell}}(x)-\mu(x))$ are asymptotically independent in the sense of moderate deviation for $\ell\in\{0,\ldots,k\}$ and for any $k\in\mathbb{N}.$

3.2. Moderate deviation principle for additive functionals of BMCs

In order to study the variance term $V_{{\mathbb{A}}_{n},h_{n}}(x)$ , we give here a moderate deviation principle for a general additive functionals of BMCs. For that purpose, we introduce the following assumption.

Assumption 3.8.

For $n\in\mathbb{N}$ , let ${\mathfrak{f}}_{n}=(f_{\ell,n},n\geq\ell\geq 0)$ be a sequence of functions defined on $S$ such that $f_{\ell,n}=0$ if $\ell>n$ and there exists $\gamma\in(0,1/d)$ such that:

(i)

$\sup_{0\leq\ell\leq n}\{2^{-d\gamma n/2}\|f_{\ell,n}\|_{\infty};\,2^{d\gamma n/2}\|{\mathcal{Q}}f_{\ell,n}\|_{\infty};\,\|{\mathcal{Q}}(f_{\ell,n}^{2})\|_{\infty};\,2^{d\gamma n}\|{\mathcal{P}}(f_{\ell,n}\otimes^{2})\|_{\infty}\}<+\infty.$
(ii)

$\sup_{0\leq\ell\leq n}\{2^{d\gamma n/2}\langle\mu,|f_{\ell,n}|\rangle;\,\langle\mu,f_{\ell,n}^{2}\rangle\}<+\infty.$

(iii)

The following limit exists and is finite:

(7)

\sigma^{2}=\lim_{n\rightarrow+\infty}\sum_{\ell=0}^{n}2^{-\ell}\mathop{\parallel\!f_{\ell,n}\!\parallel}\nolimits_{L^{2}(\mu)}^{2}<+\infty.

We will use the following notations. For a finite set ${\mathbb{A}}\subset\mathbb{T}$ and a function $f\in{\mathcal{B}}(S)$ , we set:

M_{\mathbb{A}}(f)=\sum_{i\in{\mathbb{A}}}f(X_{i}).

In this paper, we are interested in the cases ${\mathbb{A}}=\mathbb{G}_{n}$ and ${\mathbb{A}}=\mathbb{T}_{n}$ , that is the $n\text{-}$ th generation and the first $n$ generation of the tree. Recall $\mu$ the invariant probability of ${\mathcal{Q}}$ , transition probability of the auxiliary Markov chain $(Y_{n},n\in\mathbb{N})$ . For $f\in L^{1}(\mu)$ , we set:

\tilde{f}=f-\langle\mu,f\rangle.

Recall the sequence ${\mathfrak{f}}_{n}$ defined in Assumption 3.8. For $n\in\mathbb{N}$ , we set:

(8)

N_{n,\emptyset}({\mathfrak{f}}_{n})=|\mathbb{G}_{n}|^{-1/2}\sum_{\ell=0}^{n}M_{\mathbb{G}_{n-\ell}}(\tilde{f}_{\ell,n}).

The notation $N_{n,\emptyset}$ means that we consider the average from the root $\emptyset$ to the $n\text{-}$ th generation.

Remark 3.9.

The definition of $N_{n,\emptyset}({\mathfrak{f}}_{n})$ in (8) is mainly motivated by the decomposition (4). It will allow us to threat the variance term of the estimator $\widehat{\mu}_{{\mathbb{A}}_{n}}(x)$ defined in (2). Instead, for $n\in\mathbb{N}$ , we set $f_{n}^{x}(\cdot)=K_{h_{n}}(x-\cdot)$ . Then, we consider the sequences of functions $(f_{\ell,n}^{\text{id}},\,n\geq\ell\geq 0)$ and $(f^{0}_{\ell,n},\,n\geq\ell\geq 0)$ defined by:

(9)

f_{\ell,n}^{\text{id}}=f^{x}_{n}\quad\text{and}\quad f^{0}_{\ell,n}=f^{x}_{n}{\bf 1}_{\{\ell=0\}}.

It is not difficult to check that under Assumption 2.11, the sequence $(f_{\ell,n}^{\text{id}},\,n\geq\ell\geq 0)$ and $(f^{0}_{\ell,n},\,n\geq\ell\geq 0)$ defined in (9) satisfy Assumption 3.8. In particular, let $x$ be in the set of continuity of $\mu$ . Thanks to Lemma 6.3, we have:

(10)

\lim_{n\rightarrow+\infty}\mathop{\parallel\!f^{x}_{n}\!\parallel}\nolimits_{L^{2}(\mu)}^{2}=\lim_{n\rightarrow+\infty}\langle\mu,(f^{x}_{n})^{2}\rangle=\mu(x)\mathop{\parallel\!K\!\parallel}\nolimits_{2}^{2}.

If ${\mathbb{A}}_{n}=\mathbb{G}_{n}$ , it suffices to consider the sequence ${\mathfrak{f}}_{n}=(f_{\ell,n},0\leq\ell\leq n)$ with $f_{\ell,n}=f_{\ell,n}^{0}$ and in that case, using (10), the asymptotic variance defined in (7) is given by $\sigma^{2}=\|K\|_{2}^{2}\,\mu(x)$ . If ${\mathbb{A}}_{n}=\mathbb{T}_{n}$ , it suffices to consider the sequence ${\mathfrak{f}}_{n}=(f_{\ell,n},0\leq\ell\leq n)$ with $f_{\ell,n}=f_{\ell,n}^{id}$ and in that case, using (10), the asymptotic variance defined in (7) is given by $\sigma^{2}=2\|K\|_{2}^{2}\,\mu(x)$ .

For our convenience, we assume that the quantity $\gamma$ which appears in Assumptions 2.11 and 3.8 is the same. The main result of this section is the following.

Theorem 3.10.

Let $X$ be a BMC with kernel ${\mathcal{P}}$ and initial distribution $\nu$ such that Assumptions 2.9, 2.11 and 3.8 hold. Furthermore, if $\alpha>1/2$ then assume that Assumption 2.13 holds. Let $(b_{n},n\in\mathbb{N})$ be a positive sequence with satisfies (6). Then $b_{n}^{-1}N_{n,\emptyset}({\mathfrak{f}}_{n})$ satisfies a moderate deviation principle on $\mathbb{R}$ with speed $b_{n}^{2}$ and rate function $I$ defined by: $I(x)=x^{2}/(2\sigma^{2})$ for all $x\in\mathbb{R}$ , with the finite variance $\sigma^{2}$ defined in (7).

Remark 3.11.

In particular, using the contraction principle (see for e.g Dembo and Zeitouni [11], Chap 4), Theorem 3.10 implies that

\lim_{n\rightarrow+\infty}\frac{1}{b_{n}^{2}}\log\mathbb{P}\left(\left|b_{n}^{-1}N_{n,\emptyset}({\mathfrak{f}}_{n})\right|>\delta\right)=-I(\delta)\quad\forall\delta>0.

Remark 3.12.

Unlike the results of Bitseki and Gorgui [4], one can note that the different regimes disappear in Theorem 3.10. Moreover, we are able here to give the fluctuations if $2\alpha^{2}>1$ which is not the case in [4].

4. Proof of Lemma 3.1, Theorems 3.2 and 3.3 and Corollary 3.6

We will denote by $C$ any unimportant finite constant which may vary from line to line (in particular $C$ does not depend on $n\in{\mathbb{N}}$ ).

4.1. Proof of Lemma 3.1

We begin the proof with ${\mathbb{A}}_{n}=\mathbb{T}_{n}.$ Recall the decomposition (4) with $\mathbb{T}_{n}$ instead of ${\mathbb{A}}$ . Using Lemma 6.3, we have $\lim_{n\rightarrow+\infty}|B_{h_{n}}(x)|=0.$ From Remark 2.7, this implies that $B_{h_{n}}(x)\xRightarrow[b_{n}^{2}]{\rm superexp}0.$ Next, we set $f_{n}(\cdot)=K_{h_{n}}(x-\cdot)$ in such a way that we have

|\mathbb{T}_{n}|^{-1}h_{n}^{d-/2}\sum_{u\in\mathbb{T}_{n}}\Big{(}K_{h}(x-X_{u})-K_{h}\star\mu(x)\Big{)}=|\mathbb{T}_{n}|^{-1}h_{n}^{d-/2}\sum_{\ell=0}^{n}M_{\mathbb{G}_{\ell}}(\tilde{f}_{n}).

Following line by line the proof of (32) (where we take $f_{\ell,n}=f_{n}$ for all $\ell\leq n$ ), we get

\mathbb{P}\Big{(}|\mathbb{T}_{n}|^{-1}h_{n}^{d-/2}\Big{|}\sum_{\ell=0}^{n}M_{\mathbb{G}_{\ell}}(\tilde{f}_{n})\Big{|}>\delta\Big{)}\leq 2\exp\Big{(}\frac{3\delta}{c_{1}+c_{2}\delta}\Big{)}\exp\Big{(}-\frac{3\delta^{2}|\mathbb{T}_{n}|h_{n}^{d}}{c_{1}+c_{2}\delta}\Big{)}.

Taking the $\log$ , dividing by $b_{n}^{2}$ and letting $n$ goes to the infinity in the latter inequality, we get

|\mathbb{T}_{n}|^{-1}h_{n}^{d-/2}\sum_{u\in\mathbb{T}_{n}}\Big{(}K_{h}(x-X_{u})-K_{h}\star\mu(x)\Big{)}\xRightarrow[b_{n}^{2}]{\rm superexp}0.

It then follows from the decomposition (4) that $\widehat{\mu}_{\mathbb{T}_{n}}(x)\xRightarrow[b_{n}^{2}]{\rm superexp}\mu(x).$ We similarly get the result for ${\mathbb{A}}_{n}=\mathbb{G}_{n}$ and this ends the proof of the lemma.

4.2. Proof of Theorem 3.2

We begin the proof with ${\mathbb{A}}_{n}=\mathbb{T}_{n}$ . We have the following decomposition:

b_{n}^{-1}\sqrt{|\mathbb{T}_{n}|h_{n}^{d}}\big{(}\widehat{\mu}_{\mathbb{T}_{n}}(x)-\mu(x)\big{)}=\sqrt{\frac{|\mathbb{G}_{n}|}{|\mathbb{T}_{n}|}}b_{n}^{-1}N_{n,\emptyset}({\mathfrak{f}}_{n})+\frac{\sqrt{|\mathbb{T}_{n}|h_{n}^{d}}}{b_{n}}B_{h_{n}}(x),

where ${\mathfrak{f}}_{n}=(f_{\ell,n},\,n\geq\ell\geq 0)$ with the functions $f_{\ell,n}=f_{\ell,n}^{\text{id}}$ defined in (9) for $n\geq\ell\geq 0$ and $f_{\ell,n}=0$ otherwise; $N_{n,\emptyset}({\mathfrak{f}}_{n})$ is defined in (8) and the bias term $B_{h_{n}}(x)$ is defined in (4). Thanks to Theorem 3.10 applied to the sequence $(f_{\ell,n}^{id},n\geq\ell\geq 0)$ and using that $\lim_{n\rightarrow+\infty}|\mathbb{G}_{n}|/|\mathbb{T}_{n}|=1/2,$ we get that $\sqrt{|\mathbb{G}_{n}||\mathbb{T}_{n}|^{-1}}b_{n}^{-1}N_{n,\emptyset}({\mathfrak{f}}_{n})$ satisfies a moderate deviation principle in $\mathbb{R}$ with speed $b_{n}^{2}$ and rate function $I$ defined by: $I(y)=y^{2}/(2\|K\|_{2}^{2}\,\mu(x))$ for all $y\in\mathbb{R}.$ To complete the proof of Theorem 3.2, it suffices to prove that

(11)

\lim_{n\rightarrow+\infty}\frac{\sqrt{|\mathbb{T}_{n}|h_{n}^{d}}}{b_{n}}B_{h_{n}}(x)=0.

Next, using that

\mu(x-h_{n}y)-\mu(x)=\sum_{j=1}^{d}(\mu(x_{1}-h_{n}y_{1},\ldots,x_{j}-h_{n}y_{j},x_{j+1},\ldots,x_{d})\\ -\mu(x_{1}-h_{n}y_{1},\ldots,x_{j-1}-h_{n}y_{j-1},x_{j},x_{j+1},\ldots,x_{d})),

the Taylor expansion and Assumption 2.12, we get that, for some finite constant $C>0$ ,

	$\displaystyle\|\mathbb{T}_{n}\|^{1/2}h_{n}^{d/2}B_{h_{n}}(x)$	$\displaystyle=\sqrt{\|\mathbb{T}_{n}\|h_{n}^{d}}\,\,\Big{\|}\int_{\mathbb{R}^{d}}h_{n}^{-d}K(h_{n}^{-1}(x-y))\mu(y)dy-\mu(x)\Big{\|}$
		$\displaystyle=\sqrt{\|\mathbb{T}_{n}\|h_{n}^{d}}\,\,\Big{\|}\int_{\mathbb{R}^{d}}K(y)(\mu(x-h_{n}y)-\mu(x))\,dy\Big{\|}$
		$\displaystyle\leq C\sqrt{\|\mathbb{T}_{n}\|h_{n}^{d}}\,\,\sum_{j=1}^{d}\,\,\int_{\mathbb{R}^{d}}K(y)\frac{(h_{n}\|y_{j}\|)^{s}}{\lfloor s\rfloor!}dy$
		$\displaystyle\leq C\sqrt{\|\mathbb{T}_{n}\|h_{n}^{2s+d}}.$

Now, (11) follows using the latter inequality and (6). This ends the proof of Theorem 3.2 for ${\mathbb{A}}_{n}=\mathbb{T}_{n}$ . The proof is similar for ${\mathbb{A}}_{n}=\mathbb{G}_{n}$ using $f_{\ell,n}=f^{0}_{\ell,n}.$

4.3. Proof of Theorem 3.3

. We begin the proof with ${\mathbb{A}}_{n}=\mathbb{T}_{n}$ . We have the following decomposition:

(12)

\frac{b_{n}^{-1}\sqrt{|\mathbb{T}_{n}|h_{n}^{d}}(\widehat{\mu}_{\mathbb{T}_{n}}(x)-\mu(x))}{\|K\|_{2}\sqrt{\widehat{\mu}_{{\mathbb{A}}_{n}^{*}}(x)}\vee\varpi_{n}}=T_{1}(n)+T_{2}(n)

where

	$\displaystyle T_{1}(n)=(\\|K\\|_{2}\sqrt{\mu(x)}b_{n})^{-1}\sqrt{\|\mathbb{T}_{n}\|h_{n}^{d}}\Big{(}\widehat{\mu}_{\mathbb{T}_{n}}(x)-\mu(x)\Big{)};$
	$\displaystyle T_{2}(n)=\Big{(}\frac{1}{\\|K\\|_{2}\sqrt{\widehat{\mu}_{{\mathbb{A}}_{n}^{*}}(x)}\vee\varpi_{n}}-\frac{1}{\\|K\\|_{2}\sqrt{\mu(x)}}\Big{)}b_{n}^{-1}\sqrt{\|\mathbb{T}_{n}\|h_{n}^{d}}\Big{(}\widehat{\mu}_{\mathbb{T}_{n}}(x)-\mu(x)\Big{)}.$

First, we prove that

(13)

T_{2}(n)\xRightarrow[b_{n}^{2}]{\rm superexp}0.

Let $\delta>0.$ For all $r>0$ , we have

\mathbb{P}\big{(}|T_{2}(n)|>\delta\big{)}\leq\mathbb{P}\big{(}\big{|}b_{n}^{-1}\sqrt{|\mathbb{T}_{n}|h_{n}^{d}}\big{(}\widehat{\mu}_{\mathbb{T}_{n}}(x)-\mu(x)\big{)}\big{|}>\delta/r\big{)}\\ +\mathbb{P}\big{(}\big{|}\frac{1}{\|K\|_{2}\sqrt{\widehat{\mu}_{{\mathbb{A}}_{n}^{*}}(x)}\vee\varpi_{n}}-\frac{1}{\|K\|_{2}\sqrt{\mu(x)}}\big{|}>r\big{)}.

This implies that (see for e.g [11], Lemma 1.2.15)

(14)

\limsup_{n\rightarrow+\infty}\frac{1}{b_{n}^{2}}\log\mathbb{P}\big{(}|T_{2}(n)|>\delta\big{)}\leq\max\Big{\{}\limsup_{n\rightarrow+\infty}\frac{1}{b_{n}^{2}}\log\mathbb{P}\big{(}\big{|}b_{n}^{-1}\sqrt{|\mathbb{T}_{n}|h_{n}^{d}}\big{(}\widehat{\mu}_{\mathbb{T}_{n}}(x)-\mu(x)\big{)}\big{|}>\delta/r\big{)};\\ \limsup_{n\rightarrow+\infty}\frac{1}{b_{n}^{2}}\log\mathbb{P}\big{(}\big{|}\frac{1}{\|K\|_{2}\sqrt{\widehat{\mu}_{{\mathbb{A}}_{n}^{*}}(x)}\vee\varpi_{n}}-\frac{1}{\|K\|_{2}\sqrt{\mu(x)}}\big{|}>r\big{)}\Big{\}}.

Using Theorem 3.2 and the contraction principle, we have

(15)

\limsup_{n\rightarrow+\infty}\frac{1}{b_{n}^{2}}\log\mathbb{P}\big{(}\big{|}b_{n}^{-1}\sqrt{|\mathbb{T}_{n}|h_{n}^{d}}\big{(}\widehat{\mu}_{\mathbb{T}_{n}}(x)-\mu(x)\big{)}\big{|}>\delta/r\big{)}=-\frac{\delta^{2}}{2\|K\|_{2}\mu(x)r^{2}}.

Following the step 1 of the proof of Theorem 6 in [7] and using Lemma 3.1, we can prove that

\|K\|_{2}^{2}\,\widehat{\mu}_{{\mathbb{A}}_{n}^{*}}(x)\vee\varpi_{n}^{2}\xRightarrow[b_{n}^{2}]{\rm superexp}\|K\|_{2}^{2}\,\mu(x).

Using Lemma B.2 in [3], the latter convergence implies that

(16)

\frac{1}{\|K\|_{2}\sqrt{\widehat{\mu}_{{\mathbb{A}}_{n}^{*}}(x)}\vee\varpi_{n}}\xRightarrow[b_{n}^{2}]{\rm superexp}\frac{1}{\|K\|_{2}\sqrt{\mu(x)}}.

Using (14), (15) and (16), we get

\limsup_{n\rightarrow+\infty}\frac{1}{b_{n}^{2}}\log\mathbb{P}\big{(}|T_{2}(n)|>\delta\big{)}\leq-\frac{\delta^{2}}{2\|K\|_{2}\mu(x)r^{2}}.

Since $r$ can be taken arbitrarily close to $0$ , we get (13) and using (12), this implies that

(17)

\frac{b_{n}^{-1}\sqrt{|\mathbb{T}_{n}|h_{n}^{d}}(\widehat{\mu}_{\mathbb{T}_{n}}(x)-\mu(x))}{\|K\|_{2}\sqrt{\widehat{\mu}_{{\mathbb{A}}_{n}^{*}}(x)}\vee\varpi_{n}}\mathrel{\underset{b_{n}^{2}}{\overset{{\rm superexp}}{\scalebox{2.0}[1.0]{$\sim$}}}}T_{1}(n).

Using Theorem 3.2 and the contraction principle, we get that $T_{1}(n)$ satisfies a moderate deviation principle on $\mathbb{R}$ with speed $b_{n}^{2}$ and rate function $I^{\prime}$ defined by: $I^{\prime}(y)=y^{2}/2$ for all $y\in\mathbb{R}$ . Using (17) and Remark 2.7, we get the result of Theorem 3.3.

4.4. Proof of Corollary 3.6

Let $\boldsymbol{a}=(a_{0},\ldots,a_{k})^{t}\in\mathbb{R}^{k+1}$ . Let $n>k.$ We consider the sequence ${\mathfrak{f}}_{n}=(f_{\ell,n},n\geq\ell\geq 0)$ defined by $f_{\ell,n}=2^{\ell/2}\,a_{\ell}\,K_{h_{n-\ell}}(x-\cdot)$ for all $\ell\in\{0,\ldots,k\}$ and $f_{\ell,n}=0$ otherwise. We easily check that ${\mathfrak{f}}_{n}$ satisfies Assumptions 3.8. In particular, the asymptotic variance defined in (7) is given by $\sigma^{2}=\big{(}\sum_{\ell=0}^{k}2^{\ell}a_{\ell}^{2}\big{)}\|K\|_{2}^{2}\,\mu(x).$ Observe that the linear combinaison $M_{n}(\boldsymbol{a})$ , with coefficients $\boldsymbol{a}=(a_{0},\ldots,a_{k})^{t}\in\mathbb{R}^{k+1},$ of the estimators $|\mathbb{G}_{n-\ell}|^{1/2}h_{n-\ell}^{d/2}(\widehat{\mu}_{\mathbb{G}_{n-\ell}}(x)-\mu(x))$ , $\ell\in\{0,\ldots,k\}$ has the following decomposition:

(18)

M_{n}(\boldsymbol{a})=N_{n,\emptyset}({\mathfrak{f}}_{n})+\sum_{\ell=0}^{k}a_{\ell}\big{(}|\mathbb{G}_{n-\ell}|\,h_{n-\ell}^{d}\big{)}^{1/2}B_{h_{n-\ell}}(x),

where $N_{n,\emptyset}({\mathfrak{f}}_{n})$ is defined in (8) and the $B_{h_{n-\ell}}(x)$ , $\ell\in\{0,\ldots,k\}$ , are defined in (4). Applying Theorem 3.10, we get that $b_{n}^{-1}\,N_{n,\emptyset}({\mathfrak{f}}_{n})$ satisfies a moderate deviation principle on $\mathbb{R}$ with speed $b_{n}^{2}$ and rate function $I_{x,\boldsymbol{a}}:\mathbb{R}\rightarrow\mathbb{R}$ defined by

(19)

I_{x,\boldsymbol{a}}(y)=\frac{y^{2}}{2\,\big{(}\sum_{\ell=0}^{k}2^{\ell}a_{\ell}^{2}\big{)}\|K\|_{2}^{2}\,\mu(x)},\quad y\in\mathbb{R}.

Using (11), we have that

\lim_{n\rightarrow+\infty}\frac{1}{b_{n}}\sum_{\ell=0}^{k}a_{\ell}\big{(}|\mathbb{G}_{n-\ell}|\,h_{n-\ell}^{d}\big{)}^{1/2}B_{h_{n-\ell}}(x)=0.

Using Remark 2.7, this implies that

(20)

\frac{1}{b_{n}}\sum_{\ell=0}^{k}a_{\ell}\big{(}|\mathbb{G}_{n-\ell}|\,h_{n-\ell}^{d}\big{)}^{1/2}B_{h_{n-\ell}}(x)\xRightarrow[b_{n}^{2}]{\rm superexp}0.

Using (18) and (20) we get that $b_{n}^{-1}M_{n}(\boldsymbol{a})$ and $b_{n}^{-1}N_{n,\emptyset}({\mathfrak{f}}_{n})$ satisfy the same moderate deviation principle. We then conclude that $b_{n}^{-1}M_{n}(\boldsymbol{a})$ satisfies a moderate deviation principle on $\mathbb{R}$ with speed $b_{n}^{2}$ and rate function $I_{x,\boldsymbol{a}}$ defined in (19). Since this is true for all vector $\boldsymbol{a}\in\mathbb{R}^{k+1}$ , that is for all the linear combinaisons of the estimators $|\mathbb{G}_{n-\ell}|^{1/2}h_{n-\ell}^{d/2}(\widehat{\mu}_{\mathbb{G}_{n-\ell}}(x)-\mu(x))$ , $\ell\in\{0,\ldots,k\}$ , we get the result of Corollary 3.6.

5. Proof of Theorem 3.10

We begin with some notations. We will denote by $C$ any unimportant finite constant which may vary from line to line (in particular $C$ does not depend on $n\in{\mathbb{N}}$ nor on the considered sequence of functions ${\mathfrak{f}}_{n}=(f_{\ell,n},n\geq\ell\geq 0)$ ). Let $(p_{n},n\in{\mathbb{N}})$ be a non-decreasing sequence of elements of ${\mathbb{N}}^{*}$ such that

\lim_{n\rightarrow+\infty}p_{n}^{3}\,b_{n}^{2}\,|\mathbb{G}_{n-p_{n}}|^{-1}=0.

When there is no ambiguity, we write $p$ for $p_{n}$ .

Let $i,j\in\mathbb{T}$ . We write $i\preccurlyeq j$ if $j\in i\mathbb{T}$ . We denote by $i\wedge j$ the most recent common ancestor of $i$ and $j$ , which is defined as the only $u\in\mathbb{T}$ such that if $v\in\mathbb{T}$ and $v\preccurlyeq i$ , $v\preccurlyeq j$ then $v\preccurlyeq u$ . We also define the lexicographic order $i\leq j$ if either $i\preccurlyeq j$ or $v0\preccurlyeq i$ and $v1\preccurlyeq j$ for $v=i\wedge j$ . Let $X=(X_{i},i\in\mathbb{T})$ be a $BMC$ with kernel ${\mathcal{P}}$ and initial measure $\nu$ . For $i\in\mathbb{T}$ , we define the $\sigma$ -field:

{\mathcal{F}}_{i}=\{X_{u};u\in\mathbb{T}\text{ such that $u\leq i$}\}.

By construction, the $\sigma$ -fields $({\mathcal{F}}_{i};\,i\in\mathbb{T})$ are nested as ${\mathcal{F}}_{i}\subset{\mathcal{F}}_{j}$ for $i\leq j$ .

We define for $n\in{\mathbb{N}}$ , $i\in\mathbb{G}_{n-p_{n}}$ and ${\mathfrak{f}}_{n}$ the martingale increments:

(21)

\Delta_{n,i}({\mathfrak{f}}_{n})=N_{n,i}({\mathfrak{f}}_{n})-{\mathbb{E}}\left[N_{n,i}({\mathfrak{f}}_{n})|\,{\mathcal{F}}_{i}\right]\quad\text{and}\quad\Delta_{n}({\mathfrak{f}}_{n})=\sum_{i\in\mathbb{G}_{n-p_{n}}}\Delta_{n,i}({\mathfrak{f}}_{n}),

where

(22)

N_{n,i}({\mathfrak{f}}_{n})=|\mathbb{G}_{n}|^{-1/2}\sum_{\ell=0}^{p}M_{i\mathbb{G}_{p-\ell}}(\tilde{f}_{\ell,n})\quad\text{and}\quad i\mathbb{G}_{p-\ell}=\{ij,j\in\mathbb{G}_{p-\ell}\}.

We have:

\sum_{i\in\mathbb{G}_{n-p_{n}}}N_{n,i}({\mathfrak{f}}_{n})=|\mathbb{G}_{n}|^{-1/2}\sum_{\ell=0}^{p_{n}}M_{\mathbb{G}_{n-\ell}}(\tilde{f}_{\ell,n})=|\mathbb{G}_{n}|^{-1/2}\sum_{k=n-p_{n}}^{n}M_{\mathbb{G}_{k}}(\tilde{f}_{n-k,n}).

Using the branching Markov property, we get for $i\in\mathbb{G}_{n-p_{n}}$ :

(23)

{\mathbb{E}}\left[N_{n,i}({\mathfrak{f}}_{n})|\,{\mathcal{F}}_{i}\right]={\mathbb{E}}\left[N_{n,i}({\mathfrak{f}}_{n})|\,X_{i}\right]=|\mathbb{G}_{n}|^{-1/2}\sum_{\ell=0}^{p_{n}}{\mathbb{E}}_{X_{i}}\left[M_{\mathbb{G}_{p_{n}-\ell}}(\tilde{f}_{\ell,n})\right].

We have the following decomposition:

(24)

N_{n,\emptyset}({\mathfrak{f}}_{n})=\Delta_{n}({\mathfrak{f}}_{n})+R_{0}(n)+R_{1}(n),

where $\Delta_{n}({\mathfrak{f}})$ is defined in (21) and:

R_{0}(n)=|\mathbb{G}_{n}|^{-1/2}\,\sum_{k=0}^{n-p_{n}-1}M_{\mathbb{G}_{k}}(\tilde{f}_{n-k,n})\quad\text{and}\quad R_{1}(n)=\sum_{i\in\mathbb{G}_{n-p_{n}}}{\mathbb{E}}\left[N_{n,i}({\mathfrak{f}}_{n})|\,{\mathcal{F}}_{i}\right].

From (24), our goals will be achieved if we prove the following:

(25)		$\displaystyle b_{n}^{-1}R_{0}(n)\xRightarrow[b_{n}^{2}]{\rm superexp}0;$
(26)		$\displaystyle b_{n}^{-1}R_{1}(n)\xRightarrow[b_{n}^{2}]{\rm superexp}0;$
(27)		$\displaystyle b_{n}^{-1}\Delta_{n}({\mathfrak{f}})\quad\text{satisfies a MDP on $S$ with speed $b_{n}^{2}$ and rate function $I$.}$

Note that (25) and (26) mean that $R_{0}(n)$ and $R_{1}(n)$ are negligible in the sense of moderate deviations in such a way that using (24) and Remark 2.7, $N_{n,\emptyset}({\mathfrak{f}})$ and $\Delta_{n}({\mathfrak{f}})$ satisfy the same moderate deviation principle. To prove (27), the main method we will use is the moderate deviations for martingale (see [12] for more details).

In the sequel, the sequence $(2^{-\gamma n},n\in\mathbb{N})$ which appears in Assumption 3.8 will be denoted $(h_{n},n\in\mathbb{N})$ in such a way that we have $2^{-d\gamma n/2}=h_{n}^{d/2}.$ We have the following result.

Lemma 5.1.

Under the assumptions of Theorem 3.10, we have $b_{n}^{-1}R_{0}(n)\xRightarrow[b_{n}^{2}]{\rm superexp}0.$

Proof.

Let $\delta>0$ . Using the Chernoff bound, we have, for all $\lambda>0$ ,

(28)

\mathbb{P}\Big{(}b_{n}^{-1}R_{0}(n)>\delta\Big{)}\leq\exp\Big{(}-\lambda b_{n}|\mathbb{G}_{n}|^{1/2}\delta\Big{)}\mathbb{E}\Big{[}\exp\Big{(}\lambda\sum_{\ell=0}^{n-p-1}M_{\mathbb{G}_{\ell}}(\tilde{f}_{n-\ell,n})\Big{)}\Big{]}.

For all $k\in\{1,\ldots,n-p\}$ and for $u\in\mathbb{T}$ , we set

g_{p,k}=\sum_{r=0}^{k-1}2^{r}{\mathcal{Q}}^{r}\tilde{f}_{p+k-r,n}\quad\text{and}\quad Z_{p,k}(u)=g_{p,k}(X_{u0})+g_{p,k}(X_{u1})-2{\mathcal{Q}}g_{p,k}(X_{u}).

Then, using recursively the fact that

\sum_{u\in\mathbb{G}_{\ell}}f(X_{u})=\sum_{u\in\mathbb{G}_{\ell-1}}(f(X_{u0})+f(X_{u1})-2{\mathcal{Q}}f(X_{u}))+\sum_{u\in\mathbb{G}_{\ell-1}}2{\mathcal{Q}}f(X_{u}),

for all $\ell\geq 1$ and for some function $f$ , we get

\mathbb{E}\Big{[}\exp\Big{(}\lambda\sum_{\ell=0}^{n-p-1}M_{\mathbb{G}_{\ell}}(\tilde{f}_{n-\ell,n})\Big{)}\Big{]}=\mathbb{E}\Big{[}\exp\Big{(}\lambda g_{p,n-p}(X_{\emptyset})\Big{)}\,\prod_{k=1}^{n-p-1}\exp\Big{(}\lambda\sum_{u\in\mathbb{G}_{n-p-k-1}}Z_{p,k}(u)\Big{)}\Big{]}.

For all $m\in\{1,\ldots,n-p-1\}$ , we set

\mathbb{I}_{m}=\mathbb{E}\Big{[}\exp(\lambda g_{p,n-p}(X_{\emptyset}))\,\prod_{k=m}^{n-p-1}\exp(\lambda\sum_{u\in\mathbb{G}_{n-p-k-1}}Z_{p,k}(u))\Big{]}.

Using the branching Markov property, we get the following decomposition:

\mathbb{I}_{m}=\mathbb{E}\Big{[}\exp(\lambda g_{p,n-p}(X_{\emptyset}))\,\mathbb{J}_{m}\,\prod_{k=m+1}^{n-p-1}\exp(\lambda\sum_{u\in\mathbb{G}_{n-p-k-1}}Z_{p,k}(u))\Big{]},

with

\mathbb{J}_{m}=\prod_{u\in\mathbb{G}_{n-p-m-1}}\mathbb{E}_{X_{u}}\Big{[}\exp(\lambda Z_{p,m}(u))\Big{]}.

For all $u\in\mathbb{G}_{n-p-m-1}$ , we will upper bound the quantity $\mathbb{E}_{X_{u}}[\exp(\lambda Z_{p,m}(u))]$ and then $\mathbb{J}_{m}$ . We claim that:

(29)

|Z_{p,m}(u)|\,\leq\,M\,=\,C\,h_{n}^{-d/2};

(30)

\mathbb{E}_{X_{u}}[Z_{p,m}(u)^{2}]\,\leq\,\sigma_{m}^{2}\,=\,C+\,C\,h_{n}^{d}\,\Big{(}\sum_{r=0}^{m-1}(2\alpha)^{r-1}\Big{)}^{2}{\bf 1}_{\{m>1\}}.

For that purpose, we plan to use the bound

(31)

\mathbb{E}\big{[}\exp(\lambda Z)\big{]}\leq\exp\Big{(}\frac{\lambda^{2}\sigma^{2}}{2(1-\lambda M/3)}\Big{)}

valid for any $\lambda\in(0,3/M)$ , any random variable $Z$ such that $|Z|\leq M$ , $\mathbb{E}[Z]=0$ and $\mathbb{E}[Z^{2}]\leq\sigma^{2}$ . For all $u\in\mathbb{G}_{n-p-m-1}$ and for all $\lambda\in(0,Ch_{n}^{-d/2}/3)$ we get, using (29)-(31),

\mathbb{E}_{X_{u}}\Big{[}\exp(\lambda Z_{p,m}(u))\Big{]}\leq\exp\Big{(}\frac{\lambda^{2}\sigma_{m}^{2}}{2(1-\lambda M/3)}\Big{)}.

For all $m\in\{1,\ldots,n-p-1\}$ , the latter inequality implies that

\mathbb{J}_{m}\leq\exp\Big{(}\frac{\lambda^{2}\sigma_{m}^{2}|\mathbb{G}_{n-p-m-1}|}{2(1-\lambda M/3)}\Big{)}\quad\text{and}\quad\mathbb{I}_{m}\leq\exp\Big{(}\frac{\lambda^{2}\sigma_{m}^{2}|\mathbb{G}_{n-p-m-1}|}{2(1-\lambda M/3)}\Big{)}\mathbb{I}_{m+1}.

Recall that $\mathbb{I}_{1}=\mathbb{E}\big{[}\exp\big{(}\lambda\sum_{\ell=0}^{n-p-1}M_{\mathbb{G}_{\ell}}(\tilde{f}_{n-\ell,n})\big{)}\big{]}.$ By recurrence, we get

\mathbb{E}\Big{[}\exp\Big{(}\lambda\sum_{\ell=0}^{n-p-1}M_{\mathbb{G}_{\ell}}(\tilde{f}_{n-\ell,n})\Big{)}\Big{]}=\mathbb{I}_{1}\leq\exp\Big{(}\frac{\lambda^{2}\sum_{m=1}^{n-p-1}\sigma_{m}^{2}|\mathbb{G}_{n-p-m-1}|}{2(1-\lambda M/3)}\Big{)}\mathbb{E}\Big{[}\exp\Big{(}\lambda g_{p,n-p}(X_{\emptyset})\Big{)}\Big{]}.

Using $(i)$ and $(ii)$ of Assumption 3.8 and (3), we have

\displaystyle|g_{p,n-p}|\leq|\tilde{f}_{n,n}|+\sum_{r=1}^{n-p-1}2^{r}|{\mathcal{Q}}^{r-1}({\mathcal{Q}}\tilde{f}_{n-r,n})|\leq Ch_{n}^{-d/2}+Ch_{n}^{d/2}\sum_{r=1}^{n-p-1}(2\alpha)^{r-1}.

This implies that

\mathbb{E}\Big{[}\exp\Big{(}\lambda\sum_{\ell=0}^{n-p-1}M_{\mathbb{G}_{\ell}}(\tilde{f}_{n-\ell,n})\Big{)}\Big{]}\leq\exp\Big{(}\frac{\lambda^{2}\sum_{m=1}^{n-p-1}\sigma_{m}^{2}|\mathbb{G}_{n-p-m-1}|}{2(1-\lambda M/3)}\Big{)}\\ \times\,\exp\Big{(}\lambda\,C\,h_{n}^{-d/2}\,+\,\lambda\,C\,h_{n}^{d/2}\,\sum_{r=0}^{n-p-2}(2\alpha)^{r}\Big{)}.

Distinguishing the cases $2\alpha\leq 1$ , $1/2<2\alpha\leq\sqrt{2}$ and $2\alpha>\sqrt{2}$ and using (5) for $2\alpha>1$ , we get

\mathbb{E}\Big{[}\exp\Big{(}\lambda\sum_{\ell=0}^{n-p-1}M_{\mathbb{G}_{\ell}}(\tilde{f}_{n-\ell,n})\Big{)}\Big{]}\leq\exp\Big{(}\frac{c_{1}\lambda^{2}|\mathbb{G}_{n-p}|}{2(1-c_{2}\lambda h_{n}^{-d/2}/3)}\Big{)}\exp\Big{(}c_{3}\lambda h_{n}^{-d/2}\Big{)},

where $c_{1}$ , $c_{2}$ and $c_{3}$ are some positive constants. The latter inequality and (28) imply that

\mathbb{P}\Big{(}b_{n}^{-1}R_{0}(n)>\delta\Big{)}\leq\exp\Big{(}-\lambda b_{n}|\mathbb{G}_{n}|^{1/2}\delta+\frac{c_{1}\lambda^{2}|\mathbb{G}_{n-p}|}{2(1-c_{2}\lambda h_{n}^{-d/2}/3)}\Big{)}\,\exp\Big{(}c_{3}\lambda h_{n}^{-d/2}\Big{)}.

Taking¹¹1In fact, we use the following. For $\alpha,\beta,\gamma>0$ and $h(x)=-\alpha x+\frac{\beta x^{2}}{2(1-\gamma x)}$ we have $h(x^{*})=\frac{-\alpha^{2}}{2(\beta+\alpha\gamma)}$ for the choice $x^{*}=\frac{\alpha}{2\alpha\gamma+\beta}\in(0,1/\gamma).$

\lambda=\frac{3\,b_{n}\,|\mathbb{G}_{n}|^{1/2}\,\delta}{2\,c_{2}\,b_{n}\,|\mathbb{G}_{n}|^{1/2}\,h_{n}^{-d/2}\,\delta+3\,c_{1}\,|\mathbb{G}_{n-p}|},

we are led to

\mathbb{P}\Big{(}b_{n}^{-1}R_{0}(n)>\delta\Big{)}\leq C\,\exp\Big{(}-\,\frac{3\,\delta^{2}\,b_{n}^{2}\,|\mathbb{G}_{n}|}{2(c_{2}\,\delta\,b_{n}\,|\mathbb{G}_{n}|^{1/2}\,h_{n}^{-d/2}+3\,c_{1}\,|\mathbb{G}_{n-p}|)}\Big{)}.

Since we can do the same thing for $-{\mathfrak{f}}_{n}$ instead of ${\mathfrak{f}}_{n}$ , we get that

(32)

\mathbb{P}\Big{(}b_{n}^{-1}|R_{0}(n)|>\delta\Big{)}\leq 2\,C\,\exp\Big{(}-\,\frac{3\,\delta^{2}\,b_{n}^{2}\,|\mathbb{G}_{n}|}{2(c_{2}\,\delta\,b_{n}\,|\mathbb{G}_{n}|^{1/2}\,h_{n}^{-d/2}+3\,c_{1}\,|\mathbb{G}_{n-p}|)}\Big{)}.

Finally, in the latter inequality, taking the $\log$ , dividing by $b_{n}^{2}$ and letting $n$ goes to infinity, we get the result of Lemma 5.1. Now, to end the proof, we will prove (29) and (30).

Proof of (29)

Using Assumption 2.9, $(i)$ and $(ii)$ of Assumption 3.8 and Assumption 2.13, we get

	$\displaystyle\|Z_{p,m}(u)\|$	$\displaystyle\leq C\,\\|\tilde{f}_{p+1,n}\\|_{\infty}+C(1+2\alpha)(\sum_{r=1}^{m-1}(2\alpha)^{r-1}\\|{\mathcal{Q}}f_{p+m-r,n}\\|_{\infty}){\bf 1}_{\{m>1\}}$
		$\displaystyle\leq C\,h_{n}^{-d/2}\,+\,C\,h_{n}^{d/2}\,\sum_{r=0}^{m-1}(2\alpha)^{r}\,\leq\,C\,h_{n}^{-d/2}.$

Proof of (30)

Using the branching Markov property for the second inequality, Assumption 2.9 for the fourth inequality and $(i)$ and $(ii)$ of Assumption 3.8 for the last inequality, we get

	$\displaystyle\mathbb{E}_{X_{u}}[Z_{p,m}(u)^{2}]$	$\displaystyle\leq\mathbb{E}_{X_{u}}[(g_{p,m}(X_{u0})+g_{p,m}(X_{u1}))^{2}]\leq C{\mathcal{Q}}(g_{p,m}^{2})(X_{u})$
		$\displaystyle\leq C{\mathcal{Q}}(\tilde{f}_{p+1,n}^{2})(X_{u})+C{\mathcal{Q}}\Big{(}\Big{(}\sum_{r=1}^{m-1}2^{r}{\mathcal{Q}}^{r-1}({\mathcal{Q}}\tilde{f}_{p+m-r,n})\Big{)}^{2}\Big{)}(X_{u})\,{\bf 1}_{\{m>1\}}$
		$\displaystyle\leq C\\|{\mathcal{Q}}\tilde{f}_{p+1,n}^{2}\\|_{\infty}+\Big{(}\sum_{r=1}^{m-1}(2\alpha)^{r-1}\\|{\mathcal{Q}}f_{p+m-r,n}\\|_{\infty}\Big{)}^{2}{\bf 1}_{\{m>1\}}$
		$\displaystyle\leq\,C\,+\,C\,h_{n}^{d}\,\Big{(}\sum_{r=0}^{m-1}(2\alpha)^{r}\Big{)}^{2}{\bf 1}_{\{m>1\}}.$

∎

Next, we have the following result.

Lemma 5.2.

Under the assumptions of Theorem 3.10, we have $b_{n}^{-1}R_{1}(n)\xRightarrow[b_{n}^{2}]{\rm superexp}0.$

Proof.

We have, using (23) and (49),

(33)

R_{1}(n)=|\mathbb{G}_{n}|^{-1/2}M_{\mathbb{G}_{n-p}}(g_{p,n})\quad\text{where}\quad g_{p,n}=\sum_{\ell=0}^{p}2^{p-\ell}{\mathcal{Q}}^{p-\ell}\tilde{f}_{\ell,n}.

We follow the same arguments that in the proof of Lemma 5.1. For all $m\in\{1,\ldots,n-p\}$ and for all $u\in\mathbb{T}$ , we set

Z_{p,m}(u)=2^{m-1}{\mathcal{Q}}^{m-1}g_{p}(X_{u0})+2^{m-1}{\mathcal{Q}}^{m-1}g_{p}(X_{u1})-2^{m}{\mathcal{Q}}^{m}g_{p}(X_{u}).

We also consider the following quantities for $m\in\{1,\ldots,n-p\}$ and $\lambda>0$ :

	$\displaystyle\mathbb{I}_{m}=\mathbb{E}\Big{[}\exp\Big{(}\lambda 2^{n-p}{\mathcal{Q}}^{n-p}g_{p,n}(X_{\emptyset})\Big{)}\prod_{k=m}^{n-p}\exp\Big{(}\lambda\sum_{u\in\mathbb{G}_{n-p-k}}Z_{p,k}(u)\Big{)}\Big{]}\quad\text{and}$
	$\displaystyle\mathbb{J}_{m}=\prod_{u\in\mathbb{G}_{n-p-m}}\mathbb{E}_{X_{u}}\Big{[}\exp\Big{(}\lambda Z_{p,m}(u)\Big{)}\Big{]}.$

Note that using the branching Markov property, we have

(34)

\mathbb{I}_{m}=\mathbb{E}\Big{[}\exp\Big{(}\lambda 2^{n-p}{\mathcal{Q}}^{n-p}g_{p,n}(X_{\emptyset})\Big{)}\prod_{k=m+1}^{n-p}\exp\Big{(}\lambda\sum_{u\in\mathbb{G}_{n-p-k}}Z_{p,k}(u)\Big{)}\mathbb{J}_{m}\Big{]}.

As for (29)-(30), for all $m\in\{1,\ldots,n-p\}$ and $u\in\mathbb{G}_{n-p-m}$ , one can prove that

(35)

|Z_{p,m}(u)|\leq M=Ch^{-d/2}\quad\text{and}\quad\mathbb{E}_{X_{u}}\Big{[}Z_{p,m}(u)^{2}\Big{]}\leq\sigma_{m}^{2}=C{\bf 1}_{\{m=1\}}+Ch_{n}^{d}\Big{(}\sum_{\ell=0}^{p}(2\alpha)^{p+m-\ell-2}\Big{)}^{2}.

Using (31) and (35), we have, for all $u\in\mathbb{G}_{n-p-m}$ and for all $\lambda\in(0,Ch^{-d/2}/3)$ ,

\mathbb{E}_{X_{u}}\Big{[}\exp(\lambda Z_{p,m}(u))\Big{]}\leq\exp\Big{(}\frac{\lambda^{2}\sigma_{m}^{2}}{2(1-\lambda M/3)}\Big{)}.

The latter inequality and (34) imply that

\mathbb{I}_{m}\leq\exp\Big{(}\frac{\lambda^{2}\sigma_{m}^{2}|\mathbb{G}_{n-p-m}|}{2(1-\lambda M/3)}\Big{)}\mathbb{I}_{m+1}.

By recurrence, this implies that

(36)

\mathbb{I}_{1}\leq\exp\Big{(}\frac{\lambda^{2}\sum_{m=1}^{n-p}\sigma_{m}^{2}|\mathbb{G}_{n-p-m}|}{2(1-\lambda M/3)}\Big{)}\,\mathbb{E}\Big{[}\exp\Big{(}\lambda 2^{n-p}{\mathcal{Q}}^{n-p}g_{p,n}(X_{\emptyset})\Big{)}\Big{]}.

Using $(i)$ and $(ii)$ of Assumption 3.8 and Assumption 2.9, we get

(37)

|g_{p,n}|\leq Ch_{n}^{d/2}\sum_{\ell=0}^{p}(2\alpha)^{n-\ell}.

From (36), (37) and according to the value of $\alpha$ , we have, for some positive constants $c_{1}$ , $c_{2}$ and $c_{3}$ (recall the definition of $M$ and $\sigma_{m}^{2}$ given in (35)):

	$\displaystyle\mathbb{I}_{1}\leq C\exp\Big{(}\frac{\lambda^{2}c_{1}\|\mathbb{G}_{n-p}\|}{2(1-\lambda c_{2}h_{n}^{-d/2}/3)}\Big{)}\quad\hskip 176.407pt\text{if $2\alpha\leq 1;$}$
	$\displaystyle\mathbb{I}_{1}\leq\exp\Big{(}\lambda c_{3}(2\alpha)^{n}h_{n}^{d/2}\Big{)}\,\exp\Big{(}\frac{\lambda^{2}c_{1}\|\mathbb{G}_{n-p}\|(1+(2\alpha)^{2p}h_{n}^{d})}{2(1-\lambda c_{2}h_{n}^{-d/2}/3)}\Big{)}\quad\hskip 62.59596pt\text{if $1<2\alpha\leq\sqrt{2};$}$
	$\displaystyle\mathbb{I}_{1}\leq\exp\Big{(}\lambda c_{3}(2\alpha)^{n}h_{n}^{d/2}\Big{)}\,\exp\Big{(}\frac{\lambda^{2}c_{1}\|\mathbb{G}_{n-p}\|(1+(2\alpha)^{2p}h_{n}^{d}+2^{p}(2\alpha^{2})^{n}h_{n}^{d})}{2(1-\lambda c_{2}h_{n}^{-d/2}/3)}\Big{)}\quad\text{if $2\alpha>\sqrt{2}.$}$

Recall that $\mathbb{I}_{1}=\mathbb{E}[\exp(\lambda M_{\mathbb{G}_{n-p}}(g_{p,n}))]$ . Using the Chernoff bound and (33), we have for all $\lambda\in(0,Ch_{n}^{-d/2}/3)$ and for all $\delta>0$ ,

\mathbb{P}\Big{(}b_{n}^{-1}R_{1}(n)>\delta\Big{)}\leq\exp\Big{(}-\lambda b_{n}|\mathbb{G}_{n}|^{1/2}\delta\Big{)}\,\mathbb{I}_{1}.

Taking

\lambda=\begin{cases}\frac{3b_{n}|\mathbb{G}_{n}|^{1/2}\delta}{2c_{2}b_{n}|\mathbb{G}_{n}|^{1/2}h_{n}^{-d/2}\delta\,+\,3c_{1}|\mathbb{G}_{n-p}|}&\text{if $2\alpha\leq 1$}\\ \frac{3b_{n}|\mathbb{G}_{n}|^{1/2}\delta}{2c_{2}b_{n}|\mathbb{G}_{n}|^{1/2}h_{n}^{-d/2}\delta\,+\,3c_{1}|\mathbb{G}_{n-p}|(1+(2\alpha)^{2p}h_{n}^{d})}&\text{if $1<2\alpha\leq\sqrt{2}$}\\ \frac{3b_{n}|\mathbb{G}_{n}|^{1/2}\delta}{2c_{2}b_{n}|\mathbb{G}_{n}|^{1/2}h_{n}^{-d/2}\delta\,+\,3c_{1}|\mathbb{G}_{n-p}|(1+(2\alpha)^{2p}h_{n}^{d}+2^{p}(2\alpha^{2})^{n}h_{n}^{d})}&\text{if $1<2\alpha\leq\sqrt{2},$}\end{cases}

and since we can do the same things for $-{\mathfrak{f}}_{n}$ instead of ${\mathfrak{f}}_{n}$ , we get,

if $2\alpha\leq 1:$

\mathbb{P}\Big{(}b_{n}^{-1}|R_{1}(n)|>\delta\Big{)}\leq C\exp\Big{(}-\frac{3b_{n}^{2}|\mathbb{G}_{n}|\delta^{2}}{2(c_{2}b_{n}|\mathbb{G}_{n}|^{1/2}h_{n}^{-d/2}\delta\,+\,3c_{1}|\mathbb{G}_{n-p}|)}\Big{)};

if $1<2\alpha\leq\sqrt{2}:$

	$\displaystyle\mathbb{P}\Big{(}b_{n}^{-1}\|R_{1}(n)\|>\delta\Big{)}$	$\displaystyle\leq 2\exp\Big{(}\frac{c_{3}(2\alpha)^{n}h_{n}^{d/2}b_{n}\|\mathbb{G}_{n}\|^{1/2}}{2c_{2}b_{n}\|\mathbb{G}_{n}\|^{1/2}h_{n}^{-d/2}\delta\,+\,3c_{1}\|\mathbb{G}_{n-p}\|(1+(2\alpha)^{2p}h_{n}^{d})}\Big{)}$
		$\displaystyle\hskip 28.45274pt\times\exp\Big{(}-\frac{3b_{n}^{2}\|\mathbb{G}_{n}\|\delta^{2}}{2(c_{2}b_{n}\|\mathbb{G}_{n}\|^{1/2}h_{n}^{-d/2}\delta\,+\,3c_{1}\|\mathbb{G}_{n-p}\|(1+(2\alpha)^{2p}h_{n}^{d}))}\Big{)};$

if $2\alpha>\sqrt{2}:$

	$\displaystyle\mathbb{P}\Big{(}b_{n}^{-1}\|R_{1}(n)\|>\delta\Big{)}$	$\displaystyle\leq 2\exp\Big{(}\frac{c_{3}(2\alpha)^{n}h_{n}^{d/2}b_{n}\|\mathbb{G}_{n}\|^{1/2}}{2c_{2}b_{n}\|\mathbb{G}_{n}\|^{1/2}h_{n}^{-d/2}\delta\,+\,3c_{1}\|\mathbb{G}_{n-p}\|(1+(2\alpha)^{2p}h_{n}^{d}+2^{p}(2\alpha^{2})^{n}h_{n}^{d})}\Big{)}$
		$\displaystyle\hskip 7.11317pt\times\exp\Big{(}-\frac{3b_{n}^{2}\|\mathbb{G}_{n}\|\delta^{2}}{2(c_{2}b_{n}\|\mathbb{G}_{n}\|^{1/2}h_{n}^{-d/2}\delta\,+\,3c_{1}\|\mathbb{G}_{n-p}\|(1+(2\alpha)^{2p}h_{n}^{d}+2^{p}(2\alpha^{2})^{n}h_{n}^{d}))}\Big{)}.$

Finally, applying the $\log$ to each of these last three inequalities, dividing by $b_{n}^{2}$ , letting $n$ goes to infinity and using (6) and Assumption 2.13, we get the result of Lemma 5.2. ∎

From (24), Lemmas 5.1 and 5.2, we have

(38)

b_{n}^{-1}N_{n,\emptyset}({\mathfrak{f}}_{n})\mathrel{\underset{b_{n}^{2}}{\overset{{\rm superexp}}{\scalebox{2.0}[1.0]{$\sim$}}}}b_{n}^{-1}\Delta_{n}({\mathfrak{f}}_{n}).

As a consequence, using Remark 2.7, $b_{n}^{-1}N_{n,\emptyset}({\mathfrak{f}}_{n})$ and $b_{n}^{-1}\Delta_{n}({\mathfrak{f}}_{n})$ satisfy the same moderate deviation principle.

We now study the martingale part $\Delta_{n}({\mathfrak{f}}_{n})$ of the decomposition (24). The bracket $V(n)$ of $\Delta_{n}({\mathfrak{f}}_{n})$ is defined by:

V(n)=\sum_{i\in\mathbb{G}_{n-p_{n}}}{\mathbb{E}}\left[\Delta_{n,i}({\mathfrak{f}}_{n})^{2}|{\mathcal{F}}_{i}\right].

Using (22) and (21), we write:

(39)

V(n)=|\mathbb{G}_{n}|^{-1}\sum_{i\in\mathbb{G}_{n-p_{n}}}{\mathbb{E}}_{X_{i}}\left[\left(\sum_{\ell=0}^{p_{n}}M_{\mathbb{G}_{p_{n}-\ell}}(\tilde{f}_{\ell,n})\right)^{2}\right]-R_{2}(n)=V_{1}(n)+2V_{2}(n)-R_{2}(n),

with:

	$\displaystyle V_{1}(n)$	$\displaystyle=\|\mathbb{G}_{n}\|^{-1}\sum_{i\in\mathbb{G}_{n-p_{n}}}\sum_{\ell=0}^{p_{n}}{\mathbb{E}}_{X_{i}}\left[M_{\mathbb{G}_{p_{n}-\ell}}(\tilde{f}_{\ell,n})^{2}\right],$
	$\displaystyle V_{2}(n)$	$\displaystyle=\|\mathbb{G}_{n}\|^{-1}\sum_{i\in\mathbb{G}_{n-p_{n}}}\sum_{0\leq\ell<k\leq p_{n}}{\mathbb{E}}_{X_{i}}\left[M_{\mathbb{G}_{p_{n}-\ell}}(\tilde{f}_{\ell,n})M_{\mathbb{G}_{p_{n}-k}}(\tilde{f}_{k,n})\right],$
	$\displaystyle R_{2}(n)$	$\displaystyle=\sum_{i\in\mathbb{G}_{n-p_{n}}}{\mathbb{E}}\left[N_{n,i}({\mathfrak{f}}_{n})\|X_{i}\right]^{2}.$

We have the following result.

Lemma 5.3.

Under the Assumptions of Theorem 3.10, we have $R_{2}(n)\xRightarrow[b_{n}^{2}]{\rm superexp}0.$

Proof.

Using the branching Markov property, we have

R_{2}(n)=|\mathbb{G}_{n}|^{-1}M_{\mathbb{G}_{n-p}}(g_{p})\quad\text{with}\quad g_{p}=\Big{(}\sum_{\ell=0}^{p}2^{p-\ell}{\mathcal{Q}}^{p-\ell}\tilde{f}_{\ell,n}\Big{)}^{2}.

Using Assumption 2.9 and $(i)$ and $(ii)$ of Assumption 3.8, we get

	$\displaystyle\\|g_{p}\\|_{\infty}$	$\displaystyle\leq C\\|\tilde{f}_{p,n}\\|^{2}_{\infty}+C\\|(\sum_{\ell=0}^{p-1}2^{p-\ell}{\mathcal{Q}}^{p-\ell}\tilde{f}_{\ell,n})^{2}\\|_{\infty}$
		$\displaystyle\leq Ch_{n}^{-d}+C\Big{(}\sum_{\ell=0}^{p-1}(2\alpha)^{p-\ell}h_{n}^{d/2}\Big{)}^{2}$
		$\displaystyle\leq Ch_{n}^{-d}{\bf 1}_{\{2\alpha\leq 1\}}+C(h_{n}^{-d}+h_{n}^{d}(2\alpha)^{2p}){\bf 1}_{\{2\alpha>1\}}.$

This implies that

(40)

R_{2}(n)\leq C|\mathbb{G}_{n}|^{-1}h_{n}^{-d}\,{\bf 1}_{\{2\alpha\leq 1\}}+C(|\mathbb{G}_{n}|^{-1}h_{n}^{-d}+(2\alpha^{2})^{p}h_{n}^{d}|\mathbb{G}_{n-p}|^{-1})\,{\bf 1}_{\{2\alpha>1\}}.

Recall that $h_{n}=2^{-n\gamma}$ with $\gamma\in(0,1/d)$ . Using Assumption 2.13, we conclude from (40) that $R_{2}(n)$ is bounded by a deterministic sequence which converge to 0. As a consequence, using Remark 2.8, we get the result of Lemma 5.3. ∎

Recall $\sigma^{2}$ given in (7). We have the following result.

Lemma 5.4.

Under the Assumptions of Theorem 3.10, we have $V_{1}(n)\xRightarrow[b_{n}^{2}]{\rm superexp}\sigma^{2}.$

Proof.

We have the following decomposition which is a consequence of (50):

V_{1}(n)=V_{3}(n)+V_{4}(n),

with

	$\displaystyle V_{3}(n)$	$\displaystyle=\|\mathbb{G}_{n}\|^{-1}\sum_{i\in\mathbb{G}_{n-p}}\sum_{\ell=0}^{p}2^{p-\ell}\,{\mathcal{Q}}^{p-\ell}(\tilde{f}_{\ell,n}^{2})(X_{i}),$
	$\displaystyle V_{4}(n)$	$\displaystyle=\|\mathbb{G}_{n}\|^{-1}\sum_{i\in\mathbb{G}_{n-p}}\sum_{\ell=0}^{p-1}\,\sum_{k=0}^{p-\ell-1}2^{p-\ell+k}\,{\mathcal{Q}}^{p-1-(\ell+k)}\left({\mathcal{P}}\left({\mathcal{Q}}^{k}\tilde{f}_{\ell,n}\otimes^{2}\right)\right)(X_{i}).$

Now, the result of Lemma 5.4 is a direct consequence of the following:

(41)			$\displaystyle V_{3}(n)\xRightarrow[b_{n}^{2}]{\rm superexp}\sigma^{2};$
(42)			$\displaystyle V_{4}(n)\xRightarrow[b_{n}^{2}]{\rm superexp}0.$

To end the proof, we will now prove (41) and (42).

Proof of (41)

Set

g_{p,n}=\sum_{\ell=0}^{p}2^{-\ell}{\mathcal{Q}}^{p-\ell}(\tilde{f}^{2}_{\ell,n}-\langle\mu,\tilde{f}^{2}_{\ell,n}\rangle)\quad\text{and}\quad H_{3}^{[n]}({\mathfrak{f}}_{n})=\sum_{\ell=0}^{p}2^{-\ell}\langle\mu,\tilde{f}_{\ell,n}^{2}\rangle.

Following the same arguments that in the proof of Lemmas 5.1 and 5.2, we get after studious calculations:

if $2\alpha\leq 1,$

	$\displaystyle\mathbb{P}\Big{(}\|V_{3}(n)-H_{3}^{[n]}\|>\delta\Big{)}=\mathbb{P}\Big{(}\|\mathbb{G}_{n-p}\|^{-1}\|M_{\mathbb{G}_{n-p}}(g_{p,n})\|>\delta\Big{)}$
	$\displaystyle\leq C\exp\Big{(}\frac{Cp\,\delta}{C\delta h_{n}^{-d}+3(p^{2}2^{-p}+2^{-p}h_{n}^{-d})}\Big{)}\exp\Big{(}-\frac{3\delta^{2}\|\mathbb{G}_{n}\|}{2(C\delta h_{n}^{-d}+3(p^{2}2^{-p}+2^{-p}h_{n}^{-d}))}\Big{)};$

if $1<2\alpha\leq\sqrt{2},$

	$\displaystyle\mathbb{P}\Big{(}\|V_{3}(n)-H_{3}^{[n]}\|>\delta\Big{)}=\mathbb{P}\Big{(}\|\mathbb{G}_{n-p}\|^{-1}\|M_{\mathbb{G}_{n-p}}(g_{p,n})\|>\delta\Big{)}$
	$\displaystyle\leq\exp\Big{(}\frac{c_{3}(2\alpha)^{n}h_{n}^{d}\delta}{c_{2}\delta+3c_{1}((2\alpha^{2})^{p}h_{n}^{d}+2^{-p})}\Big{)}\exp\Big{(}-\frac{3\delta^{2}\|\mathbb{G}_{n}\|h_{n}^{d}}{2(c_{2}\delta+3c_{1}((2\alpha^{2})^{p}h_{n}^{d}+2^{-p}))}\Big{)};$

if $2\alpha>\sqrt{2},$

	$\displaystyle\mathbb{P}\Big{(}\|V_{3}(n)-H_{3}^{[n]}\|>\delta\Big{)}=\mathbb{P}\Big{(}\|\mathbb{G}_{n-p}\|^{-1}\|M_{\mathbb{G}_{n-p}}(g_{p,n})\|>\delta\Big{)}$
	$\displaystyle\leq\exp\Big{(}\frac{c_{3}(2\alpha)^{n}h_{n}^{d}\delta}{c_{2}\delta+3c_{1}((2\alpha^{2})^{n}h_{n}^{d}+2^{-p})}\Big{)}\exp\Big{(}-\frac{3\delta^{2}\|\mathbb{G}_{n}\|h_{n}^{d}}{2(c_{2}\delta+3c_{1}((2\alpha^{2})^{n}h_{n}^{d}+2^{-p}))}\Big{)};$

Taking the $\log$ , dividing by $b_{n}^{2}$ , letting $n$ goes to the infinity and using (6) and Assumption 2.13, we get

\limsup_{n\rightarrow+\infty}\frac{1}{b_{n}^{2}}\log\mathbb{P}\Big{(}|V_{3}(n)-H_{3}^{[n]}|>\delta\Big{)}=-\infty.

Next, using $(iii)$ of Assumption 3.8, we get $\lim_{n\rightarrow+\infty}H_{3}^{[n]}({\mathfrak{f}}_{n})=\sigma^{2}.$ This ends the proof of (41) since $(H_{3}^{[n]}({\mathfrak{f}}_{n}))$ is a deterministic sequence.

Proof of (42)

We set

h_{\ell,k}^{(n)}=2^{k-\ell}\,{\mathcal{Q}}^{p-1-(\ell+k)}\big{(}{\mathcal{P}}\big{(}{\mathcal{Q}}^{k}\tilde{f}_{\ell,n}\otimes^{2}\big{)}\big{)}\quad\text{and}\quad H_{4,n}=\sum_{\ell=0}^{p-1}\sum_{k=0}^{p-\ell-1}h_{\ell,k}^{(n)}

in such a that $V_{4}(n)=|\mathbb{G}_{n-p}|^{-1}M_{\mathbb{G}_{n-p}}(H_{4,n})$ . Using, (3) and $(i)$ and $(ii)$ of Assumption 3.8, we get

|h_{\ell,k}^{(n)}|\leq 2^{k-\ell}{\mathcal{P}}(|{\mathcal{Q}}^{k}\tilde{f}_{\ell,n}|\otimes^{2})\leq C2^{k-\ell}h_{n}^{d}\alpha^{2k}.

This implies that $|H_{4,n}|\leq c_{n}$ and then that $|V_{4}(n)|\leq c_{n}$ , where the sequence $(c_{n},n\in\mathbb{N})$ is defined by

c_{n}=Ch_{n}^{d}{\bf 1}_{\{2\alpha^{2}\leq 1\}}+Ch_{n}^{d}(2\alpha^{2})^{p}{\bf 1}_{\{2\alpha^{2}>1\}}

Using (5) and the fact that $(h_{n},n\in\mathbb{N})$ converges to 0, we get that the sequence $(c_{n},n\in\mathbb{N})$ converges to $0$ . Thus, we have that $V_{4}(n)$ is bounded by a deterministic sequence which converges to $0$ . Then (42) follows using Remark 2.8. ∎

Lemma 5.5.

Under the Assumptions of Theorem 3.10, we have $V_{2}(n)\xRightarrow[b_{n}^{2}]{\rm superexp}0.$

Proof.

Using (51), we get:

V_{2}(n)=V_{5}(n)+V_{6}(n),

with

	$\displaystyle V_{5}(n)$	$\displaystyle=\|\mathbb{G}_{n}\|^{-1}\sum_{i\in\mathbb{G}_{n-p}}\sum_{0\leq\ell<k\leq p}2^{p-\ell}{\mathcal{Q}}^{p-k}\big{(}\tilde{f}_{k,n}{\mathcal{Q}}^{k-\ell}\tilde{f}_{\ell,n}\big{)}(X_{i}),$
	$\displaystyle V_{6}(n)$	$\displaystyle=\|\mathbb{G}_{n}\|^{-1}\sum_{i\in\mathbb{G}_{n-p}}\sum_{0\leq\ell<k<p}\sum_{r=0}^{p-k-1}2^{p-\ell+r}\,{\mathcal{Q}}^{p-1-(r+k)}\big{(}{\mathcal{P}}\big{(}{\mathcal{Q}}^{r}\tilde{f}_{k,n}\otimes_{\rm sym}{\mathcal{Q}}^{k-\ell+r}\tilde{f}_{\ell,n}\big{)}\big{)}(X_{i}).$

First, we set

h_{k,\ell,r}^{(n)}=2^{r-\ell}\,{\mathcal{Q}}^{p-1-(r+k)}\big{(}{\mathcal{P}}\big{(}{\mathcal{Q}}^{r}\tilde{f}_{k,n}\otimes_{\rm sym}{\mathcal{Q}}^{k-\ell+r}\tilde{f}_{\ell,n}\big{)}\big{)}\quad\text{and}\quad H_{6,n}=\sum_{0\leq\ell<k<p}\sum_{r=0}^{p-k-1}h_{k,\ell,r}^{(n)}

in such a way that $V_{6}(n)=|\mathbb{G}_{n-p}|^{-1}M_{\mathbb{G}_{n-p}}(H_{6,n}).$ Using, (3) and $(i)$ and $(ii)$ of Assumption 3.8, we get

|h_{k,\ell,r}^{(n)}|\leq Ch_{n}^{d}(2\alpha^{2})^{r}\alpha^{k-\ell}.

This implies that $|H_{6,n}|\leq c_{n}$ and then that $V_{6}(n)\leq c_{n}$ , where the sequence $(c_{n},n\in\mathbb{N})$ is defined by

c_{n}=Ch_{n}^{d}{\bf 1}_{\{2\alpha^{2}\leq 1\}}+C(2\alpha^{2})^{p}h_{n}^{d}{\bf 1}_{\{2\alpha^{2}>1\}}.

Since the sequence $(c_{n},n\in\mathbb{N})$ is deterministic and converges to 0, it follows, using Remark 2.8, that

V_{6}(n)\xRightarrow[b_{n}^{2}]{\rm superexp}0.

Next, for the term $V_{5}(n)$ , we have for all $k>\ell$ :

	$\displaystyle\big{\|}2^{-\ell}{\mathcal{Q}}^{p-k}\big{(}\tilde{f}_{k,n}{\mathcal{Q}}^{k-\ell}\tilde{f}_{\ell,n}\big{)}\big{\|}$	$\displaystyle\leq 2^{-\ell}{\mathcal{Q}}^{p-k}\big{(}\|\tilde{f}_{k,n}\|\|{\mathcal{Q}}^{k-\ell}\tilde{f}_{\ell,n}\|\big{)}\leq C2^{-\ell}h_{n}^{d/2}\alpha^{k-\ell}{\mathcal{Q}}^{p-k}(\|\tilde{f}_{k,n}\|)$
		$\displaystyle\leq C\alpha^{p}(2\alpha)^{-\ell}{\bf 1}_{\{k=p\}}+Ch_{n}^{d}(2\alpha)^{-\ell}\alpha^{k}{\bf 1}_{\{k\leq p-1\}},$

where we used (3) for the second inequality and $(i)$ and $(ii)$ of Assumption 3.8 for the second and the last inequality. Using the latter inequality in $V_{5}(n)$ , we get

|V_{5}(n)|\leq C\big{(}2^{-p}{\bf 1}_{\{2\alpha<1\}}+\alpha^{p}{\bf 1}_{\{2\alpha\geq 1\}}+h_{n}^{d}\big{)}.

We thus have that $V_{5}(n)$ is bounded by a deterministic sequence which converges to $0$ . It then follows from Remark 2.8 that

V_{5}(n)\xRightarrow[b_{n}^{2}]{\rm superexp}0.

From the foregoing, we get the result of Lemma since $V_{2}(n)=V_{5}(n)+V_{6}(n).$ ∎

As a direct consequence of (39) and the Lemmas 5.3, 5.4 and 5.5, we have the following result.

Lemma 5.6.

Under the Assumptions of Theorem 3.10, we have $V(n)\xRightarrow[b_{n}^{2}]{\rm superexp}\sigma^{2}.$

We now study the 4th-order exponential moment condition. We stress that this condition imply in particular the exponential Lindeberg condition (condition (C3) in Proposition 6.1). We have the following result.

Lemma 5.7.

Under the Assumptions of Theorem 3.10, we have

\limsup_{n\rightarrow+\infty}\frac{1}{b_{n}^{2}}\log\mathbb{P}\Big{(}b_{n}^{2}\sum_{i\in\mathbb{G}_{n-p}}\mathbb{E}[\Delta_{n,i}({\mathfrak{f}}_{n})^{4}|{\mathcal{F}}_{i}]>\delta\Big{)}=-\infty\quad\forall\delta>0.

Proof.

For all $i\in\mathbb{G}_{n-p}$ , we have

(43)

\mathbb{E}\left[\Delta_{n,i}({\mathfrak{f}}_{n})^{4}|{\mathcal{F}}_{i}\right]\leq 16(p+1)^{3}2^{-2n}\sum_{\ell=0}^{p}\mathbb{E}_{X_{i}}\left[M_{\mathbb{G}_{p-\ell}}(\tilde{f}_{\ell,n})^{4}\right],

where we have used the definition of $\Delta_{n,i}({\mathfrak{f}}_{n})$ , the inequality $(\sum_{k=0}^{r}a_{k})^{4}\leq(r+1)^{3}\sum_{k=0}^{r}a_{k}^{4}$ and the branching Markov property. Using (43), we get

(44)

b_{n}^{2}\sum_{i\in\mathbb{G}_{n-p}}\mathbb{E}[\Delta_{n,i}({\mathfrak{f}}_{n})^{4}|{\mathcal{F}}_{i}]\leq Cb_{n}^{2}p^{3}2^{-2n}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}h_{n,\ell}(X_{i}),

where $h_{n,\ell}(x)=\mathbb{E}_{x}[M_{\mathbb{G}_{p-\ell}}(\tilde{f}_{\ell,n})^{4}].$ We will now prove that the right hand side of (44) converges superexponentially to $0$ at the speed $b_{n}^{2}$ , that is

\limsup_{n\rightarrow+\infty}\frac{1}{b_{n}^{2}}\log\mathbb{P}\Big{(}Cb_{n}^{2}p^{3}2^{-2n}\Big{|}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}h_{n,\ell}(X_{i})\Big{|}>\delta\Big{)}=-\infty.

For that purpose, we will treat the case $\ell=p$ , $\ell=p-1$ and finally the case $\ell\in\{0,\ldots,p-2\}$ . First, we treat the case $\ell=p.$ Set $g_{p,n}=\tilde{f}_{p,n}^{4}.$ We have

(45)

b_{n}^{2}p^{3}2^{-2n}\sum_{i\in\mathbb{G}_{n-p}}h_{n,p}(X_{i})=b_{n}^{2}p^{3}2^{-2n}\sum_{i\in\mathbb{G}_{n-p}}\tilde{g}_{p,n}(X_{i})+b_{n}^{2}p^{3}2^{-2n}\,|\mathbb{G}_{n-p}|\langle\mu,g_{p,n}\rangle.

Since $b_{n}^{2}p^{3}2^{-2n}|\mathbb{G}_{n-p}|\langle\mu,g_{p,n}\rangle\leq p^{3}2^{-p}\,b_{n}^{2}\,(|\mathbb{G}_{n}|h_{n}^{d})^{-1}\rightarrow 0$ as $n\rightarrow 0$ , it suffices to prove that the first term of the right hand side in (45) converges superexponentially to $0$ at the speed $b_{n}^{2}$ , that is, for all $\delta>0,$

(46)

\limsup_{n\rightarrow+\infty}\frac{1}{b_{n}^{2}}\log\mathbb{P}\Big{(}b_{n}^{2}p^{3}2^{-2n}\,|\sum_{i\in\mathbb{G}_{n-p}}\tilde{g}_{p,n}(X_{i})|>\delta\Big{)}=-\infty.

As in the proof of Lemma 5.2, we can prove that

\mathbb{P}\Big{(}b_{n}^{2}p^{3}2^{-2n}\,|\sum_{i\in\mathbb{G}_{n-p}}\tilde{g}_{p,n}(X_{i})|>\delta\Big{)}\leq C\exp\Big{(}-\frac{\delta^{2}|\mathbb{G}_{n}|^{2}h_{n}^{2d}}{Cp^{3}b_{n}^{2}(\delta+Cp^{3}b_{n}^{2}(|\mathbb{G}_{n+p}|h_{n}^{d})^{-1})}\Big{)}.

Taking the $\log$ and dividing by $b_{n}^{2}$ , we get (46).

Next, for $\ell\in\{0,\ldots,p-1\}$ , we plan to prove that the quantity $b_{n}^{2}p^{3}2^{-2n}\sum_{\ell=0}^{p-1}\sum_{i\in\mathbb{G}_{n-p}}h_{n,\ell}(X_{i})$ is bounded by a deterministic sequence which converges to $0$ . First, for $\ell=p-1$ , using the branching Markov property, $(i)$ and $(ii)$ of Assumption 3.8, we have, for all $i\in\mathbb{G}_{n-p,}$

h_{n,p-1}(X_{i})=\mathbb{E}_{X_{i}}[M_{\mathbb{G}_{1}}(\tilde{f}_{p-1,n})^{4}]\leq C{\mathcal{Q}}(\tilde{f}_{p-1,n}^{4})\leq Ch_{n}^{-d}.

Using (6), this implies that

b_{n}^{2}\,p^{3}\,2^{-2n}\sum_{i\in\mathbb{G}_{n-p}}h_{n,p-1}(X_{i})\leq C\,b_{n}^{2}\,2^{-p}\,p^{3}(|\mathbb{G}_{n}|h_{n}^{d})^{-1}\rightarrow 0\quad\text{as $n\rightarrow+\infty.$}

Now we consider the case $\ell\in\{0,\ldots,p-2\}$ . From Lemma 6.4 with $f$ replaced by $\tilde{f}_{\ell,n}$ and $\nu$ by the Dirac mass at $X_{i}$ ( $\delta_{X_{i}}$ ), we have

(47)

b_{n}^{2}\,p^{3}\,2^{-2n}\sum_{\ell=0}^{p-2}\sum_{i\in\mathbb{G}_{n-p}}h_{n,\ell}(X_{i})\leq b_{n}^{2}\,|\mathbb{G}_{n}|^{-2}\,p^{3}\,\sum_{\ell=0}^{p-2}\sum_{i\in\mathbb{G}_{n-p}}\sum_{j=1}^{9}|\psi_{j,p-\ell}|(X_{i}).

For all $j\in\{1,\ldots,9\}$ , we will upper bound each term of the right hand side in (47) by a deterministic sequence which converges to $0$ .

Upper bound of $b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{1,p-\ell}|(X_{i})$

Using $(i)$ of Assumption 3.8, we have

|\psi_{1,p-\ell}|\leq C2^{p-\ell}{\mathcal{Q}}^{p-\ell}(f_{\ell,n}^{4})\leq C2^{p-\ell}h_{n}^{-d}.

Using (6), this implies that

b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{1,p-\ell}|(X_{i})\leq Cb_{n}^{2}p^{3}(|\mathbb{G}_{n}|h_{n}^{d})^{-1}\rightarrow 0\quad\text{as $n\rightarrow+\infty.$}

Upper bound of $b_{n}^{2}|\mathbb{G}_{n}|^{-3}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{2,p-\ell}|(X_{i})$

Using Assumption 2.9 and $(i)$ and $(ii)$ of Assumption 3.8 for the second inequality, we get

	$\displaystyle\|\psi_{2,p-\ell}\|$	$\displaystyle\leq C2^{2(p-\ell)}\sum_{k=0}^{p-\ell-1}2^{-k}{\mathcal{Q}}^{k}{\mathcal{P}}(\|{\mathcal{Q}}^{p-k-1-\ell}(\tilde{f}_{\ell,n}^{3})\|\otimes_{\rm sym}\|{\mathcal{Q}}^{p-\ell-k-2}({\mathcal{Q}}\tilde{f}_{\ell,n})\|)$
		$\displaystyle\leq C2^{2(p-\ell)}\sum_{k=0}^{p-\ell-1}2^{-k}\alpha^{p-\ell-k}\,\leq\,C2^{p-\ell}\big{(}{\bf 1}_{\{2\alpha<1\}}+(p-\ell){\bf 1}_{\{2\alpha=1\}}+(2\alpha)^{p-\ell}{\bf 1}_{\{2\alpha>1\}}\big{)}.$

Using (6) and (5), this implies that

b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{2,p-\ell}|(X_{i})\leq Cb_{n}^{2}|\mathbb{G}_{n}|^{-1}\big{(}p^{4}{\bf 1}_{\{2\alpha\leq 1\}}+(2\alpha)^{p}{\bf 1}_{\{2\alpha>1\}}\big{)}\rightarrow 0\quad\text{as $n\rightarrow+\infty.$}

Upper bound of $b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{3,p-\ell}|(X_{i})$

Using $(i)$ and $(ii)$ of Assumption 3.8 for the second inequality, we get

|\psi_{3,p-\ell}|\leq 2^{2(p-\ell)}\sum_{k=0}^{p-\ell-1}2^{-k}{\mathcal{Q}}^{k}{\mathcal{P}}({\mathcal{Q}}^{p-\ell-k-1}(\tilde{f}_{\ell,n}^{2})\otimes^{2})\leq C2^{2(p-\ell)}\sum_{k=0}^{p-\ell-1}2^{-k}\leq C2^{2(p-\ell)}.

Using (6), this implies that

b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{3,p-\ell}|(X_{i})\leq C\,b_{n}^{2}\,p^{3}\,2^{-n+p}\rightarrow 0\quad\text{as $n\rightarrow+\infty.$}

Upper bound of $b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{4,p-\ell}|(X_{i})$

Using Assumption (2.9) and $(i)$ and $(ii)$ of Assumption 3.8 for the second inequality, we get

|\psi_{4,p-\ell}|\leq C2^{4(p-\ell)}{\mathcal{P}}\big{(}{\mathcal{P}}\big{(}|{\mathcal{Q}}^{p-\ell-2}\tilde{f}_{\ell,n}|\otimes^{2}\big{)}\otimes^{2}\big{)}\leq C2^{4(p-\ell)}\alpha^{4(p-\ell-2)}h_{n}^{2d}.

Using (6) and (5), this implies that

b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{4,p-\ell}|(X_{i})\\ \leq C\,(b_{n}^{2}\,p^{4}\,2^{-n-p}h_{n}^{2d}{\bf 1}_{\{2\alpha^{2}\leq 1\}}+b_{n}^{2}p^{3}2^{-n+p}(2\alpha^{2})^{2p}h_{n}^{2d}{\bf 1}_{\{2\alpha^{2}>1\}})\rightarrow 0\quad\text{as $n\rightarrow+\infty.$}

Upper bound of $b_{n}^{2}|\mathbb{G}_{n}|^{-3}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{5,p-\ell}|(X_{i})$

Using Assumption (2.9) and $(i)$ and $(ii)$ of Assumption 3.8 for the second inequality, we get

	$\displaystyle\|\psi_{5,p-\ell}\|$	$\displaystyle\leq C\,2^{4(p-\ell)}\sum_{k=2}^{p-\ell-1}\sum_{r=0}^{k-1}2^{-2k-r}{\mathcal{Q}}^{r}{\mathcal{P}}\big{(}{\mathcal{Q}}^{k-r-1}\big{(}{\mathcal{P}}\big{(}\|{\mathcal{Q}}^{p-\ell-k-1}\tilde{f}_{\ell,n}\|\otimes^{2}\big{)}\big{)}\otimes^{2}\big{)}$
		$\displaystyle\leq C\,2^{4(p-\ell)}\sum_{k=2}^{p-\ell-1}\sum_{r=0}^{k-1}2^{-2k-r}h_{n}^{2d}\alpha^{4(p-\ell-k)}$
		$\displaystyle\leq C\,h_{n}^{2d}\,2^{2(p-\ell)}\big{(}{\bf 1}_{\{2\alpha^{2}<1\}}+(p-\ell){\bf 1}_{\{2\alpha^{2}=1\}}+(2\alpha^{2})^{2(p-\ell)}{\bf 1}_{\{2\alpha^{2}>1\}}\big{)}.$

Using (6) and (5), this implies that

b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{5,p-\ell}|(X_{i})\\ \leq C\,(b_{n}^{2}\,p^{4}\,2^{-n+p}h_{n}^{2d}{\bf 1}_{\{2\alpha^{2}\leq 1\}}+b_{n}^{2}p^{3}2^{-n+p}(2\alpha^{2})^{2p}h_{n}^{2d}{\bf 1}_{\{2\alpha^{2}>1\}})\rightarrow 0\quad\text{as $n\rightarrow+\infty.$}

Upper bound of $b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{6,p-\ell}|(X_{i})$

Using Assumption (2.9) and $(i)$ and $(ii)$ of Assumption 3.8 for the second inequality, we get

	$\displaystyle\|\psi_{6,n}\|$	$\displaystyle\leq C\,2^{3(p-\ell)}\sum_{k=1}^{p-\ell-1}\sum_{r=0}^{k-1}2^{-k-r}{\mathcal{Q}}^{r}{\mathcal{P}}\big{(}{\mathcal{Q}}^{k-r-1}{\mathcal{P}}\big{(}\|{\mathcal{Q}}^{p-\ell-k-1}\tilde{f}_{\ell,n}\|\otimes^{2}\big{)}\otimes_{\rm sym}{\mathcal{Q}}^{p-\ell-r-1}(\tilde{f}^{2}_{\ell,n})\big{)}$
		$\displaystyle\leq C\,2^{3(p-\ell)}\sum_{k=1}^{p-\ell-1}\sum_{r=0}^{k-1}2^{-k-r}h_{n}^{d}\alpha^{2(p-\ell-k)}$
		$\displaystyle\leq C\,h_{n}^{d}\,2^{2(p-\ell)}\big{(}{\bf 1}_{\{2\alpha^{2}<1\}}\,+\,(p-\ell)\,{\bf 1}_{\{2\alpha^{2}=1\}}\,+\,(2\alpha^{2})^{p-\ell}{\bf 1}_{\{2\alpha^{2}>1\}}\big{)}.$

Using (6) and (5), this implies that

b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{6,p-\ell}|(X_{i})\\ \leq C\,(b_{n}^{2}\,p^{4}\,2^{-n+p}h_{n}^{d}{\bf 1}_{\{2\alpha^{2}\leq 1\}}+b_{n}^{2}p^{3}2^{-n+p}(2\alpha^{2})^{2p}h_{n}^{d}{\bf 1}_{\{2\alpha^{2}>1\}})\rightarrow 0\quad\text{as $n\rightarrow+\infty.$}

Upper bound of $b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{7,p-\ell}|(X_{i})$

In the same way as for $\psi_{6,p-\ell}$ , we have

b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{7,p-\ell}|(X_{i})\\ \leq C\,(b_{n}^{2}\,p^{4}\,2^{-n}h_{n}^{d}{\bf 1}_{\{2\alpha\leq 1\}}+b_{n}^{2}p^{3}2^{-n+p}(2\alpha^{2})^{2p}h_{n}^{d}{\bf 1}_{\{2\alpha>1\}})\rightarrow 0\quad\text{as $n\rightarrow+\infty.$}

Upper bound of $b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{8,p-\ell}|(X_{i})$

Using Assumption (2.9) and $(i)$ and $(ii)$ of Assumption 3.8 for the second inequality, we get

	$\displaystyle\|\psi_{8,p-\ell}\|$	$\displaystyle\leq C\,2^{4(p-\ell)}\,\sum_{k=2}^{p-\ell-1}\sum_{r=1}^{k-1}\sum_{j=0}^{r-1}2^{-k-r-j}{\mathcal{Q}}^{j}{\mathcal{P}}\big{(}{\mathcal{Q}}^{r-j-1}{\mathcal{P}}\big{(}\|{\mathcal{Q}}^{p-\ell-1-r}\tilde{f}_{\ell,n}\|\otimes^{2}\big{)}$
		$\displaystyle\hskip 142.26378pt\otimes_{\rm sym}{\mathcal{Q}}^{k-j-1}{\mathcal{P}}\big{(}\|{\mathcal{Q}}^{p-\ell-1-k}\tilde{f}_{\ell,n}\|\otimes^{2}\big{)}\big{)}$
		$\displaystyle\leq C\,2^{4(p-\ell)}\,\sum_{k=2}^{p-\ell-1}\sum_{r=1}^{k-1}\sum_{j=0}^{r-1}2^{-k-r-j}h_{n}^{2d}\alpha^{4(p-\ell)-2r-2k}$
		$\displaystyle\leq C\,h_{n}^{2d}\,2^{2(p-\ell)}\big{(}{\bf 1}_{\{2\alpha^{2}<1\}}\,+\,(p-\ell)^{2}\,{\bf 1}_{\{2\alpha^{2}=1\}}\,+\,(2\alpha^{2})^{2(p-\ell)}\,{\bf 1}_{\{2\alpha^{2}>1\}}\big{)}.$

Using (6) and (5), this implies that

b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{8,p-\ell}|(X_{i})\\ \leq C\,(b_{n}^{2}\,p^{5}\,2^{-n+p}h_{n}^{2d}{\bf 1}_{\{2\alpha^{2}\leq 1\}}+b_{n}^{2}p^{3}2^{-n+p}(2\alpha^{2})^{2p}h_{n}^{2d}{\bf 1}_{\{2\alpha^{2}>1\}})\rightarrow 0\quad\text{as $n\rightarrow+\infty.$}

Upper bound of $b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{9,p-\ell}|(X_{i})$

In the same way as for $\psi_{8,p-\ell}$ , we have

b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{9,p-\ell}|(X_{i})\\ \leq C\,(b_{n}^{2}\,p^{5}\,2^{-n+p}h_{n}^{2d}{\bf 1}_{\{2\alpha^{2}\leq 1\}}+b_{n}^{2}p^{3}2^{-n+p}(2\alpha^{2})^{2p}h_{n}^{2d}{\bf 1}_{\{2\alpha^{2}>1\}})\rightarrow 0\quad\text{as $n\rightarrow+\infty.$}

Putting together all the upper bounds for $\ell\in\{0,\ldots,p-1\}$ and using (43) and (47), we deduce that $b_{n}^{2}\,p^{3}\,2^{-2n}\sum_{\ell=0}^{p-1}\sum_{i\in\mathbb{G}_{n-p}}h_{n,\ell}(X_{i})$ is bounded by a deterministic sequence which converges to 0. As a consequence, it follows, using Remark 2.8, that

\limsup_{n\rightarrow+\infty}\frac{1}{b_{n}^{2}}\log\mathbb{P}\Big{(}b_{n}^{2}\,p^{3}\,2^{-2n}\sum_{\ell=0}^{p-1}\sum_{i\in\mathbb{G}_{n-p}}h_{n,\ell}(X_{i})>\delta\Big{)}=-\infty.

Finally, using (43), (44), (46), we get

\limsup_{n\rightarrow+\infty}\frac{1}{b_{n}^{2}}\log\mathbb{P}\Big{(}\sum_{i\in\mathbb{G}_{n-p}}\mathbb{E}[\Delta_{n,i}({\mathfrak{f}}_{n})^{4}|{\mathcal{F}}_{i}]>\frac{\delta}{b_{n}^{2}}\Big{)}=-\infty\quad\forall\delta>0.

∎

For Chen-Ledoux type condition, we have the following result.

Lemma 5.8.

Under the assumptions of Theorem 3.10, we have

\limsup_{n\rightarrow+\infty}\frac{1}{b_{n}^{2}}\log\Big{(}|\mathbb{G}_{n}|\sup_{i\in\mathbb{G}_{n-p}}\mathbb{P}_{{\mathcal{F}}_{i}}\Big{(}|\Delta_{n,i}({\mathfrak{f}}_{n})|>b_{n}\Big{)}\Big{)}=-\infty.

Proof.

For all $i\in\mathbb{G}_{n-p}$ , using (21) we have

(48)

\mathbb{P}_{{\mathcal{F}}_{i}}\left(|\Delta_{n,i}({\mathfrak{f}})|>b_{n}\sqrt{n}\right)\leq\mathbb{P}_{{\mathcal{F}}_{i}}\left(|N_{n,i}({\mathfrak{f}})|>b_{n}\sqrt{n}/2\right)+\mathbb{P}_{{\mathcal{F}}_{i}}\left(|\mathbb{E}_{X_{i}}\left[N_{n,i}({\mathfrak{f}})\right]|>b_{n}\sqrt{n}/2\right),

with $N_{n,i}({\mathfrak{f}})$ defined in (22). Following the proof of (32), we get

	$\displaystyle\mathbb{P}_{{\mathcal{F}}_{i}}\Big{(}\|N_{n,i}({\mathfrak{f}})\|>\frac{b_{n}\sqrt{n}}{2}\Big{)}$	$\displaystyle=\mathbb{P}_{X_{i}}\Big{(}\|\sum_{\ell=0}^{p}M_{\mathbb{G}_{p-\ell}}(\tilde{f}_{\ell},n)\|>\frac{b_{n}\sqrt{\|\mathbb{G}_{n}\|}}{2}\Big{)}$
		$\displaystyle\leq C\exp\Big{(}-\frac{b_{n}^{2}\|\mathbb{G}_{n}\|}{2(Cb_{n}\|\mathbb{G}_{n}\|^{1/2}h_{n}^{-d/2}+6C\|\mathbb{G}_{p}\|)}\Big{)}.$

Next, for

\lambda=\frac{b_{n}\sqrt{|\mathbb{G}_{n}|}}{2(c_{2}h_{n}^{-d/2}b_{n}\sqrt{n}+3c_{1}|\mathbb{G}_{p}|)},

we have

	$\displaystyle\mathbb{P}_{{\mathcal{F}}_{i}}\Big{(}\mathbb{E}_{X_{i}}\left[N_{n,i}({\mathfrak{f}})\right]>\frac{b_{n}\sqrt{n}}{2}\Big{)}$	$\displaystyle=\mathbb{P}_{X_{i}}\Big{(}\sum_{\ell=0}^{p}2^{p-\ell}{\mathcal{Q}}^{p-\ell}(\tilde{f}_{\ell})(X_{i})>\frac{b_{n}\sqrt{n\|\mathbb{G}_{n}\|}}{2}\Big{)}$
		$\displaystyle\leq\exp\Big{(}-\frac{\lambda b_{n}\sqrt{n\|\mathbb{G}_{n}\|}}{2}\Big{)}\mathbb{E}_{X_{i}}\Big{[}\exp\Big{(}\lambda\sum_{\ell=0}^{p}2^{p-\ell}{\mathcal{Q}}^{p-\ell}(\tilde{f}_{\ell})(X_{i})\Big{)}\Big{]}$
		$\displaystyle\leq C\exp\Big{(}-\frac{b_{n}^{2}\|\mathbb{G}_{n}\|}{2(Cb_{n}\|\mathbb{G}_{n}\|^{1/2}h_{n}^{-d/2}+6C\|\mathbb{G}_{p}\|)}\Big{)},$

where we used (49) and the branching Markov property for the first equality, Chernoff bound for the first inequality and (3) for the last inequality. Doing the same thing for $-{\mathfrak{f}}$ instead of ${\mathfrak{f}},$ we get

\mathbb{P}_{{\mathcal{F}}_{i}}\Big{(}|\mathbb{E}_{X_{i}}\left[N_{n,i}({\mathfrak{f}})\right]|>\frac{b_{n}\sqrt{n}}{2}\Big{)}\leq 2\,C\exp\Big{(}-\frac{b_{n}^{2}|\mathbb{G}_{n}|}{2(Cb_{n}|\mathbb{G}_{n}|^{1/2}h_{n}^{-d/2}+6C|\mathbb{G}_{p}|)}\Big{)}.

From the foregoing, we get, using (48),

|\mathbb{G}_{n}|\sup_{i\in\mathbb{G}_{n-p}}\mathbb{P}_{{\mathcal{F}}_{i}}\left(|\Delta_{n,i}({\mathfrak{f}})|>b_{n}\sqrt{n}\right)\leq C\,|\mathbb{G}_{n}|\,\exp\Big{(}-\frac{b_{n}^{2}|\mathbb{G}_{n}|}{2(Cb_{n}|\mathbb{G}_{n}|^{1/2}h_{n}^{-d/2}+6C|\mathbb{G}_{p}|)}\Big{)}.

Finally, taking the $\log$ and dividing by $b_{n}^{2}$ in the latter inequality, we get the result of Lemma 5.8. ∎

We can now use Proposition 6.1 to deduce from Lemmas 5.6, 5.7 and 5.8 that $\Delta_{n}({\mathfrak{f}}_{n})$ satisfies a moderate deviation principle with speed $b_{n}^{2}$ and rate function $I$ defined by: $I(x)=x^{2}/(2\sigma^{2})$ for all $x\in\mathbb{R}$ , with the finite variance $\sigma^{2}$ defined in (7). Using (38) and Remark 2.7, we then deduce Theorem 3.10.

6. Appendix

We recall here a simplified version of Theorem 1 in [12]. We consider the real martingale $(M_{n},n\in\mathbb{N})$ with respect to the filtration $({\mathcal{H}}_{n},n\in\mathbb{N})$ and we denote $(\langle M\rangle_{n},n\in\mathbb{N})$ its bracket.

Proposition 6.1.

Let $(b_{n})$ a sequence satisfying

b_{n}\quad\text{is increasing},\quad b_{n}\longrightarrow+\infty,\quad\frac{b_{n}}{\sqrt{n}}\longrightarrow 0,

such that $c(n):=\sqrt{n}/b_{n}$ is non-decreasing, and define the reciprocal function $c^{-1}(t)$ by

c^{-1}(t):=\inf\{n\in\mathbb{N}:c(n)\geq t\}.

Under the following conditions:

(C1)

there exists $Q\in\mathbb{R}_{+}^{*}$ such that for all $\delta>0$ ,

$\displaystyle\limsup_{n\rightarrow+\infty}\frac{1}{b_{n}^{2}}\log\left(\mathbb{P}\left(\left|\frac{\langle M\rangle_{n}}{n}-Q\right|>\delta\right)\right)=-\infty,$
(C2)

$\displaystyle\limsup\limits_{n\rightarrow+\infty}\frac{1}{b_{n}^{2}}\log\left(n\quad\underset{1\leq k\leq c^{-1}(b_{n+1})}{\rm ess\,sup}\mathbb{P}(|M_{k}-M_{k-1}|>b_{n}\sqrt{n}\Big{|}\mathcal{H}_{k-1})\right)=-\infty,$
(C3)

for all $a>0$ and for all $\delta>0$ ,

$\displaystyle\limsup_{n\rightarrow+\infty}\frac{1}{b_{n}^{2}}\log\left(\mathbb{P}\left(\frac{1}{n}\sum\limits_{k=1}^{n}\mathbb{E}\left(|M_{k}-M_{k-1}|^{2}\mathbf{1}_{\{|M_{k}-M_{k-1}|\geq a\frac{\sqrt{n}}{b_{n}}\}}\Big{|}\mathcal{H}_{k-1}\right)>\delta\right)\right)=-\infty,$

$\left(M_{n}/(b_{n}\sqrt{n})\right)_{n\in{\mathbb{N}}}$ satisfies the MDP on $\mathbb{R}$ with the speed $b_{n}^{2}/n$ and rate function $\displaystyle I(x)=\frac{x^{2}}{2Q}.$

We have the following many-to-one formulas. Ideas of the proofs can be found in [16] and [3].

Lemma 6.2.

Let $f,g\in{\mathcal{B}}(S)$ , $x\in S$ and $n\geq m\geq 0$ . Assuming that all the quantities below are well defined, we have:

(49)	$\displaystyle{\mathbb{E}}_{x}\left[M_{\mathbb{G}_{n}}(f)\right]$	$\displaystyle=\|\mathbb{G}_{n}\|\,{\mathcal{Q}}^{n}f(x)=2^{n}\,{\mathcal{Q}}^{n}f(x),$
(50)	$\displaystyle{\mathbb{E}}_{x}\left[M_{\mathbb{G}_{n}}(f)^{2}\right]$	$\displaystyle=2^{n}\,{\mathcal{Q}}^{n}(f^{2})(x)+\sum_{k=0}^{n-1}2^{n+k}\,{\mathcal{Q}}^{n-k-1}\left({\mathcal{P}}\left({\mathcal{Q}}^{k}f\otimes{\mathcal{Q}}^{k}f\right)\right)(x),$
(51)	$\displaystyle{\mathbb{E}}_{x}\left[M_{\mathbb{G}_{n}}(f)M_{\mathbb{G}_{m}}(g)\right]$	$\displaystyle=2^{n}{\mathcal{Q}}^{m}\left(g{\mathcal{Q}}^{n-m}f\right)(x)$
	$\displaystyle\hskip 56.9055pt+\sum_{k=0}^{m-1}2^{n+k}\,{\mathcal{Q}}^{m-k-1}\left({\mathcal{P}}\left({\mathcal{Q}}^{k}g\otimes_{\rm sym}{\mathcal{Q}}^{n-m+k}f\right)\right)(x).$

We recall the following result due to Bochner (see [21, Theorem 1A] which can be easily extended to any dimension $d\geq 1$ ).

Lemma 6.3.

Let $(h_{n},n\in\mathbb{N})$ be a sequence of positive numbers converging to $0$ as $n$ goes to infinity. Let $g:\mathbb{R}^{d}\rightarrow\mathbb{R}$ be a measurable function such that $\int_{\mathbb{R}^{d}}|g(x)|dx<+\infty$ . Let $f:\mathbb{R}^{d}\rightarrow\mathbb{R}$ be a measurable function such that $\mathop{\parallel\!f\!\parallel}\nolimits_{\infty}<+\infty$ , $\int_{{\mathbb{R}}^{d}}|f(y)|\,dy<+\infty$ and $\lim_{|x|\rightarrow+\infty}|x|f(x)=0$ . Define

g_{n}(x)=h_{n}^{-d}\int_{\mathbb{R}^{d}}f(h_{n}^{-1}(x-y))g(y)dy.

Then, we have at every point $x$ of continuity of $g$ ,

\lim_{n\rightarrow+\infty}g_{n}(x)=g(x)\int_{\mathbb{R}}f(y)dy.

We also give some bounds on ${\mathbb{E}}_{x}\left[M_{\mathbb{G}_{n}}(f)^{4}\right]$ , see the proof of Theorem 2.1 in [3]. We will use the notation:

g\otimes^{2}=g\otimes g.

Lemma 6.4.

There exists a finite constant $C$ such that for all $f\in{\mathcal{B}}(S)$ , $n\in{\mathbb{N}}$ and $\nu$ a probability measure on $S$ , assuming that all the quantities below are well defined, there exist functions $\psi_{j,n}$ for $1\leq j\leq 9$ such that:

{\mathbb{E}}_{\nu}\left[M_{\mathbb{G}_{n}}(f)^{4}\right]=\sum_{j=1}^{9}\langle\nu,\psi_{j,n}\rangle,

and, with $h_{k}={\mathcal{Q}}^{k-1}(f)$ and (notice that either $|\psi_{j}|$ or $|\langle\nu,\psi_{j}\rangle|$ is bounded), writing $\nu g=\langle\nu,g\rangle$ :

	$\displaystyle\|\psi_{1,n}\|$	$\displaystyle\leq C\,2^{n}{\mathcal{Q}}^{n}(f^{4}),$
	$\displaystyle\|\nu\psi_{2,n}\|$	$\displaystyle\leq C\,2^{2n}\,\sum_{k=0}^{n-1}2^{-k}\|\nu{\mathcal{Q}}^{k}{\mathcal{P}}\left({\mathcal{Q}}^{n-k-1}(f^{3})\otimes_{\rm sym}h_{n-k}\right)\|,$
	$\displaystyle\|\psi_{3,n}\|$	$\displaystyle\leq C2^{2n}\sum_{k=0}^{n-1}2^{-k}\,{\mathcal{Q}}^{k}{\mathcal{P}}\left({\mathcal{Q}}^{n-k-1}(f^{2})\otimes^{2}\right),$
	$\displaystyle\|\psi_{4,n}\|$	$\displaystyle\leq C\,2^{4n}\,{\mathcal{P}}\left(\|{\mathcal{P}}(h_{n-1}\otimes^{2})\otimes^{2}\|\right),$
	$\displaystyle\|\psi_{5,n}\|$	$\displaystyle\leq C\,2^{4n}\,\sum_{k=2}^{n-1}\sum_{r=0}^{k-1}2^{-2k-r}{\mathcal{Q}}^{r}{\mathcal{P}}\left({\mathcal{Q}}^{k-r-1}\|{\mathcal{P}}(h_{n-k}\otimes^{2})\|\otimes^{2}\right),$
	$\displaystyle\|\psi_{6,n}\|$	$\displaystyle\leq C\,2^{3n}\,\sum_{k=1}^{n-1}\sum_{r=0}^{k-1}2^{-k-r}{\mathcal{Q}}^{r}\|{\mathcal{P}}\left({\mathcal{Q}}^{k-r-1}{\mathcal{P}}\left(h_{n-k}\otimes^{2}\right)\otimes_{\rm sym}{\mathcal{Q}}^{n-r-1}(f^{2})\right)\|,$
	$\displaystyle\|\nu\psi_{7,n}\|$	$\displaystyle\leq C\,2^{3n}\,\sum_{k=1}^{n-1}\sum_{r=0}^{k-1}2^{-k-r}\|\nu{\mathcal{Q}}^{r}{\mathcal{P}}\left({\mathcal{Q}}^{k-r-1}{\mathcal{P}}\left(h_{n-k}\otimes_{\rm sym}{\mathcal{Q}}^{n-k-1}(f^{2})\right)\otimes_{\rm sym}h_{n-r}\right)\|,$
	$\displaystyle\|\psi_{8,n}\|$	$\displaystyle\leq C\,2^{4n}\,\sum_{k=2}^{n-1}\sum_{r=1}^{k-1}\sum_{j=0}^{r-1}2^{-k-r-j}{\mathcal{Q}}^{j}{\mathcal{P}}\left(\|{\mathcal{Q}}^{r-j-1}{\mathcal{P}}\left(h_{n-r}\otimes^{2}\right)\|\otimes_{\rm sym}\|{\mathcal{Q}}^{k-j-1}{\mathcal{P}}\left(h_{n-k}\otimes^{2}\right)\|\right),$
	$\displaystyle\|\psi_{9,n}\|$	$\displaystyle\leq C\,2^{4n}\,\sum_{k=2}^{n-1}\sum_{r=1}^{k-1}\sum_{j=0}^{r-1}2^{-k-r-j}{\mathcal{Q}}^{j}\|{\mathcal{P}}\left({\mathcal{Q}}^{r-j-1}\|{\mathcal{P}}\left(h_{n-r}\otimes_{\rm sym}{\mathcal{Q}}^{k-r-1}{\mathcal{P}}\left(h_{n-k}\otimes^{2}\right)\right)\otimes_{\rm sym}h_{n-j}\right)\|.$

References

[1] I. V. Basawa and J. Zhou. Non-Gaussian bifurcating models and quasi-likelihood estimation. Adv. in Appl. Probab., 41(A):55–64, 2004.
[2] S. V. Bitseki Penda and J.-F. Delmas. Central limit theorem for kernel estimator of invariant density in bifurcating markov chains models. arXiv preprint arXiv:2106.08626, 2021.
[3] S. V. Bitseki Penda, H. Djellout, and A. Guillin. Deviation inequalities, moderate deviations and some limit theorems for bifurcating Markov chains with application. Ann. Appl. Probab., 24(1):235–291, 2014.
[4] S. V. Bitseki Penda and G. Gackou. Moderate deviation principles for bifurcating markov chains: case of functions dependent of one variable. arXiv e-prints, pages arXiv–2105, 2021.
[5] S. V. Bitseki Penda, M. Hoffmann, and A. Olivier. Adaptive estimation for bifurcating Markov chains. Bernoulli, 23(4B):3598–3637, 2017.
[6] S. V. Bitseki Penda and A. Olivier. Autoregressive functions estimation in nonlinear bifurcating autoregressive models. Stat. Inference Stoch. Process., 20(2):179–210, 2017.
[7] S. V. Bitseki Penda and A. Olivier. Moderate deviation principle in nonlinear bifurcating autoregressive models. Statistics & Probability Letters, 138:20–26, 2018.
[8] S. V. Bitseki Penda and A. Roche. Local bandwidth selection for kernel density estimation in a bifurcating Markov chain model. Journal of Nonparametric Statistics, 0(0):1–28, 2020.
[9] R. Cowan and R. Staudte. The bifurcating autoregression model in cell lineage studies. Biometrics, 42(4):769–783, December 1986.
[10] J.-F. Delmas and L. Marsalle. Detection of cellular aging in a Galton-Watson process. Stochastic Process. Appl., 120(12):2495–2519, 2010.
[11] A. Dembo and O. Zeitouni. Large Deviations Techniques and Applications. Applications of mathematics. Springer, 1998.
[12] H. Djellout. Moderate deviations for martingale differences and applications to o-mixing sequences. Stochastics and stochastics reports, 73(1):37–64, 2002.
[13] M. Doumic, M. Hoffmann, N. Krell, and L. Robert. Statistical estimation of a growth-fragmentation model observed on a genealogical tree. Bernoulli, 21(3):1760–1799, 2015.
[14] M. Duflo. Random iterative models, volume 34. Springer Science & Business Media, 2013.
[15] F. Gao. Moderate deviations and large deviations for kernel density estimators. Journal of Theoretical Probability, 16(2):401–418, 2003.
[16] J. Guyon. Limit theorems for bifurcating Markov chains. Application to the detection of cellular aging. Ann. Appl. Probab., 17(5-6):1538–1569, 2007.
[17] M. Hoffmann and A. Marguet. Statistical estimation in a randomly structured branching population. Stochastic Process. Appl., 129(12):5236–5277, 2019.
[18] E. Masry. Recursive probability density estimation for weakly dependent stationary processes. IEEE Transactions on Information Theory, 32(2):254–267, 1986.
[19] A. Mokkadem and M. Pelletier. Confidence bands for densities, logarithmic point of view. Alea, 2:231–266, 2006.
[20] A. Mokkadem, M. Pelletier, and J. Worms. Large and moderate deviations principles for kernel estimation of a multivariate density and its partial derivatives. Australian & New Zealand Journal of Statistics, 47(4):489–502, 2005.
[21] E. Parzen. On estimation of a probability density function and mode. The Annals of Mathematical Statistics, 33(3):1065–1076, 1962.
[22] G. G. Roussas. Nonparametric estimation in Markov processes. Annals of the Institute of Statistical Mathematics, 21(1):73–87, 1969.

	$\displaystyle\|\mathbb{T}_{n}\|^{1/2}h_{n}^{d/2}B_{h_{n}}(x)$	$\displaystyle=\sqrt{\|\mathbb{T}_{n}\|h_{n}^{d}}\,\,\Big{\|}\int_{\mathbb{R}^{d}}h_{n}^{-d}K(h_{n}^{-1}(x-y))\mu(y)dy-\mu(x)\Big{\|}$
		$\displaystyle=\sqrt{\|\mathbb{T}_{n}\|h_{n}^{d}}\,\,\Big{\|}\int_{\mathbb{R}^{d}}K(y)(\mu(x-h_{n}y)-\mu(x))\,dy\Big{\|}$
		$\displaystyle\leq C\sqrt{\|\mathbb{T}_{n}\|h_{n}^{d}}\,\,\sum_{j=1}^{d}\,\,\int_{\mathbb{R}^{d}}K(y)\frac{(h_{n}\|y_{j}\|)^{s}}{\lfloor s\rfloor!}dy$
		$\displaystyle\leq C\sqrt{\|\mathbb{T}_{n}\|h_{n}^{2s+d}}.$

	$\displaystyle\|\psi_{8,p-\ell}\|$	$\displaystyle\leq C\,2^{4(p-\ell)}\,\sum_{k=2}^{p-\ell-1}\sum_{r=1}^{k-1}\sum_{j=0}^{r-1}2^{-k-r-j}{\mathcal{Q}}^{j}{\mathcal{P}}\big{(}{\mathcal{Q}}^{r-j-1}{\mathcal{P}}\big{(}\|{\mathcal{Q}}^{p-\ell-1-r}\tilde{f}_{\ell,n}\|\otimes^{2}\big{)}$
		$\displaystyle\hskip 142.26378pt\otimes_{\rm sym}{\mathcal{Q}}^{k-j-1}{\mathcal{P}}\big{(}\|{\mathcal{Q}}^{p-\ell-1-k}\tilde{f}_{\ell,n}\|\otimes^{2}\big{)}\big{)}$
		$\displaystyle\leq C\,2^{4(p-\ell)}\,\sum_{k=2}^{p-\ell-1}\sum_{r=1}^{k-1}\sum_{j=0}^{r-1}2^{-k-r-j}h_{n}^{2d}\alpha^{4(p-\ell)-2r-2k}$
		$\displaystyle\leq C\,h_{n}^{2d}\,2^{2(p-\ell)}\big{(}{\bf 1}_{\{2\alpha^{2}<1\}}\,+\,(p-\ell)^{2}\,{\bf 1}_{\{2\alpha^{2}=1\}}\,+\,(2\alpha^{2})^{2(p-\ell)}\,{\bf 1}_{\{2\alpha^{2}>1\}}\big{)}.$

	$\displaystyle\mathbb{P}_{{\mathcal{F}}_{i}}\Big{(}\mathbb{E}_{X_{i}}\left[N_{n,i}({\mathfrak{f}})\right]>\frac{b_{n}\sqrt{n}}{2}\Big{)}$	$\displaystyle=\mathbb{P}_{X_{i}}\Big{(}\sum_{\ell=0}^{p}2^{p-\ell}{\mathcal{Q}}^{p-\ell}(\tilde{f}_{\ell})(X_{i})>\frac{b_{n}\sqrt{n\|\mathbb{G}_{n}\|}}{2}\Big{)}$
		$\displaystyle\leq\exp\Big{(}-\frac{\lambda b_{n}\sqrt{n\|\mathbb{G}_{n}\|}}{2}\Big{)}\mathbb{E}_{X_{i}}\Big{[}\exp\Big{(}\lambda\sum_{\ell=0}^{p}2^{p-\ell}{\mathcal{Q}}^{p-\ell}(\tilde{f}_{\ell})(X_{i})\Big{)}\Big{]}$
		$\displaystyle\leq C\exp\Big{(}-\frac{b_{n}^{2}\|\mathbb{G}_{n}\|}{2(Cb_{n}\|\mathbb{G}_{n}\|^{1/2}h_{n}^{-d/2}+6C\|\mathbb{G}_{p}\|)}\Big{)},$

	$\displaystyle\|\psi_{1,n}\|$	$\displaystyle\leq C\,2^{n}{\mathcal{Q}}^{n}(f^{4}),$
	$\displaystyle\|\nu\psi_{2,n}\|$	$\displaystyle\leq C\,2^{2n}\,\sum_{k=0}^{n-1}2^{-k}\|\nu{\mathcal{Q}}^{k}{\mathcal{P}}\left({\mathcal{Q}}^{n-k-1}(f^{3})\otimes_{\rm sym}h_{n-k}\right)\|,$
	$\displaystyle\|\psi_{3,n}\|$	$\displaystyle\leq C2^{2n}\sum_{k=0}^{n-1}2^{-k}\,{\mathcal{Q}}^{k}{\mathcal{P}}\left({\mathcal{Q}}^{n-k-1}(f^{2})\otimes^{2}\right),$
	$\displaystyle\|\psi_{4,n}\|$	$\displaystyle\leq C\,2^{4n}\,{\mathcal{P}}\left(\|{\mathcal{P}}(h_{n-1}\otimes^{2})\otimes^{2}\|\right),$
	$\displaystyle\|\psi_{5,n}\|$	$\displaystyle\leq C\,2^{4n}\,\sum_{k=2}^{n-1}\sum_{r=0}^{k-1}2^{-2k-r}{\mathcal{Q}}^{r}{\mathcal{P}}\left({\mathcal{Q}}^{k-r-1}\|{\mathcal{P}}(h_{n-k}\otimes^{2})\|\otimes^{2}\right),$
	$\displaystyle\|\psi_{6,n}\|$	$\displaystyle\leq C\,2^{3n}\,\sum_{k=1}^{n-1}\sum_{r=0}^{k-1}2^{-k-r}{\mathcal{Q}}^{r}\|{\mathcal{P}}\left({\mathcal{Q}}^{k-r-1}{\mathcal{P}}\left(h_{n-k}\otimes^{2}\right)\otimes_{\rm sym}{\mathcal{Q}}^{n-r-1}(f^{2})\right)\|,$
	$\displaystyle\|\nu\psi_{7,n}\|$	$\displaystyle\leq C\,2^{3n}\,\sum_{k=1}^{n-1}\sum_{r=0}^{k-1}2^{-k-r}\|\nu{\mathcal{Q}}^{r}{\mathcal{P}}\left({\mathcal{Q}}^{k-r-1}{\mathcal{P}}\left(h_{n-k}\otimes_{\rm sym}{\mathcal{Q}}^{n-k-1}(f^{2})\right)\otimes_{\rm sym}h_{n-r}\right)\|,$
	$\displaystyle\|\psi_{8,n}\|$	$\displaystyle\leq C\,2^{4n}\,\sum_{k=2}^{n-1}\sum_{r=1}^{k-1}\sum_{j=0}^{r-1}2^{-k-r-j}{\mathcal{Q}}^{j}{\mathcal{P}}\left(\|{\mathcal{Q}}^{r-j-1}{\mathcal{P}}\left(h_{n-r}\otimes^{2}\right)\|\otimes_{\rm sym}\|{\mathcal{Q}}^{k-j-1}{\mathcal{P}}\left(h_{n-k}\otimes^{2}\right)\|\right),$
	$\displaystyle\|\psi_{9,n}\|$	$\displaystyle\leq C\,2^{4n}\,\sum_{k=2}^{n-1}\sum_{r=1}^{k-1}\sum_{j=0}^{r-1}2^{-k-r-j}{\mathcal{Q}}^{j}\|{\mathcal{P}}\left({\mathcal{Q}}^{r-j-1}\|{\mathcal{P}}\left(h_{n-r}\otimes_{\rm sym}{\mathcal{Q}}^{k-r-1}{\mathcal{P}}\left(h_{n-k}\otimes^{2}\right)\right)\otimes_{\rm sym}h_{n-j}\right)\|.$

Moderate deviation principles for kernel estimator of invariant density in bifurcating Markov chains models.

Abstract.

1. Introduction

2. The model of bifurcating Markov chain and definition of the estimators

2.1. The regular binary tree associated to BMC models

2.2. The probability kernels associated to BMC models

Definition 2.1 (Bifurcating Markov Chains, see [16, 2]).

Assumption 2.2.

Remark 2.3.

2.3. Kernel estimator of the invariant density μ\mu

2.4. Moderate deviation principle and related topics

Definition 2.4 (Moderate deviation principle, MDP).

Definition 2.5 (Super-exponential convergence).

Definition 2.6 (Exponential equivalence, see [11], Chap 4).

Remark 2.7.

Remark 2.8.

Assumption 2.9.

Remark 2.10.

Assumption 2.11 (Regularity of the kernel function and the bandwidth).

Assumption 2.12 (Further regularity on the density μ\mu, the kernel function and the bandwidths).

Assumption 2.13.

Remark 2.14.

3. Main result

3.1. Moderate deviation principle for μ^𝔸n\widehat{\mu}_{{\mathbb{A}}_{n}}

Lemma 3.1.

Theorem 3.2.

Theorem 3.3.

Corollary 3.4.

Remark 3.5.

Corollary 3.6.

Remark 3.7.

3.2. Moderate deviation principle for additive functionals of BMCs

Assumption 3.8.

Remark 3.9.

Theorem 3.10.

Remark 3.11.

Remark 3.12.

4. Proof of Lemma 3.1, Theorems 3.2 and 3.3 and Corollary 3.6

4.1. Proof of Lemma 3.1

4.2. Proof of Theorem 3.2

4.3. Proof of Theorem 3.3

4.4. Proof of Corollary 3.6

5. Proof of Theorem 3.10

Lemma 5.1.

Proof.

Proof of (29)

Proof of (30)

Lemma 5.2.

Proof.

Lemma 5.3.

Proof.

Lemma 5.4.

Proof.

Proof of (41)

Proof of (42)

Lemma 5.5.

Proof.

Lemma 5.6.

Lemma 5.7.

Proof.

Upper bound of bn2​|𝔾n|−2​p3​∑ℓ=0p∑i∈𝔾n−p|ψ1,p−ℓ|​(Xi)b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{1,p-\ell}|(X_{i})

Upper bound of bn2​|𝔾n|−3​p3​∑ℓ=0p∑i∈𝔾n−p|ψ2,p−ℓ|​(Xi)b_{n}^{2}|\mathbb{G}_{n}|^{-3}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{2,p-\ell}|(X_{i})

Upper bound of bn2​|𝔾n|−2​p3​∑ℓ=0p∑i∈𝔾n−p|ψ3,p−ℓ|​(Xi)b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{3,p-\ell}|(X_{i})

Upper bound of bn2​|𝔾n|−2​p3​∑ℓ=0p∑i∈𝔾n−p|ψ4,p−ℓ|​(Xi)b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{4,p-\ell}|(X_{i})

Upper bound of bn2​|𝔾n|−3​p3​∑ℓ=0p∑i∈𝔾n−p|ψ5,p−ℓ|​(Xi)b_{n}^{2}|\mathbb{G}_{n}|^{-3}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{5,p-\ell}|(X_{i})

Upper bound of bn2​|𝔾n|−2​p3​∑ℓ=0p∑i∈𝔾n−p|ψ6,p−ℓ|​(Xi)b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{6,p-\ell}|(X_{i})

Upper bound of bn2​|𝔾n|−2​p3​∑ℓ=0p∑i∈𝔾n−p|ψ7,p−ℓ|​(Xi)b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{7,p-\ell}|(X_{i})

Upper bound of bn2​|𝔾n|−2​p3​∑ℓ=0p∑i∈𝔾n−p|ψ8,p−ℓ|​(Xi)b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{8,p-\ell}|(X_{i})

Upper bound of bn2​|𝔾n|−2​p3​∑ℓ=0p∑i∈𝔾n−p|ψ9,p−ℓ|​(Xi)b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{9,p-\ell}|(X_{i})

Lemma 5.8.

Proof.

6. Appendix

Proposition 6.1.

Lemma 6.2.

Lemma 6.3.

Lemma 6.4.

References

2.3. Kernel estimator of the invariant density $\mu$

Assumption 2.12 (Further regularity on the density $\mu$ , the kernel function and the bandwidths).

3.1. Moderate deviation principle for $\widehat{\mu}_{{\mathbb{A}}_{n}}$

Upper bound of $b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{1,p-\ell}|(X_{i})$

Upper bound of $b_{n}^{2}|\mathbb{G}_{n}|^{-3}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{2,p-\ell}|(X_{i})$

Upper bound of $b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{3,p-\ell}|(X_{i})$

Upper bound of $b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{4,p-\ell}|(X_{i})$

Upper bound of $b_{n}^{2}|\mathbb{G}_{n}|^{-3}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{5,p-\ell}|(X_{i})$

Upper bound of $b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{6,p-\ell}|(X_{i})$

Upper bound of $b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{7,p-\ell}|(X_{i})$

Upper bound of $b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{8,p-\ell}|(X_{i})$

Upper bound of $b_{n}^{2}|\mathbb{G}_{n}|^{-2}p^{3}\sum_{\ell=0}^{p}\sum_{i\in\mathbb{G}_{n-p}}|\psi_{9,p-\ell}|(X_{i})$