Compound Poisson particle approximation for McKean-Vlasov SDEs

Xicheng Zhang Xicheng Zhang: School of Mathematics and Statistics, Beijing Institute of Technology, Beijing 100081, China
Email: XichengZhang@gmail.com

Abstract.

We present a comprehensive discretization scheme for linear and nonlinear stochastic differential equations (SDEs) driven by either Brownian motions or $\alpha$ -stable processes. Our approach utilizes compound Poisson particle approximations, allowing for simultaneous discretization of both the time and space variables in McKean-Vlasov SDEs. Notably, the approximation processes can be represented as a Markov chain with values on a lattice. Importantly, we demonstrate the propagation of chaos under relatively mild assumptions on the coefficients, including those with polynomial growth. This result establishes the convergence of the particle approximations towards the true solutions of the McKean-Vlasov SDEs. By only imposing moment conditions on the intensity measure of compound Poisson processes, our approximation exhibits universality. In the case of ordinary differential equations (ODEs), we investigate scenarios where the drift term satisfies the one-sided Lipschitz assumption. We prove the optimal convergence rate for Filippov solutions in this setting. Additionally, we establish a functional central limit theorem (CLT) for the approximation of ODEs and show the convergence of invariant measures for linear SDEs. As a practical application, we construct a compound Poisson approximation for 2D-Navier Stokes equations on the torus and demonstrate the optimal convergence rate.

Keywords: Compound Poisson approximation, McKean-Vlasov stochastic differential equation, Invariance measure, Navier-Stokes equation, Central Limit Theorem.

AMS 2010 Mathematics Subject Classification: 65C35, 60H10, 35Q30

This work is partially supported by NNSFC grants of China (Nos. 12131019), and the German Research Foundation (DFG) through the Collaborative Research Centre(CRC) 1283 “Taming uncertainty and profiting from randomness and low regularity in analysis, stochastics and their applications”.

1. Introduction

Let $\sigma:{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}\times{\mathbb{R}}^{d}\to{\mathbb{R}}^{d}\otimes{\mathbb{R}}^{d}$ and $b:{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}\times{\mathbb{R}}^{d}\to{\mathbb{R}}^{d}$ be two Borel measurable functions. Throughout this paper, for a probability measure $\mu$ over ${\mathbb{R}}^{d}$ , we write

\sigma[t,x,\mu]:=\int_{{\mathbb{R}}^{d}}\sigma(t,x,y)\mu({\mathord{{\rm d}}}y),\ \ b[t,x,\mu]:=\int_{{\mathbb{R}}^{d}}b(t,x,y)\mu({\mathord{{\rm d}}}y).

Fix $\alpha\in(0,2]$ and consider the following McKean-Vlasov SDE or distribution-dependent SDE (abbreviated as DDSDE):

\displaystyle{\mathord{{\rm d}}}X_{t}=\sigma[t,X_{t},\mu_{t}]{\mathord{{\rm d}}}L^{(\alpha)}_{t}+b[t,X_{t},\mu_{t}]{\mathord{{\rm d}}}t,

(1.1)

where $\mu_{t}=\mu_{X_{t}}$ denotes the probability distribution of $X_{t}$ , $L^{(2)}_{t}=W_{t}$ stands for a $d$ -dimensional standard Brownian motion, and for $\alpha\in(0,2)$ , $L^{(\alpha)}_{t}$ is a symmetric and rotationally invariant $\alpha$ -stable process with infinitesimal generator $\Delta^{\alpha/2}$ (the usual fractional Laplacian operator).

In the literature, DDSDE (1.1) is also considered as a nonlinear SDE due to the dependence of its coefficients on the distribution of the solution. By applying Itô’s formula, $\mu_{t}$ solves the following nonlinear Fokker-Planck equation in the distributional sense:

\partial_{t}\mu_{t}={\mathscr{L}}^{*}_{t,\mu_{t}}\mu_{t},

where ${\mathscr{L}}^{*}_{t,\mu}$ is the adjoint operator of the generator (local/nonlocal ) of SDE (1.1): for $\alpha=2$ ,

{\mathscr{L}}_{t,\mu}f(x):=\tfrac{1}{2}\mathrm{tr}{\big{(}}\sigma[t,x,\mu]\sigma^{*}[t,x,\mu]\cdot\nabla^{2}f(x){\big{)}}+b[t,x,\mu]\cdot\nabla f(x),

and for $\alpha\in(0,2)$ ,

{\mathscr{L}}_{t,\mu}f(x):={\rm p.v.}\int_{{\mathbb{R}}^{d}}\frac{f(x+\sigma[t,x,\mu]z)-f(x)}{|z|^{d+\alpha}}{\mathord{{\rm d}}}z+b[t,x,\mu]\cdot\nabla f(x),

where $\sigma^{*}$ stands for the transpose of matrix $\sigma$ and p.v. stands for the Cauchy principle value.

In the seminal work by McKean [31], the study of nonlinear SDE (1.1) driven by Brownian motions was initiated. His paper established a natural connection between nonlinear Markov processes and nonlinear parabolic equations. Since then, the McKean-Vlasov SDE has evolved into a fundamental mathematical framework, offering a powerful tool for analyzing complex systems comprising a large number of interacting particles. The McKean-Vlasov SDE discribes the dynamics of a single particle, influenced by the collective behavior of the entire system. Its applications have expanded across various fields, including statistical physics, stochastic analysis, economics, and biology. Through the study of the McKean-Vlasov SDE, researchers have gained significant understanding of diverse phenomena, ranging from the behavior of particles in statistical mechanics to intricate dynamics in economic and biological systems. Its utility extends beyond theoretical investigations, playing a vital role in the development of numerical methods, data analysis techniques, and decision-making models. For a more comprehensive overview and references, the survey paper by [6] provides valuable insights into the McKean-Vlasov SDE and its wide-ranging applications.

When $b$ and $\sigma$ satisfy the following Lipschitz assumption

\displaystyle\|\sigma(t,x,y)-\sigma(t,x^{\prime},y^{\prime})\|+|b(t,x,y)-b(t,x^{\prime},y^{\prime})|\leqslant\kappa(|x-x^{\prime}|+|y-y^{\prime}|),

(1.2)

it is well-known that for any initial value $X_{0}$ , there is a unique strong solution to DDSDE (1.1) (see [40], [6]). From the perspective of Monte-Carlo simulations and practical applications, the McKean-Vlasov SDEs (1.1) are often approximated using an interaction particle system. In the case of Brownian motions ( $\alpha=2$ ), the approximation takes the following form: For fixed $N\in{\mathbb{N}}$ , let ${\mathbf{X}}^{N}:=(X^{N,1},\cdots,X^{N,N})$ solve the following SDE in ${\mathbb{R}}^{Nd}$ :

\displaystyle{\mathord{{\rm d}}}X^{N,i}_{t}=\sigma[t,X^{N,i}_{t},\mu_{{\mathbf{X}}^{N}_{t}}]{\mathord{{\rm d}}}W^{i}_{t}+b[t,X^{N,i}_{t},\mu_{{\mathbf{X}}^{N}_{t}}]{\mathord{{\rm d}}}t,

(1.3)

where $\{W^{i},i=1,2,\cdots\}$ is a sequence of i.i.d. Brownian motions, and for a point ${\mathbf{x}}=(x^{1},\cdots,x^{N})\in({\mathbb{R}}^{d})^{N}$ , the empirical measure of ${\mathbf{x}}$ is defined by

\mu_{{\mathbf{x}}}({\mathord{{\rm d}}}z):=\frac{1}{N}\sum_{i=1}^{N}\delta_{x^{i}}({\mathord{{\rm d}}}z)\in{\mathcal{P}}({\mathbb{R}}^{d}),

where $\delta_{x^{i}}$ is the usual Dirac measure concentrated at point $x^{i}$ . Under Lipschitz assumption (1.2), it is well-known that for any $T>0$ , there is a constant $C>0$ such that for any $N\in{\mathbb{N}}$ ,

\sup_{i=1,\cdots,N}{\mathbb{E}}\left(\sup_{t\in[0,T]}|X^{N,i}_{t}-\bar{X}^{i}_{t}|^{2}\right)\leqslant\frac{C}{N},

where $\bar{X}^{i}_{t}$ solves SDE (1.1) driven by Brownian motion $W^{i}$ . Since ${\bar{X}^{i},i\in\mathbb{N}}$ are independent, the above estimate indicates that the particle system becomes statistically independent as $N\to\infty$ . This property is commonly referred to as the propagation of chaos (see [40], [6]). Furthermore, the fluctuation

\eta^{N}_{t}:=\sqrt{N}(\mu_{{\mathbf{X}}^{N}_{t}}-\mu_{X_{t}})

weakly converges to an Ornstein-Uhlenbeck process (cf. [13]). However, for numerical simulation purposes, it is still necessary to discretize the particle system (1.3) along the time direction by employing methods such as the explicit or implicit Euler’s scheme (see [25]).

The objective of this paper is to present a comprehensive discretization scheme for DDSDE (1.1). Our approximation SDE is driven by compound Poisson processes and possesses the advantage of being easily simulated on a computer. Moreover, our proposed scheme not only allows for efficient numerical simulation of the DDSDE but also provides lattice approximations for the equation.

1.1. Poisson processes approximation for ODEs

Numerical methods for ordinary differential equations (ODEs) encompass well-established techniques such as Euler’s method, the Runge-Kutta methods, and more advanced methods like the Adams-Bashforth methods and the backward differentiation formulas. These methods enable us to approximate the solution of an ODE over a given interval by evaluating the function at discrete points. In this work, we aim to develop a stochastic approximation method tailored for rough ODEs, which exhibit irregular behavior or involve coefficients that are not smooth.

Let us consider the classical ordinary differential equation (ODE)

\displaystyle\dot{X}_{t}=b(t,X_{t}),\ \ X_{0}=x\in{\mathbb{R}}^{d}.

(1.4)

Suppose that the time-dependent vector field $b:{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}\to{\mathbb{R}}^{d}$ satisfies the one-sided Lipschitz condition

\displaystyle\langle x-y,b(s,x)-b(s,y)\rangle\leqslant\kappa|x-y|^{2}\mbox{ for a.e. }(s,x,y)\in{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}\times{\mathbb{R}}^{d},

(1.5)

and linear growth assumption:

\displaystyle\ |b(s,x)|\leqslant\kappa(1+|x|).

(1.6)

Note that under (1.5), $b$ need not even be continuous. By smooth approximation, it is easy to see that in the sense of distributions, (1.5) is equivalent to (see [5, Lemma 2.2])

\displaystyle{\rm Sym}(\nabla b):=\tfrac{\nabla b+(\nabla b)^{*}}{2}\leqslant\kappa{\mathbb{I}},

(1.7)

where ${\mathbb{I}}$ stands for the identity matrix. In particular, if $f:{\mathbb{R}}^{d}\to{\mathbb{R}}$ is a semiconvex function, that is, the Hessian matrix $\nabla^{2}f$ has a lower bound in the distributional sense, then for $b=-\nabla f$ , (1.7) and (1.5) hold.

When $b$ is Lipshcitz continuous in $x$ , it is well-known that the flow $\{X_{t}(x),x\in{\mathbb{R}}^{d}\}_{t\geqslant 0}$ associated to ODE (1.4) is closely related to the linear transport equation

\displaystyle\partial_{t}u+b(t,x)\cdot\nabla u=0

(1.8)

and the dual continuity equation

\displaystyle\partial_{t}f+\mathord{{\rm div}}(b(t,x)f)=0.

(1.9)

In [11], DiPerna and Lions established a well-posedness theory for ODE (1.4) for Lebesgue almost all starting point $x$ by studying the renormalization solution to linear transport equation (1.8) with $b$ being ${\mathbb{W}}^{1,p}$ -regularity and having bounded divergence, where ${\mathbb{W}}^{1,p}$ is the usual first order Sobolev space and $p\geqslant 1$ . Subsequently, Ambrosio [1] extended the DiPerna-Lions theory to the case that $b\in BV_{loc}$ and $\mathord{{\rm div}}b\in L^{1}$ by studying the continuity equation (1.9) and using deep results from geometric measure theory. It is noticed that these aforementioned results do not apply to vector field $b$ that satisfies the one-sided Lipschitz condition (1.5).

On the other hand, under the conditions (1.5) and (1.6), the ODE (1.4) can be uniquely solved in the sense of Filippov [14], resulting in a solution family $\{X_{t}(x),x\in\mathbb{R}^{d}\}_{t\geqslant 0}$ that forms a Lipschitz flow in $(t,x)$ (see Theorem 2.8 below). In a recent study, Lions and Seeger [28] investigated the relationship between the solvability of (1.8) and (1.9) and ODE (1.4) when $b$ satisfies (1.5) and (1.6). Condition (1.5) naturally arise in fluid dynamics (cf. [5] and [28]), optimal control theory and viability theory (cf. [2]). From a practical application standpoint, it is desirable to construct an easily implementable numerical scheme. However, the direct Euler scheme is not suitable for solving the ODE (1.4) when $b$ satisfies condition (1.5) or ${\mathbb{W}}^{1,p}$ -regularity conditions. Our objective in the following discussion is to develop a direct discretization scheme that is well-suited for addressing the aforementioned cases.

For given $\varepsilon\in(0,1)$ , let $({\mathcal{N}}^{\varepsilon}_{t})_{t\geqslant 0}$ be a Poisson process with intensity $1/\varepsilon$ (see (2.1) below for a precise definition). We consider the following simple SDE driven only by Poisson process ${\mathcal{N}}^{\varepsilon}$ :

\displaystyle X^{\varepsilon}_{t}=x+\varepsilon\int^{t}_{0}b(s,X^{\varepsilon}_{s-}){\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{s},

(1.10)

where $X^{\varepsilon}_{s-}$ stands for the left-hand limit. Since the Poisson process ${\mathcal{N}}^{\varepsilon}_{s}$ only jumps at exponentially distributed waiting times, the above SDE is always solvable as long as the coefficient $b$ takes finite values. Under (1.5) and (1.6), we show the following convergence: for any $T>0$ ,

{\mathbb{E}}\left(\sup_{t\in[0,T]}|X^{\varepsilon}_{t}-X_{t}|^{2}\right)\leqslant C\varepsilon,\ \ \varepsilon\in(0,1),

where $X$ is the unique Filippov solution of ODE (1.4) and $C=C(\kappa,d,T)>0$ (see Theorem 2.11). Furthermore, in the sense of DiPerna and Lions (cf. [11] and [9]), we establish the convergence of $X^{\varepsilon}$ in probability to the exact solution under certain ${\mathbb{W}}^{1,p}$ assumptions on $b$ (see Corollary 2.14). This convergence result is particularly significant as it allows for the construction of Monte-Carlo approximations for the first-order partial differential equations (PDEs) (1.8) or (1.9). In fact, in subsection 2.4, we delve into the study of particle approximations for distribution-dependent ODEs, which are closely related to nonlinear PDEs.

One important aspect to highlight is that unlike the classical Euler scheme, our proposed scheme does not rely on any continuity assumptions in the time variable $t$ . In fact, for any $f\in L^{2}([0,1])$ and $\varepsilon\in(0,1)$ , we have

{\mathbb{E}}\left|\varepsilon\int^{1}_{0}f(s){\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{s}-\int^{1}_{0}f(s){\mathord{{\rm d}}}s\right|^{2}=\varepsilon\int^{1}_{0}|f(s)|^{2}{\mathord{{\rm d}}}s.

We complement the theoretical analysis with numerical experiments to showcase the scheme’s performance, as illustrated in Remark 2.3.

1.2. Compound Poisson approximation for SDEs

Now we consider the classical stochastic differential equation driven by $\alpha$ -stable processes: for $\alpha\in(0,2]$ ,

\displaystyle{\mathord{{\rm d}}}X_{t}=\sigma(t,X_{t}){\mathord{{\rm d}}}L^{(\alpha)}_{t}+b(t,X_{t}){\mathord{{\rm d}}}t,\ \ X_{0}=x.

(1.11)

The traditional Euler scheme, also known as the Euler-Maruyama scheme, for SDE (1.11) and its variants have been extensively studied in the literature from both theoretical and numerical perspectives. When the coefficients $b$ and $\sigma$ are globally Lipschitz continuous, it is well-known that the explicit Euler-Maruyama algorithm for SDEs driven by Brownian motions exhibits strong convergence rate of $\frac{1}{2}$ and weak convergence rate of $1$ (see [4], [19]).

In the case where the drift satisfies certain monotonicity conditions and the diffusion coefficient satisfies locally Lipschitz assumptions, Gyöngy [15] proved almost sure convergence and convergence in probability of the Euler-Maruyama scheme (see Krylov’s earlier work [26]). However, Hutzenthaler, Jentzen, and Kloeden [21] provided examples illustrating the divergence of the absolute moments of Euler’s approximations at a finite time. In other words, it is not possible to establish strong convergence of the Euler scheme in the $L^{p}$ -sense for SDEs with drift terms exhibiting super-linear growth. To overcome this issue, Hutzenthaler, Jentzen, and Kloeden [22] introduced a tamed Euler scheme, where the drift term is modified to be bounded. This modification allows them to demonstrate strong convergence in the $L^{p}$ -sense with a rate of $\frac{1}{2}$ to the exact solution of the SDE, assuming the drift coefficient is globally one-sided Lipschitz continuous. Subsequently, Sabanis [35] improved upon the tamed scheme of [22] to cover more general cases and provided simpler proofs for the strong convergence.

On the other hand, there is also a considerable body of literature addressing the Euler approximations for SDEs with irregular coefficients, such as Hölder and even singular drifts (see [3], [33], [39], and references therein). However, to the best of our knowledge, there are relatively few results concerning the Euler scheme for SDEs driven by $\alpha$ -stable processes and under non-Lipschitz conditions (with the exception of [32], [27] which focus on the additive noise case).

Our goal is to develop a unified compound Poisson approximation scheme for the SDE (1.11), which is driven by either purely jumping $\alpha$ -stable processes or Brownian motions. To achieve this, let $(\xi_{n})_{n\in\mathbb{N}}$ be a sequence of independent and identically distributed random variables taking values in $\mathbb{Z}^{d}$ , such that for any integer lattice value $z\in\mathbb{Z}^{d}$ ,

\displaystyle{\mathbb{P}}(\xi_{n}=z)=\left\{\begin{aligned} &(2d)^{-1},\ |z|=1,&\alpha=2,\\ &c_{0}|z|^{-d-\alpha},\ z\not=0,&\alpha\in(0,2),\end{aligned}\right.

(1.12)

where $c_{0}=(\sum_{0\not=z\in{\mathbb{Z}}^{d}}|z|^{-d-\alpha})^{-1}$ is a normalized constant. Let $\xi_{0}=0$ . We define a ${\mathbb{Z}}^{d}$ -valued compound Poisson process $H^{\varepsilon}$ by

\displaystyle H^{\varepsilon}_{t}:=\sum_{n\leqslant{\mathcal{N}}^{\varepsilon}_{t}}\xi_{n},\ \ t\geqslant 0,

(1.13)

where $({\mathcal{N}}^{\varepsilon}_{t})_{t\geqslant 0}$ is a Poisson process with intensity $1/\varepsilon$ . Let ${\mathcal{H}}^{\varepsilon}$ be the associated Poisson random measure, i.e., for $t>0$ and $E\in{\mathscr{B}}({\mathbb{R}}^{d})$ ,

{\mathcal{H}}^{\varepsilon}([0,t],E):=\sum_{s\leqslant t}{\mathbbm{1}}_{E}(\Delta H^{\varepsilon}_{s})=\sum_{n\leqslant{\mathcal{N}}^{\varepsilon}_{t}}{\mathbbm{1}}_{E}(\xi_{n}).

Consider the following SDE driven by compound Poisson process ${\mathcal{H}}^{\varepsilon}$ :

\displaystyle X^{\varepsilon}_{t}=x+\int^{t}_{0}\int_{{\mathbb{R}}^{d}}\Big{(}\varepsilon^{\frac{1}{\alpha}}\sigma(s,X^{\varepsilon}_{s-})z+\varepsilon b(s,X^{\varepsilon}_{s-})\Big{)}{\mathcal{H}}^{\varepsilon}({\mathord{{\rm d}}}s,{\mathord{{\rm d}}}z),

(1.14)

where the integral is a finite sum since the compound Poisson process only jumps at exponentially distributed waiting times. Let $S_{n}^{\varepsilon}$ be the $n$ -th jump time of ${\mathcal{N}}^{\varepsilon}_{t}$ . It is easy to see that (see Lemma 3.4)

X^{\varepsilon}_{t}=x+\sum_{n\leqslant{\mathcal{N}}^{\varepsilon}_{t}}\Big{(}\varepsilon^{\frac{1}{\alpha}}\sigma(S^{\varepsilon}_{n},X^{\varepsilon}_{S^{\varepsilon}_{n-1}})\xi_{n}+\varepsilon b(S^{\varepsilon}_{n},X^{\varepsilon}_{S^{\varepsilon}_{n-1}})\Big{)}.

Indeed, it is possible to choose different independent Poisson processes for the drift and diffusion coefficients in the compound Poisson approximation scheme. However, it is worth noting that doing so would increase the computational time required for simulations. By using the same compound Poisson process for both coefficients, the computational efficiency can be improved as the generation of random numbers for the Poisson process is shared between the drift and diffusion terms.

We note that the problem of approximating continuous diffusions by jump processes has been studied in [23, p.558, Theorem 4.21] under rather abstract conditions. However, from a numerical approximation or algorithmic standpoint, the explicit procedure (1.14) does not seem to have been thoroughly investigated. In this paper, we establish the weak convergence of $X^{\varepsilon}$ to $X$ in the space ${\mathbb{D}}(\mathbb{R}^{d})$ of all càdlàg functions under weak assumptions. Notably, these assumptions allow for coefficients with polynomial growth. Furthermore, under nondegenerate and additive noise assumptions, as well as Hölder continuity assumptions on the drift, we establish the following weak convergence rate: for some $\beta=\beta(\alpha)\in(0,1)$ , for any $T>0$ and $t\in[0,T]$ ,

|{\mathbb{E}}\varphi(X^{\varepsilon}_{t})-{\mathbb{E}}\varphi(X_{t})|\leqslant C\|\varphi\|_{C^{1}_{b}}\varepsilon^{\beta}.

It is worth mentioning that when $b=0$ and $\sigma$ is the identity matrix, the convergence of $X^{\varepsilon}$ to $X$ corresponds to the classical Donsker invariant principle. Additionally, when the drift $b$ satisfies certain dissipativity assumptions, we show the weak convergence of the invariant measure $\mu^{\varepsilon}$ of SDE (1.14) to the invariant measure $\mu$ of SDE (1.11), provided that the latter is unique.

As an application, we consider the discretized probabilistic approximation in the time direction for the 2D-Navier-Stokes equations (NSEs) on the torus. Specifically, for a fixed $T>0$ , we focus on the vorticity form of the backward 2D-Navier-Stokes equations on the torus, given by:

\partial_{s}w+\nu\Delta w+u\cdot\nabla w=0,\ \ w(T)=w_{0}={\rm curl}\varphi,\ \ u=K_{2}*w,

where $\varphi:{\mathbb{T}}^{2}\to{\mathbb{R}}^{2}$ is a smooth divergence-free vector field on the torus, and $K_{2}$ represents the Biot-Savart law (as described in (4.8) below). The stochastic Lagrangian particle method for NSEs has been previously studied in [8] and [43]. In this paper, we propose a discretized version of the NSEs, defined as follows: for $\varepsilon\in(0,1)$ , let $X^{\varepsilon}_{s,t}$ solve the following stochastic system

\displaystyle\left\{\begin{aligned} X^{\varepsilon}_{s,t}(x)&=x+\varepsilon\int^{t}_{s}u_{\varepsilon}(r,X^{\varepsilon}_{s,r-}(x)){\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{r}+\sqrt{\varepsilon\nu}(H^{\varepsilon}_{t}-H^{\varepsilon}_{s}),\\ w_{\varepsilon}(s,x)&={\mathbb{E}}w_{0}(X^{\varepsilon}_{s,T}(x)),\ \ u_{\varepsilon}=K_{2}*w_{\varepsilon},\ \ 0\leqslant s\leqslant t\leqslant T,\end{aligned}\right.

(1.15)

where $H^{\varepsilon}_{t}$ is defined in (1.13). We establish that there exists a constant $C>0$ such that for all $s\in[0,T]$ and $\varepsilon\in(0,1)$ ,

\|u_{\varepsilon}(s)-u(s)\|_{\infty}\leqslant C\varepsilon.

The scheme (1.15) provides a novel approach for simulating 2D-NSEs using Monte Carlo methods, offering a promising method for computational simulations of these equations.

1.3. Compound Poisson particle approximation for DDSDEs

Motivated by the aforementioned scheme, we can develop a compound Poisson particle approximation for the nonlinear SDE (1.1). Fix $N\in{\mathbb{N}}$ . Let $({\mathcal{N}}^{N,i})_{i=1,\cdots,N}$ be a sequence of i.i.d. Poisson processes with intensity $N$ and $(\xi^{N,i}_{n})_{n\in{\mathbb{N}},i=1,\cdots,N}$ i.i.d ${\mathbb{R}}^{d}$ -valued random variables with common distribution (1.12). Define for $i=1,\cdots,N$ ,

H^{N,i}_{t}:=\Big{(}\xi^{N,i}_{1}+\cdots+\xi^{N,i}_{{\mathcal{N}}^{N,i}_{t}}\Big{)}{\mathbbm{1}}_{{\mathcal{N}}^{N,i}_{t}\geqslant 1}.

Then $(H^{N,i})_{i=1,\cdots,N}$ is a sequence of i.i.d. compound Poisson processes. Let ${\mathcal{H}}^{N,i}$ be the associated Poisson random measure, that is,

{\mathcal{H}}^{N,i}([0,t],E):=\sum_{s\leqslant t}{\mathbbm{1}}_{E}(\Delta H^{N,i}_{s})=\sum_{n\leqslant{\mathcal{N}}^{N,i}_{t}}{\mathbbm{1}}_{E}(\xi^{N,i}_{n}),\ \ E\in{\mathscr{B}}({\mathbb{R}}^{d}).

Let $(X^{N,i}_{0})_{i=1,\cdots,N}$ be a sequence of symmetric random variables and ${\mathbf{X}}^{N}_{t}=(X^{N,i}_{t})_{i=1,\cdots,N}$ solve the following interaction particle system driven by ${\mathcal{H}}^{N,i}$ :

\displaystyle X^{N,i}_{t}=X^{N,i}_{0}+\int^{t}_{0}\int_{{\mathbb{R}}^{d}}\left(N^{-\frac{1}{\alpha}}\sigma\big{[}s,X^{N,i}_{s-},\mu_{{\mathbf{X}}^{N}_{s-}}\big{]}z+N^{-1}b\big{[}s,X^{N,i}_{s-},\mu_{{\mathbf{X}}^{N}_{s-}}\big{]}\right){\mathcal{H}}^{N,i}({\mathord{{\rm d}}}s,{\mathord{{\rm d}}}z).

(1.16)

Under suitable assumptions on $\sigma$ , $b$ , and $\mathbf{X}^{N}_{0}$ , we will show that for any $k\in{\mathbb{N}}$ ,

\displaystyle{\mathbb{P}}\circ(X^{N,1}_{\cdot},\cdots,X^{N,k}_{\cdot})^{-1}\to{\mathbb{P}}^{\otimes k}_{0}\ \mbox{ as $N\to\infty$,}

(1.17)

where $\mathbb{P}_{0}$ represents the law of the solution of the DDSDE (1.1) in the space of càdlàg functions, and $\mathbb{P}^{\otimes k}_{0}$ denotes the $k$ -fold product measure induced by $\mathbb{P}_{0}$ . Here, we have chosen $\varepsilon=1/N$ in (1.14). In contrast to the traditional particle approximation (1.3), the stochastic particle system (1.16) is fully discretized and can be easily simulated on a computer. The convergence result (1.17) can be interpreted as the propagation of chaos in the sense of Kac [24]. Furthermore, in the case of additive noise, we also establish the quantitative convergence rate with respect to the Wasserstein metric $\mathcal{W}_{1}$ under Lipschitz conditions.

1.4. Organization of the paper and notations

This paper is structured as follows:

In Section 2, we introduce the Poisson process approximation for ordinary differential equations (ODEs). We investigate the case where the vector field $b$ is bounded Lipschitz continuous and establish the optimal convergence rate in both the strong and weak senses. Additionally, we present a functional central limit theorem in this setting. Furthermore, we consider the case where $b$ satisfies the one-sided Lipschitz condition (not necessarily continuous), allowing for linear growth. We demonstrate the $L^{p}$ -strong convergence of $X^{\varepsilon}$ to the unique Filippov solution. When the vector field $b$ belongs to the first-order Sobolev space ${\mathbb{W}}^{1,p}$ and has bounded divergence, we also show the convergence in probability of $X^{\varepsilon}$ to $X$ . Moreover, we explore particle approximation methods for nonlinear ODEs.

In Section 3, we focus on the compound Poisson approximation for stochastic differential equations (SDEs), which provides a more general framework than the one described in (1.14) above. Under relatively weak assumptions, we establish the weak convergence of $X^{\varepsilon}$ , the convergence of invariant measures, as well as the weak convergence rate.

In Section 4, we concentrate on the 2D Navier-Stokes/Euler equations on the torus and propose a novel compound Poisson approximation scheme for these equations.

In Section 5, we specifically examine the compound Poisson particle approximation for DDSDEs driven by either $\alpha$ -stable processes or Brownian motions. Notably, we consider the case where the interaction kernel exhibits linear growth in the Brownian diffusion case. In the additive noise case, we establish the convergence rate in terms of the $\mathcal{W}_{1}$ metric.

In the Appendix, we provide a summary of the relevant notions and facts about martingale solutions that are utilized throughout the paper.

Throughout this paper, we use $C$ with or without subscripts to denote constants, whose values may change from line to line. We also use $:=$ to indicate a definition and set

a\wedge b:=\max(a,b),\ \ a\vee b:={\mathord{{\rm min}}}(a,b).

By $A\lesssim_{C}B$ or simply $A\lesssim B$ , we mean that for some constant $C\geqslant 1$ , $A\leqslant CB$ . For the readers’ convenience, we collect some frequently used notations below.

•

${\mathcal{P}}(E)$ : The space of all probability measures over a Polish space $E$ .
•

${\mathscr{B}}(E)$ : The Borel $\sigma$ -algebra of a Polish space $E$ .
•

$\Rightarrow$ : Weak convergence of probability measures or random variables.
•

${\mathbb{D}}={\mathbb{D}}({\mathbb{R}}^{d})$ : The space of all càdlàg functions from $[0,\infty)$ to ${\mathbb{R}}^{d}$ .
•

$\Delta f_{s}:=f_{s}-f_{s-}$ : The jump of $f\in{\mathbb{D}}$ at time $s$ .
•

${\mathscr{T}}_{T}$ : The set of all bounded stopping times.
•

$C^{\beta}_{b}$ : The usual Hölder spaces of $\beta$ -order.
•

$B_{R}$ : The ball in ${\mathbb{R}}^{d}$ with radius $R$ and center $0$ .

2. Poisson process approximation for ODEs

In this section, we focus on the simple Poisson approximation for ODEs. A distinguishing feature of our approach is that we do not make any regularity assumptions on the time variable. Moreover, we allow the coefficient to satisfy only the one-sided Lipschitz condition (1.5). The convergence analysis relies on straightforward stochastic calculus involving Poisson processes.

Let $(T_{k})_{k\in{\mathbb{N}}}$ be a sequence of i.i.d. random variables on some probability space $(\Omega,{\mathcal{F}},{\mathbb{P}})$ with common exponential distribution of parameter $1$ , i.e.,

{\mathbb{P}}(T_{k}\geqslant t)=\mathrm{e}^{-t},\ \ t\geqslant 0,\ \ k=1,2,\cdots.

Let $S_{0}\equiv 0$ , and for $n\geqslant 1$ , define

S_{n}:=S_{n-1}+T_{n},

and for $t\geqslant 0$ ,

{\mathcal{N}}_{t}:=\max\{n:S_{n}\leqslant t\}.

Then ${\mathcal{N}}_{t}$ is a standard Poisson process with intensity $1$ . In particular, $S_{n}$ is the jump time of ${\mathcal{N}}_{t}$ .

Refer to caption — Figure 1. Standard Poisson process

Note that

{\mathbb{E}}T_{k}=1,\ \ {\mathbb{E}}{\mathcal{N}}_{t}=t,\ \ {\mathbb{E}}({\mathcal{N}}_{t}-t)^{2}=t.

For given $\varepsilon>0$ , we introduce

\displaystyle{\mathcal{N}}^{\varepsilon}_{t}:={\mathcal{N}}_{t/\varepsilon},\ \ \widetilde{\mathcal{N}}^{\varepsilon}_{t}:={\mathcal{N}}_{t/\varepsilon}-t/\varepsilon.

(2.1)

Then $\mathcal{N}^{\varepsilon}_{t}$ is a Poisson process with intensity $1/\varepsilon$ . In this paper, we choose a sub- $\sigma$ field $\mathcal{F}_{0}\subset\mathcal{F}$ , which is independent of $(T_{k})_{k\in\mathbb{N}}$ and therefore independent of $(\mathcal{N}^{\varepsilon}_{t})_{t\geqslant 0}$ . We assume that $\mathcal{F}_{0}$ is sufficiently rich so that for any $\mu\in\mathcal{P}(\mathbb{R}^{d})$ , there exists an $\mathcal{F}_{0}$ -measurable random variable $X_{0}$ such that $\mathbb{P}\circ X_{0}^{-1}=\mu$ . In particular, if we introduce the filtration

{\mathcal{F}}^{\varepsilon}_{t}:={\mathcal{F}}_{0}\vee\sigma\{{\mathcal{N}}^{\varepsilon}_{s}:s\leqslant t\},\ \ t\geqslant 0,

then one can verify that $\widetilde{\mathcal{N}}^{\varepsilon}_{t}$ is an ${\mathcal{F}}^{\varepsilon}_{t}$ -martingale.

In the following, we will utilize an SDE driven by Poisson process $\mathcal{N}^{\varepsilon}_{t}$ to construct a discrete approximation for ODEs. We will demonstrate the convergence of this approximation under various assumptions and establish certain functional central limit theorems.

2.1. Classical solutions for ODEs with Lipschitz coefficients

In this section, we begin by considering the case where the vector fields are bounded and Lipschitz. We demonstrate the optimal rates of strong and weak convergence for the Poisson process approximation as introduced in the introduction. Additionally, we establish a central limit theorem for this approximation scheme.

Let $b:{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}\to{\mathbb{R}}^{d}$ be a measurable vector field. Suppose that

\displaystyle\|b\|_{\infty}+\|\nabla b\|_{\infty}<\infty,

(2.2)

where $\|\cdot\|_{\infty}$ is the usual $L^{\infty}$ -norm in ${\mathbb{R}}_{+}\times{\mathbb{R}}^{d}$ . For any ${\mathcal{F}}_{0}$ -measurable initial value $X_{0}$ , by the Cauchy-Lipschitz theorem, there is a unique global solution $X_{t}$ to the following ODE:

\displaystyle X_{t}=X_{0}+\int^{t}_{0}b(s,X_{s}){\mathord{{\rm d}}}s.

(2.3)

Let $X_{t}(x)$ be the unique solution starting from $x\in{\mathbb{R}}^{d}$ . Then

X_{t}=X_{t}(x)|_{x=X_{0}}.

Now we consider the following SDE driven by Poisson process ${\mathcal{N}}^{\varepsilon}$ :

\displaystyle X^{\varepsilon}_{t}=X_{0}+\int^{t}_{0}\varepsilon b(s,X^{\varepsilon}_{s-}){\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{s}.

(2.4)

Since $s\mapsto{\mathcal{N}}^{\varepsilon}_{s}$ is a step function (see Figure 1), it is easy to see that

\displaystyle X^{\varepsilon}_{t}=X_{0}+\varepsilon\sum_{s\leqslant t}b(s,X^{\varepsilon}_{s-})\Delta{\mathcal{N}}^{\varepsilon}_{s}=X_{0}+\varepsilon\sum_{n=1}^{\infty}b(S^{\varepsilon}_{n},X^{\varepsilon}_{S^{\varepsilon}_{n-1}}){\mathbbm{1}}_{S^{\varepsilon}_{n}\leqslant t},

where $\Delta{\mathcal{N}}^{\varepsilon}_{s}:={\mathcal{N}}^{\varepsilon}_{s}-{\mathcal{N}}^{\varepsilon}_{s-}$ and $S^{\varepsilon}_{n}:=\varepsilon S_{n}$ . In particular,

\displaystyle X^{\varepsilon}_{t}-X^{\varepsilon}_{t-}=\varepsilon b(t,X^{\varepsilon}_{t-})\Delta{\mathcal{N}}^{\varepsilon}_{t}

(2.5)

and

\displaystyle X^{\varepsilon}_{t}=X_{0}+\int^{t}_{0}b(s,X^{\varepsilon}_{s}){\mathord{{\rm d}}}s+\int^{t}_{0}\varepsilon b(s,X^{\varepsilon}_{s-}){\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{\varepsilon}_{s},

(2.6)

where we have used that $b(s,X^{\varepsilon}_{s})=b(s,X^{\varepsilon}_{s-})$ except countable many points $s$ . It is worth noting that the solvability of the SDE (2.5) does not need any regularity assumptions on $b$ , and the second integral term is a martingale. In a sense, we can view (2.5) as an Euler scheme with random step sizes. Furthermore, let $X^{\varepsilon}_{t}(x)$ be the unique solution of (2.4) starting from $x$ . Then

X^{\varepsilon}_{t}=X^{\varepsilon}_{t}(x)|_{x=X_{0}}.

Hence, if $X_{0}\in\mathcal{F}_{0}$ has a density, then for each $t>0$ , $X^{\varepsilon}_{t}$ also possesses a density.

First of all we show the following simple approximation result.

Theorem 2.1.

(i)

(Strong Convergence) Under (2.2), for any $T>0$ , we have

{\mathbb{E}}\left(\sup_{t\in[0,T]}|X^{\varepsilon}_{t}-X_{t}|^{2}\right)\leqslant 4\mathrm{e}^{2\|\nabla b\|_{\infty}T}\|b\|^{2}_{\infty}T\varepsilon,\ \ \varepsilon\in(0,1).

(ii)

(Weak Convergence) Under (2.2) and $\|\nabla^{2}b\|_{\infty}<\infty$ , for any $T>0$ , there is a constant $C=C(T,\|b\|_{C^{2}_{b}})>0$ such that for any $f$ with $\|\nabla f\|_{C^{1}_{b}}<\infty$ and $t\in[0,T]$ ,

|{\mathbb{E}}f(X^{\varepsilon}_{t})-{\mathbb{E}}f(X_{t})|\leqslant C\|\nabla f\|_{C^{1}_{b}}\varepsilon,\ \ \varepsilon\in(0,1).

Proof.

Noting that by (2.6) and (2.3),

\displaystyle X^{\varepsilon}_{t}-X_{t}

\displaystyle=\int^{t}_{0}(b(s,X^{\varepsilon}_{s})-b(s,X_{s})){\mathord{{\rm d}}}s+\int^{t}_{0}\varepsilon b(s,X^{\varepsilon}_{s-}){\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{\varepsilon}_{s},

we have

	$\displaystyle\|X^{\varepsilon}_{t}-X_{t}\|$	$\displaystyle\leqslant\left\|\int^{t}_{0}(b(s,X^{\varepsilon}_{s})-b(s,X_{s})){\mathord{{\rm d}}}s\right\|+\left\|\int^{t}_{0}\varepsilon b(s,X^{\varepsilon}_{s-}){\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{\varepsilon}_{s}\right\|$
		$\displaystyle\leqslant\\|\nabla b\\|_{\infty}\int^{t}_{0}\|X^{\varepsilon}_{s}-X_{s}\|{\mathord{{\rm d}}}s+\left\|\int^{t}_{0}\varepsilon b(s,X^{\varepsilon}_{s-}){\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{\varepsilon}_{s}\right\|.$

Hence, by Gronwall’s inequality and Doob’s maximal inequality,

	$\displaystyle{\mathbb{E}}\left(\sup_{t\in[0,T]}\|X^{\varepsilon}_{t}-X_{t}\|^{2}\right)$	$\displaystyle\leqslant\mathrm{e}^{2\\|\nabla b\\|_{\infty}T}{\mathbb{E}}\left(\sup_{t\in[0,T]}\left\|\int^{t}_{0}\varepsilon b(s,X^{\varepsilon}_{s-}){\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{\varepsilon}_{s}\right\|^{2}\right)$
		$\displaystyle\leqslant 4\mathrm{e}^{2\\|\nabla b\\|_{\infty}T}{\mathbb{E}}\left\|\int^{T}_{0}\varepsilon b(s,X^{\varepsilon}_{s-}){\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{\varepsilon}_{s}\right\|^{2}$
		$\displaystyle=4\mathrm{e}^{2\\|\nabla b\\|_{\infty}T}{\mathbb{E}}\left(\int^{T}_{0}\|\varepsilon b(s,X^{\varepsilon}_{s})\|^{2}{\mathord{{\rm d}}}{\big{(}}\tfrac{s}{\varepsilon}{\big{)}}\right)$
		$\displaystyle\leqslant 4\mathrm{e}^{2\\|\nabla b\\|_{\infty}T}\\|b\\|^{2}_{\infty}T\varepsilon.$

(ii) Fix $t>0$ and $f\in C^{2}_{b}({\mathbb{R}}^{d})$ . Let $u(s,x)$ solve the backward transport equation:

\displaystyle\partial_{s}u+b\cdot\nabla u=0,\ \ u(t,x)=f(x).

(2.7)

In fact, the unique solution of the above transport equation is given by

u(s,x)=f(X_{s,t}(x)),

where $X_{s,t}(x)$ solves the following ODE:

X_{s,t}(x)=x+\int^{t}_{s}b(r,X_{s,r}(x)){\mathord{{\rm d}}}r.

Since $\nabla b,\nabla f$ , $\nabla^{2}b,\nabla^{2}f\in L^{\infty}$ , by the chain rule, it is easy to derive that

	$\displaystyle\\|\nabla^{2}u(s,\cdot)\\|_{\infty}$	$\displaystyle\leqslant\\|\nabla^{2}f\\|_{\infty}\\|\nabla X_{s,t}\\|_{\infty}^{2}+\\|\nabla f\\|_{\infty}\\|\nabla^{2}X_{s,t}\\|_{\infty}$
		$\displaystyle\leqslant\mathrm{e}^{4\\|\nabla b\\|_{\infty}(t-s)}\Big{(}\\|\nabla^{2}f\\|_{\infty}+\\|\nabla f\\|_{\infty}\\|\nabla^{2}b\\|_{\infty}\Big{)},$

and for the solution $X_{t}$ of (2.3),

\displaystyle f(X_{t})=u(t,X_{t})=u(0,X_{0})+\int^{t}_{0}(\partial_{s}u+b\cdot\nabla u)(s,X_{s}){\mathord{{\rm d}}}s=u(0,X_{0}).

(2.8)

Moreover, by Itô’s formula we have

\displaystyle{\mathbb{E}}f(X^{\varepsilon}_{t})={\mathbb{E}}u(t,X^{\varepsilon}_{t})={\mathbb{E}}u(0,X_{0})+{\mathbb{E}}\int^{t}_{0}\left[\partial_{s}u(s,X^{\varepsilon}_{s})+\frac{u(s,X^{\varepsilon}_{s}+\varepsilon b(s,X^{\varepsilon}_{s}))-u(s,X^{\varepsilon}_{s})}{\varepsilon}\right]{\mathord{{\rm d}}}s.

Hence, by (2.7) and (2.8),

	$\displaystyle\|{\mathbb{E}}f(X^{\varepsilon}_{t})-{\mathbb{E}}f(X_{t})\|$	$\displaystyle=\left\|{\mathbb{E}}\int^{t}_{0}b(s,X^{\varepsilon}_{s})\cdot\int^{1}_{0}\Big{(}\nabla u(s,X^{\varepsilon}_{s}+\theta\varepsilon b(s,X^{\varepsilon}_{s}))-\nabla u(s,X^{\varepsilon}_{s})\Big{)}{\mathord{{\rm d}}}\theta{\mathord{{\rm d}}}s\right\|$
		$\displaystyle\leqslant\\|b\\|^{2}_{\infty}\\|\nabla^{2}u\\|_{\infty}\varepsilon\int^{1}_{0}\theta{\mathord{{\rm d}}}\theta\leqslant\\|b\\|^{2}_{\infty}\mathrm{e}^{4\\|\nabla b\\|_{\infty}t}\Big{(}\\|\nabla^{2}f\\|_{\infty}+\\|\nabla f\\|_{\infty}\\|\nabla^{2}b\\|_{\infty}\Big{)}\tfrac{\varepsilon}{2}.$

The proof is complete. ∎

Remark 2.2.

It is noted that the rate of weak convergence is better than the rate of strong convergence in the Poisson process approximation. The order of convergence, both in terms of strong and weak convergence, is the same as the classical Euler approximation of SDEs (see [25]).

Remark 2.3.

Consider a measurable function $f$ . For $\varepsilon>0$ , let us define

I^{\varepsilon}_{f}(t):=\varepsilon\int^{t}_{0}f(s){\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{s}=\varepsilon\sum_{s\leqslant t}f(s)\Delta{\mathcal{N}}^{\varepsilon}_{s}.

By applying Doob’s maximal inequality, we obtain

{\mathbb{E}}\left[\sup_{t\in[0,T]}\left|I^{\varepsilon}_{f}(t)-\int^{t}_{0}f(s){\mathord{{\rm d}}}s\right|^{2}\right]\leqslant 4\varepsilon\int^{T}_{0}|f(s)|^{2}{\mathord{{\rm d}}}s.

It is worth noting that the calculation of $I^{\varepsilon}_{f}(t)$ can be easily implemented on a computer, where the step size is randomly chosen according to the exponential distribution. As a result, we can utilize the Monte Carlo method to theoretically compute the integral $\int_{0}^{T}f(s)\mathrm{d}s$ . To illustrate the effectiveness of our scheme, we provide an example involving a highly oscillatory function:

f(s):=(1-2*([200*s]\%2))*100,\ \ s\in[0,1],

where $[a]$ stands for the integer part of $a$ and $n\%2=1$ or $0$ depends on $n$ being odd or even. Note that $t\mapsto\int^{t}_{0}f(s){\mathord{{\rm d}}}s=:F(t)$ oscillates between $0$ and $0.5$ . We simulate the graph using both Euler’s scheme and the Poisson approximation scheme, as depicted in Figure 2. From the graph, we can observe that Euler’s scheme exhibits instability due to the regular choice of partition points. Conversely, Poisson’s scheme demonstrates stability, with partition points being chosen randomly.

Next, we investigate the asymptotic distribution of the following deviation as $\varepsilon\to 0$ ,

Z^{\varepsilon}_{t}:=\frac{X^{\varepsilon}_{t}-X_{t}}{\sqrt{\varepsilon}}.

By (2.4) and (2.6), it is easy to see that

\displaystyle\begin{split}Z_{t}^{\varepsilon}&=\int^{t}_{0}\sqrt{\varepsilon}b(s,X^{\varepsilon}_{s-}){\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{\varepsilon}_{s}+\int^{t}_{0}\frac{b(X^{\varepsilon}_{s})-b(X_{s})}{\sqrt{\varepsilon}}{\mathord{{\rm d}}}s\\ &=\int^{t}_{0}\sqrt{\varepsilon}b(s,X^{\varepsilon}_{s-}){\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{\varepsilon}_{s}+\int^{t}_{0}Z^{\varepsilon}_{s}B^{\varepsilon}_{s}{\mathord{{\rm d}}}s,\end{split}

(2.9)

where

B^{\varepsilon}_{s}:=\int^{1}_{0}\nabla b(s,\theta X^{\varepsilon}_{s}+(1-\theta)X_{s}){\mathord{{\rm d}}}\theta=\int^{1}_{0}\nabla b(s,X_{s}+\sqrt{\varepsilon}\theta Z^{\varepsilon}_{s}){\mathord{{\rm d}}}\theta.

Note that as $\varepsilon\to 0$ ,

{\mathbb{E}}\mathrm{e}^{{\rm i}\xi\sqrt{\varepsilon}\widetilde{\mathcal{N}}^{\varepsilon}_{t}}=\exp\big{\{}t\varepsilon^{-1}(\mathrm{e}^{{\rm i}\xi\sqrt{\varepsilon}}-1)-{\rm i}\xi t/\sqrt{\varepsilon}\big{\}}\to\mathrm{e}^{-\xi^{2}t/2}.

This implies that $\sqrt{\varepsilon}\widetilde{\mathcal{N}}^{\varepsilon}_{t}$ weakly converges to a one-dimensional standard Brownian motion $W_{t}$ . Therefore, we formally have $Z^{\varepsilon}\Rightarrow Z$ , where $Z$ solves the following linear SDE:

\displaystyle Z_{t}=\int^{t}_{0}b(s,X_{s}){\mathord{{\rm d}}}W_{s}+\int^{t}_{0}Z_{s}\cdot\nabla b(s,X_{s}){\mathord{{\rm d}}}s.

(2.10)

Clearly, $Z_{t}$ is an OU process and it’s infinitesimal generator is given by

\displaystyle{\mathscr{L}}_{s}f(z)=\tfrac{1}{2}\mathrm{tr}{\big{(}}(b\otimes b)(s,X_{s})\cdot\nabla^{2}f(z){\big{)}}+\langle z\cdot\nabla b(s,X_{s}),\nabla f(z)\rangle.

(2.11)

Proposition 2.4.

Let ${\mathscr{L}}$ be given in (2.11) with $b$ being a bounded Lipschitz vector field. For any $(s,z)\in{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}$ , there is a unique martingale solution ${\mathbb{P}}\in{\mathcal{M}}^{z}_{s}({\mathscr{L}})$ in the sense of Definition 6.2 in the appendix. Moreover, ${\mathbb{P}}$ concentrates on the space of continuous functions.

Proof.

Since the diffusion coefficient does not depend on $z$ and the drift is linear in $z$ , it is easy to see that for any $(s,z)\in{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}$ , there is a unique martingale solution ${\mathbb{P}}_{s,z}\in{\mathcal{M}}^{z}_{s}({\mathscr{L}})$ . Moreover, by Proposition 6.3 in appendix, ${\mathbb{P}}$ concentrates on the space of continuous functions. ∎

Now we show the following functional CLT about the above $Z^{\varepsilon}$ .

Theorem 2.5.

Suppose that $b$ is bounded and Lipschitz continuous. Let ${\mathbb{P}}\in{\mathcal{M}}^{0}_{0}({\mathscr{L}})$ be the unique martingale solution associated with ${\mathscr{L}}$ starting from $0$ at time $0$ . Let ${\mathbb{P}}_{\varepsilon}$ be the law of $Z^{\varepsilon}:=\frac{X^{\varepsilon}-X}{\sqrt{\varepsilon}}$ in the space ${\mathbb{D}}$ of càdlàg functions, where $X^{\varepsilon}$ is the unique solution of SDE (2.4) with the same fixed initial value $X_{0}=x_{0}$ as $X$ . Then we have

{\mathbb{P}}_{\varepsilon}\Rightarrow{\mathbb{P}}\mbox{ in ${\mathcal{P}}({\mathbb{D}})$}.

Proof.

First of all, for any $f\in C^{2}_{b}({\mathbb{R}}^{d})$ , by (2.9) and Itô’s formula, we have

\displaystyle f(Z^{\varepsilon}_{t})=f(0)+\int^{t}_{0}\Big{[}A^{\varepsilon}_{s}f(Z^{\varepsilon}_{s})+(Z^{\varepsilon}_{s}B^{\varepsilon}_{s})\cdot\nabla f(Z^{\varepsilon}_{s})\Big{]}{\mathord{{\rm d}}}s+M^{\varepsilon}_{t},

where $M^{\varepsilon}_{t}:=\int^{t}_{0}\big{(}f(Z^{\varepsilon}_{s-}+\sqrt{\varepsilon}b(s,X^{\varepsilon}_{s-}))-f(Z^{\varepsilon}_{s-})\big{)}{\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{\varepsilon}_{s}$ is a martingale and

\displaystyle A^{\varepsilon}_{s}f(z)=\frac{f(z+\sqrt{\varepsilon}b(s,X_{s}))-f(z)-\sqrt{\varepsilon}b(s,X_{s})\cdot\nabla f(z)}{\varepsilon}.

Therefore, the infinitesimal generator of $Z^{\varepsilon}_{t}$ is given by

{\mathscr{L}}^{(\varepsilon)}_{s}f(z):=A^{\varepsilon}_{s}f(z)+(zB^{\varepsilon}_{s})\cdot\nabla f(z).

From the very definition, it is easy to see that for any $s,R>0$ ,

\displaystyle\lim_{\varepsilon\to 0}\sup_{|z|\leqslant R}|{\mathscr{L}}^{(\varepsilon)}_{s}f(z)-{\mathscr{L}}_{s}f(z)|=0.

(2.12)

In fact, noting that by Taylor’s expansion,

A^{\varepsilon}_{s}f(z)=\int^{1}_{0}\theta\int^{1}_{0}\mathrm{tr}\Big{(}(b\otimes b)(s,X_{s})\cdot\nabla^{2}f(z+\theta\theta^{\prime}\sqrt{\varepsilon}b(s,X_{s})\Big{)}{\mathord{{\rm d}}}\theta{\mathord{{\rm d}}}\theta^{\prime},

one sees that for each $s>0$ ,

\displaystyle\lim_{\varepsilon\to 0}\sup_{|z|\leqslant R}\Big{|}A^{\varepsilon}_{s}f(z)-\tfrac{1}{2}\mathrm{tr}{\big{(}}(b\otimes b)(s,X_{s})\cdot\nabla^{2}f(z){\big{)}}\Big{|}=0.

Moreover, by the definition of $B^{\varepsilon}_{s}$ , we clearly have

\lim_{\varepsilon\to 0}|B^{\varepsilon}_{s}-\nabla b(s,X_{s})|=0.

Thus we have (2.12). On the other hand, by (2.9) and Gronwall’s lemma, it is easy to see that for some $C>0$ ,

\sup_{\varepsilon\in(0,1)}{\mathbb{E}}\left(\sup_{t\in[0,T]}|Z_{t}^{\varepsilon}|^{2}\right)\leqslant C,

and for any stopping time $\tau$ and $\delta>0$ ,

\displaystyle\sup_{\varepsilon\in(0,1)}{\mathbb{E}}\left(\sup_{t\in[0,\delta]}|Z^{\varepsilon}_{\tau+t}-Z^{\varepsilon}_{\tau}|^{2}\right)\leqslant C\delta.

Thus, by Aldous’ criterion (see [23, p356, Theorem 4.5]), $({\mathbb{P}}_{\varepsilon})_{\varepsilon\in(0,1)}$ is tight. Let ${\mathbb{P}}_{0}$ be any accumulation point. By (2.12) and Theorem 6.4 in appendix, ${\mathbb{P}}_{0}\in{\mathcal{M}}^{0}_{0}({\mathscr{L}})$ . By the uniqueness (see Proposition 2.4), one has ${\mathbb{P}}_{0}={\mathbb{P}}$ . The proof is complete. ∎

Remark 2.6.

We emphasize that in the above theorem, the initial value is a nonrandom fixed point. We shall consider the general random initial value in Theorem 2.19 below.

2.2. Filippov solutions for ODEs with one-sided Lipschitz coefficients

In this section, our focus is on the Poisson process approximation for the ODE (2.3) with one-sided Lipschitz coefficients. We will explore the convergence properties and effectiveness of this approximation scheme in this setting.

(H^b)

We assume that for some $\kappa>0$ and all $(s,x,y)\in{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}\times{\mathbb{R}}^{d}$ ,

$\langle x-y,b(s,x)-b(s,y)\rangle\leqslant\kappa|x-y|^{2},\ \ |b(s,x)|\leqslant\kappa(1+|x|).$

Due to the lack of continuity of $x\mapsto b(s,x)$ , assumption (H^b) does not guarantee the existence of a solution to the ODE (2.3) in the classical sense. In such cases, Filippov [14] introduced a concept of solution in the sense of differential inclusions, providing a unique solution to the ODE (2.3). This notion is closely connected to the study of differential inclusions as discussed in [2].

To define Filippov solutions, we introduce the supporting function $H_{b}$ of $b$ , defined by

H_{b}(t,x,w):=\lim_{\delta\downarrow 0}{\rm ess}\!\!\!\!\!\sup_{|y-x|\leqslant\delta}\langle b(t,y),w\rangle,

where the essential supremum is taken with respect to the Lebesgue measure. The essential convex hull of $b$ is then given by

A^{b}_{t,x}:=\{y\in{\mathbb{R}}^{d}:\langle y,w\rangle\leqslant H_{b}(t,x,w),w\in{\mathbb{R}}^{d}\}.

Note that $A^{b}_{t,x}$ is a closed convex subset and $H_{b}(t,x,\cdot)$ is precisely the support function of $A^{b}_{t,x}$ .

Definition 2.7.

We call an absolutely continuous curve $(X_{t})_{t\geqslant 0}$ in ${\mathbb{R}}^{d}$ a Filippov solution of ODE (2.3) starting from $x_{0}$ if $X_{0}=x_{0}$ and for Lebesgue almost all $t\geqslant 0$ ,

\dot{X}_{t}\in A^{b}_{t,X_{t}}.

In [14], Filippov proved the following result (see also [20, Theorem 1.42]) .

Theorem 2.8.

Under (H^b), for any starting point $X_{0}=x_{0}$ , there is a unique Filippov solution $(X_{t}(x_{0}))_{t\geqslant 0}$ to ODE (2.3). Moreover, for any $x_{0},x_{0}^{\prime}\in{\mathbb{R}}^{d}$ and $t\geqslant 0$ ,

\displaystyle|X_{t}(x_{0})-X_{t}(x^{\prime}_{0})|\leqslant\mathrm{e}^{2\kappa t}|x_{0}-x_{0}^{\prime}|.

(2.13)

Let $b_{\delta}(t,x):=b(t,\cdot)*\rho_{\delta}(x)$ be the mollifier approximation of $b$ , where $\rho_{\delta}(x)=\delta^{-d}\rho(x/\delta)$ and $\rho$ is a smooth density function with compact support. Let $X^{\delta}(x_{0})$ be the unique solution of ODE (2.3) corresponding to $b_{\delta}$ and starting from $x_{0}$ . Then for any $T>0$ , we have

\displaystyle\lim_{\delta\to 0}\sup_{t\in[0,T]}|X^{\delta}_{t}(x_{0})-X_{t}(x_{0})|=0.

(2.14)

The existence of a Filippov solution can be established through a compactness argument, while the uniqueness follows from the one-sided Lipschitz condition. It is remarkable that we can show that the Filippov solution of the ODE (2.3) coincides with the $L^{p}$ -limit of $X^{\varepsilon}$ under assumption (H^b). This result is particularly significant as it provides an explicit time discretization scheme for Filippov solutions. To prove this result, we begin by demonstrating a simple convergence estimate in the case where $b$ is continuous in $x$ .

Lemma 2.9.

Let $X^{\varepsilon}_{0}=\xi\in\cap_{p>1}L^{p}(\Omega,{\mathcal{F}}_{0},{\mathbb{P}})$ . Suppose that (H^b) and for each $t\geqslant 0$ , $x\mapsto b(t,x)$ is continuous. Then for any $T>0$ and $p\geqslant 1$ , there is a constant $C=C(\kappa,d,T,p)>0$ such that

\displaystyle\sup_{\varepsilon\in(0,1)}{\mathbb{E}}\left(\sup_{t\in[0,T]}|X^{\varepsilon}_{t}|^{p}\right)\leqslant C(1+{\mathbb{E}}|\xi|^{p}),

(2.15)

and for all $\varepsilon\in(0,1)$ ,

\displaystyle{\mathbb{E}}\left(\sup_{t\in[0,T]}|X^{\varepsilon}_{t}-X_{t}|^{2p}\right)\leqslant C(1+{\mathbb{E}}|\xi|^{2p})\varepsilon^{p},

(2.16)

where $X$ is the unique solution of ODE (2.3) starting from $\xi$ .

Proof.

For $p\geqslant 1$ , by Itô’s formula and $|x+y|^{p}-|x|^{p}\leqslant p|y|(|x|+|y|)^{p-1}$ , we have

	$\displaystyle\|X^{\varepsilon}_{t}\|^{p}$	$\displaystyle=\|\xi\|^{p}+\int^{t}_{0}(\|X^{\varepsilon}_{s-}+\varepsilon b(s,X^{\varepsilon}_{s-})\|^{p}-\|X^{\varepsilon}_{s-}\|^{p}){\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{s}$
		$\displaystyle\leqslant\|\xi\|^{p}+p\varepsilon\int^{t}_{0}\|b(s,X^{\varepsilon}_{s-})\|\big{(}\|X^{\varepsilon}_{s-}\|+\varepsilon\|b(s,X^{\varepsilon}_{s-})\|\big{)}^{p-1}{\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{s}.$

Hence, by the linear growth of $b$ in $x$ ,

	$\displaystyle{\mathbb{E}}\left(\sup_{s\in[0,t]}\|X^{\varepsilon}_{s}\|^{p}\right)$	$\displaystyle\leqslant{\mathbb{E}}\|\xi\|^{p}+p\varepsilon{\mathbb{E}}\left(\int^{t}_{0}\|b(s,X^{\varepsilon}_{s})\|\big{(}\|X^{\varepsilon}_{s}\|+\varepsilon\|b(s,X^{\varepsilon}_{s})\|\big{)}^{p-1}{\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{s}\right)$
		$\displaystyle={\mathbb{E}}\|\xi\|^{p}+p{\mathbb{E}}\left(\int^{t}_{0}\|b(s,X^{\varepsilon}_{s})\|\big{(}\|X^{\varepsilon}_{s}\|+\varepsilon\|b(s,X^{\varepsilon}_{s})\|\big{)}^{p-1}{\mathord{{\rm d}}}s\right)$
		$\displaystyle\leqslant{\mathbb{E}}\|\xi\|^{p}+C{\mathbb{E}}\left(\int^{t}_{0}(1+\|X^{\varepsilon}_{s}\|^{p}){\mathord{{\rm d}}}s\right),$

which implies the first estimate by Gronwall’s inequality.

Next, we look at (2.16). Since $b(t,x)$ is continuous in $x$ for each $t>0$ , it is well-known that there is a unique classical solution to ODE (2.3) under (H^b). Note that

Z_{t}:=X^{\varepsilon}_{t}-X_{t}=\int^{t}_{0}\varepsilon b(s,X^{\varepsilon}_{s-}){\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{\varepsilon}_{s}+\int^{t}_{0}\Big{[}b(s,X^{\varepsilon}_{s})-b(s,X_{s})\Big{]}{\mathord{{\rm d}}}s.

By Itô’s formula and (H^b), we have

	$\displaystyle\|Z^{\varepsilon}_{t}\|^{2}$	$\displaystyle=2\int^{t}_{0}\langle Z^{\varepsilon}_{s},b(s,X^{\varepsilon}_{s})-b(s,X_{s})\rangle{\mathord{{\rm d}}}s+\int^{t}_{0}(\|Z^{\varepsilon}_{s-}+\varepsilon b(s,X^{\varepsilon}_{s-})\|^{2}-\|Z^{\varepsilon}_{s-}\|^{2}){\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{\varepsilon}_{s}$
		$\displaystyle\quad+\int^{t}_{0}(\|Z^{\varepsilon}_{s}+\varepsilon b(s,X^{\varepsilon}_{s})\|^{2}-\|Z^{\varepsilon}_{s}\|^{2}-2\varepsilon\langle b(s,X^{\varepsilon}_{s}),Z^{\varepsilon}_{s}\rangle){\mathord{{\rm d}}}{\big{(}}\frac{s}{\varepsilon}{\big{)}}$
		$\displaystyle\leqslant 2\kappa\int^{t}_{0}\|Z^{\varepsilon}_{s}\|^{2}{\mathord{{\rm d}}}s+\int^{t}_{0}\Big{(}2\varepsilon\langle b(s,X^{\varepsilon}_{s-}),Z^{\varepsilon}_{s-}\rangle+\varepsilon^{2}\|b(s,X^{\varepsilon}_{s-})\|^{2}\Big{)}{\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{\varepsilon}_{s}+\varepsilon\int^{t}_{0}\|b(s,X^{\varepsilon}_{s})\|^{2}{\mathord{{\rm d}}}s.$

Hence, by Gronwall’s inequality, (2.15) and BDG’s inequality, we get for $p\geqslant 2$ ,

	$\displaystyle{\mathbb{E}}\left(\sup_{s\in[0,t]}\|Z^{\varepsilon}_{s}\|^{2p}\right)$	$\displaystyle\lesssim{\mathbb{E}}\left\|\int^{t}_{0}\Big{(}2\varepsilon\langle b(s,X^{\varepsilon}_{s-}),Z^{\varepsilon}_{s-}\rangle+\varepsilon^{2}\|b(s,X^{\varepsilon}_{s-})\|^{2}\Big{)}{\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{\varepsilon}_{s}\right\|^{p}+(1+{\mathbb{E}}\|\xi\|^{2p})\varepsilon^{p}$
		$\displaystyle\lesssim{\mathbb{E}}\left(\int^{t}_{0}\Big{\|}2\varepsilon\langle b(s,X^{\varepsilon}_{s}),Z^{\varepsilon}_{s}\rangle+\varepsilon^{2}\|b(s,X^{\varepsilon}_{s})\|^{2}\Big{\|}^{2}{\mathord{{\rm d}}}(\tfrac{s}{\varepsilon})\right)^{\frac{p}{2}}$
		$\displaystyle+{\mathbb{E}}\int^{t}_{0}\Big{\|}2\varepsilon\langle b(s,X^{\varepsilon}_{s}),Z^{\varepsilon}_{s}\rangle+\varepsilon^{2}\|b(s,X^{\varepsilon}_{s})\|^{2}\Big{\|}^{p}{\mathord{{\rm d}}}(\tfrac{s}{\varepsilon})+(1+{\mathbb{E}}\|\xi\|^{2p})\varepsilon^{p}$
		$\displaystyle\lesssim{\mathbb{E}}\left(\int^{t}_{0}\Big{(}\|Z^{\varepsilon}_{s}\|^{4}+\varepsilon^{2}(1+\|X^{\varepsilon}_{s}\|^{4}\Big{)}{\mathord{{\rm d}}}s\right)^{\frac{p}{2}}$
		$\displaystyle+{\mathbb{E}}\int^{t}_{0}\Big{(}\|Z^{\varepsilon}_{s}\|^{2}+\varepsilon^{2}(1+\|X^{\varepsilon}_{s}\|^{2}\Big{)}^{p}{\mathord{{\rm d}}}(\tfrac{s}{\varepsilon})+(1+{\mathbb{E}}\|\xi\|^{2p})\varepsilon^{p}$
		$\displaystyle\lesssim\int^{t}_{0}{\mathbb{E}}\|Z^{\varepsilon}_{s}\|^{2p}{\mathord{{\rm d}}}s+(1+{\mathbb{E}}\|\xi\|^{2p})\varepsilon^{p},$

which in turn implies the desired estimate (2.16). ∎

Next we show the continuous dependence of $X^{\varepsilon}$ with respect to $b$ and the initial values.

Lemma 2.10.

(i) Let $X^{\varepsilon}_{0}=\xi\in\cap_{p>1}L^{p}(\Omega,{\mathcal{F}}_{0},{\mathbb{P}})$ and $X^{\varepsilon,\delta}$ be the solution of ODE (2.4) corresponding to $b_{\delta}$ , where $b_{\delta}$ is the smooth approximation of $b$ as in Theorem 2.8. Suppose (H^b) and $\xi$ has a density with respect to the Lebesgue measure. Then for any $T>0$ and $p\geqslant 1$ , there is a constant $C=C(\kappa,d,T,p)>0$ such that for all $\varepsilon\in(0,1)$ ,

\displaystyle\lim_{\delta\to 0}{\mathbb{E}}\left(\sup_{t\in[0,T]}|X^{\varepsilon,\delta}_{t}-X^{\varepsilon}_{t}|^{2p}\right)\leqslant C(1+{\mathbb{E}}|\xi|^{2p})\varepsilon^{p}.

(2.17)

(ii) Let $\xi,\widetilde{\xi}\in\cap_{p>1}L^{p}(\Omega,{\mathcal{F}}_{0},{\mathbb{P}})$ and $X^{\varepsilon}$ , $\widetilde{X}^{\varepsilon}$ be the solutions of ODE (2.4) corresponding to initial values $\xi$ and $\widetilde{\xi}$ , respectively. Under (H^b), for any $T>0$ and $p\geqslant 1$ , there is a constant $C=C(\kappa,d,T,p)>0$ such that for all $\varepsilon\in(0,1)$ ,

\displaystyle{\mathbb{E}}\left(\sup_{t\in[0,T]}|X^{\varepsilon}_{t}-\tilde{X}^{\varepsilon}_{t}|^{2p}\right)\leqslant C{\mathbb{E}}|\xi-\widetilde{\xi}|^{2p}+C(1+{\mathbb{E}}|\xi|^{2p}+{\mathbb{E}}|\widetilde{\xi}|^{2p})\varepsilon^{p}.

(2.18)

Proof.

We will only prove (i) since (ii) follows in the same manner. Note that

\displaystyle Z_{t}:=X^{\varepsilon,\delta}_{t}-X^{\varepsilon}_{t}

\displaystyle=\int^{t}_{0}\varepsilon(b_{\delta}(s,X^{\varepsilon,\delta}_{s-})-b(s,X^{\varepsilon}_{s-})){\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{s}=\int^{t}_{0}\Big{[}\varepsilon B^{\varepsilon,\delta}_{s}+\varepsilon g^{\delta}_{s}(X^{\varepsilon}_{s-})\Big{]}{\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{s},

where

B^{\varepsilon,\delta}_{s}:=b_{\delta}(s,X^{\varepsilon,\delta}_{s-})-b_{\delta}(s,X^{\varepsilon}_{s-}),\ \ g^{\delta}_{s}(x):=(b_{\delta}-b)(s,x).

By Itô’s formula and (H^b), we have

	$\displaystyle\|Z_{t}\|^{2}$	$\displaystyle=\int^{t}_{0}(\|Z_{s-}+\varepsilon B^{\varepsilon,\delta}_{s}+\varepsilon g^{\delta}_{s}(X^{\varepsilon}_{s-})\|^{2}-\|Z_{s-}\|^{2}){\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{s}$
		$\displaystyle=\int^{t}_{0}\Big{(}2\varepsilon\langle B^{\varepsilon,\delta}_{s}+g^{\delta}_{s}(X^{\varepsilon}_{s-}),Z_{s-}\rangle+\varepsilon^{2}\|B^{\varepsilon,\delta}_{s}+g^{\delta}_{s}(X^{\varepsilon}_{s-})\|^{2}\Big{)}{\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{s}$
		$\displaystyle\leqslant\int^{t}_{0}\varepsilon\Big{(}(2\kappa+1)\|Z_{s-}\|^{2}+\|g^{\delta}_{s}(X^{\varepsilon}_{s-})\|^{2}+2\varepsilon(\|B^{\varepsilon,\delta}_{s}\|^{2}+\|g^{\delta}_{s}(X^{\varepsilon}_{s-})\|^{2})\Big{)}{\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{s}$
		$\displaystyle=\int^{t}_{0}\varepsilon\Big{(}(2\kappa+1)\|Z_{s-}\|^{2}+\|g^{\delta}_{s}(X^{\varepsilon}_{s-})\|^{2}+2\varepsilon(\|B^{\varepsilon,\delta}_{s}\|^{2}+\|g^{\delta}_{s}(X^{\varepsilon}_{s-})\|^{2})\Big{)}{\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{\varepsilon}_{s}$
		$\displaystyle\quad+\int^{t}_{0}\Big{(}(2\kappa+1)\|Z_{s}\|^{2}+\|g^{\delta}_{s}(X^{\varepsilon}_{s})\|^{2}+2\varepsilon(\|B^{\varepsilon,\delta}_{s}\|^{2}+\|g^{\delta}_{s}(X^{\varepsilon}_{s})\|^{2}\Big{)}{\mathord{{\rm d}}}s.$

Hence, for $p\geqslant 2$ , by BDG’s inequality and (2.15), we have

	$\displaystyle{\mathbb{E}}\left(\sup_{s\in[0,t]}\|Z_{s}\|^{2p}\right)$	$\displaystyle\lesssim{\mathbb{E}}\left[\int^{t}_{0}\varepsilon\Big{(}(2\kappa+1)\|Z_{s}\|^{2}+\|g^{\delta}_{s}(X^{\varepsilon}_{s})\|^{2}+2\varepsilon(\|B^{\varepsilon,\delta}_{s}\|^{2}+\|g^{\delta}_{s}(X^{\varepsilon}_{s})\|^{2})\Big{)}^{2}{\mathord{{\rm d}}}s\right]^{p/2}$
		$\displaystyle\quad+{\mathbb{E}}\left[\int^{t}_{0}\varepsilon^{p-1}\Big{(}(2\kappa+1)\|Z_{s}\|^{2}+\|g^{\delta}_{s}(X^{\varepsilon}_{s})\|^{2}+2\varepsilon(\|B^{\varepsilon,\delta}_{s}\|^{2}+\|g^{\delta}_{s}(X^{\varepsilon}_{s})\|^{2})\Big{)}^{p}{\mathord{{\rm d}}}s\right]$
		$\displaystyle\quad+{\mathbb{E}}\left[\int^{t}_{0}\Big{(}(2\kappa+1)\|Z_{s}\|^{2}+\|g^{\delta}_{s}(X^{\varepsilon}_{s})\|^{2}+2\varepsilon(\|B^{\varepsilon,\delta}_{s}\|^{2}+\|g^{\delta}_{s}(X^{\varepsilon}_{s})\|^{2}\Big{)}{\mathord{{\rm d}}}s\right]^{p}$
		$\displaystyle\lesssim\int^{t}_{0}{\mathbb{E}}\|Z_{s}\|^{2p}{\mathord{{\rm d}}}s+\int^{t}_{0}{\mathbb{E}}\|g^{\delta}_{s}(X^{\varepsilon}_{s})\|^{2p}{\mathord{{\rm d}}}s+(1+{\mathbb{E}}\|\xi\|^{2p})\varepsilon^{p},$

where in the last step we have used the linear growth of $b$ and estimate (2.15), and the implicit constant only depends on $\kappa,d,T,p$ . By Gronwall’s inequality, we get

\displaystyle{\mathbb{E}}\left(\sup_{t\in[0,T]}|Z_{t}|^{2p}\right)\lesssim\int^{T}_{0}{\mathbb{E}}|g^{\delta}_{s}(X^{\varepsilon}_{s})|^{2p}{\mathord{{\rm d}}}s+(1+{\mathbb{E}}|\xi|^{2p})\varepsilon^{p}.

Since for fixed $\varepsilon\in(0,1)$ and $s\in[0,T]$ , the law of $X^{\varepsilon}_{s}$ is absolutely continuous with respect to the Lebesgue measure, by the dominated convergence theorem, we have

\lim_{\delta\to 0}\int^{T}_{0}{\mathbb{E}}|g^{\delta}_{s}(X^{\varepsilon}_{s})|^{2p}{\mathord{{\rm d}}}s=\int^{T}_{0}\!\!\!\int_{{\mathbb{R}}^{d}}\lim_{\delta\to 0}|b_{\delta}-b|^{2p}(s,x)\rho^{\varepsilon}_{s}(x){\mathord{{\rm d}}}x{\mathord{{\rm d}}}s=0,

where $\rho^{\varepsilon}_{s}(x)$ is the density of $X^{\varepsilon}_{s}$ . Thus we obtain the limit (2.17). ∎

Now we can show the following main result of this section.

Theorem 2.11.

Let $\xi\in\cap_{p>1}L^{p}(\Omega,{\mathcal{F}}_{0},{\mathbb{P}})$ and $(X_{t})_{t\geqslant 0}$ be the unique Filippov solution of ODE (2.3) with $X_{0}=\xi$ . Then for any $T>0$ and $p\geqslant 1$ , there is a constant $C=C(\kappa,d,T,p)>0$ such that for all $\varepsilon\in(0,1)$ ,

\displaystyle{\mathbb{E}}\left(\sup_{t\in[0,T]}|X^{\varepsilon}_{t}-X_{t}|^{2p}\right)\leqslant C(1+{\mathbb{E}}|\xi|^{2p})\varepsilon^{p}.

(2.19)

Proof.

We dive the proof into two steps.

(Step 1). In this step we assume that $\xi$ has a density. Let $X^{\varepsilon,\delta}_{t}$ be the unique solution of ODE (2.4) corresponding to $b_{\delta}$ and starting from $\xi$ . By (2.16), for any $T>0$ and $p\geqslant 1$ , there is a constant $C=C(\kappa,d,T,p)>0$ such that for any $\varepsilon\in(0,1)$ ,

{\mathbb{E}}\left(\sup_{t\in[0,T]}|X^{\varepsilon,\delta}_{t}-X^{\delta}_{t}|^{2p}\right)\leqslant C(1+{\mathbb{E}}|\xi|^{2p})\varepsilon^{p}.

By (2.14), (2.17) and taking limits $\delta\to 0$ , we get (2.19).

(Step 2). For general $\xi\in\cap_{p>1}L^{p}(\Omega,{\mathcal{F}}_{0},{\mathbb{P}})$ . Let $\eta\in{\mathcal{F}}_{0}$ be a standard normal distribution and independent of $\xi$ . Define

\xi_{\delta}:=\xi+\delta\eta,\ \ \delta>0.

Clearly, $\xi_{\delta}\in\cap_{p>1}L^{p}(\Omega,{\mathcal{F}}_{0},{\mathbb{P}})$ has a density and for any $p\geqslant 1$ ,

{\mathbb{E}}|\xi_{\delta}|^{p}\leqslant C(1+{\mathbb{E}}|\xi|^{p}),\ \ \lim_{\delta\to 0}{\mathbb{E}}|\xi_{\delta}-\xi|^{p}=0.

Let $\widetilde{X}^{\varepsilon,\delta}$ be the unique solution of ODE (2.4) with $\widetilde{X}^{\varepsilon,\delta}_{0}=\xi_{\delta}$ and $\widetilde{X}^{\delta}$ be the unique Filippov solution of ODE (2.3) with $\widetilde{X}^{\delta}_{0}=\xi_{\delta}$ . By what we have proved in Step 1, we have

{\mathbb{E}}\left(\sup_{t\in[0,T]}|\widetilde{X}^{\varepsilon,\delta}_{t}-\widetilde{X}^{\delta}_{t}|^{2p}\right)\leqslant C(1+{\mathbb{E}}|\xi_{\delta}|^{2p})\varepsilon^{p}\leqslant C(1+{\mathbb{E}}|\xi|^{2p})\varepsilon^{p}.

By (2.13) and (2.18), taking limits $\delta\to 0$ , we obtain (2.19). ∎

Remark 2.12.

Theorem 2.11 presents a specific discretized SDE approximation for the ODE (2.3) under the assumption of one-sided Lipschitz conditions. This result offers a practical and computationally efficient scheme for approximating the solutions of the ODE using SDEs.

2.3. DiPerna-Lions solutions for ODEs with ${\mathbb{W}}^{1,q}$ -coefficients

In this section, we focus on the ODE in the sense of DiPerna-Lions. In this case, the coefficient is permitted to belong to the Sobolev space ${\mathbb{W}}^{1,q}$ , but the initial value is assumed to possess a density. Specifically, we make the following assumption:

(H ${}^{b}_{q}$ )

$b$ is bounded measurable, and for some $q\in[1,\infty]$ and each $R>0$ , there is a Borel measurable function $f_{R}(s,x)\in L^{q}_{loc}({\mathbb{R}}_{+}\times{\mathbb{R}}^{d})$ such that for Lebesgue almost all $(s,x,y)\in{\mathbb{R}}_{+}\times B_{R}\times B_{R}$ ,

$\displaystyle\langle x-y,b(s,x)-b(s,y)\rangle\leqslant f_{R}(s,y)|x-y|^{2}.$ (2.20)

We first show the following result.

Theorem 2.13.

Let $X_{0}\in{\mathcal{F}}_{0}$ with ${\mathbb{E}}|X_{0}|<\infty$ . Suppose that (H ${}^{b}_{q}$ ) holds, and ODE (2.3) admits a solution $X_{t}$ with initial value $X_{0}$ and $X_{t}$ has a density $\rho_{t}(x)\in L^{p}_{loc}({\mathbb{R}}_{+}\times{\mathbb{R}}^{d})$ , where $p=\frac{q}{q-1}$ . Then for any $T>0$ , there is a constant $C_{T}>0$ such that for all $R\geqslant 1$ and $\varepsilon,h\in(0,1)$ ,

{\mathbb{P}}\left(\sup_{t\in[0,T]}|X^{\varepsilon}_{t}-X_{t}|\geqslant h\right)\leqslant C_{T}\left(\frac{1+\|b\|^{2}_{\infty}}{R^{2}}+\frac{\|f_{R}\|_{L^{q}([0,T]\times B_{R})}\|\rho\|_{L^{p}([0,T]\times B_{R})}+1}{\log(1+h^{2}/(36\varepsilon\|b\|_{\infty}))}\right).

Proof.

We follow the proof in [34]. By (2.6) we have

Z^{\varepsilon}_{t}:=X^{\varepsilon}_{t}-X_{t}=\int^{t}_{0}b(s,X^{\varepsilon}_{s})-b(s,X_{s}){\mathord{{\rm d}}}s+\int^{t}_{0}\varepsilon b(s,X^{\varepsilon}_{s-}){\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{\varepsilon}_{s}.

Fix $\delta>0$ . By applying Itô’s formula to function $x\mapsto\log(\frac{|x|^{2}}{\delta^{2}}+1)$ , we have

	$\displaystyle\log\left(\frac{\|Z^{\varepsilon}_{t}\|^{2}}{\delta^{2}}+1\right)$	$\displaystyle=2\int^{t}_{0}\frac{\langle Z^{\varepsilon}_{s},b(s,X^{\varepsilon}_{s})-b(s,X_{s})\rangle}{\|Z^{\varepsilon}_{s}\|^{2}+\delta^{2}}{\mathord{{\rm d}}}s+\int^{t}_{0}\log\left(\frac{\|Z^{\varepsilon}_{s-}+\varepsilon b(s,X^{\varepsilon}_{s-})\|^{2}+\delta^{2}}{\|Z^{\varepsilon}_{s-}\|^{2}+\delta^{2}}\right){\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{\varepsilon}_{s}$
		$\displaystyle\quad+\int^{t}_{0}\left[\log\left(\frac{\|Z^{\varepsilon}_{s}+\varepsilon b(s,X^{\varepsilon}_{s})\|^{2}+\delta^{2}}{\|Z^{\varepsilon}_{s}\|^{2}+\delta^{2}}\right)-\frac{2\varepsilon\langle b(s,X^{\varepsilon}_{s}),Z^{\varepsilon}_{s}\rangle}{\|Z^{\varepsilon}_{s}\|^{2}+\delta^{2}}\right]{\mathord{{\rm d}}}{\big{(}}\frac{s}{\varepsilon}{\big{)}}$
		$\displaystyle=:I_{1}(t)+I_{2}(t)+I_{3}(t).$

For $R>0$ , define a stopping time

\tau_{R}:=\inf\{t>0:|X^{\varepsilon}_{t}|\vee|X_{t}|\geqslant R\}.

For $I_{1}(t)$ , by the assumption we have

I_{1}(t\wedge\tau_{R})\leqslant 2\int^{t}_{0}\frac{f_{R}(s,X_{s})|Z^{\varepsilon}_{s}|^{2}}{|Z^{\varepsilon}_{s}|^{2}+\delta^{2}}{\mathbbm{1}}_{\{|X_{s}|<R\}}{\mathord{{\rm d}}}s\leqslant 2\int^{t}_{0}f_{R}(s,X_{s}){\mathbbm{1}}_{\{|X_{s}|<R\}}{\mathord{{\rm d}}}s.

For $I_{2}(t)$ , by Doob’s maximal inequality, we have

	$\displaystyle{\mathbb{E}}\left(\sup_{t\in[0,T]}\|I_{2}(t)\|^{2}\right)$	$\displaystyle\leqslant 4{\mathbb{E}}\left(\int^{T}_{0}\log\left(\frac{\|Z^{\varepsilon}_{s-}+\varepsilon b(s,X^{\varepsilon}_{s-})\|^{2}+\delta^{2}}{\|Z^{\varepsilon}_{s-}\|^{2}+\delta^{2}}\right){\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{\varepsilon}_{s}\right)^{2}$
		$\displaystyle=4{\mathbb{E}}\left(\int^{T}_{0}\left\|\log\left(\frac{\|Z^{\varepsilon}_{s}+\varepsilon b(s,X^{\varepsilon}_{s})\|^{2}+\delta^{2}}{\|Z^{\varepsilon}_{s}\|^{2}+\delta^{2}}\right)\right\|^{2}{\mathord{{\rm d}}}{\big{(}}\frac{s}{\varepsilon}{\big{)}}\right).$

Note that

\displaystyle|\log(1+r)-r|\leqslant Cr^{2},\ \ r>-\tfrac{1}{2},

(2.21)

and for $A^{\varepsilon}_{s}:=\frac{|Z^{\varepsilon}_{s}+\varepsilon b(s,X^{\varepsilon}_{s})|^{2}-|Z^{\varepsilon}_{s}|^{2}}{|Z^{\varepsilon}_{s}|^{2}+\delta^{2}}$ and $\delta>\varepsilon\|b\|_{\infty}$ ,

\displaystyle|A^{\varepsilon}_{s}|\leqslant 2\frac{\varepsilon\|b\|_{\infty}}{\delta}+\frac{\varepsilon^{2}\|b\|^{2}_{\infty}}{\delta^{2}}\leqslant 3\frac{\varepsilon\|b\|_{\infty}}{\delta}.

(2.22)

In particular, we further have for $\delta\geqslant 6\varepsilon\|b\|_{\infty}$ ,

\displaystyle{\mathbb{E}}\left(\sup_{t\in[0,T]}|I_{2}(t)|^{2}\right)\leqslant 4\int^{T}_{0}{\mathbb{E}}\left|\log\left(1+A^{\varepsilon}_{s}\right)\right|^{2}{\mathord{{\rm d}}}{\big{(}}\frac{s}{\varepsilon}{\big{)}}\lesssim\int^{T}_{0}{\mathbb{E}}(|A^{\varepsilon}_{s}|^{2}+|A^{\varepsilon}_{s}|^{4}){\mathord{{\rm d}}}{\big{(}}\frac{s}{\varepsilon}{\big{)}}\lesssim\frac{\varepsilon\|b\|^{2}_{\infty}T}{\delta^{2}}.

Similarly, for $I_{3}(t)$ , by (2.21) and (2.22), we have for $\delta\geqslant 6\varepsilon\|b\|_{\infty}$ ,

	$\displaystyle I_{3}(t)$	$\displaystyle=\int^{t}_{0}\Big{(}\log\left(1+A^{\varepsilon}_{s}\right)-A^{\varepsilon}_{s}\Big{)}{\mathord{{\rm d}}}{\big{(}}\frac{s}{\varepsilon}{\big{)}}+\varepsilon\int^{t}_{0}\frac{\|b(s,X^{\varepsilon}_{s})\|^{2}}{\|Z^{\varepsilon}_{s}\|^{2}+\delta^{2}}{\mathord{{\rm d}}}s$
		$\displaystyle\lesssim\int^{t}_{0}\|A^{\varepsilon}_{s}\|^{2}{\mathord{{\rm d}}}{\big{(}}\frac{s}{\varepsilon}{\big{)}}+\frac{\varepsilon t\\|b\\|_{\infty}^{2}}{\delta^{2}}\lesssim\frac{\varepsilon\\|b\\|^{2}_{\infty}t}{\delta^{2}}.$

Combining the above calculations, we obtain that for $\delta\geqslant 6\sqrt{\varepsilon}\|b\|_{\infty}$ ,

	$\displaystyle{\mathbb{E}}\left(\sup_{t\in[0,T\wedge\tau_{R}]}\log\left(\frac{\|Z^{\varepsilon}_{t}\|^{2}}{\delta^{2}}+1\right)\right)$	$\displaystyle\lesssim\int^{T}_{0}{\mathbb{E}}\left(f_{R}(s,X_{s}){\mathbbm{1}}_{\{\|X_{s}\|<R\}}\right){\mathord{{\rm d}}}s+\frac{\sqrt{\varepsilon}\\|b\\|_{\infty}}{\delta}$
		$\displaystyle=\int^{T}_{0}\left(\int_{B_{R}}f_{R}(s,x)\rho_{s}(x){\mathord{{\rm d}}}x\right){\mathord{{\rm d}}}s+\frac{\sqrt{\varepsilon}\\|b\\|_{\infty}}{\delta}$
		$\displaystyle\leqslant\\|f_{R}\\|_{L^{q}([0,T]\times B_{R})}\\|\rho\\|_{L^{p}([0,T]\times B_{R})}+\frac{\sqrt{\varepsilon}\\|b\\|_{\infty}}{\delta}.$

Now for any $h\in(0,1)$ and $\delta=6\sqrt{\varepsilon}\|b\|_{\infty}$ , by Chebyschev’s inequality we have

	$\displaystyle{\mathbb{P}}\left(\sup_{t\in[0,T\wedge\tau_{R}]}\|Z^{\varepsilon}_{t}\|>h\right)$	$\displaystyle\leqslant{\mathbb{E}}\left(\sup_{t\in[0,T\wedge\tau_{R}]}\log\left(\frac{\|Z^{\varepsilon}_{t}\|^{2}}{\delta^{2}}+1\right)\right)/\log(1+(h/\delta)^{2})$
		$\displaystyle\lesssim\frac{\\|f_{R}\\|_{L^{q}([0,T]\times B_{R})}\\|\rho\\|_{L^{q}([0,T]\times B_{R})}+1}{\log(1+h^{2}/(36\varepsilon\\|b\\|^{2}_{\infty}))}.$		(2.23)

On the other hand, it is standard to show that

{\mathbb{P}}(\tau_{R}\leqslant T)\leqslant\frac{{\mathbb{E}}\left(\sup_{t\in[0,T]}(|X_{t}|+|X^{\varepsilon}_{t}|)^{2}\right)}{R^{2}}\leqslant\frac{C(1+\|b\|^{2}_{\infty}T^{2})}{R^{2}},

which together with (2.23) yields the desired estimate. ∎

As a consequence, we have

Corollary 2.14.

Assume that $\nabla b\in L^{q}({\mathbb{R}}_{+}\times{\mathbb{R}}^{d})$ for some $q>d$ and $b,\mathord{{\rm div}}b\in L^{\infty}({\mathbb{R}}_{+}\times{\mathbb{R}}^{d})$ . Let $p=\frac{q}{q-1}$ and $T>0$ . For any $X_{0}\in{\mathcal{F}}_{0}$ with density $\rho_{0}\in L^{p}({\mathbb{R}}^{d})$ , there is a unique solution $X_{t}$ to ODE (2.3) so that $X_{t}$ admits a density $\rho_{t}(x)\in L^{\infty}([0,T];L^{p}({\mathbb{R}}^{d}))$ . Moreover, there is a constant $C_{T}>0$ such that for all $\varepsilon,h\in(0,1)$ ,

{\mathbb{P}}\left(\sup_{t\in[0,T]}|X^{\varepsilon}_{t}-X_{t}|\geqslant h\right)\leqslant C_{T}\frac{\|\nabla b\|_{L^{q}([0,T]\times{\mathbb{R}}^{d})}\|\rho\|_{L^{p}([0,T]\times{\mathbb{R}}^{d})}+1}{\log(1+h^{2}/(36\varepsilon\|b\|_{\infty}))}.

Proof.

Let $r\in(d,q)$ . By Morrey’s inequality (see [12, p143, Theorem 3]), there is a constant $C=C(d,r)>0$ such that for Lebesgue almost all $x,y\in{\mathbb{R}}^{d}$ ,

|b(s,x)-b(s,y)|\leqslant C|x-y|\left(\frac{1}{|B_{|x-y|}|}\int_{B_{|x-y|}}|\nabla b(s,y+z)|^{r}{\mathord{{\rm d}}}z\right)^{\frac{1}{r}}\leqslant C|x-y|({\mathcal{M}}|\nabla b(s,\cdot)|^{r}(y))^{\frac{1}{r}},

where

{\mathcal{M}}|\nabla b(s,\cdot)|^{r}(y):=\sup_{r\geqslant 0}\frac{1}{|B_{r}|}\int_{B_{r}}|\nabla b(s,y+z)|^{r}{\mathord{{\rm d}}}z.

Hence, (2.20) holds with $f_{R}(s,y)=({\mathcal{M}}|\nabla b(s,\cdot)|^{r}(y))^{1/r}$ and by the $L^{p}$ -boundedness of the maximal function (cf. [36]),

\|({\mathcal{M}}|\nabla b|^{r})^{1/r}\|_{L^{q}([0,T]\times{\mathbb{R}}^{d})}\leqslant C\|\nabla b\|_{L^{q}([0,T]\times{\mathbb{R}}^{d})}.

By the DiPerna-Lions theory (see [11, Corollary II.1] and [1, 9]), for any $X_{0}\in{\mathcal{F}}_{0}$ with a density $\rho_{0}\in L^{p}({\mathbb{R}}^{d})$ , there is a unique solution $X_{t}$ to ODE (2.3) with density $\rho_{t}(x)\in L^{\infty}([0,T];L^{p}({\mathbb{R}}^{d}))$ . Now by Theorem 2.13 with $R=\infty$ , we obtain the desired estimate. ∎

Remark 2.15.

Corollary 2.14 provides a discretization approximation for ODEs with ${\mathbb{W}}^{1,q}$ -coefficients. Let us consider the case where $d=2$ and the vector field $b(x)$ is defined as

b(x)=(-x_{2},x_{1})/|x|^{\alpha}\phi(x),

where $\alpha<1$ and $\phi\in C^{\infty}_{c}(\mathbb{R}^{d})$ . It can be easily seen that $\mathord{{\rm div}}b\in L^{\infty}$ and $\nabla b\in L^{q}(\mathbb{R}^{2})$ for any $q\in[1,2/\alpha)$ . Additionally, it should be noted that $b$ is Hölder continuous at the point $0$ .

2.4. Particle approximation for DDODEs

In this section, we turn our attention to the study of nonlinear or distribution-dependent ODEs (DDODEs) and the corresponding interaction particle system. We establish the strong convergence of the particle approximation scheme, as well as a central limit theorem, similar to what was discussed earlier. It is important to note that our scheme is fully discretized, with the time scale chosen as $\varepsilon=1/N$ . This choice allows for efficient numerical implementation and analysis of the particle system.

Let $\phi:{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}\times{\mathbb{R}}^{d}\to{\mathbb{R}}^{m}$ and $F:{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}\times{\mathbb{R}}^{m}\to{\mathbb{R}}^{d}$ be Borel measurable functions. For a (sub)-probability measure $\mu$ over ${\mathbb{R}}^{d}$ , we define

b(t,x,\mu):=F(t,x,(\phi_{t}\circledast\mu)(x)),

where

(\phi_{t}\circledast\mu)(x):=\int_{{\mathbb{R}}^{d}}\phi_{t}(x,y)\mu({\mathord{{\rm d}}}y).

Now we consider the following DDODE:

\displaystyle X_{t}=X_{0}+\int^{t}_{0}b(s,X_{s},\mu_{X_{s}}){\mathord{{\rm d}}}s,

(2.24)

where $X_{0}$ is any random variable and $\mu_{X_{s}}$ stands for the distribution of $X_{s}$ . Suppose that

\displaystyle\left\{\begin{aligned} |F(t,x,r)-F(t,x^{\prime},r^{\prime})|&\leqslant\kappa(|x-x^{\prime}|+|r-r^{\prime}|),\\ |\phi(t,x,y)-\phi(t,x^{\prime},y^{\prime})|&\leqslant\kappa(|x-x^{\prime}|+|y-y^{\prime}|)\end{aligned}\right.

(2.25)

and

\displaystyle|F(t,x,r)|+|\phi(t,x,y)|\leqslant\kappa.

(2.26)

Under the above conditions, it is well-known that DDODE (2.24) has a unique solution. In particular, $\mu_{X_{t}}$ solves the following nonlinear first order PDE in the distributional sense:

\partial_{t}\mu_{X_{t}}+\mathord{{\rm div}}(b(t,\cdot,\mu_{X_{t}})\mu_{X_{t}})=0.

Remark 2.16.

If $X_{0}=x$ is a fixed point, then $\mu_{X_{s}}=\delta_{X_{s}}$ is a Dirac measure and

b(s,X_{s},\mu_{X_{s}})=F(s,X_{s},\phi_{s}(X_{s},X_{s})).

In this case, there is no interaction. Now, suppose that $X_{0}$ has a density $\rho_{0}$ , and let $b(t,x,\mu)=\int_{\mathbb{R}^{d}}b(t,x,y)\mu(\mathrm{d}y)$ . Then, $X_{t}$ also has a density $\rho_{t}(x)$ , and in the distributional sense, we have

\partial_{t}\rho_{t}(x)+\mathord{{\rm div}}\left(\rho_{t}(x)\int_{{\mathbb{R}}^{d}}b(t,x,y)\rho_{t}(y){\mathord{{\rm d}}}y\right)=0.

In particular, if we consider the case where $b(t,x,y)=-{\mathbbm{1}}_{[0,\infty)}(x-y)$ , we obtain

\partial_{t}V_{t}(x)=(V^{2}_{t}(x))^{\prime}/2,

which is the classical Burgers equation.

Now we construct the interaction particle approximation for DDODE (2.24). Let $({\mathcal{N}}^{k})_{k\in{\mathbb{N}}}$ be a family of i.i.d. standard Poisson processes. Fix $N\in{\mathbb{N}}$ . For $k\in{\mathbb{N}}$ , define

{\mathcal{N}}^{N,k}_{t}:={\mathcal{N}}^{k}_{Nt},\ \ \widetilde{\mathcal{N}}^{N,k}_{t}:={\mathcal{N}}^{k}_{Nt}-Nt,\ \ t>0.

Let $(X^{i}_{0})_{i\in{\mathbb{N}}}$ be a sequence of i.i.d. ${\mathcal{F}}_{0}$ -measurable random variables with common distribution $\nu$ . We consider the following interaction particle system driven by Poisson processes: for $i=1,\cdots,N,$

X^{N,i}_{t}=X^{i}_{0}+\frac{1}{N}\int^{t}_{0}b(s,X^{N,i}_{s-},\mu^{N}_{s-}){\mathord{{\rm d}}}{\mathcal{N}}^{N,i}_{s}=X^{N,i}_{t-}+\frac{1}{N^{2}}\sum_{j=1}^{N}b(t,X^{N,i}_{t-},X^{N,j}_{t-})\Delta{\mathcal{N}}^{N,i}_{t},

where we have chosen $\varepsilon=1/N$ in Poisson approximation (2.4), and

\mu^{N}_{s}:=\frac{1}{N}\sum_{j=1}^{N}\delta_{X^{N,j}_{s}}.

In order to show the convergence rate, we need the following simple lemma (see [40]).

Lemma 2.17.

Let $\boldsymbol{\xi}^{N}:=(\xi_{1},\cdots,\xi_{N})$ be a sequence of i.i.d. ${\mathbb{R}}^{d}$ -valued random variables with common distribution $\mu$ . Let $\mu_{\boldsymbol{\xi}^{N}}:=\frac{1}{N}\sum_{j=1}^{N}\delta_{\xi_{j}}$ be the empirical measure of $\boldsymbol{\xi}^{N}$ . Then there is a universal constant $C>0$ such that for any nonnegative measurable function $f(x,y):{\mathbb{R}}^{d}\times{\mathbb{R}}^{d}\to{\mathbb{R}}$ and $\bar{\mu}\in{\mathcal{P}}({\mathbb{R}}^{d})$ , and $i=1,\cdots,N$ ,

\displaystyle\begin{split}{\mathbb{E}}|f(\xi_{i},\mu_{\boldsymbol{\xi}^{N}})-f(\xi_{i},\bar{\mu})|^{2}&\lesssim_{C}\int_{{\mathbb{R}}^{d}}{\big{(}}f(x,\mu)-f(x,\bar{\mu}){\big{)}}^{2}\mu({\mathord{{\rm d}}}x)+\frac{1}{N}\int_{{\mathbb{R}}^{2d}}f(x,y)^{2}\mu({\mathord{{\rm d}}}x)\mu({\mathord{{\rm d}}}y)\\ &\quad+\frac{1}{N}\int_{{\mathbb{R}}^{d}}\left(\int_{{\mathbb{R}}^{d}}f(x,y)\bar{\mu}({\mathord{{\rm d}}}y)\right)^{2}\mu({\mathord{{\rm d}}}y)+\frac{1}{N}\int_{{\mathbb{R}}^{d}}f(x,x)^{2}\mu({\mathord{{\rm d}}}x).\end{split}

(2.27)

In particular,

\displaystyle{\mathbb{E}}|f(\xi_{i},\mu_{\boldsymbol{\xi}^{N}})-f(\xi_{i},\mu)|^{2}\leqslant\frac{C}{N}\left(\int_{{\mathbb{R}}^{2d}}f(x,y)^{2}\mu({\mathord{{\rm d}}}x)\mu({\mathord{{\rm d}}}y)+\int_{{\mathbb{R}}^{d}}f(x,x)^{2}\mu({\mathord{{\rm d}}}x)\right).

(2.28)

Proof.

By definition we have

\displaystyle{\mathbb{E}}|f(\xi_{i},\mu_{\boldsymbol{\xi}^{N}})-f(\xi_{i},\bar{\mu})|^{2}=\frac{1}{N^{2}}\sum_{j,k=1}^{N}{\mathbb{E}}\Big{[}{\big{(}}f(\xi_{i},\xi_{j})-f(\xi_{i},\bar{\mu}){\big{)}}{\big{(}}f(\xi_{i},\xi_{k})-f(\xi_{i},\bar{\mu}){\big{)}}\Big{]}.

Since for $j\not=k\not=i$ , $\xi_{i},\xi_{j},\xi_{k}$ are independent and have the same distribution $\mu$ , we have

\displaystyle{\mathbb{E}}\Big{[}{\big{(}}f(\xi_{i},\xi_{j})-f(\xi_{i},\bar{\mu}){\big{)}}{\big{(}}f(\xi_{i},\xi_{k})-f(\xi_{i},\bar{\mu}){\big{)}}\Big{]}

\displaystyle=\int_{{\mathbb{R}}^{d}}{\big{(}}f(x,\mu)-f(x,\bar{\mu}){\big{)}}^{2}\mu({\mathord{{\rm d}}}x).

Thus,

	$\displaystyle{\mathbb{E}}\|f(\xi_{i},\mu_{\boldsymbol{\xi}^{N}})-f(\xi_{i},\bar{\mu})\|^{2}$	$\displaystyle\leqslant\int_{{\mathbb{R}}^{d}}{\big{(}}f(x,\mu)-f(x,\bar{\mu}){\big{)}}^{2}\mu({\mathord{{\rm d}}}x)+\frac{1}{N^{2}}\sum_{j=1}^{N}{\mathbb{E}}\Big{[}f(\xi_{i},\xi_{j})-f(\xi_{i},\bar{\mu})\Big{]}^{2}$
		$\displaystyle\quad+\frac{2}{N^{2}}\sum_{j=1}^{N}{\mathbb{E}}\Big{[}{\big{(}}f(\xi_{i},\xi_{j})-f(\xi_{i},\bar{\mu}){\big{)}}{\big{(}}f(\xi_{i},\xi_{i})-f(\xi_{i},\bar{\mu}){\big{)}}\Big{]}$
		$\displaystyle\lesssim\int_{{\mathbb{R}}^{d}}{\big{(}}f(x,\mu)-f(x,\bar{\mu}){\big{)}}^{2}\mu({\mathord{{\rm d}}}x)+\frac{1}{N^{2}}\sum_{j=1}^{N}{\mathbb{E}}\|f(\xi_{i},\xi_{j})\|^{2}$
		$\displaystyle\quad+\frac{1}{N}{\mathbb{E}}\Big{(}\|f(\xi_{i},\bar{\mu})\|^{2}+\|f(\xi_{i},\xi_{i})\|^{2}\Big{)}.$

From this, we derive the desired estimate. ∎

Let $\bar{X}^{i}_{t}$ solve the following DDODE:

\displaystyle\bar{X}^{i}_{t}=X^{i}_{0}+\int^{t}_{0}b(s,\bar{X}^{i}_{s},\mu_{\bar{X}^{i}_{s}}){\mathord{{\rm d}}}s,\ i=1,\cdots,N.

(2.29)

Clearly, $(\bar{X}^{1}_{\cdot},\cdots,\bar{X}^{N}_{\cdot})$ are i.i.d. random processes. We present a simple result regarding the propagation of chaos, which is consistent with [40]. This result highlights the independence of the particle system as the number of particles increases, and provides support for the validity and effectiveness of the approximation scheme.

Theorem 2.18.

Under (2.25) and (2.26), for any $T>0$ , there is a constant $C=C(\kappa,T,d)>0$ independent of $N$ such that for all $i=1,\cdots,N$ ,

{\mathbb{E}}\left(\sup_{t\in[0,T]}|X^{N,i}_{t}-\bar{X}^{i}_{t}|^{2}\right)\leqslant\frac{C}{N}.

Proof.

Let $\bar{\mu}^{N}_{t}:=\frac{1}{N}\sum_{j=1}^{N}\delta_{\bar{X}^{j}_{t}}.$ Note that

	$\displaystyle X^{N,i}_{t}-\bar{X}^{i}_{t}$	$\displaystyle=\frac{1}{N}\int^{t}_{0}b(s,X^{N,i}_{s-},\mu^{N}_{s-}){\mathord{{\rm d}}}{\mathcal{N}}^{N,i}_{s}-\int^{t}_{0}b(s,\bar{X}^{i}_{s},\mu_{\bar{X}^{i}_{s}}){\mathord{{\rm d}}}s$
		$\displaystyle=\frac{1}{N}\int^{t}_{0}b(s,X^{N,i}_{s-},\mu^{N}_{s-}){\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{N,i}_{s}+\int^{t}_{0}\Big{[}b(s,X^{N,i}_{s},\mu^{N}_{s})-b(s,\bar{X}^{i}_{s},\bar{\mu}^{N}_{s})\Big{]}{\mathord{{\rm d}}}s$
		$\displaystyle\qquad+\int^{t}_{0}\Big{[}b(s,\bar{X}^{i}_{s},\bar{\mu}^{N}_{s})-b(s,\bar{X}^{i}_{s},\mu_{\bar{X}^{i}_{s}})\Big{]}{\mathord{{\rm d}}}s$
		$\displaystyle=:I_{1}(t)+I_{2}(t)+I_{3}(t).$

Below for a nonnegative function $f(t)$ , we write

f^{*}(t):=\sup_{s\in[0,t]}f(s).

For $I_{1}(t)$ , by Doob’s maximal inequality we have

	$\displaystyle{\mathbb{E}}\|I^{*}_{1}(T)\|^{2}$	$\displaystyle\leqslant\frac{1}{N^{2}}{\mathbb{E}}\left(\sup_{t\in[0,T]}\left\|\int^{t}_{0}b(s,X^{N,i}_{s-},\eta^{N}_{s-}){\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{N,i}_{s}\right\|^{2}\right)$
		$\displaystyle\leqslant\frac{4}{N}{\mathbb{E}}\left(\int^{T}_{0}\|b(s,X^{N,i}_{s},\eta^{N}_{s})\|^{2}{\mathord{{\rm d}}}s\right)\leqslant\frac{4\\|b\\|^{2}_{\infty}T}{N}.$

For $I_{2}(t)$ , by the Lipschitz assumptions (2.25), we have

\displaystyle{\mathbb{E}}|I^{*}_{2}(t)|^{2}\lesssim\int^{t}_{0}{\mathbb{E}}|X^{N,i}_{s}-\bar{X}^{i}_{s}|^{2}{\mathord{{\rm d}}}s+\int^{t}_{0}{\mathbb{E}}\Big{(}\frac{1}{N}\sum_{j=1}^{N}|X^{N,j}_{s}-\bar{X}^{j}_{s}|\Big{)}^{2}{\mathord{{\rm d}}}s.

For $I_{3}(t)$ , by (2.28) we have

\displaystyle{\mathbb{E}}|I^{*}_{3}(T)|^{2}\leqslant\frac{C}{N}\|b\|_{\infty}^{2}.

Combining the above calculations, we obtain that for each $i=1,\cdots,N$ ,

\displaystyle{\mathbb{E}}\left(\sup_{s\in[0,t]}|X^{N,i}_{s}-\bar{X}^{i}_{s}|^{2}\right)\lesssim\frac{1}{N}+\int^{t}_{0}{\mathbb{E}}|X^{N,i}_{s}-\bar{X}^{i}_{s}|^{2}{\mathord{{\rm d}}}s+\int^{t}_{0}{\mathbb{E}}\Big{(}\frac{1}{N}\sum_{j=1}^{N}|X^{N,j}_{s}-\bar{X}^{j}_{s}|\Big{)}^{2}{\mathord{{\rm d}}}s.

By Gronwall’s inequality, we get

	$\displaystyle{\mathbb{E}}\left(\sup_{s\in[0,t]}\|X^{N,i}_{s}-\bar{X}^{i}_{s}\|^{2}\right)$	$\displaystyle\lesssim\frac{1}{N}+\int^{t}_{0}{\mathbb{E}}\Big{(}\frac{1}{N}\sum_{j=1}^{N}\|X^{N,j}_{s}-\bar{X}^{j}_{s}\|\Big{)}^{2}{\mathord{{\rm d}}}s$
		$\displaystyle\lesssim\frac{1}{N}+\int^{t}_{0}\Big{(}\frac{1}{N}\sum_{j=1}^{N}\\|X^{N,j}_{s}-\bar{X}^{j}_{s}\\|_{L^{2}(\Omega)}\Big{)}^{2}{\mathord{{\rm d}}}s$
		$\displaystyle\lesssim\frac{1}{N}+\int^{t}_{0}\sup_{j=1,\cdots,N}\\|X^{N,j}_{s}-\bar{X}^{j}_{s}\\|_{L^{2}(\Omega)}^{2}{\mathord{{\rm d}}}s,$

where the implicit constant does not depend on $i$ . The desired estimate now follows by Gronwall’s inequality again. ∎

Next we consider the asymptotic distribution of the following fluctuation:

Z^{N}_{t}:=\frac{1}{\sqrt{N}}\sum_{i=1}^{N}(X^{N,i}_{t}-\bar{X}^{i}_{t}).

Note that

Z^{N}_{t}=\frac{1}{N^{3/2}}\sum_{i=1}^{N}\int^{t}_{0}b[s,X^{N,i}_{s-},\mu^{N}_{s-}]{\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{N,i}_{s}+\frac{1}{\sqrt{N}}\sum_{i=1}^{N}\int^{t}_{0}\Big{(}b[s,X^{N,i}_{s},\mu^{N}_{s}]-b[s,\bar{X}^{i}_{s},\mu_{\bar{X}^{i}_{s}}]\Big{)}{\mathord{{\rm d}}}s.

Since $b$ is bounded, one sees that the martingale part converges to zero in $L^{2}$ . Indeed,

\displaystyle\frac{1}{N^{3}}{\mathbb{E}}\left|\sum_{i=1}^{N}\int^{t}_{0}b[s,X^{N,i}_{s-},\mu^{N}_{s-}]{\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{N,i}_{s}\right|^{2}

\displaystyle=\frac{1}{N^{2}}\sum_{i=1}^{N}\int^{t}_{0}{\mathbb{E}}|b[s,X^{N,i}_{s},\mu^{N}_{s}]|^{2}{\mathord{{\rm d}}}s\leqslant\frac{\|b\|^{2}_{\infty}t}{N}.

Therefore, it is not expected that $Z^{N}_{t}$ converges to some non-degenerate Gaussian distribution. Moreover, let

a^{N}_{t}:=\frac{1}{\sqrt{N}}\sum_{i=1}^{N}\int^{t}_{0}\Big{(}b[s,X^{N,i}_{s},\mu^{N}_{s}]-b[s,\bar{X}^{i}_{s},\mu_{\bar{X}^{i}_{s}}]\Big{)}{\mathord{{\rm d}}}s.

By (2.25), (2.28) and Theorem 2.18, it is easy to see that for all $t\in[0,T]$ ,

\displaystyle{\mathbb{E}}|a^{N}_{t}|^{2}

\displaystyle\leqslant T\sum_{i=1}^{N}\int^{T}_{0}{\mathbb{E}}\Big{|}b[s,X^{N,i}_{s},\mu^{N}_{s}]-b[s,\bar{X}^{i}_{s},\mu_{\bar{X}^{i}_{s}}]\Big{|}^{2}{\mathord{{\rm d}}}s\leqslant C_{T},

where $C_{T}$ does not depend on $N$ . We aim to show the following result about the fluctuation.

Theorem 2.19.

Suppose that (2.25) and (2.26) hold. Then as $N\to\infty$ ,

\sqrt{N}(Z^{N}_{t}-a^{N}_{t})_{t\geqslant 0}\Rightarrow(Y_{t})_{t\geqslant 0},

where $Y_{t}=\int^{t}_{0}b{\big{(}}s,X_{s},\mu_{X_{s}}{\big{)}}{\mathord{{\rm d}}}W_{s}$ is a Gaussian martingale, and $X_{t}$ solves the following DDODE:

\displaystyle X_{t}=X_{0}+\int^{t}_{0}b(s,X_{s},\mu_{X_{s}}){\mathord{{\rm d}}}s,

(2.30)

and $X_{0}\sim\nu$ , is the common distribution of $X^{N,i}_{0}$ , and $W$ is a one dimensional standard Brownian motion.

Proof.

By definition it is easy to see that

\displaystyle Y^{N}_{t}:=\sqrt{N}(Z^{N}_{t}-a^{N}_{t})=\sum_{i=1}^{N}\left(X^{N,i}_{t}-\int^{t}_{0}b[s,X^{N,i}_{s},\mu^{N}_{s}]{\mathord{{\rm d}}}s\right)=\sum_{i=1}^{N}\int^{t}_{0}{\mathcal{A}}^{N,i}_{s}{\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{N,i}_{s},

(2.31)

where

{\mathcal{A}}^{N,i}_{s}:=b[s,X^{N,i}_{s-},\mu^{N}_{s-}]/N.

For any stopping time $\tau$ and $\delta>0$ , by Doob’s maximal inequality we have

\displaystyle{\mathbb{E}}\left(\sup_{t\in[0,\delta]}|Y^{N}_{\tau+t}-Y^{N}_{\tau}|^{2}\right)

\displaystyle\leqslant 4N\sum_{i=1}^{N}{\mathbb{E}}\left(\int^{\tau+\delta}_{\tau}|{\mathcal{A}}^{N,i}_{s}|^{2}{\mathord{{\rm d}}}s\right)\leqslant 4\|b\|_{\infty}^{2}\delta.

To prove the result, we consider an auxiliary process $\widetilde{X}_{t}$ , which satisfies (2.30) with starting point $\widetilde{X}_{0}$ independent of $X^{i}_{0}$ . Clearly, we also have

\sup_{t\in[0,\delta]}|\widetilde{X}_{\tau+t}-\widetilde{X}_{\tau}|\leqslant\|b\|_{\infty}\delta.

Thus by Aldous’ criterion (see [23, p356, Theorem 4.5]), the law ${\mathbb{P}}_{N}$ of $(\widetilde{X}_{\cdot},Y^{N}_{\cdot})_{N\in{\mathbb{N}}}$ in ${\mathbb{D}}({\mathbb{R}}^{2d})$ is tight. Without loss of generality, we assume that ${\mathbb{P}}_{N}$ weakly converges to ${\mathbb{P}}_{\infty}$ . We show that ${\mathbb{P}}_{\infty}$ is a martingale solution of the following second order operator starting from $\nu\otimes\delta_{0}$ at time $0$

{\mathscr{L}}_{s}f(x,y)=b[s,x,\mu_{s}]\cdot\nabla_{x}f(x,y)+\tfrac{1}{2}\mathrm{tr}{\big{(}}(b\otimes b)[s,x,\mu_{s}]\cdot\nabla^{2}_{y}f(x,y){\big{)}}.

For $f\in C^{2}_{b}({\mathbb{R}}^{2d})$ , we need to show that for $w_{t}=(x_{t},y_{t})\in{\mathbb{D}}({\mathbb{R}}^{2d})$ ,

f(w_{t})-f(w_{0})-\int^{t}_{0}{\mathscr{L}}_{s}f(w_{s}){\mathord{{\rm d}}}s,

is a ${\mathbb{P}}_{\infty}$ -martingale. On one hand, let

{\mathscr{E}}:=\left\{\sum_{i=1}^{m}a_{i}h_{i}(x)g_{i}(y),\ h_{i},g_{i}\in C^{2}_{b}({\mathbb{R}}^{d}),a_{i}\in{\mathbb{R}},m\in{\mathbb{N}}\right\}.

Since ${\mathscr{E}}$ is dense in $C^{2}_{b}({\mathbb{R}}^{2d})$ , it suffices to consider $f(x,y)=h(x)g(y)$ , where $h,g\in C^{2}_{b}({\mathbb{R}}^{d})$ . On the other hand, since $\widetilde{X}$ solves ODE (2.29), we have

h(\widetilde{X}_{t})=h(\widetilde{X}_{0})+\int^{t}_{0}b[s,\widetilde{X}_{s},\mu_{s}]\cdot\nabla h(\widetilde{X}_{s}){\mathord{{\rm d}}}s.

Therefore, we only need to consider $f(x,y)=g(y)$ . By (2.31) and Itô’s formula, we have

g(Y^{N}_{t})=g(0)+\int^{t}_{0}{\mathscr{A}}^{N}_{s}g(Y^{N}_{s}){\mathord{{\rm d}}}s+M^{N}_{t},

where $M^{N}_{t}:=\sum_{i=1}^{N}\int^{t}_{0}\Big{(}g(Y^{N}_{s-}+{\mathcal{A}}^{N,i}_{s})-g(Y^{N}_{s-})\Big{)}{\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{N,i}_{s}$ is a martingale, and

{\mathscr{A}}^{N}_{s}g(y):=N\sum_{i=1}^{N}\Big{(}g(y+\bar{\mathcal{A}}^{N,i}_{s})-g(y)-\bar{\mathcal{A}}^{N,i}_{s}\cdot\nabla g(y)\Big{)},

and

\bar{\mathcal{A}}^{N,i}_{s}:=b[s,X^{N,i}_{s},\mu^{N}_{s}]/N.

Below, for simplicity of notations, we write

B_{s}(x,y):=(b\otimes b)(s,x,y),\ \ {\mathscr{A}}_{s}g(y):=\tfrac{1}{2}\mathrm{tr}{\big{(}}B_{s}[\widetilde{X}_{s},\mu_{s}]\cdot\nabla^{2}g(y){\big{)}}.

By Theorem 6.4, it suffices to show

\lim_{N\to\infty}{\mathbb{E}}\int^{T}_{0}\Big{|}{\mathbb{E}}{\mathscr{A}}^{N}_{s}g(Y^{N}_{s})-{\mathbb{E}}{\mathscr{A}}_{s}g(Y^{N}_{s})\Big{|}{\mathord{{\rm d}}}s=0.

Observe that by Taylor’s expansion,

{\mathscr{A}}^{N}_{s}g(y)=\frac{1}{N}\sum_{i=1}^{N}\mathrm{tr}\left(B_{s}[X^{N,i}_{s},\mu^{N}_{s}]\cdot\int^{1}_{0}\theta\int^{1}_{0}\nabla^{2}_{y}g(y+\theta\theta^{\prime}\bar{\mathcal{A}}^{N,i}_{s}){\mathord{{\rm d}}}\theta{\mathord{{\rm d}}}\theta^{\prime}\right).

Let

\bar{\mathscr{A}}^{N}_{s}g(y):=\frac{1}{2N}\sum_{i=1}^{N}\mathrm{tr}\big{(}B_{s}[\bar{X}^{i}_{s},\mu_{s}]\cdot\nabla^{2}g(y)\big{)}=\mathrm{tr}{\big{(}}B_{s}[\mu_{\bar{\mathbf{X}}^{N}_{s}},\mu_{s}]\cdot\nabla^{2}g(y){\big{)}}.

Then

{\mathscr{A}}^{N}_{s}g(y)-{\mathscr{A}}_{s}g(y)={\mathscr{A}}^{N}_{s}g(y)-\bar{\mathscr{A}}^{N}_{s}g(y)+\bar{\mathscr{A}}^{N}_{s}g(y)-{\mathscr{A}}_{s}g(y)

By Theorem 2.18, it is easy to see that

\sup_{s\in[0,T]}{\mathbb{E}}\|{\mathscr{A}}^{N}_{s}g-\bar{\mathscr{A}}^{N}_{s}g\|^{2}_{\infty}\leqslant C/N.

Moreover, since $\mu_{\bar{X}^{i}_{s}}=\mu_{s}$ , by (2.28), we also have

	$\displaystyle\left\|{\mathbb{E}}\int^{t}_{0}\Big{(}\bar{\mathscr{A}}^{N}_{s}g{\big{(}}Y^{N}_{s}{\big{)}}-{\mathscr{A}}_{s}g{\big{(}}Y^{N}_{s}{\big{)}}\Big{)}{\mathord{{\rm d}}}s\right\|^{2}\leqslant t\int^{t}_{0}\left\|{\mathbb{E}}\bar{\mathscr{A}}^{N}_{s}g{\big{(}}Y^{N}_{s}{\big{)}}-{\mathbb{E}}{\mathscr{A}}_{s}g{\big{(}}Y^{N}_{s}{\big{)}}\right\|^{2}{\mathord{{\rm d}}}s$
	$\displaystyle\qquad=\frac{t}{2}\int^{t}_{0}\left\|{\mathbb{E}}\left(\mathrm{tr}\big{(}(B_{s}[\mu_{\bar{\mathbf{X}}^{N}_{s}},\mu_{s}]-{\mathbb{E}}B_{s}[\widetilde{X}_{s},\mu_{s}])\cdot\nabla^{2}g(Y^{N}_{s}){\big{)}}\right)\right\|^{2}{\mathord{{\rm d}}}s$
	$\displaystyle\qquad\leqslant\frac{t}{2}\\|\nabla^{2}g\\|_{\infty}\int^{t}_{0}{\mathbb{E}}\left\|B_{s}[\mu_{\bar{\mathbf{X}}^{N}_{s}},\mu_{s}]-{\mathbb{E}}B_{s}[\widetilde{X}_{s},\mu_{s}]\right\|^{2}{\mathord{{\rm d}}}s\leqslant\frac{C}{N}.$

Hence,

{\mathbb{E}}\left|\int^{t}_{0}({\mathscr{L}}^{N}_{s}f-{\mathscr{L}}_{s}f)(\widetilde{X}_{s},Y^{N}_{s}){\mathord{{\rm d}}}s\right|^{2}\leqslant\frac{C}{N}.

Thus, by Theorem 6.4 in appendix, we get ${\mathbb{P}}_{\infty}\in{\mathcal{M}}^{\nu\otimes\delta_{0}}_{0}({\mathscr{L}})$ and conclude the proof. ∎

Remark 2.20.

By the above theorem, one sees that $(\sqrt{N}(Z^{N}_{t}-a^{N}_{t}))_{t\in[0,T]}$ weakly converges to a Gaussian martingale with covariance matrix $\int^{t}_{0}(b\otimes b)(s,X_{s},\mu_{s}){\mathord{{\rm d}}}s$ .

3. Compound Poisson approximation for SDEs

The main objective of this section is to introduce a unified compound Poisson approximation for SDEs driven by either Brownian motions or $\alpha$ -stable processes. This is accomplished by selecting different scaling parameters. We establish the convergence of the approximation SDEs under relatively mild assumptions, as demonstrated in Theorem 3.16. Furthermore, under more restrictive assumptions, we derive the convergence rate in Theorem 3.19. Additionally, we obtain the convergence of the invariant measures under dissipativity assumptions, as presented in Theorem 3.17. The convergence of the generators plays a pivotal role in our proofs. In essence, our results can be interpreted as a form of nonlinear central limit theorem. In the subsequent section, we will apply this framework to address nonlinear partial differential equations (PDEs), with a specific focus on the 2D-Navier-Stokes equations on the torus.

Let $(\xi_{n})_{n\in{\mathbb{N}}}$ be a sequence of i.i.d. ${\mathbb{R}}^{d}$ -valued symmetric random variables with common distribution $\nu\in{\mathcal{P}}({\mathbb{R}}^{d})$ . Let $\xi_{0}=0$ . For $\varepsilon>0$ , we define a compound Poisson process $H^{\varepsilon}$ by

\displaystyle H^{\varepsilon}_{t}:=\sum_{n\leqslant{\mathcal{N}}^{\varepsilon}_{t}}\xi_{n},\ \ t\geqslant 0,

(3.1)

where ${\mathcal{N}}^{\varepsilon}_{t}$ is the Poisson process with intensity $1/\varepsilon$ (see (2.1)). Let ${\mathcal{H}}^{\varepsilon}$ be the associated Poisson random measure, i.e., for $t>0$ and $E\in{\mathscr{B}}({\mathbb{R}}^{d})$ ,

\displaystyle{\mathcal{H}}^{\varepsilon}([0,t],E):=\sum_{s\leqslant t}{\mathbbm{1}}_{E}(\Delta H^{\varepsilon}_{s})=\sum_{n\leqslant{\mathcal{N}}^{\varepsilon}_{t}}{\mathbbm{1}}_{E}(\xi_{n}),

(3.2)

where $\Delta H^{\varepsilon}_{s}:=H^{\varepsilon}_{s}-H^{\varepsilon}_{s-}$ . More precisely, for a function $f(s,z):{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}\to{\mathbb{R}}$ ,

\displaystyle\int^{t}_{0}f(s,z){\mathcal{H}}^{\varepsilon}({\mathord{{\rm d}}}s,{\mathord{{\rm d}}}z):=\sum_{s\leqslant t}f(s,\Delta H^{\varepsilon}_{s}){\mathbbm{1}}_{\{\Delta{\mathcal{N}}^{\varepsilon}_{s}=1\}}=\sum_{n\leqslant{\mathcal{N}}^{\varepsilon}_{t}}f(S^{\varepsilon}_{n},\xi_{n}),

(3.3)

where $S^{\varepsilon}_{n}=\varepsilon S_{n}$ is the $n$ -th jump time of ${\mathcal{N}}^{\varepsilon}_{t}$ . Note that the compensated measure of ${\mathcal{H}}^{\varepsilon}$ is given by ${\mathord{{\rm d}}}t\nu({\mathord{{\rm d}}}z)/\varepsilon$ . We also write

\widetilde{\mathcal{H}}^{\varepsilon}([0,t],E):={\mathcal{H}}^{\varepsilon}([0,t],E)-t\nu(E)/\varepsilon,

which is called the compensated Poisson random measure of ${\mathcal{H}}^{\varepsilon}$ .

Fix $\alpha>0$ . We make the following assumptions for the probability measure $\nu$ above:

{}^{\alpha}_{\nu}

)

$\nu$ is symmetric, i.e., $\nu(-{\mathord{{\rm d}}}z)=\nu({\mathord{{\rm d}}}z)$ . If $\alpha\geqslant 2$ , we suppose that

\nu(|z|^{\alpha}):=\int_{{\mathbb{R}}^{d}}|z|^{\alpha}\nu({\mathord{{\rm d}}}z)<\infty.

If $\alpha\in(0,2)$ , we suppose that

\displaystyle\sup_{\lambda\geqslant 1}\left[\lambda^{\alpha-2}\int_{|z|\leqslant\lambda}|z|^{2}\nu({\mathord{{\rm d}}}z)+\lambda^{\alpha}\int_{|z|>\lambda}\nu({\mathord{{\rm d}}}z)\right]<\infty,

(3.4)

and there is a Lévy measure $\nu_{0}$ and constants $\beta_{0}\in[0,1]$ , $\beta_{1},c_{1},c_{2}>0$ such that for any measurable function $G:{\mathbb{R}}^{d}\to{\mathbb{R}}$ satisfying

\displaystyle|G(z)|\leqslant c_{1}(|z|^{2}\wedge 1),\ \ |G(z)-G(z^{\prime})|\leqslant c_{1}(|z-z^{\prime}|\wedge 1)^{\beta_{0}},

(3.5)

it holds that

\displaystyle\left|\int_{{\mathbb{R}}^{d}}G(z)\nu_{\varepsilon}({\mathord{{\rm d}}}z)-\int_{{\mathbb{R}}^{d}}G(z)\nu_{0}({\mathord{{\rm d}}}z)\right|\leqslant c_{1}c_{2}\varepsilon^{\beta_{1}},\ \forall\varepsilon\in(0,1),

(3.6)

where

\displaystyle\nu_{\varepsilon}({\mathord{{\rm d}}}z):=\nu({\mathord{{\rm d}}}z/\varepsilon^{1/\alpha})/\varepsilon.

(3.7)

Remark 3.1.

If $\beta_{0}=0$ in (3.5), then (3.6) means that

\int_{{\mathbb{R}}^{d}}(|z|^{2}\wedge 1)|\nu_{\varepsilon}-\nu_{0}|({\mathord{{\rm d}}}z)\leqslant c_{1}c_{2}\varepsilon^{\beta_{1}},

where $|\nu_{\varepsilon}-\nu_{0}|({\mathord{{\rm d}}}z)$ stands for the total variation measure. Examples 1 and 2 below correspond to $\beta_{0}=0$ and $\beta_{1}=\frac{2}{\alpha}-1$ . For $\beta_{0}>0$ , condition (3.6) is used in Example 3 below.

In the following we provide several examples for $\alpha\in(0,2)$ to illustrate the above assumptions.

Example 1. Let $\nu({\mathord{{\rm d}}}z)=c_{0}{\mathbbm{1}}_{{\mathcal{C}}\cap B^{c}_{1}}(z)|z|^{-d-\alpha}{\mathord{{\rm d}}}z$ with $\alpha\in(0,2)$ , where ${\mathcal{C}}$ is a cone with vertex $0$ and $c_{0}$ is a normalized constant so that $\nu({\mathbb{R}}^{d})=1$ . It is easy to see that (H ${}^{\alpha}_{\nu}$ ) holds with $\nu_{0}({\mathord{{\rm d}}}z)=c_{0}{\mathbbm{1}}_{{\mathcal{C}}}(z)|z|^{-d-\alpha}{\mathord{{\rm d}}}z$ and $\beta_{0}=0$ , $\beta_{1}=\frac{2}{\alpha}-1$ . In this case $(\nu_{\varepsilon}-\nu_{0})({\mathord{{\rm d}}}z)=c_{0}{\mathbbm{1}}_{{\mathcal{C}}\cap B_{\varepsilon^{1/\alpha}}(z)}|z|^{-d-\alpha}{\mathord{{\rm d}}}z$ . In particular, if ${\mathcal{C}}={\mathbb{R}}^{d}$ , then up to a constant, $\nu_{0}$ is just the Lévy measure of a rotationally invariant and symmetric $\alpha$ -stable process.

Example 2. Let $\nu({\mathord{{\rm d}}}z)=c_{0}\sum_{i=1}^{d}{\mathbbm{1}}_{|z_{i}|>1}|z_{i}|^{-1-\alpha}{\mathord{{\rm d}}}z_{i}\delta_{\{0\}}({\mathord{{\rm d}}}z^{*}_{i})$ with $\alpha\in(0,2)$ , where $c_{0}$ is a constant so that $\nu({\mathbb{R}}^{d})=1$ and $z^{*}_{i}$ denotes the remaining variables except $z_{i}$ . It is easy to see that (H ${}^{\alpha}_{\nu}$ ) holds with $\nu_{0}({\mathord{{\rm d}}}z)=c_{0}\sum_{i=1}^{d}|z_{i}|^{-1-\alpha}{\mathord{{\rm d}}}z_{i}\delta_{\{0\}}({\mathord{{\rm d}}}z^{*}_{i})$ and $\beta_{0}=0$ , $\beta_{1}=\frac{2}{\alpha}-1$ . In this case, $\nu_{0}$ is a cylindrical Lévy measure.

Example 3. Let $\nu({\mathord{{\rm d}}}z)=c_{0}\sum_{k\in{\mathbb{Z}}\setminus\{0\}}|k|^{-1-\alpha}\delta_{k}({\mathord{{\rm d}}}z)$ with $\alpha\in(0,2)$ , where $c_{0}$ is a constant so that $\nu({\mathbb{R}})=1$ . First of all it is easy to see that (3.4) holds. We now verify that (3.6) holds for $\nu_{0}({\mathord{{\rm d}}}z)=c_{0}|z|^{-1-\alpha}{\mathord{{\rm d}}}z$ and $\beta_{0}\in(0,1]$ and $\beta_{1}<(1-\frac{\alpha}{2})\beta_{0}$ . Note that

\displaystyle\int_{\mathbb{R}}G(z)\nu_{\varepsilon}({\mathord{{\rm d}}}z)=c_{0}\sum_{k\in{\mathbb{Z}}\setminus\{0\}}\frac{G(k\varepsilon^{\frac{1}{\alpha}})}{\varepsilon|k|^{1+\alpha}}=c_{0}\int_{\mathbb{R}}\frac{G(z_{\varepsilon})}{|z_{\varepsilon}|^{1+\alpha}}{\mathord{{\rm d}}}z,

where $z_{\varepsilon}=\mbox{\rm sgn}(z)[|z|\varepsilon^{-\frac{1}{\alpha}}]\varepsilon^{\frac{1}{\alpha}}$ , and $[a]$ denotes the integer part of a real number $a>0$ . Here we have used the convention $\frac{0}{0}=0$ . Thus,

	$\displaystyle\frac{1}{c_{0}}\left\|\int_{\mathbb{R}}G(z)\nu_{\varepsilon}({\mathord{{\rm d}}}z)-\int_{\mathbb{R}}G(z)\nu_{0}({\mathord{{\rm d}}}z)\right\|$	$\displaystyle\leqslant\int_{\|z\|<2\varepsilon^{\frac{1}{\alpha}}}\left\|\frac{G(z_{\varepsilon})}{\|z_{\varepsilon}\|^{1+\alpha}}-\frac{G(z)}{\|z\|^{1+\alpha}}\right\|{\mathord{{\rm d}}}z$
		$\displaystyle+\int_{\|z\|\geqslant 2\varepsilon^{\frac{1}{\alpha}}}\|G(z_{\varepsilon})\|\left\|\frac{1}{\|z_{\varepsilon}\|^{1+\alpha}}-\frac{1}{\|z\|^{1+\alpha}}\right\|{\mathord{{\rm d}}}z$
		$\displaystyle+\int_{\|z\|\geqslant 2\varepsilon^{\frac{1}{\alpha}}}\frac{\|G(z_{\varepsilon})-G(z)\|}{\|z\|^{1+\alpha}}{\mathord{{\rm d}}}z$
		$\displaystyle=:I_{1}+I_{2}+I_{3}.$

For $I_{1}$ , by (3.5) we clearly have

I_{1}\leqslant c_{1}\int_{|z|<2\varepsilon^{\frac{1}{\alpha}}}\Big{(}|z_{\varepsilon}|^{1-\alpha}+\frac{|z|^{2}}{|z|^{1+\alpha}}\Big{)}{\mathord{{\rm d}}}z\lesssim c_{1}\varepsilon^{\frac{2}{\alpha}-1}.

Since $|z_{\varepsilon}-z|\leqslant\varepsilon^{\frac{1}{\alpha}}$ , we have for $|z|\geqslant 2\varepsilon^{\frac{1}{\alpha}}$ ,

|z|/2\leqslant|z_{\varepsilon}|\leqslant 2|z|,

and

\left|\frac{1}{|z_{\varepsilon}|^{1+\alpha}}-\frac{1}{|z|^{1+\alpha}}\right|\lesssim\frac{\varepsilon^{\frac{1}{\alpha}}}{|z|^{2+\alpha}}.

Hence,

\displaystyle I_{2}

\displaystyle\lesssim c_{1}\varepsilon^{\frac{1}{\alpha}}\int_{|z|\geqslant 2\varepsilon^{\frac{1}{\alpha}}}\frac{|z|^{2}\wedge 1}{|z|^{2+\alpha}}{\mathord{{\rm d}}}z\lesssim\left\{\begin{aligned} &c_{1}\varepsilon^{\frac{2}{\alpha}-1},&\alpha\in(1,2),\\ &c_{1}\varepsilon^{\frac{1}{\alpha}}|\log\varepsilon|,&\alpha=1,\\ &c_{1}\varepsilon^{\frac{1}{\alpha}},&\alpha\in(0,1).\end{aligned}\right.

For $I_{3}$ , noting that by (3.5),

|G(z_{\varepsilon})-G(z)|\leqslant c_{1}(|z|^{\alpha}\wedge 1)\varepsilon^{(1-\frac{\alpha}{2})\beta_{0}},

we have

\displaystyle I_{3}

\displaystyle\lesssim c_{1}\varepsilon^{(1-\frac{\alpha}{2})\beta_{0}}\int_{|z|\geqslant 2\varepsilon^{\frac{1}{\alpha}}}\frac{|z|^{\alpha}\wedge 1}{|z|^{1+\alpha}}{\mathord{{\rm d}}}z\lesssim c_{1}\varepsilon^{(1-\frac{\alpha}{2})\beta_{0}}|\log\varepsilon|.

Combining the above calculations, we obtain (3.6) for any $\beta_{0}\in(0,1]$ and $\beta_{1}<(1-\frac{\alpha}{2})\beta_{0}$ .

Remark 3.2.

For the above examples, one sees that for $\alpha\in(0,2)$ ,

\int_{{\mathbb{R}}^{d}}|z|^{\alpha}\nu({\mathord{{\rm d}}}z)=\infty,\ \ \int_{{\mathbb{R}}^{d}}|z|^{\beta}\nu({\mathord{{\rm d}}}z)<\infty,\ \ \beta\in[0,\alpha).

The following lemma is useful.

Lemma 3.3.

Under (H ${}^{\alpha}_{\nu}$ ), for $\alpha\in(0,2)$ and $\beta\in[0,\alpha)$ , we have

\displaystyle\sup_{\lambda\geqslant 1,\varepsilon\in(0,1]}\left[\lambda^{\alpha-2}\int_{|z|\leqslant\lambda}|z|^{2}\nu_{\varepsilon}({\mathord{{\rm d}}}z)+\lambda^{\alpha-\beta}\int_{|z|>\lambda}|z|^{\beta}\nu_{\varepsilon}({\mathord{{\rm d}}}z)\right]<\infty,

(3.8)

where $\nu_{\varepsilon}({\mathord{{\rm d}}}z):=\nu({\mathord{{\rm d}}}z/\varepsilon^{1/\alpha})/\varepsilon.$

Proof.

First of all, by (H ${}^{\alpha}_{\nu}$ ) we have

	$\displaystyle\int_{\|z\|>\lambda}\|z\|^{\beta}\nu({\mathord{{\rm d}}}z)$	$\displaystyle=\sum_{k=0}^{\infty}\int_{2^{k}\lambda\leqslant\|z\|<2^{k+1}\lambda}\|z\|^{\beta}\nu({\mathord{{\rm d}}}z)\leqslant\sum_{k=0}^{\infty}2^{(k+1)\beta}\lambda^{\beta}\int_{2^{k}\lambda\leqslant\|z\|<2^{k+1}\lambda}\nu({\mathord{{\rm d}}}z)$
		$\displaystyle\leqslant\sum_{k=0}^{\infty}2^{(k+1)\beta}\lambda^{\beta}2^{-k\alpha}\lambda^{-\alpha}\leqslant C\lambda^{\beta-\alpha}.$

The desired estimate follows by the change of variables. ∎

Now, we introduce a general approximating scheme for SDEs driven by either Brownian motions or $\alpha$ -stable processes. Let $\sigma_{\varepsilon}(t,x,z):{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}\times{\mathbb{R}}^{d}\to{\mathbb{R}}^{d}$ and $b_{\varepsilon}(t,x):{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}\to{\mathbb{R}}^{d}$ , where $\varepsilon\in(0,1]$ , be two families of Borel measurable functions. Suppose that

\sigma_{\varepsilon}(t,x,-z)=-\sigma_{\varepsilon}(t,x,z).

Note that the above assumption implies that

\sigma_{\varepsilon}(t,x,0)\equiv 0.

Consider the following SDE driven by compound Poisson process ${\mathcal{H}}^{\varepsilon}$ :

\displaystyle\begin{split}X^{\varepsilon}_{t}&=X^{\varepsilon}_{0}+\int^{t}_{0}\int_{{\mathbb{R}}^{d}}\Big{(}\sigma_{\varepsilon}(s,X^{\varepsilon}_{s-},z)+b_{\varepsilon}(s,X^{\varepsilon}_{s-})\Big{)}{\mathcal{H}}^{\varepsilon}({\mathord{{\rm d}}}s,{\mathord{{\rm d}}}z)\\ &=X^{\varepsilon}_{0}+\int^{t}_{0}\int_{{\mathbb{R}}^{d}}\sigma_{\varepsilon}(s,X^{\varepsilon}_{s-},z){\mathcal{H}}^{\varepsilon}({\mathord{{\rm d}}}s,{\mathord{{\rm d}}}z)+\int^{t}_{0}b_{\varepsilon}(s,X^{\varepsilon}_{s-}){\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{s}\\ &=X^{\varepsilon}_{0}+\sum_{s\leqslant t}\Big{(}\sigma_{\varepsilon}{\big{(}}s,X^{\varepsilon}_{s-},\Delta H^{\varepsilon}_{s}{\big{)}}+b_{\varepsilon}(s,X^{\varepsilon}_{s-})\Delta{\mathcal{N}}^{\varepsilon}_{s}\Big{)}.\end{split}

(3.9)

Note that $H^{\varepsilon}_{s}$ and ${\mathcal{N}}^{\varepsilon}_{s}$ jump simultaneously, that is, $\Delta H^{\varepsilon}_{s}\not=0$ if and only if $\Delta{\mathcal{N}}^{\varepsilon}_{s}=1$ . In particular,

X^{\varepsilon}_{t}-X^{\varepsilon}_{t-}=\sigma_{\varepsilon}{\big{(}}t,X^{\varepsilon}_{t-},\Delta H^{\varepsilon}_{t}{\big{)}}+b_{\varepsilon}(t,X^{\varepsilon}_{t-})\Delta{\mathcal{N}}^{\varepsilon}_{t}.

Moreover, by the symmetry of $\nu$ and $\sigma_{\varepsilon}(t,x,-z)=-\sigma_{\varepsilon}(t,x,z)$ ,

\displaystyle\int_{{\mathbb{R}}^{d}}\sigma_{\varepsilon}(s,X^{\varepsilon}_{s-},z)\nu({\mathord{{\rm d}}}z)=0,

(3.10)

we thus can write SDE (3.9) as the following form:

\displaystyle X^{\varepsilon}_{t}=X^{\varepsilon}_{0}+\int^{t}_{0}b_{\varepsilon}(s,X^{\varepsilon}_{s}){\mathord{{\rm d}}}{\big{(}}\tfrac{s}{\varepsilon}{\big{)}}+\int^{t}_{0}\int_{{\mathbb{R}}^{d}}\Big{(}\sigma_{\varepsilon}(s,X^{\varepsilon}_{s-},z)+b_{\varepsilon}(s,X^{\varepsilon}_{s-})\Big{)}\widetilde{\mathcal{H}}^{\varepsilon}({\mathord{{\rm d}}}s,{\mathord{{\rm d}}}z),

(3.11)

where the last term is the stochastic integral with respect to the compensated Poisson random measure $\widetilde{\mathcal{H}}^{\varepsilon}$ , which is a local càdlàg martingale.

Without any conditions on $\sigma$ and $b$ , SDE (3.9) is always solvable since there are only finite terms in the summation of (3.9) and it can be solved recursively. In fact, we have the following explicit construction for the solution of SDE (3.9).

Lemma 3.4.

Let $\Gamma^{\varepsilon}_{0}\equiv X^{\varepsilon}_{0}$ . For $n=0,1,2,\cdots$ , we define $\Gamma^{\varepsilon}_{n}$ recursively by

\Gamma^{\varepsilon}_{n+1}:=\Gamma^{\varepsilon}_{n}+\sigma_{\varepsilon}{\big{(}}S^{\varepsilon}_{n+1},\Gamma^{\varepsilon}_{n},\xi_{n+1}{\big{)}}+b_{\varepsilon}{\big{(}}S^{\varepsilon}_{n+1},\Gamma^{\varepsilon}_{n}{\big{)}},

where $S^{\varepsilon}_{n}=\varepsilon S_{n}$ . Then $(\Gamma^{\varepsilon}_{n})_{n\in{\mathbb{N}}}$ is a Markov chain, and for any $t\geqslant 0$ ,

X^{\varepsilon}_{t}=\Gamma^{\varepsilon}_{{\mathcal{N}}^{\varepsilon}_{t}}.

Proof.

It is direct by definitions (3.9) and (3.3). ∎

Based on the above lemma, we have the following algorithm.

(1)

Fix a step $\varepsilon\in(0,1)$ and iteration number $N$ .
(2)

Initialize $S^{\varepsilon}_{0}=0$ and $\Gamma^{\varepsilon}_{0}=X^{\varepsilon}_{0}$ . Let $\nu\in{\mathcal{P}}({\mathbb{R}}^{d})$ satisfy (H ${}^{\alpha}_{\nu}$ ).
(3)

Generate $N$ -i.i.d. random variables $(T_{n})\sim{\rm Exp}(1)$ and $(\xi_{n})\sim\nu$ .
(4)

For $n=0$ to $N-1$
$S^{\varepsilon}_{n+1}=S^{\varepsilon}_{n}+\varepsilon*T_{n+1}$ ; $\Gamma^{\varepsilon}_{n+1}=\Gamma^{\varepsilon}_{n}+\sigma_{\varepsilon}(S^{\varepsilon}_{n+1},\Gamma^{\varepsilon}_{n},\xi_{n+1})+b_{\varepsilon}(S^{\varepsilon}_{n+1},\Gamma^{\varepsilon}_{n})$ .
(5)

For given $t>0$ , let ${\mathcal{N}}^{\varepsilon}_{t}:=\max\{n:S^{\varepsilon}_{n}\leqslant t\}$ and output $X^{\varepsilon}_{t}=\Gamma^{\varepsilon}_{{\mathcal{N}}^{\varepsilon}_{t}\wedge N}$ .

The following simple lemma provides a tail probability estimate for ${\mathcal{N}}^{\varepsilon}_{t}$ , which informs us on how to choose the value of $N$ in practice.

Lemma 3.5.

For any $n\in{\mathbb{N}}$ , we have

{\mathbb{P}}{\big{(}}{\mathcal{N}}^{\varepsilon}_{t}\geqslant\tfrac{(\mathrm{e}-1)t}{\varepsilon}+n{\big{)}}\leqslant\mathrm{e}^{-n}.

Proof.

By Chebyschev’s inequality we have

\displaystyle{\mathbb{P}}{\big{(}}{\mathcal{N}}^{\varepsilon}_{t}\geqslant\tfrac{(\mathrm{e}-1)t}{\varepsilon}+n{\big{)}}\leqslant\mathrm{e}^{-\frac{(\mathrm{e}-1)t}{\varepsilon}-n}{\mathbb{E}}\mathrm{e}^{{\mathcal{N}}^{\varepsilon}_{t}}=\mathrm{e}^{-n}.

∎

Remark 3.6.

The sequence $(\Gamma^{\varepsilon}_{n})_{n\geqslant 0}$ forms a Markov chain with a state space of $\mathbb{R}^{d}$ . These lemmas provide us with a practical method for simulating $X^{\varepsilon}_{t}$ using a computer. It is important to note that approximating a diffusion process with a Markov chain is a well-established topic, as discussed in [38, Chapter 11.2]. Therein, the focus is on the time-homogeneous case, and piecewise linear interpolation is used for approximation. In our approach, we embed the Markov chain into a continuous process using a Poisson process. It is crucial to highlight that $\Gamma^{\varepsilon}_{n}$ is not independent of $\mathcal{N}^{\varepsilon}_{t}$ due to the time-inhomogeneous nature of $\sigma$ and $b$ . Our computations heavily rely on the calculus of stochastic integrals with jumps.

Note that for a bounded measurable function $f:{\mathbb{R}}^{d}\to{\mathbb{R}}$ ,

	$\displaystyle f(X^{\varepsilon}_{t})-f(X_{0})=\sum_{s\leqslant t}f(X^{\varepsilon}_{s})-f(X^{\varepsilon}_{s-})$
	$\displaystyle\quad=\sum_{s\leqslant t}\left(f{\big{(}}X^{\varepsilon}_{s-}+\sigma_{\varepsilon}{\big{(}}s,X^{\varepsilon}_{s-},\Delta H^{\varepsilon}_{s}{\big{)}}+b_{\varepsilon}(s,X^{\varepsilon}_{s-})\Delta{\mathcal{N}}^{\varepsilon}_{s}{\big{)}}-f(X^{\varepsilon}_{s-})\right)$
	$\displaystyle\quad\!\!\stackrel{{\scriptstyle\eqref{DSQ1}}}{{=}}\int^{t}_{0}\int_{{\mathbb{R}}^{d}}\left(f(X^{\varepsilon}_{s-}+\sigma_{\varepsilon}(s,X^{\varepsilon}_{s-},z)+b_{\varepsilon}(s,X^{\varepsilon}_{s-}))-f(X^{\varepsilon}_{s-})\right){\mathcal{H}}^{\varepsilon}({\mathord{{\rm d}}}s,{\mathord{{\rm d}}}z)$
	$\displaystyle\quad=\int^{t}_{0}\int_{{\mathbb{R}}^{d}}\frac{f(X^{\varepsilon}_{s}+\sigma_{\varepsilon}(s,X^{\varepsilon}_{s},z)+b_{\varepsilon}(s,X^{\varepsilon}_{s}))-f(X^{\varepsilon}_{s})}{\varepsilon}\nu({\mathord{{\rm d}}}z){\mathord{{\rm d}}}s+M^{\varepsilon}_{t},$		(3.12)

where $M^{\varepsilon}_{t}$ is a martingale defined by

M^{\varepsilon}_{t}:=\int^{t}_{0}\int_{{\mathbb{R}}^{d}}\left(f{\big{(}}X^{\varepsilon}_{s-}+\sigma_{\varepsilon}(s,X^{\varepsilon}_{s-},z)+b_{\varepsilon}(s,X^{\varepsilon}_{s-}){\big{)}}-f(X^{\varepsilon}_{s-})\right)\widetilde{\mathcal{H}}^{\varepsilon}({\mathord{{\rm d}}}s,{\mathord{{\rm d}}}z).

This is just the Itô formula of jump processes. In particular,

{\mathbb{E}}f(X^{\varepsilon}_{t})-{\mathbb{E}}f(X_{0})={\mathbb{E}}\left(\int^{t}_{0}{\mathscr{L}}^{(\varepsilon)}_{s}f(X^{\varepsilon}_{s}){\mathord{{\rm d}}}s\right),

where the infinitesimal generator ${\mathscr{L}}^{(\varepsilon)}_{s}$ of Markov process $(X^{\varepsilon}_{t})_{t\geqslant 0}$ is given by

{\mathscr{L}}^{(\varepsilon)}_{s}f(x):=\int_{{\mathbb{R}}^{d}}\frac{f(x+\sigma_{\varepsilon}(s,x,z)+b_{\varepsilon}(s,x))-f(x)}{\varepsilon}\nu({\mathord{{\rm d}}}z)=:{\mathcal{A}}^{(\varepsilon)}_{s}f(x)+{\mathcal{B}}^{(\varepsilon)}_{s}f(x)

with

{\mathcal{A}}^{(\varepsilon)}_{s}f(x):=\int_{{\mathbb{R}}^{d}}\frac{{\mathcal{D}}^{(\varepsilon)}_{s}f(x+\sigma_{\varepsilon}(s,x,z))-{\mathcal{D}}^{(\varepsilon)}_{s}f(x)}{\varepsilon}\nu({\mathord{{\rm d}}}z)

and

\displaystyle{\mathcal{B}}^{(\varepsilon)}_{s}f(x):=\frac{{\mathcal{D}}^{(\varepsilon)}_{s}f(x)-f(x)}{\varepsilon},\ \ {\mathcal{D}}^{(\varepsilon)}_{s}f(x):=f(x+b_{\varepsilon}(s,x)).

(3.13)

By convention we have used that

\displaystyle{\mathcal{D}}^{(\varepsilon)}_{s}f(x+y)=f(x+y+b_{\varepsilon}(s,x)).

(3.14)

Note that by the symmetry of $\nu$ and $\sigma(t,x,-z)=-\sigma(t,x,z)$ ,

\displaystyle{\mathcal{A}}^{(\varepsilon)}_{s}f(x)

\displaystyle=\int_{{\mathbb{R}}^{d}}\frac{{\mathcal{D}}^{(\varepsilon)}_{s}f(x+\sigma_{\varepsilon}(s,x,z))+{\mathcal{D}}^{(\varepsilon)}_{s}f(x-\sigma_{\varepsilon}(s,x,z))-{\mathcal{D}}^{(\varepsilon)}_{s}f(x)}{2\varepsilon}\nu({\mathord{{\rm d}}}z).

(3.15)

The concrete choices of $\sigma_{\varepsilon}$ (depending on $\alpha$ ) and $b_{\varepsilon}$ will be given in the following subsection.

3.1. Weak convergence of approximating SDEs

In this section, our aim is to construct appropriate functions $\sigma_{\varepsilon}$ and $b_{\varepsilon}$ such that the law of the approximating SDE converges to the law of the classical SDE driven by $\alpha$ -stable processes or Brownian motions. The key aspect of our construction lies in demonstrating the convergence of the generators. It is important to note that the drift term is assumed to satisfy dissipativity conditions and can exhibit polynomial growth.

Let

\sigma:{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}\times{\mathbb{R}}^{d}\to{\mathbb{R}}^{d},\ b:{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}\to{\mathbb{R}}^{d}

be two Borel measurable functions. We make the following assumptions on $\sigma$ and $b$ :

{}^{\sigma}_{b}

)

$\sigma(t,x,z)$ and $b(t,x)$ are locally bounded and continuous in $x$ , and for some $\kappa_{0},\kappa_{1}>0$ ,

\displaystyle\sigma(t,x,-z)=-\sigma(t,x,z),\ \ |\sigma(t,x,z)|\leqslant(\kappa_{0}+\kappa_{1}|x|)|z|,

(3.16)

and for the same $\beta_{0}$ as in (3.5),

\displaystyle|\sigma(t,x,z)-\sigma(t,x,z^{\prime})|\leqslant(\kappa_{0}+\kappa_{1}|x|)(|z-z^{\prime}|\wedge 1)^{\beta_{0}},

(3.17)

and for some $m\geqslant 1$ , $\kappa_{2},\kappa_{3},\kappa_{4}\geqslant 0$ and $\kappa_{5}<0$ ,

\displaystyle|b(t,x)|\leqslant(\kappa_{2}(1+|x|))^{m},\ \ \langle x,b(t,x)\rangle\leqslant\kappa_{3}+\kappa_{4}|x|^{2}+\kappa_{5}|x|^{m+1}.

(3.18)

We introduce the coefficients of the approximating SDE (3.9) by

\displaystyle b_{\varepsilon}(t,x):=\frac{\varepsilon b(t,x)}{1+\sqrt{\varepsilon}|b(t,x)|^{1-\frac{1}{m}}},\ \ \sigma_{\varepsilon}(t,x,z):=\left\{\begin{aligned} &\sqrt{\varepsilon}\sigma(t,x,z),&\ \alpha\geqslant 2,\\ &\sigma(t,x,\varepsilon^{\frac{1}{\alpha}}z),&\ \alpha\in(0,2).\end{aligned}\right.

(3.19)

Remark 3.7.

The purpose of introducing the function $b_{\varepsilon}$ is to ensure the dissipativity of the approximating SDEs, as demonstrated in Lemma 3.11 below. On the other hand, the introduction of $\sigma_{\varepsilon}$ with different scaling parameters for different values of $\alpha$ is aimed at ensuring the convergence of the generators, as shown in Lemma 3.9 below. It is worth noting that the drift term $b$ can exhibit polynomial growth, and in the case of linear growth (i.e., $m=1$ ), one can simply choose $b_{\varepsilon}(t,x)=\varepsilon b(t,x)$ . Furthermore, by the definition of $b_{\varepsilon}$ , it is evident that

\displaystyle|b_{\varepsilon}(t,x)|\leqslant(\varepsilon|b(t,x)|)\wedge(\sqrt{\varepsilon}|b(t,x)|^{\frac{1}{m}}).

(3.20)

In the next lemma we shall show that as $\varepsilon\to 0$ , ${\mathscr{L}}^{(\varepsilon)}_{s}f(x)$ converges to ${\mathscr{L}}^{(0)}_{s}f(x)$ with

\displaystyle{\mathscr{L}}^{(0)}_{s}f(x)={\mathcal{A}}^{(0)}_{s}f(x)+b(s,x)\cdot\nabla f(x),

(3.21)

where

{\mathcal{A}}^{(0)}_{s}f(x):=\left\{\begin{aligned} &\frac{1}{2}\mathrm{tr}\left(\int_{{\mathbb{R}}^{d}}\sigma(s,x,z)\otimes\sigma(s,x,z)\nu({\mathord{{\rm d}}}z)\cdot\nabla^{2}f(x)\right),&\alpha\geqslant 2,\\ &\int_{{\mathbb{R}}^{d}}\frac{f(x+\sigma(s,x,z))+f(x-\sigma(s,x,z))-2f(x)}{2}\nu_{0}({\mathord{{\rm d}}}z),&\alpha\in(0,2).\end{aligned}\right.

This observation suggests that $X^{\varepsilon}_{\cdot}$ is expected to weakly converge to a solution of the following SDE:

\displaystyle\left\{\begin{aligned} &{\mathord{{\rm d}}}X_{t}=\sigma_{\nu}^{(2)}{\big{(}}t,X_{t}{\big{)}}{\mathord{{\rm d}}}W_{t}+b(t,X_{t}){\mathord{{\rm d}}}t,&\alpha\geqslant 2,\\ &{\mathord{{\rm d}}}X_{t}=\int_{{\mathbb{R}}^{d}}\sigma{\big{(}}t,X_{t-},z{\big{)}}\widetilde{\mathcal{H}}({\mathord{{\rm d}}}t,{\mathord{{\rm d}}}z)+b(t,X_{t}){\mathord{{\rm d}}}t,&\alpha\in(0,2),\end{aligned}\right.

(3.22)

where $W_{t}$ is a $d$ -dimensional standard Brownian motion, and

\sigma_{\nu}^{(2)}(t,x):=\left(\int_{{\mathbb{R}}^{d}}\sigma(t,x,z)\otimes\sigma(t,x,z)\nu({\mathord{{\rm d}}}z)\right)^{\frac{1}{2}},

and when $\alpha\in(0,2)$ , for a $d$ -dimensional symmetric Lévy process $L^{(\alpha)}_{t}$ with Lévy measure $\nu_{0}$ ,

{\mathcal{H}}([0,t]\times E):=\sum_{s\leqslant t}{\mathbbm{1}}_{E}(\Delta L^{(\alpha)}_{s}),\ t\geqslant 0,E\in{\mathscr{B}}({\mathbb{R}}^{d}),

and

\displaystyle\widetilde{\mathcal{H}}([0,t]\times E):={\mathcal{H}}([0,t]\times E)-t\nu_{0}(E),\ t\geqslant 0,E\in{\mathscr{B}}({\mathbb{R}}^{d}).

(3.23)

Remark 3.8.

Let $\alpha\geqslant 2$ and $\nu({\mathord{{\rm d}}}z)=d^{-1}\sum_{i=1}^{d}\bar{\nu}({\mathord{{\rm d}}}z_{i})\delta_{\{0\}}({\mathord{{\rm d}}}z^{*}_{i})$ , where $\bar{\nu}\in{\mathcal{P}}({\mathbb{R}})$ satisfies $\int_{\mathbb{R}}|z|^{\alpha}\bar{\nu}({\mathord{{\rm d}}}z)<\infty$ , and $z^{*}_{i}$ represents the remaining variables except for $z_{i}$ . Let $\sigma(t,x,z)=\sigma_{0}(t,x)z$ , where $\sigma_{0}(t,x):{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}\to{\mathbb{R}}^{d}\otimes{\mathbb{R}}^{d}$ is Borel measurable. In this case, we can take

\sigma_{\nu}^{(2)}(t,x)=\sigma_{0}(t,x)\sqrt{\bar{\nu}(|z|^{2})/d}.

Let $\{e_{i},i=1,\cdots,d\}$ be the canonical basis of ${\mathbb{R}}^{d}$ . Suppose that $\sigma(t,x,z)=\sqrt{2d}\cdot z$ , $b=0$ and

\nu({\mathord{{\rm d}}}z)=\frac{1}{2d}\sum_{i=1}^{d}\Big{(}\delta_{e_{i}}({\mathord{{\rm d}}}z)+\delta_{-e_{i}}({\mathord{{\rm d}}}z)\Big{)}.

Then ${\mathcal{A}}^{(\varepsilon)}_{s}f(x)=\Delta_{\varepsilon}f(x)=\sum_{i=1}^{d}\frac{f(x+\sqrt{2d\varepsilon}e_{i})+f(x-\sqrt{2d\varepsilon}e_{i})-2f(x)}{2d\varepsilon}$ is the standard discrete Laplacian.

The following lemma is crucial for taking limits.

Lemma 3.9.

Under (H ${}^{\alpha}_{\nu}$ ) and (H ${}^{\sigma}_{b}$ ), for any $R>0$ , there is a constant $C_{R}>0$ such that for any $f\in C^{2}_{b}({\mathbb{R}}^{d})$ , and for all $\varepsilon\in(0,1)$ , $s\geqslant 0$ and $|x|\leqslant R$ ,

\big{|}{\mathscr{L}}^{(\varepsilon)}_{s}f(x)-{\mathscr{L}}^{(0)}_{s}f(x)\big{|}\leqslant C_{R}\Big{(}o(\varepsilon){\mathbbm{1}}_{\alpha=2}+\varepsilon^{\frac{(\alpha-2)\wedge 1}{2}}\|f\|_{C^{\alpha}_{b}}{\mathbbm{1}}_{\alpha>2}+\varepsilon^{\frac{2-\alpha}{2}\wedge\beta_{1}}\|f\|_{C^{2}_{b}}{\mathbbm{1}}_{\alpha<2}\Big{)},

where $\beta_{1}$ is from (H ${}^{\alpha}_{\nu}$ ). Moreover, if $b$ is bounded measurable and $\kappa_{1}=0$ in (H ${}^{\sigma}_{b}$ ), then the constant $C_{R}$ can be independent of $R>0$ .

Proof.

Below we drop the time variable for simplicity. Recalling ${\mathcal{B}}^{(\varepsilon)}f(x)=\frac{f(x+b_{\varepsilon}(x))-f(x)}{\varepsilon}$ , by Taylor’s expansion and the definition (3.19), we have

$\displaystyle\|{\mathcal{B}}^{(\varepsilon)}f(x)-b(x)\cdot\nabla f(x)\|$	$\displaystyle\leqslant\|{\mathcal{B}}^{(\varepsilon)}f(x)-\varepsilon^{-1}b_{\varepsilon}(x)\cdot\nabla f(x)\|+\|(\varepsilon^{-1}b_{\varepsilon}(x)-b(x))\cdot\nabla f(x)\|$
	$\displaystyle\leqslant\|b_{\varepsilon}(x)\|\int^{1}_{0}\frac{\|\nabla f(x+\theta b_{\varepsilon}(x))-\nabla f(x)\|}{\varepsilon}{\mathord{{\rm d}}}\theta+\|\varepsilon^{-1}b_{\varepsilon}(x)-b(x)\|\\|\nabla f\\|_{\infty}$
	$\displaystyle\leqslant\varepsilon^{-1}\|b_{\varepsilon}(x)\|^{2}\\|\nabla^{2}f\\|_{\infty}+\frac{\sqrt{\varepsilon}\|b(x)\|^{2-\frac{1}{m}}}{1+\sqrt{\varepsilon}\|b(x)\|^{1-\frac{1}{m}}}\\|\nabla f\\|_{\infty}$
	$\displaystyle\leqslant\varepsilon\|b(x)\|^{2}\\|\nabla^{2}f\\|_{\infty}+\sqrt{\varepsilon}\|b(x)\|^{2-\frac{1}{m}}\\|\nabla f\\|_{\infty}$
	$\displaystyle\leqslant C\sqrt{\varepsilon}\big{(}1+\|b(x)\|^{2}\big{)}\\|\nabla f\\|_{C^{1}_{b}}.$	(3.24)

Next, by (3.14) and Taylor’s expansion again, we have

	$\displaystyle{\mathcal{D}}^{(\varepsilon)}f(x+\sigma_{\varepsilon}(x,z))+{\mathcal{D}}^{(\varepsilon)}f(x-\sigma_{\varepsilon}(x,z))-2{\mathcal{D}}^{(\varepsilon)}f(x)$
	$\displaystyle\quad=\sigma_{\varepsilon}(x,z)\cdot\int^{1}_{0}\Big{[}{\mathcal{D}}^{(\varepsilon)}\nabla f(x+\theta\sigma_{\varepsilon}(x,z))-{\mathcal{D}}^{(\varepsilon)}\nabla f(x-\theta\sigma_{\varepsilon}(x,z))\Big{]}{\mathord{{\rm d}}}\theta$
	$\displaystyle\quad=\int^{1}_{0}\theta\int^{1}_{-1}\Big{[}\mathrm{tr}{\big{(}}(\sigma_{\varepsilon}\otimes\sigma_{\varepsilon})(x,z)\cdot{\mathcal{D}}^{(\varepsilon)}\nabla^{2}f(x+\theta^{\prime}\theta\sigma_{\varepsilon}(x,z)){\big{)}}\Big{]}{\mathord{{\rm d}}}\theta^{\prime}{\mathord{{\rm d}}}\theta.$		(3.25)

When $\alpha\geqslant 2$ , recalling $\sigma_{\varepsilon}(x,z)=\sqrt{\varepsilon}\sigma(x,z)$ , by (3.15) and (3.25) we have

	$\displaystyle{\mathcal{A}}^{(\varepsilon)}f(x)-{\mathcal{A}}^{(0)}f(x)={\mathcal{A}}^{(\varepsilon)}f(x)-\frac{1}{2}\int_{{\mathbb{R}}^{d}}\mathrm{tr}((\sigma\otimes\sigma)(x,z)\cdot\nabla^{2}f(x))\nu({\mathord{{\rm d}}}z)$
	$\displaystyle=\int_{{\mathbb{R}}^{d}}\int^{1}_{0}\frac{\theta}{2}\int^{1}_{-1}\Big{[}\mathrm{tr}{\big{(}}(\sigma\otimes\sigma)(x,z)\cdot{\big{(}}{\mathcal{D}}^{(\varepsilon)}\nabla^{2}f(x+\theta^{\prime}\theta\sqrt{\varepsilon}\sigma(x,z))-{\mathcal{D}}^{(\varepsilon)}\nabla^{2}f(x){\big{)}}{\big{)}}\Big{]}{\mathord{{\rm d}}}\theta^{\prime}{\mathord{{\rm d}}}\theta\nu({\mathord{{\rm d}}}z)$
	$\displaystyle\quad+\frac{1}{2}\int_{{\mathbb{R}}^{d}}\mathrm{tr}{\big{(}}(\sigma\otimes\sigma)(x,z)\cdot({\mathcal{D}}^{(\varepsilon)}\nabla^{2}f(x)-\nabla^{2}f(x)){\big{)}}\nu({\mathord{{\rm d}}}z).$

Hence, recalling ${\mathcal{D}}^{\varepsilon}f(x)=f(x+b_{\varepsilon}(x))$ , by (3.16), we have for $\alpha=2$ ,

\displaystyle\sup_{|x|\leqslant R}\big{|}{\mathcal{A}}^{(\varepsilon)}f(x)-{\mathcal{A}}^{(0)}f(x)\big{|}\leqslant C_{R}\,o(\varepsilon),

(3.26)

and for $\alpha>2$ ,

\displaystyle\big{|}{\mathcal{A}}^{(\varepsilon)}f(x)-{\mathcal{A}}^{(0)}f(x)\big{|}

\displaystyle\leqslant C_{R}\Big{(}\varepsilon^{\frac{(\alpha-2)\wedge 1}{2}}\nu(|z|^{\alpha})+\varepsilon^{(\alpha-2)\wedge 1}\nu(|z|^{2})\Big{)}\|\nabla^{2}f\|_{C^{(\alpha-2)\wedge 1}_{b}}.

(3.27)

When $\alpha\in(0,2)$ , recalling $\sigma_{\varepsilon}(x,z)=\sigma(x,\varepsilon^{\frac{1}{\alpha}}z)$ and by (3.15) and the change of variables, we have

\displaystyle{\mathcal{A}}^{(\varepsilon)}f(x)

\displaystyle=\int_{{\mathbb{R}}^{d}}\frac{{\mathcal{D}}^{(\varepsilon)}f(x+\sigma(x,z))+{\mathcal{D}}^{(\varepsilon)}f(x-\sigma(x,z))-2{\mathcal{D}}^{(\varepsilon)}f(x)}{2}\nu_{\varepsilon}({\mathord{{\rm d}}}z),

(3.28)

where

\nu_{\varepsilon}({\mathord{{\rm d}}}z)=\nu({\mathord{{\rm d}}}z/\varepsilon^{1/\alpha})/\varepsilon.

Hence, for $f_{\varepsilon}:={\mathcal{D}}^{(\varepsilon)}f-f$ , we have

	$\displaystyle\left\|{\mathcal{A}}^{(\varepsilon)}_{s}f(x)-\int_{{\mathbb{R}}^{d}}\frac{f(x+\sigma(x,z))+f(x-\sigma(x,z))-2f(x)}{2}\nu_{0}({\mathord{{\rm d}}}z)\right\|$
	$\displaystyle\leqslant\left\|\int_{{\mathbb{R}}^{d}}\frac{f(x+\sigma(x,z))+f(x-\sigma(x,z))-2f(x)}{2}(\nu_{\varepsilon}({\mathord{{\rm d}}}z)-\nu_{0}({\mathord{{\rm d}}}z))\right\|$
	$\displaystyle\quad+\left\|\int_{{\mathbb{R}}^{d}}\frac{f_{\varepsilon}(x+\sigma(x,z))+f_{\varepsilon}(x-\sigma(x,z))-2f_{\varepsilon}(x)}{2}\nu_{\varepsilon}({\mathord{{\rm d}}}z)\right\|=:I_{1}(x)+I_{2}(x).$

For $I_{1}(x)$ , set

G_{x}(z):=\frac{f(x+\sigma(x,z))+f(x-\sigma(x,z))-2f(x)}{2}.

Then by (3.25), we have

|G_{x}f(z)|\leqslant\|\nabla^{2}f\|_{\infty}(\kappa_{0}+\kappa_{1}|x|)^{2}|z|^{2},

and

|G_{x}f(z)-G_{x}f(z^{\prime})|\leqslant 2\|\nabla f\|_{\infty}|\sigma(x,z)-\sigma(x,z^{\prime})|.

Thus by (3.17) and (3.6), we have

\displaystyle\sup_{|x|\leqslant R}I_{1}(x)\leqslant C_{R}\|f\|_{C^{2}_{b}}\varepsilon^{\beta_{1}}.

For $I_{2}(x)$ , noting that by (3.25),

|f_{\varepsilon}(x+\sigma(x,z))+f_{\varepsilon}(x-\sigma(x,z))-2f_{\varepsilon}(x)|\leqslant\|\nabla^{2}f\|_{\infty}(\kappa_{0}+\kappa_{1}|x|)^{2}|z|^{2}

and by (3.20),

|f_{\varepsilon}(x+\sigma(x,z))+f_{\varepsilon}(x-\sigma(x,z))-2f_{\varepsilon}(x)|\leqslant 4\varepsilon\|\nabla f\|_{\infty}|b(x)|,

we have

\displaystyle I_{2}(x)\leqslant\|\nabla^{2}f\|_{\infty}(\kappa_{0}+\kappa_{1}|x|)^{2}\int_{|z|\leqslant\varepsilon^{\frac{1}{2}}}|z|^{2}\nu_{\varepsilon}({\mathord{{\rm d}}}z)+4\varepsilon\|\nabla f\|_{\infty}|b(x)|\int_{|z|>\varepsilon^{\frac{1}{2}}}\nu_{\varepsilon}({\mathord{{\rm d}}}z).

Combining the above calculations and by (H ${}^{\alpha}_{\nu}$ ) and Lemma 3.3, we obtain

\displaystyle\sup_{|x|\leqslant R}\big{|}{\mathcal{A}}^{(\varepsilon)}f(x)-{\mathcal{A}}^{(0)}f(x)\big{|}

\displaystyle\leqslant C_{R}\|f\|_{C^{2}_{b}}(\varepsilon^{\beta_{1}}+\varepsilon^{1-\frac{\alpha}{2}})\leqslant 2C_{R}\|f\|_{C^{2}_{b}}\varepsilon^{(1-\frac{\alpha}{2})\wedge\beta_{1}},

(3.29)

which together with (3.24), (3.26) and (3.27) yields the desired estimate. If $b$ is bounded and $\kappa_{1}=0$ , that is, $|\sigma(t,x,z)|\leqslant\kappa_{0}|z|$ , from the above proof, one sees that $C_{R}$ is independent of $R$ . ∎

For $\beta\in{\mathbb{R}}$ , we define

U_{\beta}(x):=(1+|x|^{2})^{\beta/2},\ x\in{\mathbb{R}}^{d}.

We need the following elementary Hölder estimate about $U_{\beta}$ .

Lemma 3.10.

For any $\beta\in(0,2]$ , there is a constant $C=C(\beta,d)>0$ such that for all $x,y\in{\mathbb{R}}^{d}$ ,

|U_{\beta}(x+y)+U_{\beta}(x-y)-2U_{\beta}(x)|\leqslant C|y|^{\beta}.

Proof.

For $\beta\in(0,1]$ , noting that

|U_{\beta}(x+y)-U_{\beta}(x)|\leqslant|g(x+y)-g(x)|^{\beta},

where $g(x):=(1+|x|^{2})^{1/2}$ , and by $|\nabla g(x)|\leqslant 1$ , we immediately have

	$\displaystyle\|U_{\beta}(x+y)+U_{\beta}(x-y)-2U_{\beta}(x)\|$	$\displaystyle\leqslant\|U_{\beta}(x+y)-U_{\beta}(x)\|+\|U_{\beta}(x-y)-U_{\beta}(x)\|$
		$\displaystyle\leqslant\|g(x+y)-g(x)\|^{\beta}+\|g(x-y)-g(x)\|^{\beta}\leqslant 2\|y\|^{\beta}.$

For $\beta\in(1,2]$ , by Taylor’s expansion we have

U_{\beta}(x+y)+U_{\beta}(x-y)-2U_{\beta}(x)=y\cdot\int^{1}_{0}[\nabla U_{\beta}(x+\theta y)-\nabla U_{\beta}(x-\theta y)]{\mathord{{\rm d}}}\theta.

In view of $\nabla U_{\beta}(x)=\beta x(1+|x|^{2})^{\frac{\beta}{2}-1}$ , it suffices to show

|x(1+|x|^{2})^{\frac{\beta}{2}-1}-y(1+|y|^{2})^{\frac{\beta}{2}-1}|\leqslant C|x-y|^{\beta-1},

furthermore, for each $i=1,\cdots,d$ ,

|x_{i}(1+|x_{i}|^{2}+|x^{*}_{i}|^{2})^{\frac{\beta}{2}-1}-y_{i}(1+|y_{i}|^{2}+|y^{*}_{i}|^{2})^{\frac{\beta}{2}-1}|\leqslant C|x-y|^{\beta-1},

where $x^{*}_{i}$ stands for the remaining variables except $x_{i}$ . The above estimate can be derived as a consequence of the following two estimates: for any $a>0$ ,

|x(a+|x|^{2})^{\frac{\beta}{2}-1}-y(a+|y|^{2})^{\frac{\beta}{2}-1}|\leqslant{\big{(}}\tfrac{\beta}{\beta-1}|x-y|{\big{)}}^{\beta-1},\ \ x,y\in{\mathbb{R}},

and

|a(1+a^{2}+|x|^{2})^{\frac{\beta}{2}-1}-a(1+a^{2}+|y|^{2})^{\frac{\beta}{2}-1}|\leqslant 2|x-y|^{\beta-1},\ \ x,y\in{\mathbb{R}}^{d-1}.

Set

g_{1}(x):=|x|^{\frac{1}{\beta-1}}(a+|x|^{2})^{\frac{\beta-2}{2(\beta-1)}},\ \ g_{2}(x):=a^{\frac{1}{\beta-1}}(1+a^{2}+|x|^{2})^{\frac{\beta-2}{2(\beta-1)}}.

For $\beta\in(1,2]$ , it is easy to see that

|g^{\prime}_{1}(x)|\leqslant\tfrac{\beta}{\beta-1},\ \ |\nabla g_{2}(x)|\leqslant 1.

Hence, for $x\cdot y\geqslant 0$ ,

|x(a+|x|^{2})^{\frac{\beta}{2}-1}-y(a+|y|^{2})^{\frac{\beta}{2}-1}|\leqslant|g_{1}(x)-g_{1}(y)|^{\beta-1}\leqslant{\big{(}}\tfrac{\beta}{\beta-1}|x-y|{\big{)}}^{\beta-1},

and for $x\cdot y<0$ ,

|x(a+|x|^{2})^{\frac{\beta}{2}-1}-y(a+|y|^{2})^{\frac{\beta}{2}-1}|\leqslant|x|^{\beta-1}+|y|^{\beta-1}\leqslant 2|x-y|^{\beta-1},

and

|a(1+a^{2}+|x|^{2})^{\frac{\beta}{2}-1}-a(1+a^{2}+|y|^{2})^{\frac{\beta}{2}-1}|\leqslant|g_{2}(x)-g_{2}(y)|^{\beta-1}\leqslant|x-y|^{\beta-1}.

The proof is complete. ∎

We need the following technical lemma.

Lemma 3.11.

Under (3.18), for $\kappa_{6}\in{\mathbb{R}}$ satisfying

\displaystyle\kappa_{4}+\kappa_{5}<\kappa_{6}\mbox{ if $m=1$ and }\kappa_{6}<0\mbox{ if $m>1$},

(3.30)

there are $\varepsilon_{0}\in(0,1)$ and $C_{1}>0$ such that for all $\varepsilon\in(0,\varepsilon_{0})$ and $(t,x)\in[0,\infty)\times{\mathbb{R}}^{d}$ ,

\displaystyle\varepsilon^{-1}\big{[}\langle x,b_{\varepsilon}(t,x)\rangle+|b_{\varepsilon}(t,x)|^{2}\big{]}\leqslant\kappa_{6}|x|^{2}+C_{1}.

(3.31)

Proof.

By (3.19) and (3.18) we have

\varepsilon^{-1}\langle x,b_{\varepsilon}(t,x)\rangle=\frac{\langle x,b(t,x)\rangle}{1+\sqrt{\varepsilon}|b(t,x)|^{1-\frac{1}{m}}}\leqslant\frac{\kappa_{3}+\kappa_{4}|x|^{2}+\kappa_{5}|x|^{m+1}}{1+\sqrt{\varepsilon}|b(t,x)|^{1-\frac{1}{m}}}.

When $m=1$ , by $|b_{\varepsilon}(t,x)|\leqslant\varepsilon|b(t,x)|$ and (3.18), we have

	$\displaystyle\varepsilon^{-1}\big{[}\langle x,b_{\varepsilon}(t,x)\rangle+\|b_{\varepsilon}(t,x)\|^{2}\big{]}$	$\displaystyle\leqslant\frac{\kappa_{3}+(\kappa_{4}+\kappa_{5})\|x\|^{2}}{1+\sqrt{\varepsilon}}+\varepsilon\kappa_{2}^{2}(1+\|x\|)^{2}$
		$\displaystyle\leqslant\Big{(}\tfrac{\kappa_{4}+\kappa_{5}}{1+\sqrt{\varepsilon}}+2\varepsilon\kappa^{2}_{2}\Big{)}\|x\|^{2}+\tfrac{\kappa_{3}}{1+\sqrt{\varepsilon}}+2\varepsilon\kappa^{2}_{2}.$

In particular, for given $\kappa_{6}>\kappa_{4}+\kappa_{5}$ , if $\varepsilon_{0}$ is small enough, then for some $C_{1}>0$ and all $\varepsilon\in(0,\varepsilon_{0})$ ,

\varepsilon^{-1}\big{[}\langle x,b_{\varepsilon}(t,x)\rangle+|b_{\varepsilon}(t,x)|^{2}\big{]}\leqslant\kappa_{6}|x|^{2}+C_{1}.

When $m>1$ , for any $K\geqslant 1$ , thanks to $\kappa_{5}<0$ , by Young’s inequality, there are constants $\varepsilon_{0},C_{0}(K)>0$ such that for all $\varepsilon\in(0,\varepsilon_{0})$ ,

\frac{\kappa_{5}|x|^{m+1}}{1+\sqrt{\varepsilon}|b(t,x)|^{1-\frac{1}{m}}}\leqslant\frac{\kappa_{5}|x|^{m+1}}{1+\sqrt{\varepsilon}(\kappa_{2}(1+|x|))^{m-1}}\leqslant K\kappa_{5}|x|^{2}+C_{0}.

Hence, by $|b_{\varepsilon}(t,x)|^{2}\leqslant\varepsilon|b(t,x)|^{\frac{2}{m}}\leqslant\varepsilon\kappa^{2}_{2}(1+|x|)^{2}$ ,

	$\displaystyle\varepsilon^{-1}\big{[}\langle x,b_{\varepsilon}(t,x)\rangle+\|b_{\varepsilon}(t,x)\|^{2}\big{]}$	$\displaystyle\leqslant\kappa_{3}+\kappa_{4}\|x\|^{2}+K\kappa_{5}\|x\|^{2}+C_{0}+\kappa^{2}_{2}(1+\|x\|)^{2}$
		$\displaystyle\leqslant(\kappa_{4}+2\kappa^{2}_{2}+K\kappa_{5})\|x\|^{2}+C_{1}(K),$

which implies (3.31) by $\kappa_{5}<0$ and choosing $K$ large enough. ∎

Now we show the following Lyapunov’s type estimate.

Lemma 3.12.

Under (H ${}^{\alpha}_{\nu}$ ) and (H ${}^{\sigma}_{b}$ ), for any $\beta\in(0,\alpha)$ and $\kappa_{6}\in{\mathbb{R}}$ satisfying (3.30), there are constants $\varepsilon_{0}\in(0,1)$ , $C_{0}=C_{0}(\beta)>0$ , $C_{1}=C_{1}(\beta,\nu)>0$ and $C_{2}>0$ such that for all $\varepsilon\in(0,\varepsilon_{0})$ , $s\geqslant 0$ and $x\in{\mathbb{R}}^{d}$ ,

\displaystyle{\mathscr{L}}^{(\varepsilon)}_{s}U_{\beta}(x)\leqslant{\big{(}}C_{0}\kappa_{6}+C_{1}(\kappa^{2\wedge\alpha}_{1}{\mathbbm{1}}_{\beta\in(0,2)}+\kappa_{1}^{\beta}{\mathbbm{1}}_{\beta\geqslant 2}){\big{)}}U_{\beta}(x)+C_{2}.

(3.32)

Proof.

It suffices to prove the above estimate for $|x|$ being large. We divide the proofs into three steps. For the sake of simplicity, we drop the time variable.

(Step 1). Note that

\nabla U_{\beta}(x)=\beta xU_{\beta-2}(x),

and

\displaystyle\nabla^{2}U_{\beta}(x)=\beta U_{\beta-2}(x){\mathbb{I}}+\beta(\beta-2)U_{\beta-4}(x)(x\otimes x).

(3.33)

By (3.13) and (3.31), we have

	$\displaystyle{\mathcal{B}}^{(\varepsilon)}U_{\beta}(x)$	$\displaystyle=\varepsilon^{-1}\int^{1}_{0}\langle b_{\varepsilon}(x),\nabla U_{\beta}(x+\theta b_{\varepsilon}(x))\rangle{\mathord{{\rm d}}}\theta$
		$\displaystyle=\varepsilon^{-1}\beta\int^{1}_{0}\Big{[}\langle b_{\varepsilon}(x),x\rangle+\theta\|b_{\varepsilon}(x)\|^{2}\Big{]}U_{\beta-2}(x+\theta b_{\varepsilon}(x)){\mathord{{\rm d}}}\theta$
		$\displaystyle\leqslant\beta\Big{[}\kappa_{6}\|x\|^{2}+C_{1}\Big{]}\int^{1}_{0}U_{\beta-2}(x+\theta b_{\varepsilon}(x)){\mathord{{\rm d}}}\theta.$

We have the following estimate: there is an $\varepsilon_{0}>0$ such that for any $\theta\in(0,1)$ and $\varepsilon\in(0,\varepsilon_{0})$ ,

\displaystyle(1+|x|^{2})/2\leqslant 1+|x+\theta b_{\varepsilon}(x)|^{2}\leqslant 2(1+|x|^{2}).

(3.34)

In fact, noting that by (3.20) and (3.18),

\displaystyle|b_{\varepsilon}(x)|\leqslant\sqrt{\varepsilon}|b(x)|^{\frac{1}{m}}\leqslant\sqrt{\varepsilon}\kappa_{2}(1+|x|),

(3.35)

for $\varepsilon<\varepsilon_{0}$ with $\varepsilon_{0}$ small enough, we have

1+|x+\theta b_{\varepsilon}(x)|^{2}\leqslant 1+(|x|+|b_{\varepsilon}(x)|)^{2}\leqslant 1+(|x|+\sqrt{\varepsilon}\kappa_{2}(1+|x|))^{2}\leqslant 2(1+|x|^{2}),

and for $|x|>1$ ,

1+|x+\theta b_{\varepsilon}(x)|^{2}\geqslant 1+(|x|-|b_{\varepsilon}(x)|)^{2}\geqslant 1+(|x|-\sqrt{\varepsilon}\kappa_{2}(1+|x|))^{2}\geqslant(1+|x|^{2})/2,

and for $|x|\leqslant 1$ ,

1+|x+\theta b_{\varepsilon}(x)|^{2}\geqslant 1\geqslant(1+|x|^{2})/2.

Hence, we have (3.34). Thus, for $\beta\in(0,\alpha)$ ,

\displaystyle{\mathcal{B}}^{(\varepsilon)}U_{\beta}(x)\leqslant\left\{\begin{aligned} &\beta\kappa_{6}|x|^{2}\big{(}\tfrac{1+|x|^{2}}{2}\big{)}^{\frac{\beta}{2}-1}+C_{1}\big{(}\tfrac{1+|x|^{2}}{2}\big{)}^{\frac{\beta}{2}-1},&\kappa_{6}>0,\beta\leqslant 2,\\ &\beta\kappa_{6}|x|^{2}\big{(}\tfrac{1+|x|^{2}}{2}\big{)}^{\frac{\beta}{2}-1}+C_{1}\big{(}2(1+|x|^{2})\big{)}^{\frac{\beta}{2}-1},&\kappa_{6}<0,\beta>2,\\ &\beta\kappa_{6}|x|^{2}\big{(}2(1+|x|^{2})\big{)}^{\frac{\beta}{2}-1}+C_{1}\big{(}2(1+|x|^{2})\big{)}^{\frac{\beta}{2}-1},&\kappa_{6}>0,\beta>2,\\ &\beta\kappa_{6}|x|^{2}\big{(}2(1+|x|^{2})\big{)}^{\frac{\beta}{2}-1}+C_{1}\big{(}\tfrac{1+|x|^{2}}{2}\big{)}^{\frac{\beta}{2}-1},&\kappa_{6}<0,\beta\leqslant 2,\end{aligned}\right.

which implies by Young’s inequality that for some $C_{0}=C_{0}(\beta)>0$ ,

\displaystyle{\mathcal{B}}^{(\varepsilon)}U_{\beta}(x)\leqslant C_{0}\kappa_{6}U_{\beta}(x)+C.

(3.36)

(Step 2). In the remaining steps we treat ${\mathcal{A}}^{(\varepsilon)}U_{\beta}(x)$ . First of all, we consider the case of $\alpha\geqslant 2$ and $\beta\in[2,\alpha]$ . By (3.15), (3.25) and $\sigma_{\varepsilon}(x,z)=\sqrt{\varepsilon}\sigma(x,z)$ , we have

{\mathcal{A}}^{(\varepsilon)}U_{\beta}(x)=\frac{1}{2}\int_{{\mathbb{R}}^{d}}\int^{1}_{0}\theta\int^{1}_{-1}\mathrm{tr}((\sigma\otimes\sigma)(x,z)\cdot{\mathcal{D}}^{(\varepsilon)}\nabla^{2}U_{\beta}(x+\theta\theta^{\prime}\sqrt{\varepsilon}\sigma(x,z))){\mathord{{\rm d}}}\theta^{\prime}{\mathord{{\rm d}}}\theta\nu({\mathord{{\rm d}}}z).

Since $\beta\geqslant 2$ , by (3.33) and (3.35), we have for $\varepsilon\leqslant 1/\kappa_{2}^{2}$ ,

	$\displaystyle\|{\mathcal{A}}^{(\varepsilon)}U_{\beta}(x)\|$	$\displaystyle\lesssim\int_{{\mathbb{R}}^{d}}\|\sigma(x,z)\|^{2}\int^{1}_{0}\theta\int^{1}_{-1}U_{\beta-2}(x+\theta\theta^{\prime}\sqrt{\varepsilon}\sigma(x,z)+b_{\varepsilon}(x)){\mathord{{\rm d}}}\theta^{\prime}{\mathord{{\rm d}}}\theta\nu({\mathord{{\rm d}}}z)$
		$\displaystyle\lesssim\int_{{\mathbb{R}}^{d}}\|\sigma(x,z)\|^{2}(1+\|x\|^{\beta-2}+\|\sigma(x,z)\|^{\beta-2}+\|b_{\varepsilon}(x)\|^{\beta-2})\nu({\mathord{{\rm d}}}z)$
		$\displaystyle\lesssim\int_{{\mathbb{R}}^{d}}{\big{(}}\|\sigma(x,z)\|^{2}(1+\|x\|^{\beta-2})+\|\sigma(x,z)\|^{\beta}{\big{)}}\nu({\mathord{{\rm d}}}z).$

By (3.16) and (H ${}^{\alpha}_{\nu}$ ), we further have

	$\displaystyle\|{\mathcal{A}}^{(\varepsilon)}U_{\beta}(x)\|$	$\displaystyle\lesssim\int_{{\mathbb{R}}^{d}}{\big{(}}(\kappa_{0}+\kappa_{1}\|x\|)^{2}(1+\|x\|^{\beta-2})\|z\|^{2}+(\kappa_{0}+\kappa_{1}\|x\|)^{\beta}\|z\|^{\beta}{\big{)}}\nu({\mathord{{\rm d}}}z)$
		$\displaystyle\lesssim(1+\kappa_{1}^{\beta}\|x\|^{\beta})\int_{{\mathbb{R}}^{d}}(\|z\|^{2}+\|z\|^{\beta})\nu({\mathord{{\rm d}}}z)\lesssim\kappa_{1}^{\beta}U_{\beta}(x)+C.$

(Step 3). Next we consider the case of $\beta\in(0,2)$ . Let $\kappa_{1}$ be the same as in (3.16) and write $\gamma:=(4\kappa_{1})^{-1}\varepsilon^{-\frac{1}{2\wedge\alpha}}$ . By (3.15) we have

{\mathcal{A}}^{(\varepsilon)}U_{\beta}(x)=J_{1}(x)+J_{2}(x),

where

J_{1}(x):=\int_{|z|<\gamma}\frac{{\mathcal{D}}^{(\varepsilon)}U_{\beta}(x+\sigma_{\varepsilon}(x,z))+{\mathcal{D}}^{(\varepsilon)}U_{\beta}(x-\sigma_{\varepsilon}(x,z))-2{\mathcal{D}}^{(\varepsilon)}U_{\beta}(x)}{2\varepsilon}\nu({\mathord{{\rm d}}}z)

and

J_{2}(x):=\int_{|z|\geqslant\gamma}\frac{{\mathcal{D}}^{(\varepsilon)}U_{\beta}(x+\sigma_{\varepsilon}(x,z))+{\mathcal{D}}^{(\varepsilon)}U_{\beta}(x-\sigma_{\varepsilon}(x,z))-2{\mathcal{D}}^{(\varepsilon)}U_{\beta}(x)}{2\varepsilon}\nu({\mathord{{\rm d}}}z).

For $J_{1}(x)$ , by (3.25) and (3.33), we have

	$\displaystyle J_{1}(x)$	$\displaystyle=\frac{1}{2\varepsilon}\int_{\|z\|<\gamma}\int^{1}_{0}\theta\int^{1}_{-1}\mathrm{tr}{\big{(}}(\sigma_{\varepsilon}\otimes\sigma_{\varepsilon})(x,z),{\mathcal{D}}^{(\varepsilon)}\nabla^{2}U_{\beta}(x+\theta\theta^{\prime}\sigma_{\varepsilon}(x,z)){\big{)}}{\mathord{{\rm d}}}\theta^{\prime}{\mathord{{\rm d}}}\theta\nu({\mathord{{\rm d}}}z)$
		$\displaystyle\leqslant\frac{1}{2\varepsilon}\int_{\|z\|<\gamma}\int^{1}_{0}\theta\int^{1}_{-1}\beta\|\sigma_{\varepsilon}(x,z)\|^{2}U_{\beta-2}(x+\theta\theta^{\prime}\sigma_{\varepsilon}(x,z)+b_{\varepsilon}(x)){\mathord{{\rm d}}}\theta^{\prime}{\mathord{{\rm d}}}\theta\nu({\mathord{{\rm d}}}z),$

where we have used that for $\beta\in(0,2)$ ,

\beta(\beta-2)|\langle\sigma(x,z),y\rangle|^{2}U_{\beta-4}(y)\leqslant 0.

For $\varepsilon_{0}$ small enough, and for $|z|<\gamma=(4\kappa_{1})^{-1}\varepsilon^{-\frac{1}{2\wedge\alpha}}$ , $\varepsilon\in(0,\varepsilon_{0})$ and $\theta\in(0,1),\theta^{\prime}\in(-1,1)$ ,

	$\displaystyle\|x+\theta\theta^{\prime}\sigma_{\varepsilon}(x,z)+b_{\varepsilon}(x)\|$	$\displaystyle\geqslant\|x\|-\|\sigma_{\varepsilon}(x,z)\|-\|b_{\varepsilon}(x)\|$
		$\displaystyle\!\!\!\!\stackrel{{\scriptstyle\eqref{CB00}}}{{\geqslant}}\|x\|-(\kappa_{0}+\kappa_{1}\|x\|)\varepsilon^{\frac{1}{2\wedge\alpha}}\|z\|-\|b_{\varepsilon}(x)\|$
		$\displaystyle\!\!\!\!\stackrel{{\scriptstyle\eqref{BB-1}}}{{\geqslant}}\|x\|-(\kappa_{0}+\kappa_{1}\|x\|)(4\kappa_{1})^{-1}-\sqrt{\varepsilon}\kappa_{2}(1+\|x\|)$
		$\displaystyle\geqslant\|x\|/2-C_{3}.$

Thus for $|x|>4C_{3}$ , by (H ${}^{\alpha}_{\nu}$ ),

	$\displaystyle J_{1}(x)$	$\displaystyle\leqslant\int_{\|z\|<\gamma}\frac{\beta\|\sigma_{\varepsilon}(x,z)\|^{2}}{2\varepsilon}\Big{(}1+\big{\|}\tfrac{\|x\|}{2}-C_{3}\big{\|}^{2}\Big{)}^{\frac{\beta-2}{2}}\nu({\mathord{{\rm d}}}z)$
		$\displaystyle\leqslant\int_{\|z\|<\gamma}\frac{\beta(\kappa_{0}+\kappa_{1}\|x\|)^{2}\varepsilon^{\frac{2}{2\wedge\alpha}}\|z\|^{2}}{2\varepsilon}\Big{(}\tfrac{\|x\|^{2}}{16}\Big{)}^{\frac{\beta-2}{2}}\nu({\mathord{{\rm d}}}z)$
		$\displaystyle\lesssim\frac{\beta(\kappa_{0}+\kappa_{1}\|x\|)^{2}\varepsilon^{\frac{2}{2\wedge\alpha}}}{2\varepsilon}\Big{(}\tfrac{\|x\|^{2}}{16}\Big{)}^{\frac{\beta-2}{2}}\gamma^{2-(2\wedge\alpha)}\leqslant C_{1}\kappa_{1}^{2\wedge\alpha}\|x\|^{\beta}+C_{2}.$

For $J_{2}(x)$ , since $\beta\in(0,2)$ , by Lemma 3.10, (H ${}^{\sigma}_{b}$ ) and Lemma 3.3, we directly have

	$\displaystyle J_{2}(x)$	$\displaystyle\lesssim\int_{\|z\|\geqslant\gamma}\|\sigma_{\varepsilon}(x,z)\|^{\beta}\varepsilon^{-1}\nu({\mathord{{\rm d}}}z)$
		$\displaystyle\leqslant(\kappa_{0}+\kappa_{1}\|x\|)^{\beta}\varepsilon^{\frac{\beta}{2-2\wedge\alpha}-1}\int_{\|z\|\geqslant\gamma}\|z\|^{\beta}\nu({\mathord{{\rm d}}}z)$
		$\displaystyle\lesssim(\kappa_{0}+\kappa_{1}\|x\|)^{\beta}\varepsilon^{\frac{\beta}{2-2\wedge\alpha}-1}\gamma^{\beta-2\wedge\alpha}\leqslant C_{1}\kappa_{1}^{2\wedge\alpha}\|x\|^{\beta}+C_{2}.$

Hence, for $|x|\geqslant 4C_{3}$ ,

\displaystyle{\mathcal{A}}^{(\varepsilon)}U_{\beta}(x)\leqslant C_{1}\kappa_{1}^{2\wedge\alpha}U_{\beta}(x)+C_{2}.

(3.37)

The proof is complete. ∎

Remark 3.13.

From the above proofs, one sees that if $|\sigma(t,x,z)|\leqslant\kappa_{0}|z|$ , then for any $\beta\in(0,\alpha)$ ,

{\mathscr{L}}^{(\varepsilon)}_{s}U_{\beta}(x)\leqslant C_{0}\kappa_{6}U_{\beta}(x)+C_{2},

where $\kappa_{6}$ is given in (3.30).

As an easy corollary, we have

Corollary 3.14.

Under (H ${}^{\alpha}_{\nu}$ ) and (H ${}^{\sigma}_{b}$ ), for any $\beta\in(0,\alpha)$ and $T>0$ , it holds that for some $C_{1}>0$ depending on $T$ ,

\displaystyle\sup_{\varepsilon\in(0,1)}{\mathbb{E}}\left(\sup_{t\in[0,T]}|X^{\varepsilon}_{t}|^{\beta}\right)\leqslant C_{1}(1+{\mathbb{E}}|X_{0}|^{\beta}),

(3.38)

and for some $C_{2}>0$ independent of $\varepsilon\in(0,1)$ and $t>0$ ,

\displaystyle{\mathbb{E}}U_{\beta}(X^{\varepsilon}_{t})\leqslant\mathrm{e}^{\kappa_{7}t}{\mathbb{E}}U_{\beta}(X_{0})+C_{2}(\mathrm{e}^{\kappa_{7}t}-1)/\kappa_{7},

(3.39)

where $\kappa_{7}:=C_{0}\kappa_{6}+C_{1}(\kappa^{2\wedge\alpha}_{1}{\mathbbm{1}}_{\beta\in(0,2)}+\kappa_{1}^{\beta}{\mathbbm{1}}_{\beta\geqslant 2})\in{\mathbb{R}}$ (see Lemma 3.12).

Proof.

By Itô’s formula and Lemma 3.12, we have

	$\displaystyle\mathrm{e}^{-\kappa_{7}t}U_{\beta}(X^{\varepsilon}_{t})$	$\displaystyle=U_{\beta}(X_{0})+\int^{t}_{0}\mathrm{e}^{-\kappa_{7}s}({\mathscr{L}}^{(\varepsilon)}_{s}U_{\beta}-\kappa_{7}U_{\beta})(X^{\varepsilon}_{s}){\mathord{{\rm d}}}s+M^{\varepsilon}_{t}$
		$\displaystyle\leqslant U_{\beta}(X_{0})+C_{2}\int^{t}_{0}\mathrm{e}^{-\kappa_{7}s}{\mathord{{\rm d}}}s+M^{\varepsilon}_{t},$		(3.40)

where $M^{\varepsilon}_{t}$ is a local martingale given by

M^{\varepsilon}_{t}=\int^{t}_{0}\int_{{\mathbb{R}}^{d}}\mathrm{e}^{-\kappa_{7}s}\left(U_{\beta}{\big{(}}X^{\varepsilon}_{s-}+\sigma_{\varepsilon}(s,X^{\varepsilon}_{s-},z)+b_{\varepsilon}(s,X^{\varepsilon}_{s-}){\big{)}}-U_{\beta}(X^{\varepsilon}_{s-})\right)\widetilde{\mathcal{H}}^{\varepsilon}({\mathord{{\rm d}}}s,{\mathord{{\rm d}}}z).

By applying stochastic Gronwall’s lemma (see [41, Lemma 3.7]) and utilizing the fact that $\beta$ can be chosen arbitrarily in the interval $(0,\alpha)$ , we obtain equation (3.38). Moreover, for $R>0$ , define

\tau^{\varepsilon}_{R}:=\inf\big{\{}t>0:|X^{\varepsilon}_{t}|\geqslant R\big{\}}.

By the optimal stopping theorem and taking expectations for (3.40), we also have

{\mathbb{E}}\Big{(}\mathrm{e}^{-\kappa_{7}t\wedge\tau^{\varepsilon}_{R}}U_{\beta}(X^{\varepsilon}_{t\wedge\tau^{\varepsilon}_{R}})\Big{)}\leqslant{\mathbb{E}}U_{\beta}(X_{0})+C_{2}\big{(}1-{\mathbb{E}}\mathrm{e}^{-\kappa_{7}t\wedge\tau^{\varepsilon}_{R}}\big{)}/\kappa_{7}.

Letting $R\to\infty$ and by Fatou’s lemma, we obtain (3.39). ∎

For given $T>0$ , let ${\mathscr{T}}_{T}$ be the set of all stopping times bounded by $T$ .

Lemma 3.15.

For any $T,\gamma>0$ , it holds that

\displaystyle\lim_{\delta\to 0}\sup_{\varepsilon\in(0,1)}\sup_{\tau,\eta\in{\mathscr{T}}_{T},\tau\leqslant\eta\leqslant\tau+\delta}{\mathbb{P}}\left(|X^{\varepsilon}_{\eta}-X^{\varepsilon}_{\tau}|>\gamma\right)=0.

Proof.

Let $\tau,\eta\in{\mathscr{T}}_{T}$ with $\tau\leqslant\eta\leqslant\tau+\delta$ . For any $R>0$ , define

\zeta_{R}:=\inf\big{\{}t>0:|X^{\varepsilon}_{t}|>R\big{\}},\ \tau_{R}:=\zeta_{R}\wedge\tau,\ \eta_{R}:=\zeta_{R}\wedge\eta.

We prove the limit for $\alpha\in(0,2)$ . For $\alpha=2$ , it is easier. By (3.9), we can write

	$\displaystyle X^{\varepsilon}_{\eta_{R}}-X^{\varepsilon}_{\tau_{R}}$	$\displaystyle=\int^{\eta_{R}}_{\tau_{R}}b_{\varepsilon}(s,X^{\varepsilon}_{s-}){\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{s}+\int^{\eta_{R}}_{\tau_{R}}\!\!\!\int_{\|z\|<\varepsilon^{-\frac{1}{\alpha}}}\sigma_{\varepsilon}(s,X^{\varepsilon}_{s-},z){\mathcal{H}}^{\varepsilon}({\mathord{{\rm d}}}s,{\mathord{{\rm d}}}z)$
		$\displaystyle\quad+\int^{\eta_{R}}_{\tau_{R}}\!\!\!\int_{\|z\|>\varepsilon^{-\frac{1}{\alpha}}}\sigma_{\varepsilon}(s,X^{\varepsilon}_{s-},z){\mathcal{H}}^{\varepsilon}({\mathord{{\rm d}}}s,{\mathord{{\rm d}}}z)=:I_{1}+I_{2}+I_{3}.$

For $I_{1}$ , by (3.20) and (3.18), we have

\displaystyle{\mathbb{E}}|I_{1}|\leqslant\varepsilon{\mathbb{E}}\left(\int^{\eta_{R}}_{\tau_{R}}|b(s,X^{\varepsilon}_{s-})|{\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{s}\right)={\mathbb{E}}\left(\int^{\eta_{R}}_{\tau_{R}}|b(s,X^{\varepsilon}_{s})|{\mathord{{\rm d}}}s\right)\leqslant C_{R}\delta.

For $I_{2}$ , by (3.10) and the isometry of stochastic integrals, we have

	$\displaystyle{\mathbb{E}}\|I_{2}\|^{2}$	$\displaystyle={\mathbb{E}}\left\|\int^{\eta_{R}}_{\tau_{R}}\!\!\!\int_{\|z\|<\varepsilon^{-\frac{1}{\alpha}}}\sigma_{\varepsilon}(s,X^{\varepsilon}_{s-},z)\widetilde{\mathcal{H}}^{\varepsilon}({\mathord{{\rm d}}}s,{\mathord{{\rm d}}}z)\right\|^{2}$
		$\displaystyle={\mathbb{E}}\left(\int^{\eta_{R}}_{\tau_{R}}\!\!\!\int_{\|z\|<\varepsilon^{-\frac{1}{\alpha}}}\|\sigma_{\varepsilon}(s,X^{\varepsilon}_{s},z)\|^{2}\nu({\mathord{{\rm d}}}z){\mathord{{\rm d}}}{\big{(}}\tfrac{s}{\varepsilon}{\big{)}}\right)$
		$\displaystyle={\mathbb{E}}\left(\int^{\eta_{R}}_{\tau_{R}}\!\!\!\int_{\|z\|<1}\|\sigma(s,X^{\varepsilon}_{s},z)\|^{2}\nu_{\varepsilon}({\mathord{{\rm d}}}z){\mathord{{\rm d}}}s\right)$
		$\displaystyle\leqslant(\kappa_{0}+\kappa_{1}R)^{2}\left(\int_{\|z\|<1}\|z\|^{2}\nu_{\varepsilon}({\mathord{{\rm d}}}z)\right)\delta\stackrel{{\scriptstyle\eqref{VV9}}}{{\leqslant}}C_{R}\delta.$

Fix $\beta\in(0,\alpha\wedge 1)$ . For $I_{3}$ , by $|\sum_{i}a_{i}|^{\beta}\leqslant\sum_{i}a_{i}^{\beta}$ we have

	$\displaystyle{\mathbb{E}}\|I_{3}\|^{\beta}$	$\displaystyle\leqslant{\mathbb{E}}\left(\int^{\eta_{R}}_{\tau_{R}}\!\!\!\int_{\|z\|\geqslant\varepsilon^{-\frac{1}{\alpha}}}\|\sigma_{\varepsilon}(s,X^{\varepsilon}_{s},z)\|^{\beta}{\mathcal{H}}^{\varepsilon}({\mathord{{\rm d}}}s,{\mathord{{\rm d}}}z)\right)$
		$\displaystyle={\mathbb{E}}\left(\int^{\eta_{R}}_{\tau_{R}}\!\!\!\int_{\|z\|\geqslant\varepsilon^{-\frac{1}{\alpha}}}\|\sigma_{\varepsilon}(s,X^{\varepsilon}_{s},z)\|^{\beta}\nu({\mathord{{\rm d}}}z){\mathord{{\rm d}}}{\big{(}}\tfrac{s}{\varepsilon}{\big{)}}\right)$
		$\displaystyle={\mathbb{E}}\left(\int^{\eta_{R}}_{\tau_{R}}\!\!\!\int_{\|z\|\geqslant 1}\|\sigma(s,X^{\varepsilon}_{s},z)\|^{\beta}\nu_{\varepsilon}({\mathord{{\rm d}}}z){\mathord{{\rm d}}}s\right)$
		$\displaystyle\leqslant(\kappa_{0}+\kappa_{1}R)^{\beta}\left(\int_{\|z\|\geqslant 1}\|z\|^{\beta}\nu_{\varepsilon}({\mathord{{\rm d}}}z)\right)\delta\stackrel{{\scriptstyle\eqref{VV9}}}{{\leqslant}}C_{R}\delta.$

Hence, by Chebyshev’s inequality and (3.38),

	$\displaystyle{\mathbb{P}}(\|X^{\varepsilon}_{\eta}-X^{\varepsilon}_{\tau}\|\geqslant\gamma)$	$\displaystyle\leqslant{\mathbb{P}}(\|X^{\varepsilon}_{\eta_{R}}-X^{\varepsilon}_{\tau_{R}}\|\geqslant\gamma;\zeta_{R}>T)+{\mathbb{P}}(\zeta_{R}\leqslant T)$
		$\displaystyle\leqslant\sum_{i=1}^{3}{\mathbb{P}}(\|I_{i}\|\geqslant\tfrac{\gamma}{3})+{\mathbb{P}}\left(\sup_{t\in[0,T]}\|X^{\varepsilon}_{t}\|\geqslant R\right)$
		$\displaystyle\leqslant\tfrac{3}{\gamma}{\mathbb{E}}\|I_{1}\|+(\tfrac{3}{\gamma})^{2}{\mathbb{E}}\|I_{2}\|^{2}+(\tfrac{3}{\gamma})^{\beta}{\mathbb{E}}\|I_{3}\|^{\beta}+\tfrac{C}{R^{\beta}}$
		$\displaystyle\leqslant C_{R,\gamma}\delta+C/R^{\beta},$

which converges to zero by firstly letting $\delta\to 0$ and then $R\to\infty$ . ∎

Let ${\mathbb{Q}}_{\varepsilon}$ be the law of $(X^{\varepsilon}_{t})_{t\geqslant 0}$ in ${\mathbb{D}}$ . Now we can show the following main result of this section.

Theorem 3.16.

Let $\mu_{\varepsilon}\in{\mathcal{P}}({\mathbb{R}}^{d})$ be the law of $X^{\varepsilon}_{0}$ . Suppose that (H ${}^{\alpha}_{\nu}$ ) and (H ${}^{\sigma}_{b}$ ) hold, and $\mu_{\varepsilon}$ weakly converges to $\mu_{0}$ as $\varepsilon\downarrow 0$ , and there is a unique martingale solution ${\mathbb{Q}}$ associated with ${\mathscr{L}}^{(0)}$ starting from $\mu_{0}$ at time $0$ . Then ${\mathbb{Q}}_{\varepsilon}$ weakly converges to ${\mathbb{Q}}$ as $\varepsilon\downarrow 0$ . Moreover, if $\alpha\geqslant 2$ , then ${\mathbb{Q}}_{0}$ concentrates on the space of all continuous functions.

Proof.

By Lemma 3.15 and Aldous’ criterion (see [23, p356, Theorem 4.5]), $({\mathbb{Q}}_{\varepsilon})_{\varepsilon\in(0,1)}$ is tight in ${\mathcal{P}}({\mathbb{D}})$ . Let ${\mathbb{P}}_{0}$ be any accumulation point. By Lemma 3.9 and Theorem 6.4 in appendix, one has ${\mathbb{Q}}_{0}\in{\mathcal{M}}^{\mu_{0}}_{0}({\mathscr{L}}^{(0)})$ . By the uniqueness, we have ${\mathbb{Q}}_{0}={\mathbb{Q}}$ and ${\mathbb{Q}}_{\varepsilon}$ weakly converges to ${\mathbb{Q}}$ as $\varepsilon\to 0$ . If $\alpha\geqslant 2$ , then by Proposition 6.3, ${\mathbb{Q}}$ concentrates on the space of all continuous functions. ∎

3.2. Convergence of invariant measures

In this section we show the following convergence of invariant measures under dissipativity assumptions.

Theorem 3.17.

Suppose that $b$ and $\sigma$ do not depend on the time variable. Under (H ${}^{\alpha}_{\nu}$ ) and (H ${}^{\sigma}_{b}$ ), if for some $\beta\in(0,\alpha)$ ,

\kappa_{7}(\beta):=C_{0}\kappa_{6}+C_{1}(\kappa^{2\wedge\alpha}_{1}{\mathbbm{1}}_{\beta\in(0,2)}+\kappa_{1}^{\beta}{\mathbbm{1}}_{\beta\geqslant 2})<0,

where the above constants appear in Lemma 3.12, then for each $\varepsilon\in(0,1)$ , there is an invariant probability measure $\mu_{\varepsilon}$ associated with the semigroup $P^{\varepsilon}_{t}f(x):={\mathbb{E}}f(X^{\varepsilon}_{t}(x))$ , where $X^{\varepsilon}_{t}(x)$ is the unique solution of SDE (3.9) starting from $X^{\varepsilon}_{0}=x$ . Moreover, $(\mu_{\varepsilon})_{\varepsilon\in(0,1)}\subset{\mathcal{P}}({\mathbb{R}}^{d})$ is tight and any accumulation point $\mu_{0}$ is a stationary distribution of SDE (3.22).

Proof.

Let $\beta\in(0,\alpha)$ . If $\kappa_{7}(\beta)<0$ , then by (3.39), it is easy to see that

\displaystyle\sup_{\varepsilon}\sup_{T\geqslant 1}\frac{1}{T}\int^{T}_{0}{\mathbb{E}}|X^{\varepsilon}_{t}|^{\beta}{\mathord{{\rm d}}}t<\infty.

(3.41)

For $\varepsilon\in(0,1)$ and $T\geqslant 1$ , we define a probability measure over ${\mathbb{R}}^{d}$ by

\mu_{\varepsilon,T}(A):=\frac{1}{T}\int^{T}_{0}{\mathbb{P}}\{X^{\varepsilon}_{t}\in A\}{\mathord{{\rm d}}}t,\ \ A\in{\mathscr{B}}({\mathbb{R}}^{d}).

By (3.41), one sees that $(\mu_{\varepsilon,T})_{T\geqslant 1}$ is tight. Let $\mu_{\varepsilon}$ be any accumulation point of $(\mu_{\varepsilon,T})_{T\geqslant 1}$ . By the classical Krylov-Bogoliubov argument (cf. [10, Section 3.1]), one can verify that $\mu_{\varepsilon}$ is an invariant probability measure associated with the semigroup $(P^{\varepsilon}_{t})_{t\geqslant 0}$ , and by (3.41),

\sup_{\varepsilon\in(0,1)}\int_{{\mathbb{R}}^{d}}|x|^{\beta}\mu_{\varepsilon}({\mathord{{\rm d}}}x)<\infty.

From this, by Prohorov’s theorem we derive that $(\mu_{\varepsilon})_{\varepsilon\in(0,1)}$ is tight. Let $\mu_{0}$ be any accumulation point of $(\mu_{\varepsilon})_{\varepsilon\in(0,1)}$ and for subsequence $\varepsilon_{k}\downarrow 0$ , $\mu_{\varepsilon_{k}}$ weakly converges to $\mu_{0}$ as $k\to\infty$ . Let $X^{\varepsilon_{k}}_{0}$ have the distribution $\mu_{\varepsilon_{k}}$ and $X^{\varepsilon_{k}}_{t}$ be the unique solution of SDE (3.9). Since $\mu_{\varepsilon_{k}}\in{\mathcal{P}}({\mathbb{R}}^{d})$ is an invariant probability measure of SDE (3.9), we have for each $t>0$ and $f\in C_{b}({\mathbb{R}}^{d})$ ,

\mu_{\varepsilon_{k}}(f)={\mathbb{E}}f(X^{\varepsilon_{k}}_{t}).

By Theorem 3.16 and taking weak limits, we obtain

\mu_{0}(f)={\mathbb{E}}^{{\mathbb{Q}}}f(w_{t}),\ \ t>0,

where ${\mathbb{Q}}$ is a martingale solution of SDE (3.22) with initial distribution $\mu_{0}$ . In other words, $\mu_{0}$ is a stationary distribution of ${\mathbb{Q}}$ . ∎

Remark 3.18.

If SDE (3.22) has a unique stationary distribution $\mu$ (or invariant probability measure), then $\mu_{\varepsilon}\Rightarrow\mu$ as $\varepsilon\downarrow 0$ .

Example. Let $\alpha\in(0,2]$ and consider the following SDE

\displaystyle{\mathord{{\rm d}}}X_{t}=\sigma(X_{t}){\mathord{{\rm d}}}L^{(\alpha)}_{t}+b(X_{t}){\mathord{{\rm d}}}t,\ X_{0}=x,

(3.42)

where for $\alpha\in(0,2)$ , $L^{(\alpha)}_{t}$ is a standard rotationally invariant and symmetric $\alpha$ -stable process, and for $\alpha=2$ , $L^{(2)}_{t}$ is a $d$ -dimensional standard Brownian motion, $\sigma:{\mathbb{R}}^{d}\to{\mathbb{R}}^{d}\otimes{\mathbb{R}}^{d}$ and $b:{\mathbb{R}}^{d}\to{\mathbb{R}}^{d}$ are two locally Lipschitz continuous functions. Suppose that for some $\kappa_{1}\geqslant\kappa_{0}>0$ ,

\kappa_{0}|\xi|^{2}\leqslant|\sigma(x)\xi|^{2}\leqslant\kappa_{1}|\xi|^{2},

and for some $m\geqslant 1$ and $\kappa_{2},\kappa_{3},\kappa_{4}>0$ and $\kappa_{5}<0$ (with $\kappa_{4}+\kappa_{5}<0$ in the case of $m=1$ ),

|b(x)|\leqslant(\kappa_{2}(1+|x|))^{m},\ \ \langle x,b(x)\rangle\leqslant\kappa_{3}+\kappa_{4}|x|^{2}+\kappa_{5}|x|^{m+1}.

It is well-known that SDE (3.42) has a unique invariant probability measure $\mu$ (see [42]). If we consider the approximating SDE (3.9) with $\sigma_{\varepsilon}$ and $b_{\varepsilon}$ being defined by (3.19), then SDE (3.9) admits an invariant probability measure $\mu_{\varepsilon}$ , and by Theorem 3.17,

\mu_{\varepsilon}\Rightarrow\mu,\ \ \varepsilon\downarrow 0.

3.3. Rate of weak convergence

Now we aim to show the rate of weak convergence as done for ODE (see Theorem 2.1). However, in this case, we will utilize the regularity estimate for the associated parabolic equation. To achieve this, we will require the following stronger assumptions:

(H^′)

Suppose that for some $\kappa_{1}>0$ and all $0\not=z\in{\mathbb{R}}^{d}$ ,

\|b\|_{\infty}+\|\nabla b\|_{\infty}+\|\sigma(\cdot,z)/|z|\|_{\infty}+\|\nabla_{z}\sigma\|_{\infty}+\|\nabla_{x}\sigma(\cdot,z)/|z|\|_{\infty}\leqslant\kappa_{1},

and for any $\varphi\in C^{1}_{b}$ and $t>0$ , the following parabolic equation admits a solution $u$ ,

\partial_{s}u+{\mathscr{L}}^{(0)}_{s}u=0,\ s\in[0,t),\ u(t,x)=\varphi(x),

with regularity estimate that for some $\gamma>2$ and $\beta<1$ ,

\displaystyle\|u(s,\cdot)\|_{\infty}\leqslant\|\varphi\|_{\infty},\ \ \|u(s,\cdot)\|_{C^{\gamma}_{b}}\leqslant C(t-s)^{-\beta}\|\varphi\|_{C^{1}_{b}},\ \ s\in[0,t).

(3.43)

We can show

Theorem 3.19.

Under (H ${}^{\alpha}_{\nu}$ ) and (H^′), for any $\varphi\in C^{1}_{b}({\mathbb{R}}^{d})$ and $T>0$ , there is a constant $C>0$ such that for all $t\in[0,T]$ and $\varepsilon\in(0,1)$ ,

\displaystyle|{\mathbb{E}}\varphi(X^{\varepsilon}_{t})-{\mathbb{E}}\varphi(X_{t})|\leqslant C\Big{(}\varepsilon^{\frac{(\alpha-2)\wedge 1}{2}}{\mathbbm{1}}_{\alpha\in(2,\gamma]}+\varepsilon^{\frac{2-\alpha}{2}\wedge\beta_{1}}{\mathbbm{1}}_{\alpha<2}\Big{)}\|\varphi\|_{C^{1}_{b}},

(3.44)

where $\beta_{1}$ is from (H ${}^{\alpha}_{\nu}$ ) and $\gamma$ is from (H^′).

Proof.

Fix $t>0$ . Under (H^′), by Itô’s formula, we have

\displaystyle{\mathbb{E}}\varphi(X^{\varepsilon}_{t})

\displaystyle={\mathbb{E}}u(t,X^{\varepsilon}_{t})={\mathbb{E}}u(0,X_{0})+{\mathbb{E}}\int^{t}_{0}(\partial_{s}u+{\mathscr{L}}^{(\varepsilon)}_{s}u)(s,X^{\varepsilon}_{s}){\mathord{{\rm d}}}s

and

{\mathbb{E}}\varphi(X_{t})={\mathbb{E}}u(t,X_{t})={\mathbb{E}}u(0,X_{0}).

Hence, by Lemma 3.15,

	$\displaystyle\|{\mathbb{E}}\varphi(X^{\varepsilon}_{t})-{\mathbb{E}}\varphi(X_{t})\|$	$\displaystyle=\left\|{\mathbb{E}}\int^{t}_{0}({\mathscr{L}}^{(\varepsilon)}_{s}u-{\mathscr{L}}^{(0)}_{s}u)(s,X^{\varepsilon}_{s}){\mathord{{\rm d}}}s\right\|\leqslant\int^{t}_{0}\\|{\mathscr{L}}^{(\varepsilon)}_{s}u(s)-{\mathscr{L}}^{(0)}_{s}u(s)\\|_{\infty}{\mathord{{\rm d}}}s$
		$\displaystyle\lesssim\int^{t}_{0}\Big{(}\varepsilon^{\frac{(\alpha-2)\wedge 1}{2}}\\|u(s)\\|_{C^{\alpha}_{b}}{\mathbbm{1}}_{\alpha\in(2,\gamma]}+\varepsilon^{\frac{2-\alpha}{2}\wedge\beta_{1}}\\|u(s)\\|_{C^{2}_{b}}{\mathbbm{1}}_{\alpha<2}\Big{)}{\mathord{{\rm d}}}s$
		$\displaystyle\lesssim\Big{(}\varepsilon^{\frac{(\alpha-2)\wedge 1}{2}}{\mathbbm{1}}_{\alpha\in(2,\gamma]}+\varepsilon^{\frac{2-\alpha}{2}\wedge\beta_{1}}{\mathbbm{1}}_{\alpha<2}\Big{)}\int^{t}_{0}(t-s)^{-\beta}{\mathord{{\rm d}}}s,$

which yields the desired estimate by $\beta<1$ . ∎

Remark 3.20.

Estimate (3.43) is the classical Schauder estimate, which is well-studied in the literature of partial differential equations (PDEs), particularly for the case of continuous diffusion with $\alpha=2$ . In the case of $\alpha\in(1,2)$ , the estimate can be found in [17]. Here, we provide a brief proof specifically for the additive noise case. We consider the following forward PDE:

\partial_{t}u=\Delta^{\alpha/2}u+b\cdot\nabla u,\ \ u(0)=\varphi,\ \ \alpha\in(1,2].

Let $P_{t}$ be the semigroup associated with $\Delta^{\alpha/2}$ , that is,

P_{t}\varphi(x)={\mathbb{E}}\varphi(x+L^{(\alpha)}_{t}).

By Duhamel’s formula, we have

u(t,x)=P_{t}\varphi(x)+\int^{t}_{0}P_{t-s}(b\cdot\nabla u)(s,x){\mathord{{\rm d}}}s.

It is well-known that by the gradient estimate of heat kernels, for $\beta,\gamma\geqslant 0$ (see [7] [17]),

\|P_{t}\varphi\|_{C^{\beta+\gamma}_{b}}\leqslant Ct^{-\frac{\beta}{\alpha}}\|\varphi\|_{C^{\gamma}_{b}},\ \ t>0.

Hence, for $\beta\in(2-\alpha,1]$ and $\gamma\in(2,\alpha+\beta)$ ,

	$\displaystyle\\|u(t)\\|_{C^{\gamma}_{b}}$	$\displaystyle\lesssim t^{-\frac{\gamma-1}{\alpha}}\\|\varphi\\|_{C^{1}_{b}}+\int^{t}_{0}(t-s)^{\frac{\beta-\gamma}{\alpha}}\\|b(s)\cdot\nabla u(s)\\|_{C^{\beta}_{b}}{\mathord{{\rm d}}}s$
		$\displaystyle\lesssim t^{-\frac{\gamma-1}{\alpha}}\\|\varphi\\|_{C^{1}_{b}}+\int^{t}_{0}(t-s)^{\frac{\beta-\gamma}{\alpha}}\\|b(s)\\|_{C^{\beta}_{b}}\\|u(s)\\|_{C^{\beta+1}_{b}}{\mathord{{\rm d}}}s.$

By Gronwall’s inequality of Volterra’s type, we obtain that for any $\gamma\in(2,\alpha+\beta)$ ,

\|u(t)\|_{C^{\gamma}_{b}}\lesssim t^{-\frac{\gamma-1}{\alpha}}\|\varphi\|_{C^{1}_{b}}.

In this case we have the weak convergence rate (3.44) for Hölder drift $b$ .

4. Compound Poisson approximation for 2D-NSEs

In this section, we develop a discrete compound Poisson approximation for the 2D Navier-Stokes or Euler equations on the torus. We shall show the optimal rate of convergence for this approximation. Our scheme heavily relies on the stochastic Lagrangian particle representation of the NSEs, which has been previously studied in works such as [30], [8], and [43].

4.1. Diffeomorphism flow of SDEs driven by compound Poisson processes

In this subsection we show the diffeomorphism flow property of SDEs driven by compound Poisson processes and the connection with difference equations. More precisely, fix $\varepsilon\in(0,1)$ and let $X_{s,t}(x)$ solve the following SDE:

\displaystyle X_{s,t}(x)

\displaystyle=x+\int^{t}_{s}\int_{{\mathbb{R}}^{d}}\Big{(}b_{\varepsilon}(r,X_{s,r-}(x))+\sqrt{\varepsilon}z\Big{)}{\mathcal{H}}^{\varepsilon}({\mathord{{\rm d}}}r,{\mathord{{\rm d}}}z),\ \ t>s\geqslant 0,

where $b_{\varepsilon}:\mathbb{R}_{+}\times\mathbb{R}^{d}\to\mathbb{R}^{d}$ is a bounded continuous function, and $\mathcal{H}^{\varepsilon}$ is defined as in (3.2). By the definition, we can rephrase the above SDE as follows:

	$\displaystyle X_{s,t}(x)$	$\displaystyle=x+\sum_{r\in(s,t]}\Big{(}b_{\varepsilon}(r,X_{s,r-}(x))+\sqrt{\varepsilon}\Delta H_{r}\Big{)}{\mathbbm{1}}_{\Delta{\mathcal{N}}^{\varepsilon}_{r}=1}$
		$\displaystyle=x+\int^{t}_{s}b_{\varepsilon}(r,X_{s,r-}(x)){\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{r}+\sqrt{\varepsilon}(H^{\varepsilon}_{t}-H^{\varepsilon}_{s}),$		(4.1)

where ${\mathcal{N}}^{\varepsilon}_{r}$ is defined by (2.1) and $H^{\varepsilon}_{t}$ is defined by (3.1). For given $T>0$ , bounded continuous functions $\varphi:{\mathbb{R}}^{d}\to{\mathbb{R}}$ and $f:{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}\to{\mathbb{R}}$ , define

u(s,x):={\mathbb{E}}\varphi(X_{s,T}(x))+\int^{T}_{s}{\mathbb{E}}f(r,X_{s,r}(x)){\mathord{{\rm d}}}r,\ s\in[0,T].

Since $s\mapsto{\mathcal{N}}^{\varepsilon}_{s}$ is stochastically continuous and $b_{\varepsilon}$ is bi-continuous, by (4.1) and the dominated convergence theorem, it is easy to see that

(s,x)\mapsto u(s,x)

is bi-continuous on

[0,T]\times{\mathbb{R}}^{d}

(4.2)

The following lemma states that $u$ solves the backward Kolmogorov equation. Although the proof is standard, we provide a detailed proof for the convenience of the readers.

Lemma 4.1.

For each $x\in{\mathbb{R}}^{d}$ , the function $s\mapsto u(s,x)$ is continuous differentiable, and

\displaystyle\partial_{s}u(s,x)+{\mathscr{L}}^{(\varepsilon)}_{s}u(s,x)+f(s,x)=0,\ \ s\in[0,T],

(4.3)

where

{\mathscr{L}}^{(\varepsilon)}_{s}\varphi(x):=\int_{{\mathbb{R}}^{d}}\frac{\varphi(x+\sqrt{\varepsilon}z+b_{\varepsilon}(s,x))-\varphi(x)}{\varepsilon}\nu({\mathord{{\rm d}}}z).

Proof.

Fix $s,h\in[0,T]$ with $s+h\leqslant T$ . Note that by the flow property of $X_{s,t}(x)$ ,

X_{s,T}(x)=X_{s+h,T}\circ X_{s,s+h}(x).

This follows directly from the unique solvability of SDE (4.1). Since $X_{s+h,T}(\cdot)$ and $X_{s,s+h}(\cdot)$ are independent, by definition we have

	$\displaystyle u(s,x)$	$\displaystyle={\mathbb{E}}\varphi(X_{s+h,T}\circ X_{s,s+h}(x))+\int^{T}_{s+h}{\mathbb{E}}f(r,X_{s+h,r}\circ X_{s,s+h}(x)){\mathord{{\rm d}}}r+\int^{s+h}_{s}{\mathbb{E}}f(r,X_{s,r}(x)){\mathord{{\rm d}}}r$
		$\displaystyle={\mathbb{E}}\left[{\mathbb{E}}\varphi(X_{s+h,T}(y))+\int^{T}_{s+h}{\mathbb{E}}f(r,X_{s+h,r}(y)){\mathord{{\rm d}}}r\right]_{y=X_{s,s+h}(x)}+\int^{s+h}_{s}{\mathbb{E}}f(r,X_{s,r}(x)){\mathord{{\rm d}}}r$
		$\displaystyle={\mathbb{E}}u(s+h,X_{s,s+h}(x))+\int^{s+h}_{s}{\mathbb{E}}f(r,X_{s,r}(x)){\mathord{{\rm d}}}r.$

Applying Itô’s formula to $u(s+h,\cdot)$ , we have

{\mathbb{E}}u(s+h,X_{s,s+h}(x))=u(s+h,x)+\int^{s+h}_{s}{\mathbb{E}}{\mathscr{L}}^{(\varepsilon)}_{r}u(s+h,X_{s,r}(x)){\mathord{{\rm d}}}r.

Hence,

\displaystyle\frac{u(s+h,x)-u(s,x)}{h}=-\frac{1}{h}\int^{s+h}_{s}\Big{(}{\mathbb{E}}{\mathscr{L}}^{(\varepsilon)}_{r}u(s+h,X_{s,r}(x))+{\mathbb{E}}f(r,X_{s,r}(x))\Big{)}{\mathord{{\rm d}}}r.

By the dominated convergence theorem and (4.2), it is easy to see that

\partial^{+}_{s}u(s,x)+{\mathscr{L}}^{(\varepsilon)}_{s}u(s,x)+f(s,x)=0,

where $\partial^{+}_{s}$ (resp. $\partial^{-}_{s}$ ) stands for the right (resp. left) hand derivative. Similarly, we can show

\partial^{-}_{s}u(s,x)+{\mathscr{L}}^{(\varepsilon)}_{s}u(s,x)+f(s,x)=0.

Since $(s,x)\mapsto{\mathscr{L}}^{(\varepsilon)}_{s}u(s,x)+f(s,x)$ is continuous, we complete the proof. ∎

Remark 4.2.

The continuity of $b_{\varepsilon}$ and $f$ in time variable $t$ can be dropped by smooth approximation. In this case, (4.3) holds only for Lebesgue almost all $s\in[0,T]$ .

Next, we investigate the $C^{1}$ -diffeomorphism property of the mapping $x\mapsto X_{s,t}(x)$ . To ensure the homeomorphism property of this mapping, we need to impose a condition on the gradient of $b_{\varepsilon}(s,x)$ . More specifically, we assume that the gradient of $b_{\varepsilon}(s,x)$ is not too large.

Theorem 4.3.

Suppose that $(s,x)\mapsto\nabla b_{\varepsilon}(s,x)$ is continuous and for some $\kappa>0$ ,

\displaystyle|\nabla_{x}b_{\varepsilon}(s,x)|\leqslant\kappa\varepsilon\leqslant 1,\ \ \mathord{{\rm div}}b_{\varepsilon}=0.

(4.4)

Then there is an $\varepsilon_{0}\in(0,1)$ such that for all $\varepsilon\in(0,\varepsilon_{0})$ , $\{X_{s,t}(x),x\in{\mathbb{R}}^{d}\}_{0\leqslant s<t}$ forms a $C^{1}$ -diffeomorphism flow and for some constant $C=C(d)>0$ ,

{\mathbb{E}}\det(\nabla X_{s,t}(x))+{\mathbb{E}}\det(\nabla X_{s,t}(x))^{-1}\leqslant\mathrm{e}^{C\kappa^{2}\varepsilon(t-s)}.

Proof.

Without loss of generality, we assume $s=0$ and write $X_{t}:=X_{0,t}(x)$ . Let $J_{t}:=\nabla X_{t}$ . By (4.1) we clearly have

J_{t}={\mathbb{I}}+\int^{t}_{0}\nabla b_{\varepsilon}(s,X_{s-})J_{s-}{\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{s},

and by Itô’s formula,

	$\displaystyle\det(J_{t})$	$\displaystyle=1+\int^{t}_{0}\Big{[}\det(({\mathbb{I}}+\nabla b_{\varepsilon}(s,X_{s-}))J_{s-})-\det(J_{s-})\Big{]}{\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{s}$
		$\displaystyle=1+\int^{t}_{0}\Big{[}\det({\mathbb{I}}+\nabla b_{\varepsilon}(s,X_{s-}))-1\Big{]}\det(J_{s-}){\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{s}.$		(4.5)

Note that for a matrix $B=(b_{ij})$ with $|b_{ij}|\leqslant\ell$ (see [45, Lemma 2.1]),

|\det({\mathbb{I}}+B)-1-\mathrm{tr}B|\leqslant C_{d}\ell^{2}(1+\ell)^{d-2}.

By (4.4) we have

\big{|}\det({\mathbb{I}}+\nabla b_{\varepsilon}(s,X_{s-}))-1\big{|}\leqslant C\kappa^{2}\varepsilon^{2},

and there is an $\varepsilon_{0}$ small enough so that for all $\varepsilon\in(0,\varepsilon_{0})$ ,

\big{|}\det({\mathbb{I}}+\nabla b_{\varepsilon}(s,X_{s-}))^{-1}-1\big{|}\leqslant C\kappa^{2}\varepsilon^{2}.

Thus by (4.5), we have

\det(J_{t})^{-1}=1+\int^{t}_{0}\Big{[}\det({\mathbb{I}}+\nabla b_{\varepsilon}(s,X_{s-}))^{-1}-1\Big{]}\det(J_{s-})^{-1}{\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{s}.

Hence,

\displaystyle{\mathbb{E}}\det(J_{t})=1+{\mathbb{E}}\int^{t}_{0}\Big{[}\det({\mathbb{I}}+\nabla b_{\varepsilon}(s,X_{s}))-1\Big{]}\det(J_{s}){\mathord{{\rm d}}}{\big{(}}\frac{s}{\varepsilon}{\big{)}}\leqslant 1+C\kappa^{2}\varepsilon\int^{t}_{0}{\mathbb{E}}\det(J_{s}){\mathord{{\rm d}}}s,

and also

\displaystyle{\mathbb{E}}\det(J_{t})^{-1}\leqslant 1+C\kappa^{2}\varepsilon\int^{t}_{0}{\mathbb{E}}\det(J_{s})^{-1}{\mathord{{\rm d}}}s.

By Gronwall’s inequality, we obtain the desired estimates. ∎

4.2. Compound Poisson approximation for 2D-NSEs

Fix $T>0$ . In this subsection we consider the following backward 2D-NSE on the torus ${\mathbb{T}}^{2}=[-\pi,\pi]^{2}$ :

\displaystyle\partial_{s}u+\nu\Delta u+u\cdot\nabla u+\nabla p=0,\ \ \mathord{{\rm div}}u=0,\ \ u(T)=\varphi,

(4.6)

where $\nu$ stands for the viscosity constant and $p$ is the pressure, $\varphi:{\mathbb{T}}^{2}\to{\mathbb{R}}^{2}$ is a divergence free smooth velocity field. Let $w={\rm curl}(u)$ be the curl of $u$ . Then $w$ solves the following vorticity equation

\displaystyle\partial_{s}w+\nu\Delta w+u\cdot\nabla w=0,\ \ \ w(T)={\rm curl}(\varphi)=:w_{0}.

(4.7)

If we assume

\int_{{\mathbb{T}}^{2}}u(x){\mathord{{\rm d}}}x=0,

then the velocity field $u$ can be uniquely recovered from vorticity $w$ by the Biot-Savart law:

u=K_{2}*w,

where $K_{2}$ is the Biot-Savart kernel on the torus and takes the following form (see [30, (2.19)] and [37, p256, Theorem 2.17]):

\displaystyle K_{2}(x):=(-x_{2},x_{1})/(2\pi|x|^{2})+K_{0}(x),\ \ \ K_{0}\in C^{\infty}([-\pi,\pi]^{2}).

(4.8)

Since $K_{2}\in L^{1}({\mathbb{T}}^{2})$ , we clearly have

\displaystyle\|K_{2}*w\|_{\infty}\leqslant C\|w\|_{\infty}.

(4.9)

Let $X_{s,t}(x)$ solve the following nonlinear SDE on the torus ${\mathbb{T}}^{2}$ :

\displaystyle\left\{\begin{aligned} X_{s,t}(x)&=x+\int^{t}_{s}u(r,X_{s,r}(x)){\mathord{{\rm d}}}s+\sqrt{\nu}W_{t},\ t\in[s,T],\\ w(s,x)&={\mathbb{E}}w_{0}(X_{s,T}(x)),\ \ u=K_{2}*w.\end{aligned}\right.

(4.10)

It is well-known that there is a one-to-one correspondence between (4.6) and (4.7) (see [30] [8] [43]). Motivated by the approximation in Section 3, we may construct the compound Poisson approximation for system (4.10) as follows: for $\varepsilon\in(0,1)$ ,

\displaystyle\left\{\begin{aligned} X^{\varepsilon}_{s,t}(x)&=x+\varepsilon\int^{t}_{s}u_{\varepsilon}(r,X^{\varepsilon}_{s,r-}(x)){\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{r}+\sqrt{\varepsilon\nu}(H^{\varepsilon}_{t}-H^{\varepsilon}_{s}),\\ w_{\varepsilon}(s,x)&={\mathbb{E}}w_{0}(X^{\varepsilon}_{s,T}(x)),\ \ u_{\varepsilon}=K_{2}*w_{\varepsilon},\end{aligned}\right.

(4.11)

where $H^{\varepsilon}_{t}$ is a compound Poisson process defined in (1.13). By Lemma 4.1, $w_{\varepsilon}$ solves the following nonlinear discrete difference equation:

\partial_{s}w_{\varepsilon}+{\mathscr{L}}^{(\varepsilon)}_{s}w_{\varepsilon}=0,\ \ u_{\varepsilon}=K_{2}*w_{\varepsilon},

where

{\mathscr{L}}^{(\varepsilon)}_{s}f(x):=\sum_{i=1,2}\frac{f(x+\sqrt{\varepsilon\nu}e_{i}+\varepsilon u_{\varepsilon}(s,x))+f(x-\sqrt{\varepsilon\nu}e_{i}+\varepsilon u_{\varepsilon}(s,x))-2f(x)}{2\varepsilon}.

The following Beale-Kato-Majda’s estimate for the Biot-Savart law on the torus is crucial for solving stochastic system (4.11).

Lemma 4.4.

For any $\gamma\in(0,1]$ , there is a constant $C=C(\gamma)>0$ such that for any $w\in C^{\gamma}_{b}({\mathbb{T}}^{2})$ ,

\|\nabla(K_{2}*w)\|_{\infty}\leqslant C\big{(}1+\|w\|_{\infty}(1+\log(1+[w]_{\gamma}))\big{)},

where $[w]_{\gamma}:=\sup_{x\not=y}\frac{|w(x)-w(y)|}{|x-y|^{\gamma}}$ .

Proof.

Let $H(x):=(-x_{2},x_{1})/(2\pi|x|^{2})$ . By (4.8), it suffices to make an estimate for $\nabla H*w$ . For $\varepsilon\in(0,1)$ , by definition and the cancellation property $\int_{|y|=s}\nabla H(y){\mathord{{\rm d}}}y=0$ , we have

\displaystyle\nabla H*w(x)={\rm p.v.}\int_{{\mathbb{T}}^{2}}\nabla H(y)w(x-y){\mathord{{\rm d}}}y=I_{\varepsilon}(x)+J_{\varepsilon}(x),

where

	$\displaystyle I_{\varepsilon}(x)$	$\displaystyle:=\int_{\|y\|\leqslant\varepsilon}\nabla H(y)(w(x-y)-w(x)){\mathord{{\rm d}}}y,$
	$\displaystyle J_{\varepsilon}(x)$	$\displaystyle:=\int_{\varepsilon<\|y\|\leqslant\pi}\nabla H(y)w(x-y){\mathord{{\rm d}}}y.$

For $I_{\varepsilon}$ , since $|\nabla H(y)|\leqslant 4|y|^{-2}$ , we have

\|I_{\varepsilon}\|_{\infty}\leqslant 4[w]_{\gamma}\int_{|y|\leqslant\varepsilon}|y|^{\gamma-2}{\mathord{{\rm d}}}y\leqslant C[w]_{\gamma}\varepsilon^{\gamma}.

For $J_{\varepsilon}$ , we have

\|J_{\varepsilon}\|_{\infty}\leqslant 4\|w\|_{\infty}\int_{\varepsilon<|y|\leqslant\pi}|y|^{-2}{\mathord{{\rm d}}}y\leqslant C\|w\|_{\infty}(1+\log 1/\varepsilon).

Combining the above two estimates and choosing $\varepsilon=([w]_{\gamma}+1)^{-1}$ , we obtain

\|H*w\|_{\infty}\leqslant C\big{(}1+\|w\|_{\infty}(1+\log(1+[w]_{\gamma}))\big{)}.

The proof is complete. ∎

Remark 4.5.

In the whole space, the above estimates need to be modified as follows (see [30]):

\|\nabla u\|_{\infty}\leqslant C\big{(}1+\|w\|_{\infty}(1+\log(1+[w]_{\gamma}+\|w\|_{p}))\big{)},\ \ p\in[1,\infty).

The presence of $\|w\|_{p}$ and the Jacobian determinant in Theorem 4.3, which depend on the bound of $\nabla b_{\varepsilon}$ , pose challenges when solving the approximating equation (4.11) for NSEs on the entire space. This is why we consider NSEs on the torus instead.

Now we can establish the solvability for stochastic system (4.11).

Theorem 4.6.

For any $w_{0}\in C^{1}_{b}({\mathbb{T}}^{2})$ , there is a unique solution $X^{\varepsilon}_{s,t}(x)$ to stochastic system (4.11) so that $w_{\varepsilon}\in C([0,T];C^{1}_{b}({\mathbb{T}}^{2}))$ and there is a constant $C>0$ such that for all $\varepsilon\in(0,1)$ and $s\in[0,T]$ ,

\displaystyle\|\nabla w_{\varepsilon}(s)\|_{\infty}\leqslant C.

(4.12)

Proof.

We use Picard’s iteration method. Let $u_{0}(t,x)=K_{2}*w_{0}(x)$ . For $n\in{\mathbb{N}}$ , let $X^{n}_{s,t}(x)$ solve

\displaystyle X^{n}_{s,t}(x)=x+\varepsilon\int^{t}_{s}u_{n-1}(r,X^{n}_{s,r-}(x)){\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{r}+\sqrt{\varepsilon\nu}(H^{\varepsilon}_{t}-H^{\varepsilon}_{s}),\ t\in[s,T],

(4.13)

and define recursively,

\displaystyle u_{n}(s,x):=K_{2}*w_{n}(s,\cdot)(x),\ \ w_{n}(s,x):={\mathbb{E}}w_{0}(X^{n}_{s,T}(x)).

(4.14)

Clearly, we have $u_{n}\in C([0,T];C^{1}_{b}({\mathbb{T}}^{2}))$ and

{\mathbb{E}}\|\nabla X^{n}_{s,t}\|_{\infty}\leqslant 1+\int^{t}_{s}\|\nabla u_{n-1}(r)\|_{\infty}{\mathbb{E}}\|\nabla X^{n}_{s,r}\|_{\infty}{\mathord{{\rm d}}}r.

By Gronwall’s inequality we get

{\mathbb{E}}\|\nabla X^{n}_{s,T}\|_{\infty}\leqslant\mathrm{e}^{\int^{T}_{s}\|\nabla u_{n-1}(r)\|_{\infty}{\mathord{{\rm d}}}r}.

Moreover, by (4.9) and Lemma 4.4 with $\gamma=1$ , we have

\|u_{n}(s)\|_{\infty}+\|\nabla u_{n}(s)\|_{\infty}\lesssim 1+\|w_{n}(s)\|_{\infty}(1+\log(1+\|\nabla w_{n}(s)\|_{\infty})),

and by definition (4.14),

\displaystyle\|\nabla w_{n}(s)\|_{\infty}\leqslant\|\nabla w_{0}\|_{\infty}{\mathbb{E}}\|\nabla X^{n}_{s,T}\|_{\infty}\leqslant\|\nabla w_{0}\|_{\infty}\mathrm{e}^{\int^{T}_{s}\|\nabla u_{n-1}(r)\|_{\infty}{\mathord{{\rm d}}}r}.

(4.15)

Hence,

	$\displaystyle\\|\nabla u_{n}(s)\\|_{\infty}$	$\displaystyle\lesssim_{C}1+\\|w_{0}\\|_{\infty}(1+\log(1+\\|\nabla w_{0}\\|_{\infty}\mathrm{e}^{\int^{T}_{s}\\|\nabla u_{n-1}(r)\\|_{\infty}{\mathord{{\rm d}}}r}))$
		$\displaystyle\lesssim_{C}1+\\|w_{0}\\|_{\infty}\left(1+\log(1+\\|\nabla w_{0}\\|_{\infty})+\int^{T}_{s}\\|\nabla u_{n-1}(r)\\|_{\infty}{\mathord{{\rm d}}}r\right).$

By Gronwall’s inequality again, we obtain

\displaystyle\sup_{n}\sup_{s\in[0,T]}\|\nabla u_{n}(s)\|_{\infty}\leqslant C.

(4.16)

On the other hand, by (4.13) we have

	$\displaystyle{\mathbb{E}}\\|X^{n}_{s,t}-X^{m}_{s,t}\\|_{\infty}$	$\displaystyle\leqslant\varepsilon{\mathbb{E}}\int^{t}_{s}\\|u_{n-1}(r,X^{n}_{s,r-})-u_{n-1}(r,X^{m}_{s,r-})\\|_{\infty}{\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{r}$
		$\displaystyle\quad+\varepsilon{\mathbb{E}}\int^{t}_{s}\\|u_{n-1}(r)-u_{m-1}(r)\\|_{\infty}{\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{r}$
		$\displaystyle\leqslant\int^{t}_{s}\\|\nabla u_{n-1}(r)\\|_{\infty}{\mathbb{E}}\\|X^{n}_{s,r}-X^{m}_{s,r}\\|_{\infty}{\mathord{{\rm d}}}r+\int^{t}_{s}\\|u_{n-1}(r)-u_{m-1}(r)\\|_{\infty}{\mathord{{\rm d}}}r,$

which together with (4.16) implies by Gronwall’s inequality that

\sup_{t\in[s,T]}{\mathbb{E}}\|X^{n}_{s,t}-X^{m}_{s,t}\|_{\infty}\leqslant C\int^{T}_{s}\|u_{n-1}(r)-u_{m-1}(r)\|_{\infty}{\mathord{{\rm d}}}r.

Thus, by (4.9) we get

	$\displaystyle\\|w_{n}(s)-w_{m}(s)\\|_{\infty}$	$\displaystyle\leqslant\\|\nabla w_{0}\\|_{\infty}{\mathbb{E}}\\|X^{n}_{s,T}-X^{m}_{s,T}\\|_{\infty}$
		$\displaystyle\lesssim\\|\nabla w_{0}\\|_{\infty}\int^{T}_{s}\\|u_{n-1}(r)-u_{m-1}(r)\\|_{\infty}{\mathord{{\rm d}}}r$
		$\displaystyle\lesssim\\|\nabla w_{0}\\|_{\infty}\int^{T}_{s}\\|w_{n-1}(r)-w_{m-1}(r)\\|_{\infty}{\mathord{{\rm d}}}r.$

By Gronwall’s inequality again, we have

\lim_{n,m\to\infty}\sup_{s\in[0,T]}\|w_{n}(s)-w_{m}(s)\|_{\infty}=0,

and also,

\lim_{n,m\to\infty}\sup_{s\in[0,T]}\sup_{t\in[s,T]}{\mathbb{E}}\|X^{n}_{s,t}-X^{m}_{s,t}\|_{\infty}=0.

By taking limits for (4.13) and (4.14), we obtain the desired result. Moreover, estimate (4.12) follows by (4.15) and (4.16). ∎

Now we can show the following main result of this section.

Theorem 4.7.

Suppose that $\varphi\in C^{5}({\mathbb{T}}^{2};{\mathbb{R}}^{2})$ is divergence free and satisfies $\int_{{\mathbb{T}}^{2}}\varphi(x){\mathord{{\rm d}}}x=0$ . Let $u\in C([0,T];C^{5}({\mathbb{T}}^{2};{\mathbb{R}}^{2}))$ be the unique solution of NSE (4.6). Then there is a constant $C>0$ such that for all $\varepsilon\in(0,1)$ ,

\sup_{s\in[0,T]}\|u_{\varepsilon}(s)-u(s)\|_{\infty}\leqslant C\varepsilon.

Proof.

For $x\in{\mathbb{T}}^{2}$ , let $\widetilde{X}^{\varepsilon}_{s,t}(x)$ solve the following SDE on torus ${\mathbb{T}}^{2}$ ,

\displaystyle\widetilde{X}^{\varepsilon}_{s,t}(x)=x+\varepsilon\int^{t}_{s}u(r,\widetilde{X}^{\varepsilon}_{s,r-}(x)){\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{r}+\sqrt{\varepsilon\nu}(H^{\varepsilon}_{t}-H^{\varepsilon}_{s}),

(4.17)

where $H^{\varepsilon}_{t}$ is a compound Poisson process defined in (1.13). Since $u(r,\cdot)$ is a function on ${\mathbb{T}}^{2}$ and $u(r,x+z)=u(r,x)$ for any $z\in{\mathbb{T}}^{2}$ , one sees that

\widetilde{X}^{\varepsilon}_{s,t}(x+z)=\widetilde{X}^{\varepsilon}_{s,t}(x)+z,\ \ z\in{\mathbb{T}}^{2}.

Let $w={\rm curl}(u)$ and $w_{0}={\rm curl}(\varphi)$ ,

\widetilde{w}_{\varepsilon}(s,x):={\mathbb{E}}w_{0}(\widetilde{X}^{\varepsilon}_{s,T}(x)).

By (4.7) and Itô’s formula, we have

	$\displaystyle\widetilde{w}_{\varepsilon}(s,x)$	$\displaystyle={\mathbb{E}}w(T,\widetilde{X}^{\varepsilon}_{s,T}(x))=w(s,x)+{\mathbb{E}}\int^{T}_{s}(\partial_{s}w+\widetilde{\mathscr{L}}^{(\varepsilon)}_{r}w)(r,\widetilde{X}^{\varepsilon}_{s,r}(x)){\mathord{{\rm d}}}r$
		$\displaystyle=w(s,x)+{\mathbb{E}}\int^{T}_{s}(\widetilde{\mathscr{L}}^{(\varepsilon)}_{r}w-\nu\Delta w-u\cdot\nabla w)(r,\widetilde{X}^{\varepsilon}_{s,r}(x)){\mathord{{\rm d}}}r,$

where $\widetilde{\mathscr{L}}^{(\varepsilon)}_{s}$ is the generator of SDE (4.17) and given by

\widetilde{\mathscr{L}}^{(\varepsilon)}_{s}f(x):=\sum_{i=1,2}\frac{f(x+\sqrt{\varepsilon\nu}e_{i}+\varepsilon u(s,x))+f(x-\sqrt{\varepsilon\nu}e_{i}+\varepsilon u(s,x))-2f(x)}{2\varepsilon}.

Hence,

\displaystyle\|\widetilde{w}_{\varepsilon}(s)-w(s)\|_{\infty}\leqslant\int^{T}_{s}\|(\widetilde{\mathscr{L}}^{(\varepsilon)}_{r}w-\nu\Delta w-u\cdot\nabla w)(r)\|_{\infty}{\mathord{{\rm d}}}r.

(4.18)

Noting that for $i=1,2$ ,

	$\displaystyle\frac{f(x+\sqrt{\varepsilon\nu}e_{i}+\varepsilon u(s,x))+f(x-\sqrt{\varepsilon\nu}e_{i}+\varepsilon u(s,x))-2f(x+\varepsilon u(s,x))}{2\varepsilon}-\nu\partial^{2}_{i}f(x)$
	$\displaystyle\qquad=\nu\int^{1}_{0}\frac{\theta}{2}\int^{1}_{-1}\Big{(}\partial^{2}_{i}f(x+\theta\theta^{\prime}\sqrt{\varepsilon\nu}e_{i}+\varepsilon u(s,x))-\partial^{2}_{i}f(x)\Big{)}{\mathord{{\rm d}}}\theta^{\prime}{\mathord{{\rm d}}}\theta$
	$\displaystyle\qquad=\nu\int^{1}_{0}\frac{\theta}{2}\int^{1}_{-1}\theta\theta^{\prime}\sqrt{\varepsilon\nu}\int^{1}_{0}\partial^{3}_{i}f(x+\theta\theta^{\prime}\theta^{\prime\prime}\sqrt{\varepsilon\nu}e_{i}+\theta^{\prime\prime}\varepsilon u(s,x)){\mathord{{\rm d}}}\theta^{\prime\prime}{\mathord{{\rm d}}}\theta^{\prime}{\mathord{{\rm d}}}\theta$
	$\displaystyle\qquad=\sqrt{\varepsilon\nu^{3}}\int^{1}_{0}\frac{\theta^{2}}{2}\int^{1}_{-1}\theta^{\prime}\int^{1}_{0}\Big{(}\partial^{3}_{i}f(x+\theta\theta^{\prime}\theta^{\prime\prime}\sqrt{\varepsilon\nu}e_{i}+\theta^{\prime\prime}\varepsilon u(s,x))-\partial^{3}_{i}f(x)\Big{)}{\mathord{{\rm d}}}\theta^{\prime\prime}{\mathord{{\rm d}}}\theta^{\prime}{\mathord{{\rm d}}}\theta$

and

\displaystyle\frac{f(x+\varepsilon u(s,x))-f(x)}{\varepsilon}-u(s,x)\cdot\nabla f(x)=u(s,x)\cdot\int^{1}_{0}(\nabla f(x+\theta\varepsilon u(s,x))-\nabla f(x)){\mathord{{\rm d}}}\theta,

we have

\|\widetilde{\mathscr{L}}^{(\varepsilon)}_{s}f-\nu\Delta f-u\cdot\nabla f\|_{\infty}\lesssim(\varepsilon\nu^{2}+\sqrt{\varepsilon^{3}\nu^{3}}\|u\|_{\infty})\|\nabla^{4}f\|_{\infty}+\varepsilon\|u\|^{2}_{\infty}\|\nabla^{2}f\|_{\infty}.

Substituting this into (4.18), we obtain that for all $\varepsilon,\nu\in(0,1]$ ,

\displaystyle\|\widetilde{w}_{\varepsilon}(s)-w(s)\|_{\infty}\lesssim\varepsilon\int^{T}_{s}(1+\|u(r)\|^{2}_{\infty})\|w(r)\|_{C^{4}_{b}}{\mathord{{\rm d}}}r.

(4.19)

On the other hand, by (4.17) and (4.11), we have

	$\displaystyle{\mathbb{E}}\\|\widetilde{X}^{\varepsilon}_{s,t}-X^{\varepsilon}_{s,t}\\|_{\infty}$	$\displaystyle\leqslant\varepsilon{\mathbb{E}}\left(\int^{t}_{s}\big{\\|}u(r,\widetilde{X}^{\varepsilon}_{s,r-})-u_{\varepsilon}(r,X^{\varepsilon}_{s,r-})\big{\\|}_{\infty}{\mathord{{\rm d}}}{\mathcal{N}}_{r}^{\varepsilon}\right)$
		$\displaystyle={\mathbb{E}}\int^{t}_{s}\big{\\|}u(r,\widetilde{X}^{\varepsilon}_{s,r})-u_{\varepsilon}(r,X^{\varepsilon}_{s,r})\big{\\|}_{\infty}{\mathord{{\rm d}}}r$
		$\displaystyle\leqslant\int^{t}_{s}{\mathbb{E}}\big{\\|}u_{\varepsilon}(r,\widetilde{X}^{\varepsilon}_{s,r})-u_{\varepsilon}(r,X^{\varepsilon}_{s,r})\big{\\|}_{\infty}{\mathord{{\rm d}}}r+\int^{t}_{s}\\|u(r)-u_{\varepsilon}(r)\\|_{\infty}{\mathord{{\rm d}}}r$
		$\displaystyle\leqslant\int^{t}_{s}\\|\nabla u_{\varepsilon}\\|_{\infty}{\mathbb{E}}\\|\widetilde{X}^{\varepsilon}_{s,r}-X^{\varepsilon}_{s,r}\\|_{\infty}{\mathord{{\rm d}}}r+\int^{t}_{s}\\|u(r)-u_{\varepsilon}(r)\\|_{\infty}{\mathord{{\rm d}}}r,$

which implies by Gronwall’s inequality that

{\mathbb{E}}\|\widetilde{X}^{\varepsilon}_{s,T}(\cdot)-X^{\varepsilon}_{s,t}(\cdot)\|_{\infty}\lesssim\int^{T}_{s}\|u(r)-u_{\varepsilon}(r)\|_{\infty}{\mathord{{\rm d}}}r,

and

\displaystyle\|\widetilde{w}_{\varepsilon}(s)-w_{\varepsilon}(s)\|_{\infty}\leqslant\|\nabla w_{0}\|_{\infty}{\mathbb{E}}\|\widetilde{X}^{\varepsilon}_{s,T}(\cdot)-X^{\varepsilon}_{s,T}(\cdot)\|_{\infty}\lesssim\int^{T}_{s}\|u(r)-u_{\varepsilon}(r)\|_{\infty}{\mathord{{\rm d}}}r.

Combining this with (4.19) and (4.9) yields that

\|w_{\varepsilon}(s)-w(s)\|_{\infty}\lesssim\varepsilon+\int^{T}_{s}\|u(r)-u_{\varepsilon}(r)\|_{\infty}{\mathord{{\rm d}}}r\lesssim\varepsilon+\int^{T}_{s}\|w(r)-w_{\varepsilon}(r)\|_{\infty}{\mathord{{\rm d}}}r.

By Gronwall’s inequality and (4.9), we obtain the desired estimate. ∎

Remark 4.8.

In addition to the 2D-Navier-Stokes equations on the torus, we can also consider the construction of a compound Poisson approximation for 3D-Navier-Stokes equations on the torus. This will be the focus of our future work. We anticipate that similar convergence results for short time will be obtained in this case as well, following the methodology described in [43].

5. Propagation of chaos for the particle approximation of DDSDEs

In this section, we investigate the propagation of chaos in the context of the interaction particle approximation for McKean-Vlasov SDEs driven by either Brownian motions or $\alpha$ -stable processes. The notion of propagation of chaos refers to the convergence of the particle system to the solution of the McKean-Vlasov SDE as the number of particles tends to infinity. This provides a direct full discretization scheme for nonlinear SDEs, allowing for efficient numerical simulations.

Fix an $N\in{\mathbb{N}}$ and a symmetric probability measure $\nu\in{\mathcal{P}}({\mathbb{R}}^{d})$ . Let $({\mathcal{N}}^{N,i})_{i=1,\cdots,N}$ be a sequence of i.i.d. Poisson process with intensity $N$ and $(\xi^{N,i}_{n})_{n\in{\mathbb{N}},i=1,\cdots,N}$ i.i.d ${\mathbb{R}}^{d}$ -valued random variables with common distribution $\nu$ . Define for $i=1,\cdots,N$ ,

H^{N,i}_{t}:=\Big{(}\xi^{N,i}_{1}+\cdots+\xi^{N,i}_{{\mathcal{N}}^{N,i}_{t}}\Big{)}{\mathbbm{1}}_{{\mathcal{N}}^{N,i}_{t}\geqslant 1}.

Then $(H^{N,i})_{i=1,\cdots,N}$ is a sequence of i.i.d. compound Poisson processes with intensity $N{\mathord{{\rm d}}}t\nu({\mathord{{\rm d}}}z)$ . Let ${\mathcal{H}}^{N,i}$ be the associated Poisson random measure, that is,

{\mathcal{H}}^{N,i}([0,t],E):=\sum_{s\leqslant t}{\mathbbm{1}}_{E}(\Delta H^{N,i}_{s})=\sum_{n\leqslant{\mathcal{N}}^{N,i}_{t}}{\mathbbm{1}}_{E}(\xi^{N,i}_{n}),\ \ E\in{\mathscr{B}}({\mathbb{R}}^{d}),

and $\widetilde{\mathcal{H}}^{N,i}$ the compensated Poisson random measure, that is,

\widetilde{\mathcal{H}}^{N,i}({\mathord{{\rm d}}}t,{\mathord{{\rm d}}}z):={\mathcal{H}}^{N,i}({\mathord{{\rm d}}}t,{\mathord{{\rm d}}}z)-N{\mathord{{\rm d}}}t\nu({\mathord{{\rm d}}}z).

For a point ${\mathbf{x}}=(x^{1},\cdots,x^{N})\in({\mathbb{R}}^{d})^{N}$ , the empirical measure of ${\mathbf{x}}$ is defined by

\mu_{{\mathbf{x}}}({\mathord{{\rm d}}}z):=\frac{1}{N}\sum_{i=1}^{N}\delta_{x^{i}}({\mathord{{\rm d}}}z)\in{\mathcal{P}}({\mathbb{R}}^{d}),

where $\delta_{x^{i}}$ is the usual Dirac measure concentrated at point $x^{i}$ . Let

\sigma_{N}(t,x,y,z):{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}\times{\mathbb{R}}^{d}\times{\mathbb{R}}^{d}\to{\mathbb{R}}^{d},\ b_{N}(t,x,y):{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}\times{\mathbb{R}}^{d}\to{\mathbb{R}}^{d}

be two Borel measurable functions. Suppose that

\sigma_{N}(t,x,y,-z)=-\sigma_{N}(t,x,y,z).

For a probability measure $\mu\in{\mathcal{P}}({\mathbb{R}}^{d})$ , we write

\sigma_{N}[t,x,\mu,z]:=\int_{{\mathbb{R}}^{d}}\sigma_{N}(t,x,y,z)\mu({\mathord{{\rm d}}}y),\ \ b_{N}[t,x,\mu]:=\int_{{\mathbb{R}}^{d}}b_{N}(t,x,y)\mu({\mathord{{\rm d}}}y).

Let ${\mathbf{X}}^{N}_{t}=(X^{N,i}_{t})_{i=1,\cdots,N}$ solve the following interaction particle system driven by ${\mathcal{H}}^{N,i}$ :

\displaystyle\begin{split}X^{N,i}_{t}&=X^{N,i}_{0}+\int^{t}_{0}\int_{{\mathbb{R}}^{d}}\left(\sigma_{N}{\Big{[}}s,X^{N,i}_{s-},\mu_{{\mathbf{X}}^{N}_{s-}},z{\Big{]}}+b_{N}{\Big{[}}s,X^{N,i}_{s-},\mu_{{\mathbf{X}}^{N}_{s-}}{\Big{]}}\right){\mathcal{H}}^{N,i}({\mathord{{\rm d}}}s,{\mathord{{\rm d}}}z)\\ &=X^{N,i}_{t-}+\sum_{j=1}^{N}\left(\sigma_{N}{\big{(}}t,X^{N,i}_{t-},X^{N,j}_{t-},\Delta H^{N,i}_{t}{\big{)}}+b_{N}{\big{(}}t,X^{N,i}_{t-},X^{N,j}_{t-}{\big{)}}\Delta{\mathcal{N}}^{N,i}_{t}\right),\end{split}

(5.1)

where ${\mathbf{X}}^{N}_{0}$ is a symmetric ${\mathcal{F}}_{0}$ -measurable random variables. For a function $f:{\mathbb{R}}^{d}\to{\mathbb{R}}$ , by Itô’s formula (see (3.12)), we have

\displaystyle f(X^{N,i}_{t})=f(X^{N,i}_{0})+\int^{t}_{0}{\mathscr{L}}^{N}_{\mu_{{\mathbf{X}}^{N}_{s}}}f(s,X^{N,i}_{s}){\mathord{{\rm d}}}s+\int^{t}_{0}\!\!\int_{{\mathbb{R}}^{d}}\Theta^{N}_{\mu_{{\mathbf{X}}^{N}_{s-}}}f(s,X^{N,i}_{s-},z)\widetilde{\mathcal{H}}^{N,i}({\mathord{{\rm d}}}s,{\mathord{{\rm d}}}z),

(5.2)

where for a probability measure $\mu\in{\mathcal{P}}({\mathbb{R}}^{d})$ ,

\displaystyle{\mathscr{L}}^{N}_{\mu}f(t,x):={\mathscr{L}}^{N}_{t,\mu}f(x):=N\int_{{\mathbb{R}}^{d}}\Big{(}f{\big{(}}x+\sigma_{N}[t,x,\mu,z]+b_{N}[t,x,\mu]{\big{)}}-f(x)\Big{)}\nu({\mathord{{\rm d}}}z),

(5.3)

and

\displaystyle\Theta^{N}_{\mu}f(t,x,z):=\Theta^{N}_{t,\mu}f(x,z):=f{\big{(}}x+\sigma_{N}[t,x,\mu,z]+b_{N}[t,x,\mu]{\big{)}}-f(x).

(5.4)

As in Section 2, we write

{\mathscr{L}}^{N}_{\mu}f(t,x)={\mathcal{A}}^{N}_{\mu}f(t,x)+{\mathcal{B}}^{N}_{\mu}f(t,x),

where

{\mathcal{A}}^{N}_{\mu}f(t,x):=N\int_{{\mathbb{R}}^{d}}\Big{(}{\mathcal{D}}^{N}_{\mu}f{\big{(}}t,x+\sigma_{N}[t,x,\mu,z]{\big{)}}-{\mathcal{D}}^{N}_{\mu}f(t,x)\Big{)}\nu({\mathord{{\rm d}}}z),

and

{\mathcal{B}}^{N}_{\mu}f(t,x):=N({\mathcal{D}}^{N}_{\mu}f(t,x)-f(x)),\ \ {\mathcal{D}}^{N}_{\mu}f(t,x):=f{\big{(}}x+b_{N}[t,x,\mu]{\big{)}}.

Note that by the symmetry of $\nu$ and $\sigma_{N}(t,x,y,-z)=-\sigma_{N}(t,x,y,z).$ ,

{\mathcal{A}}^{N}_{\mu}f(t,x):=\int_{{\mathbb{R}}^{d}}\frac{{\mathcal{D}}^{N}_{\mu}f{\big{(}}t,x+\sigma_{N}[t,x,\mu,z]{\big{)}}+{\mathcal{D}}^{N}_{\mu}f{\big{(}}t,x-\sigma_{N}[t,x,\mu,z]{\big{)}}-2{\mathcal{D}}^{N}_{\mu}f(t,x)}{2N^{-1}}\nu({\mathord{{\rm d}}}z).

We shall give precise choices of $\sigma_{N}$ and $b_{N}$ below in different cases.

5.1. Fractional diffusion with bounded interaction kernel

In this section we fix $\alpha\in(0,2)$ and let

\sigma(t,x,y,z):{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}\times{\mathbb{R}}^{d}\times{\mathbb{R}}^{d}\to{\mathbb{R}}^{d},\ b(t,x,y):{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}\times{\mathbb{R}}^{d}\to{\mathbb{R}}^{d}

be two Borel measurable functions. We make the following assumptions:

$\rm({\bf H}^{\sigma,b}_{\nu,\alpha})$

In addition to (H ${}^{\alpha}_{\nu}$ ) with $\alpha\in(0,2)$ , we suppose that $\sigma$ and $b$ are continuous in $(x,y)$ , and

$\sigma(t,x,y,-z)=-\sigma(t,x,y,z),\ \ |\sigma(t,x,y,z)|\leqslant(\kappa_{0}+\kappa_{1}|x|)|z|,$

and for the same $\beta_{0}$ as in (3.5),

$|\sigma(t,x,y,z)-\sigma(t,x,y,z^{\prime})|\leqslant(\kappa_{0}+\kappa_{1}|x|)(|z-z^{\prime}|\wedge 1)^{\beta_{0}},$

where $\kappa_{0},\kappa_{1}>0$ . Moreover, for some $m\geqslant 1$ and $\kappa_{2}>0$ ,

$\displaystyle|b(t,x,y)|\leqslant{\big{(}}\kappa_{2}(1+|x|){\big{)}}^{m},$ (5.5)

and for some $\kappa_{3},\kappa_{4}\geqslant 0$ and $\kappa_{5}<0$ ,

$\displaystyle\langle x,b(t,x,y)\rangle\leqslant\kappa_{3}+\kappa_{4}|x|^{2}+\kappa_{5}|x|^{m+1}.$ (5.6)

In the above assumptions, we have assumed boundedness of the coefficients with respect to the variable $y$ , which imposes a restriction on the interaction kernel. However, in the next subsection, we relax this assumption and consider the case of unbounded kernels. Now, we introduce the approximation coefficients $\sigma_{N}$ and $b_{N}$ as defined in (3.19).

\displaystyle\sigma_{N}(t,x,y,z):=\sigma{\big{(}}t,x,y,N^{-\frac{1}{\alpha}}z{\big{)}},\ \ b_{N}(t,x,y):=\frac{b(t,x,y)}{N+\sqrt{N}|b(t,x,y)|^{1-\frac{1}{m}}},

(5.7)

and also define for $t\geqslant 0$ and $\mu\in{\mathcal{P}}({\mathbb{R}}^{d})$ ,

\displaystyle{\mathscr{L}}^{\infty}_{\mu}f(t,x):={\mathscr{L}}^{\infty}_{t,\mu}f(x):={\mathcal{A}}^{\infty}_{\mu}f(t,x)+b[t,x,\mu]\cdot\nabla f(x),

(5.8)

where

{\mathcal{A}}^{\infty}_{\mu}f(t,x):=\int_{{\mathbb{R}}^{d}}\frac{f(x+\sigma[t,x,\mu,z])+f(x-\sigma[t,x,\mu,z])-2f(x)}{2}\nu_{0}({\mathord{{\rm d}}}z),

and $\nu_{0}$ is the Lévy measure from (H ${}^{\alpha}_{\nu}$ ). We consider the following McKean-Vlasov SDE:

\displaystyle{\mathord{{\rm d}}}X_{t}=\int_{{\mathbb{R}}^{d}}\sigma\big{[}t,X_{t-},\mu_{X_{t-}},z\big{]}\widetilde{\mathcal{H}}({\mathord{{\rm d}}}t,{\mathord{{\rm d}}}z)+b[t,X_{t},\mu_{X_{t}}]{\mathord{{\rm d}}}t,

(5.9)

where $\widetilde{\mathcal{H}}$ is defined as (3.23) and $\mu_{X_{t}}$ is the law of $X_{t}$ . By Itô’s formula, the nonlinear time-inhomogeneous infinitesimal generator of $X_{t}$ is given by ${\mathscr{L}}^{\infty}_{t,\mu_{X_{t}}}$ .

The following lemma is the same as Lemma 3.9.

Lemma 5.1.

Under $\rm({\bf H}^{\sigma,b}_{\nu,\alpha})$ , where $\alpha\in(0,2)$ , for any $R>0$ , there is a constant $C_{R}>0$ such that for any $f\in C^{2}_{b}({\mathbb{R}}^{d})$ and $N\in{\mathbb{N}}$ ,

\sup_{t\geqslant 0}\sup_{|x|\leqslant R}\sup_{\mu\in{\mathcal{P}}({\mathbb{R}}^{d})}\big{|}{\mathscr{L}}^{N}_{t,\mu}f(x)-{\mathscr{L}}^{\infty}_{t,\mu}f(x)\big{|}\leqslant C_{R}N^{-\frac{2-\alpha}{2}\wedge\beta_{1}}\|f\|_{C^{2}_{b}},

where $\beta_{1}$ is from (H ${}^{\alpha}_{\nu}$ ). Moreover, if $b$ is bounded measurable and $\kappa_{1}=0$ , then $C_{R}$ can be independent of $R>0$ .

Proof.

Below we drop the time variable for simplicity. Recall that

{\mathcal{B}}^{N}_{\mu}f(x)=N(f(x+b_{N}[x,\mu])-f(x)).

By Taylor’s expansion and the definition (5.7), we have

	$\displaystyle\|{\mathcal{B}}^{N}_{\mu}f(x)-b[x,\mu]\cdot\nabla f(x)\|$	$\displaystyle\leqslant\|{\mathcal{B}}^{N}_{\mu}f(x)-Nb_{N}[x,\mu]\cdot\nabla f(x)\|+\|(Nb_{N}[x,\mu]-b[x,\mu])\cdot\nabla f(x)\|$
		$\displaystyle\leqslant N\|b_{N}[x,\mu]\|\int^{1}_{0}(\|\nabla f(x+\theta b_{N}[x,\mu])-\nabla f(x)\|){\mathord{{\rm d}}}\theta$
		$\displaystyle\quad+\|Nb_{N}[x,\mu]-b[x,\mu]\|\cdot\\|\nabla f\\|_{\infty}$
		$\displaystyle\leqslant N\|b_{N}[x,\mu]\|^{2}\\|\nabla^{2}f\\|_{\infty}+\int_{{\mathbb{R}}^{d}}\frac{\sqrt{N}\|b(x,y)\|^{2-\frac{1}{m}}}{N+\sqrt{N}\|b(x,y)\|^{1-\frac{1}{m}}}\mu({\mathord{{\rm d}}}y)\\|\nabla f\\|_{\infty}$
		$\displaystyle\leqslant\int_{{\mathbb{R}}^{d}}\left(\frac{\|b(x,y)\|^{2}}{N}+\frac{\|b(x,y)\|^{2-\frac{1}{m}}}{\sqrt{N}}\right)\mu({\mathord{{\rm d}}}y)\\|\nabla f\\|_{C^{1}_{b}}.$

Under (5.5), we clearly have

\sup_{|x|\leqslant R}\sup_{\mu\in{\mathcal{P}}({\mathbb{R}}^{d})}|{\mathcal{B}}^{N}_{\mu}f(x)-b[x,\mu]\cdot\nabla f(x)|\leqslant C_{R}N^{-\frac{1}{2}}\|\nabla f\|_{C^{1}_{b}}.

Moreover, as in (3.29) we also have

\sup_{|x|\leqslant R}\sup_{\mu\in{\mathcal{P}}({\mathbb{R}}^{d})}\big{|}{\mathcal{A}}^{N}_{\mu}f(x)-{\mathcal{A}}^{\infty}_{\mu}f(x)\big{|}\leqslant C_{R}N^{-(1-\frac{\alpha}{2})\wedge\beta_{1}}\|f\|_{C^{2}_{b}}.

Combining the above two estimates, we obtain the desired estimate. ∎

The following lemma is similar to Lemma 3.12.

Lemma 5.2.

Under $\rm({\bf H}^{\sigma,b}_{\nu,\alpha})$ , where $\alpha\in(0,2)$ , for any $\beta\in(0,\alpha)$ , there are constants $N_{0}\in{\mathbb{N}}$ , $C_{0}=C_{0}(\beta)>0$ , $C_{1}=C_{1}(\beta,\nu)>0$ and $C_{2}>0$ such that for all $N\geqslant N_{0}$ , $\mu\in{\mathcal{P}}({\mathbb{R}}^{d})$ and $t\geqslant 0$ , $x\in{\mathbb{R}}^{d}$ ,

{\mathscr{L}}^{N}_{t,\mu}U_{\beta}(x)\leqslant(C_{0}\kappa_{6}+C_{1}\kappa^{\alpha}_{1})U_{\beta}(x)+C_{2},

where $U_{\beta}(x)=(1+|x|^{2})^{\beta/2}$ and $\kappa_{6}$ is given in (3.30).

Proof.

For simplicity we drop the time variable. For ${\mathcal{B}}^{N}_{\mu}U_{\beta}(x)$ , by Taylor’s expansion we have

	$\displaystyle{\mathcal{B}}^{N}_{\mu}U_{\beta}(x)$	$\displaystyle=N\int^{1}_{0}\langle b_{N}[x,\mu],\nabla U_{\beta}(x+\theta b_{N}[x,\mu])\rangle{\mathord{{\rm d}}}\theta$
		$\displaystyle=\beta N\int^{1}_{0}\langle b_{N}[x,\mu],x+\theta b_{N}[x,\mu]\rangle U_{\beta-2}(x+\theta b_{N}[x,\mu]){\mathord{{\rm d}}}\theta.$		(5.10)

By (5.5) and (5.6), for any $\varepsilon_{0}>0$ , there are $N_{0}$ large enough so that for all $N\geqslant N_{0}$ ,

\displaystyle|b_{N}(x,y)|\leqslant N^{-1/2}|b(x,y)|^{\frac{1}{m}}\leqslant N^{-1/2}\kappa_{2}(1+|x|)\leqslant\varepsilon_{0}(1+|x|),

(5.11)

and as in Lemma 3.11, for the $\kappa_{6}$ given in (3.30),

N\langle x,b_{N}(x,y)\rangle+N|b_{N}(x,y)|^{2}\leqslant\kappa_{6}|x|^{2}+C_{1}.

Thus, for all $\mu\in{\mathcal{P}}({\mathbb{R}}^{d})$ and $\theta\in(0,1)$ ,

\displaystyle N\langle b_{N}[x,\mu],x+\theta b_{N}[x,\mu]\rangle\leqslant N\int_{{\mathbb{R}}^{d}}{\big{(}}\langle b_{N}(x,y),x\rangle+|b_{N}(x,y)|^{2}{\big{)}}\mu({\mathord{{\rm d}}}y)\leqslant\kappa_{6}|x|^{2}+C_{1}

and

(1+|x|^{2})/2\leqslant 1+|x+\theta b_{N}[x,\mu]|^{2}\leqslant 2(1+|x|^{2}).

Hence, as in (3.36), we have

\displaystyle{\mathcal{B}}^{N}_{\mu}U_{\beta}(x)\leqslant C_{0}\kappa_{6}U_{\beta}(x)+C.

For ${\mathcal{A}}^{N}_{\mu}U_{\beta}(x)$ , as in (3.37) we also have

\displaystyle|{\mathcal{A}}^{N}_{\mu}U_{\beta}(x)|\leqslant C_{0}\kappa_{1}^{\alpha}U_{\beta}(x)+C.

Combining the above two estimates, we obtain the desired estimate. ∎

By the above Lyapunov estimate and Itô’s formula, the following corollary is the same as Corollary 3.14. We omit the details.

Corollary 5.3.

Under $\rm({\bf H}^{\sigma,b}_{\nu,\alpha})$ , for any $\beta\in(0,\alpha)$ and $T>0$ , there is a constant $C>0$ such that

\displaystyle\sup_{i=1,\cdots,N}{\mathbb{E}}\left(\sup_{t\in[0,T]}U_{\beta}(X^{N,i}_{t})\right)\leqslant C(1+{\mathbb{E}}U_{\beta}(X_{0})),

(5.12)

where $U_{\beta}(x)=(1+|x|^{2})^{\beta/2}$ . Moreover, there is a constant $C_{2}>0$ such that for all $t>0$ ,

\displaystyle{\mathbb{E}}U_{\beta}(X^{N,i}_{t})\leqslant\mathrm{e}^{\kappa_{7}t}{\mathbb{E}}U_{\beta}(X_{0})+C_{2}(\mathrm{e}^{\kappa_{7}t}-1)/\kappa_{7},

(5.13)

where $\kappa_{7}:=C_{0}\kappa_{6}+C_{1}\kappa^{\alpha}_{1}\in{\mathbb{R}}$ (see Lemma 5.2).

The following lemma is similar to Lemma 3.15.

Lemma 5.4.

Under $\rm({\bf H}^{\sigma,b}_{\nu,\alpha})$ , for any $T,\gamma>0$ , it holds that

\displaystyle\lim_{\delta\to 0}\sup_{N}\sup_{\tau\leqslant\eta\leqslant\tau+\delta\leqslant T}{\mathbb{P}}\Big{(}|X^{N,1}_{\eta}-X^{N,1}_{\tau}|\geqslant\gamma\Big{)}=0.

(5.14)

Proof.

Let $\tau,\eta\in{\mathscr{T}}_{T}$ with $\tau\leqslant\eta\leqslant\tau+\delta$ . For fixed $R>0$ , define

\zeta_{R}:=\inf\left\{t>0:|X^{N,1}_{t}|>R\right\},\ \ \tau_{R}:=\zeta_{R}\wedge\tau,\ \eta_{R}:=\zeta_{R}\wedge\eta.

By (5.1), we can write

	$\displaystyle X^{N,1}_{\eta_{R}}-X^{N,1}_{\tau_{R}}$	$\displaystyle=\int^{\eta_{R}}_{\tau_{R}}b_{N}{\Big{[}}s,X^{N,1}_{s-},\mu_{{\mathbf{X}}^{N}_{s-}}{\Big{]}}{\mathord{{\rm d}}}{\mathcal{N}}^{N,1}_{s}+\int^{\eta_{R}}_{\tau_{R}}\!\!\!\int_{\|z\|<N^{\frac{1}{\alpha}}}\sigma_{N}{\Big{[}}s,X^{N,1}_{s-},\mu_{{\mathbf{X}}^{N}_{s-}},z{\Big{]}}{\mathcal{H}}^{N,1}({\mathord{{\rm d}}}s,{\mathord{{\rm d}}}z)$
		$\displaystyle\quad+\int^{\eta_{R}}_{\tau_{R}}\!\!\!\int_{\|z\|>N^{\frac{1}{\alpha}}}\sigma_{N}{\Big{[}}s,X^{N,1}_{s-},\mu_{{\mathbf{X}}^{N}_{s-}},z{\Big{]}}{\mathcal{H}}^{N,1}({\mathord{{\rm d}}}s,{\mathord{{\rm d}}}z)=:I_{1}+I_{2}+I_{3}.$

For $I_{1}$ , by (5.5) and $\eta_{R}-\tau_{R}\leqslant\delta$ , we have

	$\displaystyle{\mathbb{E}}\|I_{1}\|$	$\displaystyle\leqslant\frac{1}{N}{\mathbb{E}}\left(\int^{\eta_{R}}_{\tau_{R}}\Big{\|}b{\Big{[}}s,X^{N,1}_{s-},\mu_{{\mathbf{X}}^{N}_{s-}}{\Big{]}}\Big{\|}{\mathord{{\rm d}}}{\mathcal{N}}^{N,1}_{s}\right)$
		$\displaystyle={\mathbb{E}}\left(\int^{\eta_{R}}_{\tau_{R}}\Big{\|}b{\Big{[}}s,X^{N,1}_{s},\mu_{{\mathbf{X}}^{N}_{s}}{\Big{]}}\Big{\|}{\mathord{{\rm d}}}s\right)\leqslant C_{R}\delta.$

For $I_{2}$ , by (3.10) and the isometry of stochastic integrals, we have

	$\displaystyle{\mathbb{E}}\|I_{2}\|^{2}$	$\displaystyle={\mathbb{E}}\left\|\int^{\eta_{R}}_{\tau_{R}}\!\!\!\int_{\|z\|<N^{\frac{1}{\alpha}}}\sigma_{N}{\Big{[}}s,X^{N,1}_{s-},\mu_{{\mathbf{X}}^{N}_{s-}},z{\Big{]}}\widetilde{\mathcal{H}}^{N,1}({\mathord{{\rm d}}}s,{\mathord{{\rm d}}}z)\right\|^{2}$
		$\displaystyle={\mathbb{E}}\left(\int^{\eta_{R}}_{\tau_{R}}\!\!\!\int_{\|z\|<N^{\frac{1}{\alpha}}}\Big{\|}\sigma_{N}{\Big{[}}s,X^{N,1}_{s},\mu_{{\mathbf{X}}^{N}_{s-}},z{\Big{]}}\Big{\|}^{2}\nu({\mathord{{\rm d}}}z){\mathord{{\rm d}}}(Ns){\big{)}}\right).$

Let $\nu_{N}({\mathord{{\rm d}}}z)=N\nu(N^{1/\alpha}{\mathord{{\rm d}}}z).$ By the change of variables, we further have

	$\displaystyle{\mathbb{E}}\|I_{2}\|^{2}$	$\displaystyle={\mathbb{E}}\left(\int^{\eta_{R}}_{\tau_{R}}\!\!\!\int_{\|z\|<1}\Big{\|}\sigma{\Big{[}}s,X^{N,1}_{s},\mu_{{\mathbf{X}}^{N}_{s-}},z{\Big{]}}\Big{\|}^{2}\nu_{N}({\mathord{{\rm d}}}z){\mathord{{\rm d}}}s\right)$
		$\displaystyle\leqslant(\kappa_{0}+\kappa_{1}R)^{2}\left(\int_{\|z\|<1}\|z\|^{2}\nu_{N}({\mathord{{\rm d}}}z)\right)\delta\stackrel{{\scriptstyle\eqref{VV9}}}{{\leqslant}}C_{R}\delta.$

For $I_{3}$ , let $\beta\in(0,\alpha\wedge 1)$ . By $|\sum_{i}a_{i}|^{\beta}\leqslant\sum_{i}|a_{i}|^{\beta}$ , we have

	$\displaystyle{\mathbb{E}}\|I_{3}\|^{\beta}$	$\displaystyle\leqslant{\mathbb{E}}\left(\int^{\eta_{R}}_{\tau_{R}}\!\!\!\int_{\|z\|\geqslant N^{\frac{1}{\alpha}}}\Big{\|}\sigma_{N}{\Big{[}}s,X^{N,1}_{s},\mu_{{\mathbf{X}}^{N}_{s-}},z{\Big{]}}\Big{\|}^{\beta}{\mathcal{H}}^{N,1}({\mathord{{\rm d}}}s,{\mathord{{\rm d}}}z)\right)$
		$\displaystyle={\mathbb{E}}\left(\int^{\eta_{R}}_{\tau_{R}}\!\!\!\int_{\|z\|\geqslant N^{\frac{1}{\alpha}}}\Big{\|}\sigma_{N}{\Big{[}}s,X^{N,1}_{s},\mu_{{\mathbf{X}}^{N}_{s-}},z{\Big{]}}\Big{\|}^{\beta}\nu({\mathord{{\rm d}}}z){\mathord{{\rm d}}}(Ns)\right)$
		$\displaystyle={\mathbb{E}}\left(\int^{\eta_{R}}_{\tau_{R}}\!\!\!\int_{\|z\|\geqslant 1}\Big{\|}\sigma{\Big{[}}s,X^{N,1}_{s},\mu_{{\mathbf{X}}^{N}_{s-}},z{\Big{]}}\Big{\|}\nu_{N}({\mathord{{\rm d}}}z){\mathord{{\rm d}}}s\right)$
		$\displaystyle\leqslant(\kappa_{0}+\kappa_{1}R)^{\beta}\left(\int_{\|z\|\geqslant 1}\|z\|^{\beta}\nu_{N}({\mathord{{\rm d}}}z)\right)\delta\stackrel{{\scriptstyle\eqref{VV9}}}{{\leqslant}}C_{R}\delta.$

Hence, by Chebyshev’s inequality and (5.12),

	$\displaystyle{\mathbb{P}}(\|X^{N,1}_{\eta}-X^{N,1}_{\tau}\|\geqslant\gamma)$	$\displaystyle\leqslant{\mathbb{P}}(\|X^{N,1}_{\eta_{R}}-X^{N,1}_{\tau_{R}}\|\geqslant\gamma;\zeta_{R}>T)+{\mathbb{P}}(\zeta_{R}\leqslant T)$
		$\displaystyle\leqslant\sum_{i=1}^{3}{\mathbb{P}}(\|I_{i}\|\geqslant\tfrac{\gamma}{3})+{\mathbb{P}}\left(\sup_{t\in[0,T]}\|X^{N,1}_{t}\|\geqslant R\right)$
		$\displaystyle\leqslant\tfrac{3}{\gamma}{\mathbb{E}}\|I_{1}\|+(\tfrac{3}{\gamma})^{2}{\mathbb{E}}\|I_{2}\|^{2}+(\tfrac{3}{\gamma})^{\beta}{\mathbb{E}}\|I_{3}\|^{\beta}+\tfrac{C}{R^{\beta}}$
		$\displaystyle\leqslant C_{R,\gamma}\delta+C/R^{\beta},$

which converges to zero by firstly letting $\delta\to 0$ and then $R\to\infty$ . ∎

Now we can show the following main result of this subsection about the propagation of chaos.

Theorem 5.5.

Let $\mu_{0}\in{\mathcal{P}}({\mathbb{R}}^{d})$ and $N\in{\mathbb{N}}$ . Suppose that for any $k\leqslant N$ ,

\displaystyle{\mathbb{P}}\circ{\big{(}}X^{N,1}_{0},\cdots,X^{N,k}_{0}{\big{)}}^{-1}\to\mu_{0}^{\otimes k},\ \ N\to\infty,

(5.15)

and DDSDE (5.9) admits a unique martingale solution ${\mathbb{P}}_{0}\in{\mathcal{M}}^{\mu_{0}}_{0}({\mathscr{L}}^{\infty})$ with initial distribution $\mu_{0}$ in the sense of Definition 6.2 in appendix. Then under $\rm({\bf H}^{\sigma,b}_{\nu,\alpha})$ , for any $k\leqslant N$ ,

\displaystyle{\mathbb{P}}\circ{\big{(}}X^{N,1}_{\cdot},\cdots,X^{N,k}_{\cdot}{\big{)}}^{-1}\to{\mathbb{P}}_{0}^{\otimes k},\ \ N\to\infty.

(5.16)

Proof.

We use the classical martingale method (see [18]). Consider the following random measure with values in ${\mathcal{P}}({\mathbb{D}})$ ,

\omega\to\Pi_{N}(\omega,{\mathord{{\rm d}}}w):=\frac{1}{N}\sum_{i=1}^{N}\delta_{X^{N,i}_{\cdot}(\omega)}({\mathord{{\rm d}}}w)\in{\mathcal{P}}({\mathbb{D}}).

By Lemma 5.4, Aldous’ criterion (see [23]) and [40, (ii) of Proposition 2.2], the law of $\Pi_{N}$ in ${\mathcal{P}}({\mathcal{P}}({\mathbb{D}}))$ is tight. Without loss of generality, we assume that the law of $\Pi_{N}$ weakly converges to some $\Pi_{\infty}\in{\mathcal{P}}({\mathcal{P}}({\mathbb{D}}))$ . Our aim below is to show that $\Pi_{\infty}$ is a Dirac measure, i.e.,

\Pi_{\infty}({\mathord{{\rm d}}}\eta)=\delta_{{\mathbb{P}}_{0}}({\mathord{{\rm d}}}\eta),\ \ \Pi_{\infty}-a.s.,

where ${\mathbb{P}}_{0}\in{\mathcal{M}}_{0}^{\mu_{0}}({\mathscr{L}}^{\infty})$ is the unique martingale solution of DDSDE (5.9). If we can show the above assertion, then by [40, (i) of Proposition 2.2], we conclude (5.16).

Let $f\in C^{2}_{c}({\mathbb{R}}^{d})$ . For given $\eta\in{\mathcal{P}}({\mathbb{D}})$ , we define a functional $M^{f}_{\eta}(t,\cdot)$ on ${\mathbb{D}}$ by

M^{f}_{\eta}(t,w):=f(w_{t})-f(w_{0})-\int^{t}_{0}{\mathscr{L}}^{\infty}_{s,\eta}f(w_{s}){\mathord{{\rm d}}}s,\ t\geqslant 0,\ w\in{\mathbb{D}},

where ${\mathscr{L}}^{\infty}_{s,\eta}$ is defined by (5.8) with $\mu=\eta_{s}$ . Fix $n\in{\mathbb{N}}$ and $s\leqslant t$ . For given $g\in C_{c}({\mathbb{R}}^{nd})$ and $0\leqslant s_{1}<\cdots<s_{n}\leqslant s$ , we also introduce a functional $\Xi^{g}_{f}$ over ${\mathcal{P}}({\mathbb{D}})$ by

\Xi^{g}_{f}(\eta):=\int_{{\mathbb{D}}}{\big{(}}M^{f}_{\eta}(t,w)-M^{f}_{\eta}(s,w){\big{)}}g(w_{s_{1}},\cdots,w_{s_{n}})\eta({\mathord{{\rm d}}}w).

By definition we have

\Xi^{g}_{f}(\eta)=\int_{{\mathbb{D}}}\left(f(w_{t})-f(w_{s})-\int^{t}_{s}{\mathscr{L}}^{\infty}_{r,\eta}f(w_{r}){\mathord{{\rm d}}}r\right)g(w_{s_{1}},\cdots,w_{s_{n}})\eta({\mathord{{\rm d}}}w)

and

\displaystyle\Xi^{g}_{f}(\Pi_{N})=\frac{1}{N}\sum_{i=1}^{N}\left[\left(f(X^{N,i}_{t})-f(X^{N,i}_{s})-\int^{t}_{s}{\mathscr{L}}^{\infty}_{r,\Pi_{N}}f(X^{N,i}_{r}){\mathord{{\rm d}}}r\right)g{\big{(}}X^{N,i}_{s_{1}},\cdots,X^{N,i}_{s_{n}}{\big{)}}\right].

(5.17)

By definition (5.8) and $\rm({\bf H}^{\sigma,b}_{\nu,\alpha})$ , it is easy to see that

\eta\mapsto\Xi^{g}_{f}(\eta)

is bounded continuous on

{\mathcal{P}}({\mathbb{D}})

Hence, by the weak convergence of $\Pi_{N}$ to $\Pi_{\infty}$ ,

\displaystyle\lim_{N\to\infty}{\mathbb{E}}|\Xi^{g}_{f}(\Pi_{N})|=\int_{{\mathcal{P}}({\mathbb{D}})}|\Xi^{g}_{f}(\eta)|\Pi_{\infty}({\mathord{{\rm d}}}\eta).

(5.18)

On the other hand, let

\displaystyle\widetilde{\Xi}^{g}_{f}:=\frac{1}{N}\sum_{i=1}^{N}\left[\left(f(X^{N,i}_{t})-f(X^{N,i}_{s})-\int^{t}_{s}{\mathscr{L}}^{N}_{r,\Pi_{N}}f(X^{N,i}_{r}){\mathord{{\rm d}}}r\right)g{\big{(}}X^{N,i}_{s_{1}},\cdots,X^{N,i}_{s_{n}}{\big{)}}\right],

(5.19)

where ${\mathscr{L}}^{N}_{\Pi_{N}}$ is defined by (5.3). By Itô’s formula (5.2), we have

\widetilde{\Xi}^{g}_{f}=\frac{1}{N}\sum_{i=1}^{N}\left[\left(\int^{t}_{s}\!\!\int_{{\mathbb{R}}^{d}}\Theta^{N}_{r,\Pi_{N}}f(X^{N,i}_{r-},z)\widetilde{\mathcal{H}}^{N,i}({\mathord{{\rm d}}}r,{\mathord{{\rm d}}}z)\right)g{\big{(}}X^{N,i}_{s_{1}},\cdots,X^{N,i}_{s_{n}}{\big{)}}\right],

where $\Theta^{N}_{r,\Pi_{N}}f$ is defined by (5.4). By the isometry of stochastic integrals,

\displaystyle{\mathbb{E}}|\widetilde{\Xi}^{g}_{f}|^{2}

\displaystyle\leqslant\frac{\|g\|^{2}_{\infty}}{N}{\mathbb{E}}\left(\sum_{i=1}^{N}\int^{t}_{s}\!\!\int_{{\mathbb{R}}^{d}}|\Theta^{N}_{r,\Pi_{N}}f(X^{N,i}_{r},z)|^{2}\nu({\mathord{{\rm d}}}z){\mathord{{\rm d}}}s\right).

Let $\beta\in(0,\frac{\alpha}{2})$ . Noting that by (5.4), (5.11) and $|\sigma(r,x,y,z)|\leqslant(\kappa_{0}+\kappa_{1}|x|)|z|$ ,

	$\displaystyle\|\Theta^{N}_{r,\eta}f(x,z)\|$	$\displaystyle\leqslant(\|b_{N}[r,x,\eta_{r-}]\|^{\beta}+\|\sigma_{N}[r,x,\eta_{r-},z]\|)^{\beta}\\|f\\|_{C^{\beta}_{b}}$
		$\displaystyle\lesssim{\big{(}}\tfrac{1+\|x\|^{\beta}}{N^{\beta/2}}+(1+\|x\|^{\beta})\tfrac{\|z\|^{\beta}}{N^{\beta/\alpha}}{\big{)}}\\|f\\|_{C^{\beta}_{b}},$

by Lemma 3.3 and (5.12), we have

\displaystyle{\mathbb{E}}|\widetilde{\Xi}^{g}_{f}|^{2}\lesssim\frac{\|g\|^{2}_{\infty}\|f\|_{C^{\beta}_{b}}^{2}}{N^{1+\beta}}{\mathbb{E}}\left(\sum_{i=1}^{N}\int^{t}_{s}{\big{(}}1+{\mathbb{E}}|X^{N,i}_{r}|^{2\beta}{\big{)}}{\mathord{{\rm d}}}s\right)\lesssim\frac{1}{N^{\beta}},

(5.20)

where the implicit constant does not depend on $N$ .

Claim: The following limit holds:

\displaystyle\lim_{N\to\infty}{\mathbb{E}}|\Xi^{g}_{f}(\Pi_{N})-\widetilde{\Xi}^{g}_{f}|=0.

(5.21)

Indeed, by definition (5.17), (5.19) and Lemma 5.1, for any $R\geqslant 1$ , we have

	$\displaystyle{\mathbb{E}}\|\Xi^{g}_{f}(\Pi_{N})-\widetilde{\Xi}^{g}_{f}\|$	$\displaystyle\leqslant\frac{1}{N}\sum_{i=1}^{N}{\mathbb{E}}\left(\int^{t}_{s}\|{\mathscr{L}}^{\infty}_{r,\Pi_{N}}f-{\mathscr{L}}^{N}_{r,\Pi_{N}}f\|(X^{N,i}_{r}){\mathord{{\rm d}}}r\right)\\|g\\|_{\infty}$
		$\displaystyle\leqslant\sup_{r\leqslant t}\sup_{\|x\|\leqslant R}\sup_{\mu\in{\mathcal{P}}({\mathbb{R}}^{d})}\|{\mathscr{L}}^{\infty}_{r,\mu}f(x)-{\mathscr{L}}^{N}_{r,\mu}f(x)\|\cdot\\|g\\|_{\infty}$
		$\displaystyle+\sup_{r\leqslant t}\Big{(}\sup_{\mu\in{\mathcal{P}}({\mathbb{R}}^{d})}{\big{(}}\\|{\mathscr{L}}^{\infty}_{r,\mu}f\\|_{\infty}+\\|{\mathscr{L}}^{N}_{r,\mu}f\\|_{\infty}{\big{)}}\sup_{i}{\mathbb{P}}(\|X^{N,i}_{r}\|\geqslant R)\Big{)}\\|g\\|_{\infty}$
		$\displaystyle\leqslant C_{R}N^{-(\frac{2-\alpha}{2})\wedge\beta_{1}}+C\sup_{i}\sup_{r\leqslant t}{\mathbb{E}}\|X^{N,i}_{r}\|^{\beta}/R^{\beta},$

which yields (5.21) by (5.12).

Combining (5.18), (5.20) and (5.21) we obtain that for each $f\in C^{2}_{c}({\mathbb{R}}^{d})$ and $n\in{\mathbb{N}}$ , $g\in C_{c}({\mathbb{R}}^{nd})$ ,

\int_{{\mathcal{P}}({\mathbb{D}})}|\Xi^{g}_{f}(\eta)|\Pi_{\infty}({\mathord{{\rm d}}}\eta)=0\Rightarrow\Xi^{g}_{f}(\eta)=0\mbox{ for $\Pi_{\infty}$-a.s. $\eta\in{\mathcal{P}}({\mathbb{D}})$}.

Since $C^{2}_{c}({\mathbb{R}}^{d})$ and $C_{c}({\mathbb{R}}^{nd})$ are separable, one can find a common $\Pi_{\infty}$ -null set ${\mathcal{Q}}\subset{\mathcal{P}}({\mathbb{D}})$ such that for all $\eta\notin{\mathcal{Q}}$ and for all $0\leqslant s<t\leqslant T$ , $f\in C_{c}^{2}({\mathbb{R}}^{d})$ and $n\in{\mathbb{N}}$ , $g\in C_{c}({\mathbb{R}}^{nd})$ , $0\leqslant s_{1}<\cdots<s_{n}\leqslant s$ ,

\Xi^{g}_{f}(\eta)=\int_{{\mathbb{D}}}{\big{(}}M^{f}_{\eta}(t,w)-M^{f}_{\eta}(s,w){\big{)}}g(w_{s_{1}},\cdots,w_{s_{n}})\eta({\mathord{{\rm d}}}w)=0.

Moreover, by (5.15), we also have

\Pi_{\infty}\{\eta\in{\mathcal{P}}({\mathbb{D}}):\eta_{0}=\mu_{0}\}=1.

Thus by the definition of ${\mathcal{M}}^{\mu_{0}}_{0}({\mathscr{L}}^{\infty})$ (see Definition 6.2 in appendix), for $\Pi_{\infty}$ -almost all $\eta\in{\mathcal{P}}({\mathbb{D}})$ ,

\eta\in{\mathcal{M}}^{\mu_{0}}_{0}({\mathscr{L}}^{\infty}).

Since ${\mathcal{M}}^{\mu_{0}}_{0}({\mathscr{L}}^{\infty})$ only contains one point by uniqueness, all the points $\eta\notin{\mathcal{Q}}$ are the same. Hence, $\Pi_{N}$ weakly converges to the one-point measure $\delta_{{\mathbb{P}}_{0}}$ . The proof is complete. ∎

Remark 5.6.

For each ${\mathbf{x}}=(x^{1},\cdots,x^{N})\in{\mathbb{R}}^{Nd}$ , let ${\mathbf{X}}^{N}_{t}={\mathbf{X}}^{N}_{t}({\mathbf{x}})$ be the unique solution of SDE (5.1) with starting point ${\mathbf{x}}$ . Suppose that $\kappa_{7}<0$ (see (5.13)). Then for each $N\in{\mathbb{N}}$ , the semigroup $P^{N}_{t}f({\mathbf{x}}):={\mathbb{E}}f({\mathbf{X}}^{N}_{t}({\mathbf{x}}))$ admits an invariant probability measure $\mu_{N}({\mathord{{\rm d}}}{\mathbf{x}})$ , which is symmetric in the sense

\mu_{N}({\mathord{{\rm d}}}\pi_{N}({\mathbf{x}}))=\mu_{N}({\mathord{{\rm d}}}{\mathbf{x}}),\ \ \mbox{ $\pi_{N}({\mathbf{x}})$ is any permutation of ${\mathbf{x}}=(x^{1},\cdots,x^{N})$}.

Indeed, by (5.13), for any $\beta\in(0,\alpha)$ , we have

\displaystyle\sup_{N}\sup_{i=1,\cdots,N}\sup_{T>0}\frac{1}{T}\int^{T}_{0}{\mathbb{E}}|X^{N,i}_{t}|^{\beta}{\mathord{{\rm d}}}t<\infty.

(5.22)

Now we define a probability measure $\mu_{N,T}$ over ${\mathbb{R}}^{Nd}$ by

\mu_{N,T}(A):=\frac{1}{T}\int^{T}_{0}{\mathbb{P}}({\mathbf{X}}^{N}_{t}\in A){\mathord{{\rm d}}}t,\ \ A\in{\mathscr{B}}({\mathbb{R}}^{Nd}).

By (5.22), the family of probability measures $\{\mu_{N,T},T\geqslant 1\}$ is tight. By the classical Krylov-Bogoliubov argument (cf. [10, Section 3.1]), any accumulation point $\mu_{N}$ of $\{\mu_{N,T},T\geqslant 1\}$ is an invariant probability measure of $P^{N}_{t}$ , that is, for any nonnegative measurable function $f$ on ${\mathbb{R}}^{Nd}$ ,

\int_{{\mathbb{R}}^{Nd}}f({\mathbf{x}})\mu_{N}({\mathord{{\rm d}}}{\mathbf{x}})=\int_{{\mathbb{R}}^{Nd}}P^{N}_{t}f({\mathbf{x}})\mu_{N}({\mathord{{\rm d}}}{\mathbf{x}}),\ \ t>0.

The symmetry of $\mu_{N}$ follows from the symmetry of ${\mathbf{X}}^{N}_{t}$ . Moreover, by (5.22) one sees that

\sup_{N}\int_{{\mathbb{R}}^{d}}|x|^{\beta}\mu^{(1)}_{N}({\mathord{{\rm d}}}x)<\infty,

where $\mu^{(1)}_{N}$ is the $1$ -marginal distribution of $\mu_{N}$ .

Note that the existence of invariant probability measures for DDSDE (5.9) has been investigated in [16] under dissipativity assumptions. However, an open question remains regarding the conditions under which any accumulation point of ${\mu^{(1)}_{N},N\in{\mathbb{N}}}$ becomes an invariant probability measure of DDSDE (5.9). This question is closely connected to the problem of propagation of chaos in uniform time, as discussed in [29]. In future research, we plan to address this question and explore the assumptions on the coefficients that lead to convergence of empirical measures and the emergence of invariant probability measures for DDSDE (5.9). Such investigations will contribute to a deeper understanding of the dynamics and statistical properties of DDSDEs and their particle approximations.

5.2. Brownian diffusion with unbounded interaction kernel

In the previous section, we focused on interaction terms that are bounded in the second variable $y$ , which excluded unbounded interaction kernels such as $b(x,y)=\bar{b}(x-y)$ , where $\bar{b}$ exhibits linear growth. In this section, we address the case of unbounded interaction kernels in the context of Brownian diffusion. Our results provide insights into the behavior of DDSDEs with unbounded interaction kernels and broaden the applicability of compound Poisson approximations in modeling and numerical simulations.

Fix $\alpha>2$ . We make the following assumptions about $\sigma$ and $b$ :

\rm({\bf\widetilde{H}}^{\sigma,b}_{\nu,\alpha})

We suppose that (H ${}^{\alpha}_{\nu}$ ) holds, and $\sigma$ and $b$ are continuous in $(x,y)$ , and for some $\kappa_{0},\kappa_{1}\geqslant 0$ ,

\displaystyle\sigma(t,x,y,-z)=-\sigma(t,x,y,z),\ \ |\sigma(t,x,y,z)|\leqslant(\kappa_{0}+\kappa_{1}(|x|+|y|))|z|.

(5.23)

Suppose that

b(t,x,y)=b_{1}(t,x)+b_{2}(t,x,y),

where for some $m\geqslant 1$ and $\kappa_{2}>0$ ,

\displaystyle|b_{1}(t,x)|\leqslant{\big{(}}\kappa_{2}(1+|x|){\big{)}}^{m},

(5.24)

and for some $\kappa_{3},\kappa_{4}\geqslant 0$ and $\kappa_{5}<0$ ,

\displaystyle\langle x,b_{1}(t,x)\rangle\leqslant\kappa_{3}+\kappa_{4}|x|^{2}+\kappa_{5}|x|^{m+1},

(5.25)

and for some $c_{1},c_{2},c_{3}>0$ ,

\displaystyle|b_{2}(t,x,y)|\leqslant c_{1}+c_{2}|x|+c_{3}|y|.

(5.26)

As in (3.19), we introduce the approximation coefficients of $\sigma_{N}$ and $b_{N}$ as:

\sigma_{N}(t,x,y,z):=N^{-\frac{1}{2}}\sigma(t,x,y,z),

and

\displaystyle b_{N}(t,x,y):=\frac{b_{1}(t,x)}{N+\sqrt{N}|b_{1}(t,x)|^{1-\frac{1}{m}}}+\frac{b_{2}(t,x,y)}{N}.

(5.27)

For $t\geqslant 0$ and $\mu\in{\mathcal{P}}({\mathbb{R}}^{d})$ , we also define

\displaystyle{\mathscr{L}}^{\infty}_{\mu}f(t,x):={\mathscr{L}}^{\infty}_{t,\mu}f(x):={\mathcal{A}}^{\infty}_{\mu}f(t,x)+b[t,x,\mu]\cdot\nabla f(x),

(5.28)

where

{\mathcal{A}}^{\infty}_{\mu}f(s,x):=\frac{1}{2}\mathrm{tr}\left(\int_{{\mathbb{R}}^{d}}\Big{(}\sigma[t,x,\mu,z]\otimes\sigma[t,x,\mu,z]\Big{)}\nu({\mathord{{\rm d}}}z)\cdot\nabla^{2}f(x)\right).

Consider the following McKean-Vlasov SDE:

\displaystyle{\mathord{{\rm d}}}X_{t}=\sigma_{\nu}^{(2)}{\big{(}}t,X_{t},\mu_{X_{t}}{\big{)}}{\mathord{{\rm d}}}W_{t}+b[t,X_{t},\mu_{X_{t}}]{\mathord{{\rm d}}}t,

(5.29)

where $W_{t}$ is a $d$ -dimensional standard Brownian motion, and

\sigma_{\nu}^{(2)}(t,x,\mu):=\left(\int_{{\mathbb{R}}^{d}}\sigma[t,x,\mu,z]\otimes\sigma[t,x,\mu,z]\nu({\mathord{{\rm d}}}z)\right)^{\frac{1}{2}}.

By Itô’s formula, the nonlinear time-inhomogeneous generator of DDSDE (5.29) is given by ${\mathscr{L}}^{\infty}_{t,\mu}$ .

The following lemma is the same as Lemmas 5.1 and 3.9. We omit the details.

Lemma 5.7.

Under $\rm({\bf\widetilde{H}}^{\sigma,b}_{\nu,\alpha})$ , where $\alpha>2$ , for any $R>0$ , there is a constant $C_{R}>0$ such that for any $f\in C^{\alpha}_{b}({\mathbb{R}}^{d})$ , and for all $N$ and $\mu\in{\mathcal{P}}({\mathbb{R}}^{d})$ with $\mu(|\cdot|)\leqslant R$ ,

\sup_{t\geqslant 0,|x|\leqslant R}\big{|}{\mathscr{L}}^{N}_{t,\mu}f(x)-{\mathscr{L}}^{\infty}_{t,\mu}f(x)\big{|}\leqslant C_{R}N^{-\frac{(\alpha-2)\wedge 1}{2}}\|f\|_{C^{\alpha\wedge 3}_{b}}.

Moreover, if $b$ is bounded measurable and $\kappa_{1}=0$ , then $C_{R}$ can be independent of $R>0$ .

The following lemma is similar to Lemma 5.2.

Lemma 5.8.

Under $\rm({\bf\widetilde{H}}^{\sigma,b}_{\nu,\alpha})$ , where $\alpha>2$ , for any $\beta\in[2,\alpha]$ , there are constants $C_{0},C_{1},C_{2}>0$ such that for all $N\in{\mathbb{N}}$ ,

\displaystyle{\mathscr{L}}^{N}_{s,\mu}(|\cdot|^{\beta})(x)\leqslant C_{0}|x|^{\beta}+C_{1}\mu(|\cdot|)^{\beta}+C_{2}.

(5.30)

Moreover, if $m>1$ , then for any $\kappa_{6}<0$ , there are constants $N_{0}\in{\mathbb{N}}$ , $C_{3}=C_{3}(\beta,\nu)>0$ and $C_{4}=C_{4}(N_{0},\kappa_{i},c_{i})>0$ such that for all $N\geqslant N_{0}$ , $\mu\in{\mathcal{P}}({\mathbb{R}}^{d})$ and $s\geqslant 0$ , $x\in{\mathbb{R}}^{d}$ ,

\displaystyle{\mathscr{L}}^{N}_{s,\mu}(|\cdot|^{\beta})(x)\leqslant\kappa_{6}|x|^{\beta}+(\beta c_{3}+C_{3}\kappa^{\beta}_{1})\mu(|\cdot|)^{\beta}+C_{4}.

(5.31)

Proof.

We only prove (5.31). For simplicity we drop the time variable. By (5.24)-(5.26), we have

	$\displaystyle N\langle x,b_{N}[x,\mu]\rangle+N\|b_{N}[x,\mu]\|^{2}$	$\displaystyle\leqslant\frac{\kappa_{3}+\kappa_{4}\|x\|^{2}+\kappa_{5}\|x\|^{m+1}}{1+\sqrt{N^{-1}}\|b_{1}(x)\|^{1-\frac{1}{m}}}+\|x\|\cdot(c_{1}+c_{2}\|x\|+c_{3}\mu(\|\cdot\|))$
		$\displaystyle\quad+\frac{2\kappa^{2}_{2}(1+\|x\|)^{2}}{N}+\frac{2(c_{1}+c_{2}\|x\|+c_{3}\mu(\|\cdot\|))^{2}}{N}$
		$\displaystyle\leqslant\frac{\kappa_{5}\|x\|^{m+1}}{1+\sqrt{N^{-1}}\|b_{1}(x)\|^{1-\frac{1}{m}}}+C_{0}+C_{1}\|x\|^{2}+\big{(}\tfrac{c_{3}}{4}+\tfrac{4c_{3}}{N}\big{)}\mu(\|\cdot\|)^{2}.$

Since $m>1$ and $\kappa_{5}<0$ , for any $K>0$ , by (5.24), there are $N_{0}$ large enough and $C_{3}>0$ such that for all $N\geqslant N_{0}$ ,

\frac{\kappa_{5}|x|^{m+1}}{1+\sqrt{N^{-1}}|b_{1}(x)|^{1-\frac{1}{m}}}\leqslant\frac{\kappa_{5}|x|^{m+1}}{1+\sqrt{N^{-1}}(\kappa_{2}(1+|x|))^{m-1}}\leqslant K\kappa_{5}|x|^{2}+C_{3}.

Thus, for any $\kappa_{6}<0$ , there is an $N_{0}$ large enough such that for all $N\geqslant N_{0}$ ,

\displaystyle N\langle x,b_{N}[x,\mu]\rangle+N|b_{N}[x,\mu]|^{2}\leqslant\kappa_{6}|x|^{2}+\tfrac{c_{3}}{2}\mu(|\cdot|)^{2}+C_{4}.

(5.32)

For ${\mathcal{B}}^{N}_{\mu}(|\cdot|^{\beta})$ , substituting (5.32) into (5.10), we get

\displaystyle{\mathcal{B}}^{N}_{\mu}(|\cdot|^{\beta})(x)

\displaystyle\leqslant\beta\Big{(}\kappa_{6}|x|^{2}+\tfrac{c_{3}}{2}\mu(|\cdot|)^{2}+C_{4}\Big{)}\int^{1}_{0}|x+\theta b_{N}[x,\mu]|^{\beta-2}{\mathord{{\rm d}}}\theta.

On the other hand, for any $\varepsilon,\theta\in(0,1)$ , by $|a+b|^{p}\leqslant(1+\varepsilon)|a|^{p}+C_{\varepsilon}|b|^{p}$ , we have

\displaystyle(1-\varepsilon)|x|^{\beta-2}-C_{\varepsilon}|b_{N}[x,\mu]|^{\beta-2}\leqslant|x+\theta b_{N}[x,\mu]|^{\beta-2}\leqslant(1+\varepsilon)|x|^{\beta-2}-C^{\prime}_{\varepsilon}|b_{N}[x,\mu]|^{\beta-2}.

Moreover, for any $\delta>0$ , by (5.24) and (5.26), there is an $N_{0}$ large enough so that for all $N\geqslant N_{0}$ ,

\displaystyle|b_{N}[x,\mu]|\leqslant\frac{\kappa_{2}(1+|x|)}{N}+\frac{c_{1}+c_{2}|x|+c_{3}\mu(|\cdot|)}{N}\leqslant\delta(1+|x|+\mu(|\cdot|)).

(5.33)

Thus for any $\varepsilon\in(0,1)$ , one can choose $N_{0}$ large enough so that for all $N\geqslant N_{0}$ and $\theta\in(0,1)$ ,

\displaystyle(1-\varepsilon)|x|^{\beta-2}-\varepsilon(1+\mu(|\cdot|)^{\beta-2})\leqslant|x+\theta b_{N}[x,\mu]|^{\beta-2}\leqslant(1+\varepsilon)|x|^{\beta-2}+\varepsilon(1+\mu(|\cdot|)^{\beta-2}).

Hence, for any $\kappa_{6}<0$ , there is an $N_{0}$ large enough so that for all $N\geqslant N_{0}$ ,

\displaystyle{\mathcal{B}}^{N}_{\mu}(|\cdot|^{\beta})(x)\leqslant\beta(\kappa_{6}|x|^{\beta}+c_{3}\mu(|\cdot|)^{\beta}+C_{5}).

(5.34)

For ${\mathcal{A}}^{N}_{\mu}U_{\beta}(x)$ , as in the Step 2 of Lemma 3.12 and by (5.33), we have

	$\displaystyle\|{\mathcal{A}}^{N}_{\mu}(\|\cdot\|^{\beta})(x)\|$	$\displaystyle\lesssim\int_{{\mathbb{R}}^{d}}\|\sigma[x,\mu,z]\|^{2}\int^{1}_{0}\theta\int^{1}_{-1}\|x+\theta\theta^{\prime}N^{-1/2}\sigma[x,\mu,z]+b_{N}[x,\mu]\|^{\beta-2}{\mathord{{\rm d}}}\theta^{\prime}{\mathord{{\rm d}}}\theta\nu({\mathord{{\rm d}}}z)$
		$\displaystyle\lesssim\int_{{\mathbb{R}}^{d}}\|\sigma[x,\mu,z]\|^{2}\Big{(}\|x\|^{\beta-2}+\|\sigma[x,\mu,z]\|^{\beta-2}+\|b_{N}[x,\mu]\|^{\beta-2}\Big{)}\nu({\mathord{{\rm d}}}z)$
		$\displaystyle\lesssim(1+\kappa_{1}^{\beta}(\|x\|^{\beta}+\mu(\|\cdot\|)^{\beta}))\int_{{\mathbb{R}}^{d}}(1+\|z\|^{\beta})\nu({\mathord{{\rm d}}}z),$

which together with (5.34) and the arbitrariness of $\kappa_{6}<0$ yields the desired estimate. ∎

Remark 5.9.

When $m=1$ , the similar estimate of (5.31) still hold, but parameter dependence becomes cumbersome.

We have the following corollary.

Corollary 5.10.

Under $\rm({\bf\widetilde{H}}^{\sigma,b}_{\nu,\alpha})$ , where $\alpha>2$ , for any $\beta\in[2,\alpha)$ and $T>0$ , it holds that

\displaystyle\sup_{i=1,\cdots,N}{\mathbb{E}}\left(\sup_{t\in[0,T]}|X^{N,i}_{t}|^{\beta}\right)<\infty.

(5.35)

Moreover, if $m>1$ , then for any $\kappa<0$ , there is a constant $C>0$ such that for all $t>0$ ,

\displaystyle\frac{1}{N}\sum_{i=1}^{N}{\mathbb{E}}|X^{N,i}_{t}|^{\beta}\leqslant\frac{\mathrm{e}^{\kappa t}}{N}\sum_{i=1}^{N}{\mathbb{E}}|X^{N,i}_{0}|^{\beta}+C,

(5.36)

and for any $i=1,\cdots,N$ ,

\displaystyle{\mathbb{E}}|X^{N,i}_{t}|^{\beta}\leqslant\mathrm{e}^{\kappa t}{\mathbb{E}}|X^{N,i}_{0}|^{\beta}+C\mathrm{e}^{\kappa t}\frac{1}{N}\sum_{j=1}^{N}{\mathbb{E}}|X^{N,j}_{0}|^{\beta}+C.

(5.37)

Proof.

For fixed $\beta\in[2,\alpha)$ , by Itô’s formula (5.2) and (5.30), we have

	$\displaystyle\|X^{N,i}_{t}\|^{\beta}$	$\displaystyle=\|X^{N,i}_{0}\|^{\beta}+\int^{t}_{0}{\mathscr{L}}^{N}_{s,\mu_{{\mathbf{X}}^{N}}}(\|\cdot\|^{\beta})(X^{N,i}_{s}){\mathord{{\rm d}}}s+M^{N,i}_{t}$		(5.38)
		$\displaystyle\leqslant\|X^{N,i}_{0}\|^{\beta}+\int^{t}_{0}\Big{(}C_{0}\|X^{N,i}_{s}\|^{\beta}+C_{1}\mu_{{\mathbf{X}}^{N}_{s}}(\|\cdot\|)^{\beta}+C_{2}\Big{)}{\mathord{{\rm d}}}s+M^{N,i}_{t},$

where

M^{N,i}_{t}=\int^{t}_{0}\!\!\int_{{\mathbb{R}}^{d}}\Theta^{N}_{\mu_{{\mathbf{X}}^{N}}}(|\cdot|^{\beta})(X^{N,i}_{s-},z)\widetilde{\mathcal{H}}^{N,i}({\mathord{{\rm d}}}s,{\mathord{{\rm d}}}z)

is a local martingale. Noting that

\displaystyle\mu_{{\mathbf{X}}^{N}_{s}}(|\cdot|)^{\beta}\leqslant\frac{1}{N}\sum_{j=1}^{N}|X^{N,j}_{s}|^{\beta}=:A^{N,\beta}_{s},

(5.39)

we have

A^{N,\beta}_{t}\leqslant A^{N,\beta}_{0}+(C_{0}+C_{1})\int^{t}_{0}A^{N,\beta}_{s}{\mathord{{\rm d}}}s+C_{2}t+\frac{1}{N}\sum_{i=1}^{N}M^{N,i}_{t}.

For any $q\in(0,1)$ and $T>0$ , by stochastic Gronwall’s inequality (see [41, Lemma 3.7]), we have

\sup_{N}{\mathbb{E}}\left(\sup_{t\in[0,T]}|A^{N,\beta}_{t}|^{q}\right)<\infty

and

\displaystyle{\mathbb{E}}\left(\sup_{t\in[0,T]}|X^{N,i}_{t}|^{\beta q}\right)\leqslant C\left({\mathbb{E}}|X^{N,i}_{0}|^{\beta}+C_{2}{\mathbb{E}}\int^{T}_{0}A^{N,\beta}_{s}{\mathord{{\rm d}}}s+C_{3}T\right)^{q}<\infty.

In particular, $M^{N,i}_{t}$ is a martingale. If $m>1$ , then by (5.38) and (5.31), for any $\kappa<0$ ,

\displaystyle{\mathord{{\rm d}}}{\mathbb{E}}|X^{N,i}_{t}|^{\beta}/{\mathord{{\rm d}}}t\leqslant(\kappa-(\beta c_{3}+C_{3}\kappa^{\beta}_{1})){\mathbb{E}}|X^{N,i}_{t}|^{\beta}+(\beta c_{3}+C_{3}\kappa^{\beta}_{1}){\mathbb{E}}A^{N,\beta}_{t}+C_{4},

and

{\mathord{{\rm d}}}{\mathbb{E}}A^{N,\beta}_{t}/{\mathord{{\rm d}}}t\leqslant\kappa{\mathbb{E}}A^{N,\beta}_{t}+C_{4}.

Solving these two differential inequalities, we obtain the desired estimates. ∎

Remark 5.11.

If $m>1$ , then by (5.37), as in Remark 5.6 one can show the existence of invariant probability measures for the semigroup $P^{N}_{t}$ defined through SDE (5.1).

The following lemma is similar to Lemma 5.4.

Lemma 5.12.

For any $T,\gamma>0$ , it holds that

\displaystyle\lim_{\delta\to 0}\sup_{N}\sup_{\tau\leqslant\eta\leqslant\tau+\delta\leqslant T}{\mathbb{P}}\Big{(}|X^{N,1}_{\eta}-X^{N,1}_{\tau}|\geqslant\gamma\Big{)}=0.

(5.40)

Proof.

Let $\tau,\eta\in{\mathscr{T}}_{T}$ with $\tau\leqslant\eta\leqslant\tau+\delta$ . For any $R>0$ , define

\zeta_{R}:=\inf\left\{t>0:|X^{N,1}_{t}|\wedge A^{N,2}_{t}>R\right\},

where $A^{N,2}_{t}$ is defined by (5.39), and

\tau_{R}:=\zeta_{R}\wedge\tau,\ \eta_{R}:=\zeta_{R}\wedge\eta.

By (5.1), we can write

	$\displaystyle X^{N,1}_{\eta_{R}}-X^{N,1}_{\tau_{R}}$	$\displaystyle=\int^{\eta_{R}}_{\tau_{R}}\!\!\!\int_{{\mathbb{R}}^{d}}\sigma_{N}{\Big{[}}s,X^{N,1}_{s-},\mu_{{\mathbf{X}}^{N}_{s-}},z{\Big{]}}{\mathcal{H}}^{N,1}({\mathord{{\rm d}}}s,{\mathord{{\rm d}}}z)$
		$\displaystyle\quad+\int^{\eta_{R}}_{\tau_{R}}b_{N}{\Big{[}}s,X^{N,1}_{s-},\mu_{{\mathbf{X}}^{N}_{s-}}{\Big{]}}{\mathord{{\rm d}}}{\mathcal{N}}^{N,1}_{s}=:I_{1}+I_{2}.$

For $I_{1}$ , by (3.10) and the isometry of stochastic integrals, we have

	$\displaystyle{\mathbb{E}}\|I_{1}\|^{2}$	$\displaystyle={\mathbb{E}}\left\|\int^{\eta_{R}}_{\tau_{R}}\!\!\!\int_{{\mathbb{R}}^{d}}\sigma_{N}{\Big{[}}s,X^{N,1}_{s-},\mu_{{\mathbf{X}}^{N}_{s-}},z{\Big{]}}\widetilde{\mathcal{H}}^{N,1}({\mathord{{\rm d}}}s,{\mathord{{\rm d}}}z)\right\|^{2}$
		$\displaystyle={\mathbb{E}}\left(\int^{\eta_{R}}_{\tau_{R}}\!\!\!\int_{{\mathbb{R}}^{d}}\Big{\|}\sigma{\Big{[}}s,X^{N,1}_{s},\mu_{{\mathbf{X}}^{N}_{s}},z{\Big{]}}\Big{\|}^{2}\nu({\mathord{{\rm d}}}z){\mathord{{\rm d}}}s{\big{)}}\right)$
		$\displaystyle\leqslant{\mathbb{E}}\left(\int^{\eta_{R}}_{\tau_{R}}\!\!\!\int_{{\mathbb{R}}^{d}}\frac{1}{N}\sum_{j=1}^{N}\Big{\|}\sigma(s,X^{N,1}_{s},X^{N,j}_{s},z)\Big{\|}^{2}\nu({\mathord{{\rm d}}}z){\mathord{{\rm d}}}s\right)$
		$\displaystyle\lesssim{\mathbb{E}}\left(\int^{\eta_{R}}_{\tau_{R}}(1+\|X^{N,1}_{s}\|^{2}+A^{N,2}_{s}){\mathord{{\rm d}}}s\right)\int_{{\mathbb{R}}^{d}}\|z\|^{2}\nu({\mathord{{\rm d}}}z)\leqslant C_{R}\delta.$

For $I_{2}$ , by (5.24) and (5.26) we similarly have

	$\displaystyle{\mathbb{E}}\|I_{2}\|$	$\displaystyle\leqslant{\mathbb{E}}\left(\int^{\eta_{R}}_{\tau_{R}}\|b_{N}\big{[}s,X^{N,1}_{s-},\mu_{{\mathbf{X}}^{N}_{s-}}\big{]}\|{\mathord{{\rm d}}}{\mathcal{N}}^{N,1}_{s}\right)$
		$\displaystyle={\mathbb{E}}\left(\int^{\eta_{R}}_{\tau_{R}}\big{\|}b_{N}\big{[}s,X^{N,1}_{s},\mu_{{\mathbf{X}}^{N}_{s}}\big{]}\big{\|}{\mathord{{\rm d}}}(Ns)\right)$
		$\displaystyle\leqslant{\mathbb{E}}\left(\int^{\eta_{R}}_{\tau_{R}}{\big{(}}\big{\|}b_{1}{\big{(}}s,X^{N,1}_{s}{\big{)}}\big{\|}+\big{\|}b_{2}\big{[}s,X^{N,1}_{s},\mu_{{\mathbf{X}}^{N}_{s}}\big{]}\big{\|}{\big{)}}{\mathord{{\rm d}}}s\right)$
		$\displaystyle\lesssim{\mathbb{E}}\left(\int^{\eta_{R}}_{\tau_{R}}{\big{(}}1+\|X^{N}_{s}\|^{m}+A^{N,2}_{s}{\big{)}}{\mathord{{\rm d}}}s\right)\leqslant C_{R}\delta.$

Hence, by Chebyshev’s inequality and (5.35),

	$\displaystyle{\mathbb{P}}(\|X^{N,1}_{\eta}-X^{N,1}_{\tau}\|\geqslant\gamma)$	$\displaystyle\leqslant{\mathbb{P}}(\|X^{N,1}_{\eta_{R}}-X^{N,1}_{\tau_{R}}\|\geqslant\gamma;\zeta_{R}>T)+{\mathbb{P}}(\zeta_{R}\leqslant T)$
		$\displaystyle\leqslant\sum_{i=1}^{2}{\mathbb{P}}(\|I_{i}\|\geqslant\tfrac{\gamma}{3})+{\mathbb{P}}\left(\sup_{t\in[0,T]}(\|X^{N,1}_{t}\|\vee\|A^{N,2}_{t}\|\geqslant R\right)$
		$\displaystyle\leqslant(\tfrac{3}{\gamma})^{2}{\mathbb{E}}\|I_{1}\|^{2}+\tfrac{3}{\gamma}{\mathbb{E}}\|I_{2}\|+\tfrac{C}{R}$
		$\displaystyle\leqslant C_{R,\gamma}\delta+C/R,$

which converges to zero by firstly letting $\delta\to 0$ and then $R\to\infty$ . ∎

The following propagation of chaos result can be proven using the same methodology as presented in Theorem 5.5. Due to the similarity of the arguments, we omit the detailed proof here.

Theorem 5.13.

Let $\mu_{0}\in{\mathcal{P}}({\mathbb{R}}^{d})$ and $N\in{\mathbb{N}}$ . Suppose that for any $k\leqslant N$ ,

{\mathbb{P}}\circ{\big{(}}X^{N,1}_{0},\cdots,X^{N,k}_{0}{\big{)}}^{-1}\to\mu_{0}^{\otimes k},\ \ N\to\infty,

and DDSDE (5.29) admits a unique martingale solution ${\mathbb{P}}_{0}\in{\mathcal{M}}^{\mu_{0}}_{0}({\mathscr{L}}^{\infty})$ with initial distribution $\mu_{0}$ in the sense of Definition 6.2 in appendix. Then under $\rm(\widetilde{\bf H}^{\sigma,b}_{\nu,\alpha})$ , for any $k\leqslant N$ ,

{\mathbb{P}}\circ{\big{(}}X^{N,1}_{\cdot},\cdots,X^{N,k}_{\cdot}{\big{)}}^{-1}\to{\mathbb{P}}_{0}^{\otimes k},\ \ N\to\infty.

5.3. ${\mathcal{W}}_{1}$ -convergence rate under Lipschitz assumptions

In this section, we establish the quantitative convergence rate of the propagation of chaos phenomenon for the additive noise particle system given by:

\displaystyle X^{N,i}_{t}=X^{N,i}_{0}+N^{-1}\int^{t}_{0}b\big{[}s,X^{N,i}_{s-},\mu_{{\mathbf{X}}^{N}_{s-}}\big{]}{\mathord{{\rm d}}}{\mathcal{N}}^{N,i}_{s}+N^{-1/\alpha}H^{N,i}_{t},

(5.41)

where $\alpha\in(0,2]$ , ${\mathcal{N}}^{N,i}$ and $H^{N,i}$ are the same as in the beginning of this section, and $b(s,x,y):{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}\times{\mathbb{R}}^{d}\to{\mathbb{R}}^{d}$ satisfies that for some $\kappa>0$ and for all $s,x,y,y^{\prime}$ ,

\displaystyle|b(s,x,y)|\leqslant\kappa,\ \ |b(s,x,y)-b(s,x^{\prime},y^{\prime})|\leqslant\kappa(|x-x^{\prime}|+|y-y^{\prime}|).

(5.42)

The associated limiting McKean-Vlasov SDE is given by

\displaystyle X_{t}=X_{0}+\int^{t}_{0}b[s,X_{s},\mu_{X_{s}}]{\mathord{{\rm d}}}s+L^{(\alpha)}_{t}.

(5.43)

Under (5.42), it is well-known that (5.43) has a unique solution for any $\alpha\in(0,2)$ . We aim to show the following result.

Theorem 5.14.

Suppose that $\{X^{N,i}_{0},i=1,\cdots,N\}$ are i.i.d. ${\mathcal{F}}_{0}$ -measurable random variables with common distribution $\mu_{0}$ . Under (H ${}^{\alpha}_{\nu}$ ) and (5.42), where $\alpha>1$ , for any $T>0$ , there is a constant $C=C(\kappa,\alpha,T,d,\|b\|_{\infty})>0$ such that for all $t\in[0,T]$ ,

{\mathcal{W}}_{1}\big{(}\mu_{X^{N,1}_{t}},\mu_{X_{t}}\big{)}\leqslant C\Big{(}N^{-\frac{(\alpha-2)\wedge 1}{2}}{\mathbbm{1}}_{\alpha\in(2,3)}+N^{-\frac{2-\alpha}{2}\wedge\beta_{1}}{\mathbbm{1}}_{\alpha\in(1,2)}\Big{)},

where $\beta_{1}$ is from (H ${}^{\alpha}_{\nu}$ ) , and for two probability measures $\mu_{1},\mu_{2}\in{\mathcal{P}}({\mathbb{R}}^{d})$ , ${\mathcal{W}}_{1}(\mu_{1},\mu_{2})$ denotes the Wasserstein 1-distance defined by

\displaystyle{\mathcal{W}}_{1}(\mu_{1},\mu_{2}):=\sup_{\|\phi\|_{C^{1}_{b}}\leqslant 1}|\mu_{1}(\phi)-\mu_{2}(\phi)|.

(5.44)

Proof.

Let $\mu_{t}:=\mu_{X_{t}}$ and $\widetilde{X}^{N,i}_{t}$ solve the following particle system:

\displaystyle\widetilde{X}^{N,i}_{t}=X^{N,i}_{0}+N^{-1}\int^{t}_{0}b\big{[}s,\widetilde{X}^{N,i}_{s-},\mu_{s}\big{]}{\mathord{{\rm d}}}{\mathcal{N}}^{N,i}_{s}+N^{-1/\alpha}H^{N,i}_{t}.

(5.45)

Clearly, $\{\widetilde{X}^{N,i}_{\cdot},i=1,\cdots,N\}$ are i.i.d. By (5.41) and (5.45), we have

	$\displaystyle{\mathbb{E}}\|X^{N,i}_{t}-\widetilde{X}^{N,i}_{t}\|$	$\displaystyle\leqslant{\mathbb{E}}\int^{t}_{0}\Big{\|}b\big{[}s,X^{N,i}_{s},\mu_{{\mathbf{X}}^{N}_{s}}\big{]}-b\big{[}s,\widetilde{X}^{N,i}_{s},\mu_{s}\big{]}\Big{\|}{\mathord{{\rm d}}}s$
		$\displaystyle\leqslant{\mathbb{E}}\int^{t}_{0}\Big{\|}b\big{[}s,X^{N,i}_{s},\mu_{{\mathbf{X}}^{N}_{s}}\big{]}-b\big{[}s,\widetilde{X}^{N,i}_{s},\mu_{\widetilde{\mathbf{X}}^{N}_{s}}\big{]}\Big{\|}{\mathord{{\rm d}}}s$
		$\displaystyle\quad+{\mathbb{E}}\int^{t}_{0}\Big{\|}b\big{[}s,\widetilde{X}^{N,i}_{s},\mu_{\widetilde{\mathbf{X}}^{N}_{s}}\big{]}-b\big{[}s,\widetilde{X}^{N,i}_{s},\mu_{s}\big{]}\Big{\|}{\mathord{{\rm d}}}s=:I_{1}+I_{2}.$

For $I_{1}$ , by (5.42) we have

I_{1}\leqslant\frac{\kappa}{N}\sum_{j=1}^{N}\int^{t}_{0}\left({\mathbb{E}}|X^{N,i}_{s}-\widetilde{X}^{N,i}_{s}|+{\mathbb{E}}|X^{N,j}_{s}-\widetilde{X}^{N,j}_{s}|\right){\mathord{{\rm d}}}s.

For $I_{2}$ , since $\{\widetilde{X}^{N,i}_{s},i=1,\cdots,N\}$ are i.i.d., by (2.27), (5.42) and definition (5.44), we have

	$\displaystyle I_{2}$	$\displaystyle\leqslant\int^{t}_{0}\left({\mathbb{E}}\Big{\|}b\big{[}s,\widetilde{X}^{N,i}_{s},\mu_{\widetilde{\mathbf{X}}^{N}_{s}}\big{]}-b\big{[}s,\widetilde{X}^{N,i}_{s},\mu_{s}\big{]}\Big{\|}^{2}\right)^{1/2}{\mathord{{\rm d}}}s$
		$\displaystyle\lesssim\int^{t}_{0}{\mathcal{W}}_{1}(\mu_{\widetilde{X}^{N,1}_{s}},\mu_{s}){\mathord{{\rm d}}}s+\frac{\\|b\\|_{\infty}}{\sqrt{N}}.$

On the other hand, by Theorem 3.19 and Remark 3.20, we have

\displaystyle\sup_{s\in[0,T]}{\mathcal{W}}_{1}(\mu_{\widetilde{X}^{N,1}_{s}},\mu_{s})\lesssim N^{-\frac{(\alpha-2)\wedge 1}{2}}{\mathbbm{1}}_{\alpha\in(2,3)}+N^{-\frac{2-\alpha}{2}\wedge\beta_{1}}{\mathbbm{1}}_{\alpha\in(1,2)}.

(5.46)

Combining the above calculations, we get

	$\displaystyle{\mathbb{E}}\|X^{N,i}_{t}-\widetilde{X}^{N,i}_{t}\|$	$\displaystyle\leqslant\frac{\kappa}{N}\sum_{j=1}^{N}\int^{t}_{0}\left({\mathbb{E}}\|X^{N,i}_{s}-\widetilde{X}^{N,i}_{s}\|+{\mathbb{E}}\|X^{N,j}_{s}-\widetilde{X}^{N,j}_{s}\|\right){\mathord{{\rm d}}}s$
		$\displaystyle\quad+C\Big{(}N^{-\frac{(\alpha-2)\wedge 1}{2}}{\mathbbm{1}}_{\alpha\in(2,3)}+N^{-\frac{2-\alpha}{2}\wedge\beta_{1}}{\mathbbm{1}}_{\alpha\in(1,2)}\Big{)},$

which implies by Gronwall’s inequality that for all $t\in[0,T]$ ,

{\mathbb{E}}|X^{N,i}_{t}-\widetilde{X}^{N,i}_{t}|\lesssim\frac{\kappa}{N}\sum_{j=1}^{N}\int^{t}_{0}{\mathbb{E}}|X^{N,j}_{s}-\widetilde{X}^{N,j}_{s}|{\mathord{{\rm d}}}s+N^{-\frac{(\alpha-2)\wedge 1}{2}}{\mathbbm{1}}_{\alpha\in(2,3)}+N^{-\frac{2-\alpha}{2}\wedge\beta_{1}}{\mathbbm{1}}_{\alpha\in(1,2)}

and

\frac{1}{N}\sum_{j=1}^{N}{\mathbb{E}}|X^{N,i}_{t}-\widetilde{X}^{N,i}_{t}|\lesssim N^{-\frac{(\alpha-2)\wedge 1}{2}}{\mathbbm{1}}_{\alpha\in(2,3)}+N^{-\frac{2-\alpha}{2}\wedge\beta_{1}}{\mathbbm{1}}_{\alpha\in(1,2)}.

These together with (5.46) yield the desired estimate. ∎

Remark 5.15.

Based on the aforementioned convergence result, an interesting direction for future work is to investigate the convergence of the fluctuation of the empirical measure given by:

\eta^{N}_{t}:=\sqrt{N}(\mu_{{\mathbf{X}}^{N}_{t}}-\mu_{X_{t}}).

This corresponds to studying the central limit theorem for the particle system, which characterizes the asymptotic behavior of the fluctuations around the mean behavior.

6. Appendix: Martingale solutions

In this section, we provide a brief overview of some key notions and results related to the martingale solutions associated with the operators ${\mathscr{L}}_{t}$ . These concepts and results are well-known and can be found in Jacob-Shiryaev’s textbook [23]. We include them here for the convenience of the readers.

Let ${\mathbb{D}}:={\mathbb{D}}({\mathbb{R}}^{d})$ be the space of all càdlàg functions from ${\mathbb{R}}_{+}$ to ${\mathbb{R}}^{d}$ , which is endowed with the Skorokhod topology (see [23, p325] for precise definition). The canonical process in ${\mathbb{D}}({\mathbb{R}}^{d})$ is defined by

w_{t}(\omega)=\omega_{t},\ \ \omega\in{\mathbb{D}}({\mathbb{R}}^{d}).

Let ${\mathscr{B}}^{0}_{t}:=\sigma\{w_{s},s\leqslant t\}$ be the natural filtration and ${\mathscr{B}}_{t}:=\cap_{s>t}{\mathscr{B}}^{0}_{s}$ . For $R>0$ , we introduce

\tau_{R}(\omega):=\inf\big{\{}t>0:|\omega_{t}|\vee|\omega_{t-}|\geqslant R\big{\}},\ \ \omega\in{\mathbb{D}}({\mathbb{R}}^{d}),

and

\displaystyle J(\omega):=\big{\{}t>0:\omega(t)-\omega(t-)>0\big{\}},\ \ V(\omega):=\big{\{}R>0:\tau_{R}(\omega)<\tau_{R+}(\omega)\big{\}}

(6.1)

and

\displaystyle V^{\prime}(\omega):=\big{\{}R>0:\tau_{R}(\omega)\in J(\omega),|\omega(\tau_{R}(\omega)-)|=R\big{\}}.

(6.2)

It is well-known that $\tau_{R}$ is an ${\mathscr{B}}^{0}_{t}$ -stopping time, that is, for all $t\geqslant 0$ , $\{\tau_{R}\leqslant t\}\in{\mathscr{B}}^{0}_{t}$ . Moreover, the function $R\mapsto\tau_{R}(\omega)$ is nondecreasing and left continuous, and $J(\omega)$ , $V(\omega)$ and $V^{\prime}(\omega)$ are at most countable (see [23, p340, Lemma 2.10]). The following proposition can be found in [23, p341, Propositions 2.11 and 2.12] and [23, p349, Lemma 3.12].

Proposition 6.1.

For each $R,t>0$ , the mappings $\omega\mapsto\tau_{R}(\omega)$ and $\omega\to(w_{t\wedge\tau_{R}})(\omega)$ are continuous with respect to the Skorokhod topology at each point $\omega$ such that $R\notin V(\omega)\cup V^{\prime}(\omega)$ . Moreover, for any ${\mathbb{P}}\in{\mathcal{P}}({\mathbb{D}}({\mathbb{R}}^{d}))$ , the set $\{R>0:{\mathbb{P}}(\omega:R\in V(\omega)\cup V^{\prime}(\omega))>0\}$ is at most countable.

Let ${\mathscr{L}}:=({\mathscr{L}}_{s})_{s\geqslant 0}$ be a family of linear operators from $C_{c}^{2}({\mathbb{R}}^{d})$ to $C({\mathbb{R}}^{d})$ . We introduce the following notion of martingale solutions (see [38]).

Definition 6.2.

Let $s>0$ and $\mu_{0}\in{\mathcal{P}}({\mathbb{R}}^{d})$ . We call a probability measure ${\mathbb{P}}\in{\mathcal{P}}({\mathbb{D}}({\mathbb{R}}^{d}))$ a martingale solution associated with ${\mathscr{L}}$ and with initial distribution $\mu_{0}$ at time $s$ if ${\mathbb{P}}\circ w^{-1}_{s}=\mu_{0}$ , and for all $f\in C^{2}_{c}({\mathbb{R}}^{d})$ , the process

M_{t}:=f(w_{t})-f(w_{s})-\int^{t}_{s}{\mathscr{L}}_{r}f(w_{r}){\mathord{{\rm d}}}r

is a local ${\mathscr{B}}_{t}$ -martingale after time $s$ under the probability measure ${\mathbb{P}}$ . All the martingale solutions starting from $\mu_{0}$ at time $s$ is denoted by ${\mathcal{M}}^{\mu_{0}}_{s}({\mathscr{L}})$ . If $\mu_{0}=\delta_{x}$ for some $x\in{\mathbb{R}}^{d}$ , we shall simply write ${\mathcal{M}}^{x}_{s}({\mathscr{L}})={\mathcal{M}}^{\delta_{x}}_{s}({\mathscr{L}})$ . If the operator ${\mathscr{L}}$ also depends on the probability measure ${\mathbb{P}}$ itself, then we shall call the probability measure ${\mathbb{P}}$ a solution of nonlinear martingale problems.

First of all we present the following purely technical result.

Proposition 6.3.

Suppose that for each $(s,x)\in{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}$ , there is a unique martingale solution ${\mathbb{P}}_{s,x}\in{\mathcal{M}}^{x}_{s}({\mathscr{L}})$ so that for each measurable $A\subset{\mathbb{D}}({\mathbb{R}}^{d})$ , $(s,x)\mapsto{\mathbb{P}}_{s,x}(A)$ is Borel measurable. Then $\{{\mathbb{P}}_{s,x},(s,x)\in{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}\}$ is a family of strong Markov probability measures. If in addition, ${\mathscr{L}}$ is a second order differential operator with the form:

{\mathscr{L}}_{s}f(x)=\mathrm{tr}(a(s,x)\cdot\nabla^{2}f(x))+b(s,x)\cdot\nabla f(x),

where $a:{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}\to{\mathbb{R}}^{d}\otimes{\mathbb{R}}^{d}$ is a symmetric matrix-valued locally bounded measurable function and $b:{\mathbb{R}}_{+}\times{\mathbb{R}}^{d}\to{\mathbb{R}}^{d}$ is a vector-valued locally bounded measurable function, then for each $(s,x)$ , ${\mathbb{P}}_{s,x}$ concentrates on the space of continuous functions.

Proof.

The statement that the uniqueness of martingale solutions implies the strong Markov property is a well-known result (see [38, Theorem 6.2.2]). We omit the details here. Now, let us prove the second conclusion. Without loss of generality we assume $s=0$ . To show that ${\mathbb{P}}_{0,x}$ concentrates on the space of continuous functions, by Kolmogorov’s continuity criterion, it suffices to show that for any $R,T>0$ and $0\leqslant t_{0}<t_{1}\leqslant T$ ,

\displaystyle{\mathbb{E}}^{{\mathbb{P}}_{0,x}}|w_{t_{1}\wedge\tau_{R}}-w_{t_{0}\wedge\tau_{R}}|^{4}\leqslant C_{R}|t_{1}-t_{0}|^{2}.

(6.3)

Let $0\leqslant t_{0}<t_{1}\leqslant T$ . Since $\tau_{R}\circ\theta_{t_{0}}=\tau_{R}-t_{0}$ for $t_{0}<\tau_{R}$ , we have

t_{1}\wedge\tau_{R}=t_{0}+(t_{1}-t_{0})\wedge(\tau_{R}\circ\theta_{t_{0}}),\ \ t_{0}<\tau_{R}.

Since $\{t_{0}<\tau_{R}\}\in{\mathscr{B}}_{t_{0}}$ , by the Markov property one sees that

$\displaystyle{\mathbb{E}}^{{\mathbb{P}}_{0,x}}\|w_{t_{1}\wedge\tau_{R}}-w_{t_{0}\wedge\tau_{R}}\|^{4}$	$\displaystyle={\mathbb{E}}^{{\mathbb{P}}_{0,x}}\Big{[}\|w_{t_{1}\wedge\tau_{R}}-w_{t_{0}}\|^{4}{\mathbbm{1}}_{t_{0}<\tau_{R}}\Big{]}$
	$\displaystyle={\mathbb{E}}^{{\mathbb{P}}_{0,x}}\Big{[}{\mathbb{E}}^{{\mathbb{P}}_{0,x}}\Big{(}\|w_{t_{0}+(t_{1}-t_{0})\wedge(\tau_{R}\circ\theta_{t_{0}})}-w_{t_{0}}\|^{4}\|{\mathscr{B}}_{t_{0}}\Big{)}{\mathbbm{1}}_{t_{0}<\tau_{R}}\Big{]}$
	$\displaystyle={\mathbb{E}}^{{\mathbb{P}}_{0,x}}\Big{[}\Big{(}{\mathbb{E}}^{{\mathbb{P}}_{s,y}}\|w_{s+(t_{1}-t_{0})\wedge(\tau_{R}\circ\theta_{s})}-y\|^{4}\Big{)}\big{\|}_{(s,y)=(t_{0},w_{t_{0}})}{\mathbbm{1}}_{t_{0}<\tau_{R}}\Big{]}.$	(6.4)

Fix $y\in{\mathbb{R}}^{d}$ and $\beta\geqslant 1$ . Define $f(x)=|x-y|^{2\beta}$ . Note that

	$\displaystyle{\mathscr{L}}_{s}f(x)$	$\displaystyle=2\beta\|x-y\|^{2(\beta-1)}\Big{[}\mathrm{tr}(a(s,x))+\langle x-y,b(s,x)\rangle\Big{]}$
		$\displaystyle+4\beta(\beta-1)\|x-y\|^{2(\beta-2)}\langle a(s,x)(x-y),x-y\rangle.$

In particular, for any $R>0$ and $T>0$ ,

\sup_{s\in[0,T]}\sup_{|x|\leqslant R}|{\mathscr{L}}_{s}f(x)|\leqslant C_{R}(|x-y|^{2(\beta-1)}+|x-y|^{2\beta-1}).

Now for $s,t\in[0,T]$ , by the definition of martingale solutions, we have

	$\displaystyle{\mathbb{E}}^{{\mathbb{P}}_{s,y}}\|w_{s+t\wedge(\tau_{R}\circ\theta_{s})}-y\|^{2\beta}$	$\displaystyle={\mathbb{E}}^{{\mathbb{P}}_{s,y}}f(w_{s+t\wedge(\tau_{R}\circ\theta_{s})})={\mathbb{E}}^{{\mathbb{P}}_{s,y}}\left(\int^{s+t\wedge(\tau_{R}\circ\theta_{s})}_{s}{\mathscr{L}}_{r}f(w_{r}){\mathord{{\rm d}}}r\right)$
		$\displaystyle\leqslant C_{R}{\mathbb{E}}^{{\mathbb{P}}_{s,y}}\left(\int^{s+t\wedge(\tau_{R}\circ\theta_{s})}_{s}\Big{(}\|w_{r}-y\|^{2(\beta-1)}+\|w_{r}-y\|^{2\beta-1}\Big{)}{\mathord{{\rm d}}}r\right)$
		$\displaystyle=C_{R}{\mathbb{E}}^{{\mathbb{P}}_{s,y}}\left(\int^{t\wedge(\tau_{R}\circ\theta_{s})}_{0}\Big{(}\|w_{s+r}-y\|^{2(\beta-1)}+\|w_{s+r}-y\|^{2\beta-1}\Big{)}{\mathord{{\rm d}}}r\right)$
		$\displaystyle\leqslant C_{R}{\mathbb{E}}^{{\mathbb{P}}_{s,y}}\left(\int^{t}_{0}\Big{(}\|w_{s+r\wedge(\tau_{R}\circ\theta_{s})}-y\|^{2(\beta-1)}+\|w_{s+r\wedge(\tau_{R}\circ\theta_{s})}-y\|^{2\beta}\Big{)}{\mathord{{\rm d}}}r\right).$

By Gronwall’s inequality, we get

{\mathbb{E}}^{{\mathbb{P}}_{s,y}}|w_{s+t\wedge(\tau_{R}\circ\theta_{s})}-y|^{2\beta}\leqslant C_{R}{\mathbb{E}}^{{\mathbb{P}}_{s,y}}\left(\int^{t}_{0}|w_{s+r\wedge(\tau_{R}\circ\theta_{s})}-y|^{2(\beta-1)}{\mathord{{\rm d}}}r\right).

In particular, if one takes $\beta=1$ , then for any $s,t\in[0,T]$ ,

{\mathbb{E}}^{{\mathbb{P}}_{s,y}}|w_{s+t\wedge(\tau_{R}\circ\theta_{s})}-y|^{2}\leqslant C_{R}t.

Furthermore, taking $\beta=2$ , we get

\displaystyle{\mathbb{E}}^{{\mathbb{P}}_{s,y}}|w_{s+t\wedge(\tau_{R}\circ\theta_{s})}-y|^{4}

\displaystyle\leqslant C_{R}{\mathbb{E}}^{{\mathbb{P}}_{s,y}}\left(\int^{t}_{0}|w_{s+r\wedge(\tau_{R}\circ\theta_{s})}-y|^{2}{\mathord{{\rm d}}}r\right)\leqslant C_{R}t^{2}.

Substituting this into (6), we obtain (6.3). The proof is complete. ∎

Next we show a result that provides a way to construct a martingale solution for the operator ${\mathscr{L}}_{s}$ . Let $\{(X^{\varepsilon}_{t})_{t\geqslant 0},\varepsilon\in(0,1)\}$ be a family of ${\mathbb{R}}^{d}$ -valued càdlàg adapted processes on some stochastic basis $(\Omega^{\varepsilon},{\mathcal{F}}^{\varepsilon},{\mathbb{P}}^{\varepsilon};({\mathcal{F}}^{\varepsilon}_{t})_{t\geqslant 0})$ . Let ${\mathbb{Q}}_{\varepsilon}$ be the law of $X^{\varepsilon}$ in ${\mathbb{D}}({\mathbb{R}}^{d})$ . Let $\{{\mathscr{L}}^{\varepsilon}=({\mathscr{L}}^{\varepsilon}_{t})_{t\geqslant 0},\varepsilon\in(0,1)\}$ be a family of random linear operators from $C^{\infty}_{b}({\mathbb{R}}^{d})$ to $C({\mathbb{R}}^{d})$ . Suppose that

(H)

${\mathbb{Q}}_{\varepsilon}$ weakly converges to ${\mathbb{Q}}_{0}$ in ${\mathcal{P}}({\mathbb{D}}({\mathbb{R}}^{d}))$ as $\varepsilon\downarrow 0$ , and for any $f\in C^{2}_{b}({\mathbb{R}}^{d})$ ,

\displaystyle M^{\varepsilon}_{t}:=f(X^{\varepsilon}_{t})-f(X^{\varepsilon}_{0})-\int^{t}_{0}{\mathscr{L}}^{\varepsilon}_{s}f(X^{\varepsilon}_{s}){\mathord{{\rm d}}}s

(6.5)

is a local ${\mathcal{F}}^{\varepsilon}_{t}$ -martingale with localized stopping time sequence $(\tau^{\varepsilon}_{n})_{n\in{\mathbb{N}}}$ , where for each $R>0$ ,

\tau^{\varepsilon}_{R}:=\inf\big{\{}t>0:|X^{\varepsilon}_{t}|\vee|X^{\varepsilon}_{t-}|\geqslant R\big{\}}.

Moreover, for each $t,R>0$ ,

\displaystyle\lim_{\varepsilon\to 0}{\mathbb{E}}^{{\mathbb{P}}^{\varepsilon}}\left|\int^{t\wedge\tau^{\varepsilon}_{R}}_{0}({\mathscr{L}}^{\varepsilon}_{s}f-{\mathscr{L}}_{s}f)(X^{\varepsilon}_{s}){\mathord{{\rm d}}}s\right|=0.

(6.6)

We have the following result about the martingale solutions.

Theorem 6.4.

Under (H), it holds that ${\mathbb{Q}}_{0}\in{\mathcal{M}}^{\mu_{0}}_{0}({\mathscr{L}})$ , where $\mu_{0}:={\mathbb{Q}}_{0}\circ w_{0}^{-1}$ .

Proof.

For given $f\in C^{2}_{b}({\mathbb{R}}^{d})$ , define

\displaystyle M_{t}:=f(w_{t})-f(w_{0})-\int^{t}_{0}{\mathscr{L}}_{s}f(w_{s}){\mathord{{\rm d}}}s.

(6.7)

Recall the definitions $V(\omega)$ and $V^{\prime}(\omega)$ in (6.1) and (6.2). Since ${\mathbb{T}}:=\{R>0:{\mathbb{Q}}_{0}(\omega:R\in V(\omega)\cup V^{\prime}(\omega))>0\}$ is at most countable and $\lim_{R\to\infty}\tau_{R}\to\infty$ , to show ${\mathbb{Q}}_{0}\in{\mathcal{M}}^{\mu_{0}}_{0}({\mathscr{L}})$ , it suffices to show that for each $R\in{\mathbb{T}}$ and $s<t$ ,

{\mathbb{E}}^{{\mathbb{Q}}_{0}}\big{(}M_{t\wedge\tau_{R}}|{\mathscr{B}}_{s\wedge\tau_{R}}\big{)}=M_{s\wedge\tau_{R}},

or equivalently, for any $n\in{\mathbb{N}}$ , $g\in C_{b}({\mathbb{R}}^{nd})$ and $s_{1}<s_{2}<\cdots<s_{n}\leqslant s_{0}$ ,

\displaystyle{\mathbb{E}}^{{\mathbb{Q}}_{0}}\Big{[}\big{(}M_{t\wedge\tau_{R}}-M_{s\wedge\tau_{R}}\big{)}G(w_{\cdot\wedge\tau_{R}})\Big{]}=0,

(6.8)

where $G(w):=g(w_{s_{1}},\cdots,w_{s_{n}}).$ Note that by the assumption,

\displaystyle{\mathbb{E}}^{{\mathbb{P}}^{\varepsilon}}\Big{[}\big{(}M^{\varepsilon}_{t\wedge\tau^{\varepsilon}_{R}}-M^{\varepsilon}_{s\wedge\tau^{\varepsilon}_{R}}\big{)}G(X^{\varepsilon}_{\cdot\wedge\tau^{\varepsilon}_{R}})\Big{]}=0,

(6.9)

where $M^{\varepsilon}_{t}$ is defined by (6.5) and $\tau^{\varepsilon}_{R}:=\inf\big{\{}t>0:|X^{\varepsilon}_{t}|\vee|X^{\varepsilon}_{t-}|\geqslant R\big{\}}$ . We want to take weak limits. Since by Proposition 6.1,

{\mathbb{D}}({\mathbb{R}}^{d})\ni\omega\mapsto\Big{[}\big{(}f(w_{t\wedge\tau_{R}})-f(w_{s\wedge\tau_{R}})\big{)}G(w_{\cdot\wedge\tau_{R}})\Big{]}(\omega)=:H(\omega)\in{\mathbb{R}}

is bounded and ${\mathbb{Q}}_{0}$ -a.s. continuous, we have

\lim_{\varepsilon\to 0}{\mathbb{E}}^{{\mathbb{Q}}_{\varepsilon}}H={\mathbb{E}}^{{\mathbb{Q}}_{0}}H.

Thus, by definitions (6.5) and (6.7), to prove (6.8), it remains to show

\displaystyle\lim_{\varepsilon\to 0}{\mathbb{E}}^{{\mathbb{P}}^{\varepsilon}}\left(G(X^{\varepsilon}_{\cdot\wedge\tau^{\varepsilon}_{R}})\int^{t\wedge\tau^{\varepsilon}_{R}}_{s\wedge\tau^{\varepsilon}_{R}}{\mathscr{L}}^{\varepsilon}_{r}f(X^{\varepsilon}_{r}){\mathord{{\rm d}}}r\right)={\mathbb{E}}^{{\mathbb{Q}}_{0}}\left(G(w_{\cdot\wedge\tau_{R}})\int^{t\wedge\tau_{R}}_{s\wedge\tau_{R}}{\mathscr{L}}_{r}f(w_{r}){\mathord{{\rm d}}}r\right).

(6.10)

Since for each $r$ , $x\mapsto{\mathscr{L}}_{r}f(x)$ is a continuous function, by Proposition 6.1, one sees that

{\mathbb{D}}({\mathbb{R}}^{d})\ni\omega\mapsto\left(G(w_{\cdot\wedge\tau_{R}})\int^{t\wedge\tau_{R}}_{s\wedge\tau_{R}}{\mathscr{L}}_{r}f(w_{r}){\mathord{{\rm d}}}r\right)(\omega)\in{\mathbb{R}}

is bounded and ${\mathbb{P}}_{0}$ -a.s. continuous. Thus,

	$\displaystyle\lim_{\varepsilon\to 0}{\mathbb{E}}^{{\mathbb{P}}^{\varepsilon}}\left(G(X^{\varepsilon}_{\cdot\wedge\tau^{\varepsilon}_{R}})\int^{t\wedge\tau^{\varepsilon}_{R}}_{s\wedge\tau^{\varepsilon}_{R}}{\mathscr{L}}_{r}f(X^{\varepsilon}_{r}){\mathord{{\rm d}}}r\right)$	$\displaystyle=\lim_{\varepsilon\to 0}{\mathbb{E}}^{{\mathbb{Q}}_{\varepsilon}}\left(G(w_{\cdot\wedge\tau_{R}})\int^{t\wedge\tau_{R}}_{s\wedge\tau_{R}}{\mathscr{L}}_{r}f(w_{r}){\mathord{{\rm d}}}r\right)$
		$\displaystyle={\mathbb{E}}^{{\mathbb{Q}}_{0}}\left(G(w_{\cdot\wedge\tau_{R}})\int^{t\wedge\tau_{R}}_{s\wedge\tau_{R}}{\mathscr{L}}_{r}f(w_{r}){\mathord{{\rm d}}}r\right),$

which together with (6.6) yields (6.10). The proof is complete. ∎

Acknowledgement: The author would like to express their gratitude to Zimo Hao, Rongchan Zhu, and Xiangchan Zhu for their valuable discussions and helpful suggestions. The numerical experiments presented in Remark 2.3 were conducted by Ming-Yang Lai.

References

[1] Ambrosio L. (2004). Transport equation and Cauchy problem for BV vector fields. Invent. Math. 158, 2 (2004), 227-260.
[2] Aubin J.P. and Cellina A. (1984). Differential inclusions: set-valued maps and viability theory. Springer-Verlag, 1984.
[3] Bao J., Huang X. and Yuan C. (2019). Convergence rate of Euler-Maruyama scheme for SDEs with Hölder-Dini continuous drifts. J. Theoret. Probab., 32(2):848-871, 2019.
[4] Bally V. and Talay D. (1996). The law of the Euler scheme for stochastic differential equations. I. Convergence rate of the distribution function. Probab. Theory Related Fields 104 43-60.
[5] Bouchut F., James F., and Mancini S. (2005). Uniqueness and weak stability for multi-dimensional transport equations with one-sided Lipschitz coefficient. Ann. Sc. Norm. Super. Pisa Cl. Sci. (5) 4, 1 (2005), 1-25.
[6] Chaintron L.P. and Diez A. (2021). Propagation of chaos: a review of models, methods and applications. Available in arXiv:2106.14812.
[7] Chen Z.Q. and Zhang X. (2023). Heat kernels for time-dependent non-symmetric mixed Lévy-type operators. Journal of Functional Analysis 285 (2023) 109947.
[8] Constantin P., Iyer G.(2008). A stochsatic Lagrangian representation of the three-dimensional incompressible Navier-Stokes equations. Comm. Pure Appl. Math. LXI, 330–345 (2008).
[9] Crippa G. and De Lellis C.(2008). Estimates and regularity results for the DiPerna– Lions flow. J. Reine Angew. Math. 616 (2008), 15-46.
[10] Da Prato G., Zabczyk J. (1996). Ergodicity for Infinite-Dimensional Systems. London Mathematical Society Lecture Note Series, vol. 229. Cambridge University Press, Cambridge (1996).
[11] DiPerna R.J., Lions P.L. (1989). Ordinary differential equations, transport theory and Sobolev spaces, Invent. Math. 98(1989) 511-547.
[12] Evans L.C., Gariepy R.F. (1992). Measure Theory and Fine Properties of Functions, Studies in Advanced Mathematics, CRC Press, London, 1992.
[13] Fernandez B. and Méléard S.: A Hilbertian approach for fluctuations on the McKean-Vlasov model. Stochastic Processes and their Applications, 71 (1997) 33-53.
[14] Filippov A.F.: Differential equations with discontinuous right-hand side, Amer.Math.Soc.Transl. Ser. 2 42 (1960), 199-231.
[15] Gyöngy I. (1998). A note on Euler’s approximations. Potential Anal. 8, (1998), 205–216.
[16] Hammersley W.R.P., $\check{\rm S}$ i $\check{\rm s}$ ka D. and Szpruch L. (2021). McKean–Vlasov SDEs under measure dependent Lyapunov conditions. Annales de l’Institut Henri Poincaré - Probabilités et Statistiques 2021, Vol. 57, No. 2, 1032-1057.
[17] Hao Z., Wu M. and Zhang X. (2020). Schauder estimates for nonlocal kinetic equations and applications. J. Math. Pures Appl., 140 (2020) 139-184.
[18] Hao Z., Röckner M. and Zhang X. (2022). Strong convergence of propagation of chaos for McKean-Vlasov SDEs with singular interactions. Available in arXiv:2204.07952.
[19] Higham D. J., Mao X. and Stuart A. M. (2002). Strong convergence of Euler type methods for nonlinear stochastic differential equations. SIAM J. Numer. Anal. 40, (2002), 1041–1063.
[20] Hörmander L.: Lectures on Nonlinear Hyperbolic Differential Equations. Springer-Verlag, Berlin Heidelberg, 1997.
[21] Hutzenthaler M., Jentzen A. and Kloeden P.E. (2011). Strong and weak divergence in finite time of Euler’s method for stochastic differential equations with non-globally Lipschitz continuous coefficients. Proc. R. Soc. A 467, (2011), 1563-1576.
[22] Hutzenthaler M., Jentzen A. and Kloeden P.E. (2012). Strong convergence of an explicit numerical method for SDEs with non-globally Lipschitz continuous coefficients. Ann. Appl. Probab. 22, (2012), 1611–1641.
[23] Jacod J. and Shiryaev N. (2002). Limit Theorems for Stochastic Processes. Wissenschaften, vol. 288, Springer, Heidelberg.
[24] Kac,M., Probability and Related Topics in the Physical Sciences (NewYork: Interscience, 1958).
[25] Kloeden P. E. and Platen E. (1992). Numerical solution of stochastic differential equations, vol. 23 of Applications of Mathematics (New York). Springer-Verlag, Berlin, 1992.
[26] Krylov N. V. (1990). A simple proof of the existence of a solution to the Ito equation with monotone coefficients. Theory Probab. Appl. 35, (1990), 583–587.
[27] Li Y. and Zhao G. (2023). Euler-Maruyama scheme for SDE driven by Lévy processes with Hölder drift. https://arxiv.org/abs/2304.13952v1.
[28] Lions P.L. Seeger B. (2023). Transport equations and flows with one-sided Lipschitz velocity fields. arXiv:2306.13288v1.
[29] Liu W., Wu L. and Zhang C. (2021). Long-Time Behaviors of Mean-Field Interacting Particle Systems Related to McKean–Vlasov Equations. Commun. Math. Phys. 387, 179-214 (2021).
[30] Majda A.J., Bertozzi A.L.(2002). Vorticity and incompressible flow. Cambridge Texts in Applied Mathematics. Cambridge: Cambridge University Press, 2002.
[31] McKean H. P. (1967). Propagation of chaos for a class of non-linear parabolic equations. In Stochastic Differential Equations (Lecture Series in Differential Equations, Session 7, Catholic Univ., 1967), pages 41-57. Air Force Office Sci. Res., Arlington, Va.
[32] Mikulevicius M. and Xu F. (2018). On the rate of convergence of strong Euler approximation for SDEs driven by Lévy processes. Stochastics, 90(4):569-604, 2018.
[33] Ngo H.L. and Taguchi D. (2016). Strong rate of convergence for the Euler-Maruyama approximation of stochastic differential equations with irregular coefficients. Mathematics of Computation, Vol. 85, Number 300, July 2016, Pages 1793-1819.
[34] Röckner M. and Zhang X. (2010). Weak uniqueness of Fokker–Planck equations with degenerate and bounded coefficients. C. R. Acad. Sci. Paris, Ser. I 348 (2010) 435-438.
[35] Sabanis S. (2013). A note on tamed Euler approximations. Electron. Commun. Probab. 18 (2013), no. 47, 1-10.
[36] Stein E.M. (1970). Singular Integrals and Differentiability Properties of Functions. Princeton University Press, 1970.
[37] Stein E.M. and Weiss G. (1971). Introduction to Fourier analysis on Euclidean spaces. Princeton University Press, 1971.
[38] Stroock D.W. and Varadhan S.R.S. (2006). Multidimensional Diffusion Processes. Springer-Verlag Berlin Heidelberg, 2006.
[39] Suo Y., Yuan C. and Zhang S.Q. (2022). Weak convergence of Euler scheme for SDEs with low regular drift. Numer. Algorithms, 90(2):731-747, 2022.
[40] Sznitman A.-S. (1991). Topics in propagation of chaos. In École d’Été de Probabilités de Saint-Flour XIX—1989, volume 1464 of Lecture Notes in Math., pages 165-251. Springer, Berlin.
[41] Xie L. and Zhang X. (2020). Ergodicity of stochastic differential equations with jumps and singular coefficients. Ann. Inst. Henri Poincaré Probab. Stat. 56 (2020), no. 1, 175-229.
[42] Zhang X. and Zhang X.(2023). Ergodicity of supercritical SDEs driven by $\alpha$ -stable processes and heavy-tailed sampling. Bernoulli 29(3), 2023, 1933-1958.
[43] Zhang X. (2010). A stochastic representation for backward incompressible Navier-Stokes equations. Prob.Theory and Rel. Fields 148(1-2), 305-332 (2010).
[44] Zhang X. (2012): Stochastic functional differential equations driven by Lévy processes and quasi-linear partial integro-differential equations. Ann. of Appl. Aprob., Vol. 22, No. 6, 2505-2538(2012).
[45] Zhang X. (2013). Degenerate irregular SDEs with jumps and application to integro-differential equations of Fokker-Planck type. Electron. J. Probab. 18 (2013), no. 55, 1-25.

	$\displaystyle{\mathbb{E}}\left(\sup_{t\in[0,T]}\|X^{\varepsilon}_{t}-X_{t}\|^{2}\right)$	$\displaystyle\leqslant\mathrm{e}^{2\\|\nabla b\\|_{\infty}T}{\mathbb{E}}\left(\sup_{t\in[0,T]}\left\|\int^{t}_{0}\varepsilon b(s,X^{\varepsilon}_{s-}){\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{\varepsilon}_{s}\right\|^{2}\right)$
		$\displaystyle\leqslant 4\mathrm{e}^{2\\|\nabla b\\|_{\infty}T}{\mathbb{E}}\left\|\int^{T}_{0}\varepsilon b(s,X^{\varepsilon}_{s-}){\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{\varepsilon}_{s}\right\|^{2}$
		$\displaystyle=4\mathrm{e}^{2\\|\nabla b\\|_{\infty}T}{\mathbb{E}}\left(\int^{T}_{0}\|\varepsilon b(s,X^{\varepsilon}_{s})\|^{2}{\mathord{{\rm d}}}{\big{(}}\tfrac{s}{\varepsilon}{\big{)}}\right)$
		$\displaystyle\leqslant 4\mathrm{e}^{2\\|\nabla b\\|_{\infty}T}\\|b\\|^{2}_{\infty}T\varepsilon.$

	$\displaystyle{\mathbb{E}}\left(\sup_{s\in[0,t]}\|X^{\varepsilon}_{s}\|^{p}\right)$	$\displaystyle\leqslant{\mathbb{E}}\|\xi\|^{p}+p\varepsilon{\mathbb{E}}\left(\int^{t}_{0}\|b(s,X^{\varepsilon}_{s})\|\big{(}\|X^{\varepsilon}_{s}\|+\varepsilon\|b(s,X^{\varepsilon}_{s})\|\big{)}^{p-1}{\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{s}\right)$
		$\displaystyle={\mathbb{E}}\|\xi\|^{p}+p{\mathbb{E}}\left(\int^{t}_{0}\|b(s,X^{\varepsilon}_{s})\|\big{(}\|X^{\varepsilon}_{s}\|+\varepsilon\|b(s,X^{\varepsilon}_{s})\|\big{)}^{p-1}{\mathord{{\rm d}}}s\right)$
		$\displaystyle\leqslant{\mathbb{E}}\|\xi\|^{p}+C{\mathbb{E}}\left(\int^{t}_{0}(1+\|X^{\varepsilon}_{s}\|^{p}){\mathord{{\rm d}}}s\right),$

	$\displaystyle\|Z^{\varepsilon}_{t}\|^{2}$	$\displaystyle=2\int^{t}_{0}\langle Z^{\varepsilon}_{s},b(s,X^{\varepsilon}_{s})-b(s,X_{s})\rangle{\mathord{{\rm d}}}s+\int^{t}_{0}(\|Z^{\varepsilon}_{s-}+\varepsilon b(s,X^{\varepsilon}_{s-})\|^{2}-\|Z^{\varepsilon}_{s-}\|^{2}){\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{\varepsilon}_{s}$
		$\displaystyle\quad+\int^{t}_{0}(\|Z^{\varepsilon}_{s}+\varepsilon b(s,X^{\varepsilon}_{s})\|^{2}-\|Z^{\varepsilon}_{s}\|^{2}-2\varepsilon\langle b(s,X^{\varepsilon}_{s}),Z^{\varepsilon}_{s}\rangle){\mathord{{\rm d}}}{\big{(}}\frac{s}{\varepsilon}{\big{)}}$
		$\displaystyle\leqslant 2\kappa\int^{t}_{0}\|Z^{\varepsilon}_{s}\|^{2}{\mathord{{\rm d}}}s+\int^{t}_{0}\Big{(}2\varepsilon\langle b(s,X^{\varepsilon}_{s-}),Z^{\varepsilon}_{s-}\rangle+\varepsilon^{2}\|b(s,X^{\varepsilon}_{s-})\|^{2}\Big{)}{\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{\varepsilon}_{s}+\varepsilon\int^{t}_{0}\|b(s,X^{\varepsilon}_{s})\|^{2}{\mathord{{\rm d}}}s.$

	$\displaystyle{\mathbb{E}}\left(\sup_{s\in[0,t]}\|Z^{\varepsilon}_{s}\|^{2p}\right)$	$\displaystyle\lesssim{\mathbb{E}}\left\|\int^{t}_{0}\Big{(}2\varepsilon\langle b(s,X^{\varepsilon}_{s-}),Z^{\varepsilon}_{s-}\rangle+\varepsilon^{2}\|b(s,X^{\varepsilon}_{s-})\|^{2}\Big{)}{\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{\varepsilon}_{s}\right\|^{p}+(1+{\mathbb{E}}\|\xi\|^{2p})\varepsilon^{p}$
		$\displaystyle\lesssim{\mathbb{E}}\left(\int^{t}_{0}\Big{\|}2\varepsilon\langle b(s,X^{\varepsilon}_{s}),Z^{\varepsilon}_{s}\rangle+\varepsilon^{2}\|b(s,X^{\varepsilon}_{s})\|^{2}\Big{\|}^{2}{\mathord{{\rm d}}}(\tfrac{s}{\varepsilon})\right)^{\frac{p}{2}}$
		$\displaystyle+{\mathbb{E}}\int^{t}_{0}\Big{\|}2\varepsilon\langle b(s,X^{\varepsilon}_{s}),Z^{\varepsilon}_{s}\rangle+\varepsilon^{2}\|b(s,X^{\varepsilon}_{s})\|^{2}\Big{\|}^{p}{\mathord{{\rm d}}}(\tfrac{s}{\varepsilon})+(1+{\mathbb{E}}\|\xi\|^{2p})\varepsilon^{p}$
		$\displaystyle\lesssim{\mathbb{E}}\left(\int^{t}_{0}\Big{(}\|Z^{\varepsilon}_{s}\|^{4}+\varepsilon^{2}(1+\|X^{\varepsilon}_{s}\|^{4}\Big{)}{\mathord{{\rm d}}}s\right)^{\frac{p}{2}}$
		$\displaystyle+{\mathbb{E}}\int^{t}_{0}\Big{(}\|Z^{\varepsilon}_{s}\|^{2}+\varepsilon^{2}(1+\|X^{\varepsilon}_{s}\|^{2}\Big{)}^{p}{\mathord{{\rm d}}}(\tfrac{s}{\varepsilon})+(1+{\mathbb{E}}\|\xi\|^{2p})\varepsilon^{p}$
		$\displaystyle\lesssim\int^{t}_{0}{\mathbb{E}}\|Z^{\varepsilon}_{s}\|^{2p}{\mathord{{\rm d}}}s+(1+{\mathbb{E}}\|\xi\|^{2p})\varepsilon^{p},$

	$\displaystyle\|Z_{t}\|^{2}$	$\displaystyle=\int^{t}_{0}(\|Z_{s-}+\varepsilon B^{\varepsilon,\delta}_{s}+\varepsilon g^{\delta}_{s}(X^{\varepsilon}_{s-})\|^{2}-\|Z_{s-}\|^{2}){\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{s}$
		$\displaystyle=\int^{t}_{0}\Big{(}2\varepsilon\langle B^{\varepsilon,\delta}_{s}+g^{\delta}_{s}(X^{\varepsilon}_{s-}),Z_{s-}\rangle+\varepsilon^{2}\|B^{\varepsilon,\delta}_{s}+g^{\delta}_{s}(X^{\varepsilon}_{s-})\|^{2}\Big{)}{\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{s}$
		$\displaystyle\leqslant\int^{t}_{0}\varepsilon\Big{(}(2\kappa+1)\|Z_{s-}\|^{2}+\|g^{\delta}_{s}(X^{\varepsilon}_{s-})\|^{2}+2\varepsilon(\|B^{\varepsilon,\delta}_{s}\|^{2}+\|g^{\delta}_{s}(X^{\varepsilon}_{s-})\|^{2})\Big{)}{\mathord{{\rm d}}}{\mathcal{N}}^{\varepsilon}_{s}$
		$\displaystyle=\int^{t}_{0}\varepsilon\Big{(}(2\kappa+1)\|Z_{s-}\|^{2}+\|g^{\delta}_{s}(X^{\varepsilon}_{s-})\|^{2}+2\varepsilon(\|B^{\varepsilon,\delta}_{s}\|^{2}+\|g^{\delta}_{s}(X^{\varepsilon}_{s-})\|^{2})\Big{)}{\mathord{{\rm d}}}\widetilde{\mathcal{N}}^{\varepsilon}_{s}$
		$\displaystyle\quad+\int^{t}_{0}\Big{(}(2\kappa+1)\|Z_{s}\|^{2}+\|g^{\delta}_{s}(X^{\varepsilon}_{s})\|^{2}+2\varepsilon(\|B^{\varepsilon,\delta}_{s}\|^{2}+\|g^{\delta}_{s}(X^{\varepsilon}_{s})\|^{2}\Big{)}{\mathord{{\rm d}}}s.$

Compound Poisson particle approximation for McKean-Vlasov SDEs

Abstract.

1. Introduction

1.1. Poisson processes approximation for ODEs

1.2. Compound Poisson approximation for SDEs

1.3. Compound Poisson particle approximation for DDSDEs

1.4. Organization of the paper and notations

2. Poisson process approximation for ODEs

2.1. Classical solutions for ODEs with Lipschitz coefficients

Theorem 2.1.

Proof.

Remark 2.2.

Remark 2.3.

Proposition 2.4.

Proof.

Theorem 2.5.

Proof.

Remark 2.6.

2.2. Filippov solutions for ODEs with one-sided Lipschitz coefficients

Definition 2.7.

Theorem 2.8.

Lemma 2.9.

Proof.

Lemma 2.10.

Proof.

Theorem 2.11.

Proof.

Remark 2.12.

2.3. DiPerna-Lions solutions for ODEs with 𝕎1,q{\mathbb{W}}^{1,q}-coefficients

Theorem 2.13.

Proof.

Corollary 2.14.

Proof.

Remark 2.15.

2.4. Particle approximation for DDODEs

Remark 2.16.

Lemma 2.17.

Proof.

Theorem 2.18.

Proof.

Theorem 2.19.

Proof.

Remark 2.20.

3. Compound Poisson approximation for SDEs

Remark 3.1.

Remark 3.2.

Lemma 3.3.

Proof.

Lemma 3.4.

Proof.

Lemma 3.5.

Proof.

Remark 3.6.

3.1. Weak convergence of approximating SDEs

Remark 3.7.

Remark 3.8.

Lemma 3.9.

Proof.

Lemma 3.10.

Proof.

Lemma 3.11.

Proof.

Lemma 3.12.

Proof.

Remark 3.13.

Corollary 3.14.

Proof.

Lemma 3.15.

Proof.

Theorem 3.16.

Proof.

3.2. Convergence of invariant measures

Theorem 3.17.

Proof.

Remark 3.18.

3.3. Rate of weak convergence

Theorem 3.19.

Proof.

Remark 3.20.

4. Compound Poisson approximation for 2D-NSEs

2.3. DiPerna-Lions solutions for ODEs with ${\mathbb{W}}^{1,q}$ -coefficients

5.3. ${\mathcal{W}}_{1}$ -convergence rate under Lipschitz assumptions