On the speed of convergence of Picard iterations
of backward stochastic differential equations

Martin Hutzenthaler Faculty of Mathematics, University of Duisburg-Essen, Essen, Germany;
      e-mail: martin.hutzenthaler\texttt{a}⃝uni-due.de Thomas Kruse Institute of Mathematics, University of Gießen, Gießen, Germany;
      e-mail: thomas.kruse\texttt{a}⃝math.uni-giessen.de Tuan Anh Nguyen Faculty of Mathematics, University of Duisburg-Essen, Essen, Germany;
      e-mail: tuan.nguyen\texttt{a}⃝uni-due.de

Abstract

It is a well-established fact in the scientific literature that Picard iterations of backward stochastic differential equations with globally Lipschitz continuous nonlinearity converge at least exponentially fast to the solution. In this paper we prove that this convergence is in fact at least square-root factorially fast. We show for one example that no higher convergence speed is possible in general. Moreover, if the nonlinearity is $z$ -independent, then the convergence is even factorially fast. Thus we reveal a phase transition in the speed of convergence of Picard iterations of backward stochastic differential equations.

^†^†footnotetext: Key words and phrases: backward stochastic differential equation, Picard iteration, a priori estimate, semilinear parabolic partial differential equation^†^†footnotetext: AMS 2020 subject classification: 65C99, 60H99, 60G99

1 Introduction

Since their introduction by Pardoux & Peng in [17] backward stochastic differential equations (BSDEs) have been extensively studied in the scientific literature and have found numerous applications. For example, BSDEs provide a solution approach for stochastic optimal control problems, BSDEs appear in the pricing and hedging of options in mathematical finance, and BSDEs provide stochastic representations of semilinear parabolic partial differential equations (PDEs).

A standard approach for proving existence results for BSDEs is to construct a contraction mapping whose fixed point is the solution $(Y,Z)$ of the BSDE. The associated fixed point iterations, the so-called Picard iterations, are a key component of several numerical approximation methods for BSDEs. We refer, e.g., to [2, 3] for numerical approximation methods for BSDEs based on Picard iterations and the least squares Monte Carlo method, we refer, e.g., to [10, 15] for numerical approximation methods for BSDEs based on Picard iterations and adaptive control variates, we refer, e.g., to [4, 9] for numerical approximation methods for BSDEs based on Picard iterations and Wiener chaos expansions, and we refer, e.g., to [6, 13, 7, 14, 11, 1, 12] for numerical approximation methods for BSDEs based on Picard iterations and a multilevel technique. Precise estimates on the speed of convergence of the Picard iterations $(Y_{n},Z_{n})_{n\in{\mathbbm{N}}_{0}}$ to the solution $(Y,Z)$ of the BSDE are essential for the error analyses of these numerical approximation methods for BSDEs where ${\mathbbm{N}}=\{1,2,\ldots\}$ and ${\mathbbm{N}}_{0}={\mathbbm{N}}\cup\{0\}$ .

Picard iterations, e.g., of ordinary differential equations converge not only exponentially fast but even factorially fast under suitable assumptions. Picard iterations of BSDEs are known to converge at least square-root factorially fast if the nonlinearity is z-independent; see the proof of [17, Theorem 3.1]. In the general case of z-dependent nonlinearities we have only found results proving that Picard iterations converge at least exponentially fast (see, e.g., [8, Theorem 2.1], [21, Theorem 4.3.1], and [19, Theorem 6.2.1]).

In this article we prove for BSDEs with z-independent and globally Lipschitz continuous nonlinearities that the Picard iterations converge in fact factorially fast. Moreover, we show for BSDEs with z-dependent and globally Lipschitz continuous nonlinearities that the Picard iterations converge at least square-root factorially fast. Somewhat surprisingly this speed of convergence cannot be improved in general. More precisely, we establish for a linear example BSDE a corresponding lower bound. We thereby reveal a phase transition in the speed of convergence of Picard iterations between the z-independent and the z-dependent case. Theorem 1.1 below illustrates the main results of this article.

Theorem 1.1.

Let $T\in(0,\infty)$ , $d,m\in{\mathbbm{N}}$ , $L_{\mathfrak{y}},L_{\mathfrak{z}}\in[0,\infty)$ , $b\in{\mathbbm{R}}^{m}$ , let $\lVert\cdot\rVert\colon\cup_{n\in{\mathbbm{N}}}{\mathbbm{R}}^{n}\to[0,\infty)$ satisfy for all $n\in{\mathbbm{N}}$ that $\lVert\cdot\rVert|_{{\mathbbm{R}}^{n}}$ is the standard norm on ${\mathbbm{R}}^{n}$ , let $\lVert\cdot\rVert_{\mathsf{F}}\colon{\mathbbm{R}}^{d\times m}\to[0,\infty)$ denote the Frobenius norm on ${\mathbbm{R}}^{d\times m}$ , let $(\Omega,\mathcal{F},{\mathbb{P}},({\mathbbm{F}}_{t})_{t\in[0,T]})$ be a filtered probability space which satisfies the usual conditions¹¹1Let $T\in(0,\infty)$ and let ${\bf\Omega}=(\Omega,\mathcal{F},{\mathbb{P}},({\mathbbm{F}}_{t})_{t\in[0,T]})$ be a filtered probability space. Then we say that ${\bf\Omega}$ satisfies the usual conditions if and only if it holds for all $t\in[0,T)$ that $\{A\in\mathcal{F}:{\mathbb{P}}(A)=0\}\subseteq{\mathbbm{F}}_{t}=\cap_{s\in(t,T]}{\mathbbm{F}}_{s}$ ., let $f\colon[0,T]\times\Omega\times{\mathbbm{R}}^{d}\times{\mathbbm{R}}^{d\times m}\to{\mathbbm{R}}^{d}$ be measurable, assume for all $t\in[0,T]$ , $y,\tilde{y}\in{\mathbbm{R}}^{d}$ , $z,\tilde{z}\in{\mathbbm{R}}^{d\times m}$ it holds a.s. that

\begin{split}\lVert f(t,y,z)-f(t,\tilde{y},\tilde{z})\rVert\leq L_{\mathfrak{y}}\lVert y-\tilde{y}\rVert+L_{\mathfrak{z}}\lVert z-\tilde{z}\rVert_{\mathsf{F}},\end{split}

(1)

let $W\colon[0,T]\times\Omega\to{\mathbbm{R}}^{m}$ be a standard $({\mathbbm{F}}_{t})_{t\in[0,T]}$ -Brownian motion with continuous sample paths, let $\xi\colon\Omega\to{\mathbbm{R}}^{d}$ be ${\mathbbm{F}}_{T}$ -measurable, let $Y^{k}\colon[0,T]\times\Omega\to{\mathbbm{R}}^{d}$ , $k\in{\mathbbm{N}}_{0}\cup\{\infty\}$ , be adapted with continuous sample paths, let $Z^{k}\colon[0,T]\times\Omega\to{\mathbbm{R}}^{d\times m}$ , $k\in{\mathbbm{N}}_{0}\cup\{\infty\}$ , be progressively measurable, assume that for all $s\in[0,T]$ , $k\in{\mathbbm{N}}_{0}\cup\{\infty\}$ it holds a.s. that $\int_{0}^{T}{\mathbb{E}}[\|\xi\rVert^{2}+\lVert f(t,0,0)\rVert^{2}+\lVert Y_{t}^{\infty}\rVert^{2}+\lVert Z_{t}^{k}\rVert^{2}_{\mathsf{F}}]\,dt<\infty$ , $Y^{0}_{s}=0$ , $Z^{0}_{s}=0$ , and

\displaystyle Y^{k+1}_{s}=\xi+\int_{s}^{T}f(t,Y_{t}^{k},Z_{t}^{k})\,dt-\int_{s}^{T}Z^{k+1}_{t}\,dW_{t},

(2)

and let $e_{k}\in[0,\infty]$ , $k\in{\mathbbm{N}}$ , satisfy for all $k\in{\mathbbm{N}}$ that

\displaystyle e_{k}=\left({\mathbb{E}}\!\left[\sup_{t\in[0,T]}\left(\left\lVert Y^{k}_{t}-Y_{t}^{\infty}\right\rVert^{2}\right)+\int_{0}^{T}\left\lVert Z^{k}_{t}-Z_{t}^{\infty}\right\rVert^{2}_{\mathsf{F}}\,dt\right]\right)^{\nicefrac{{1}}{{2}}}.

(3)

Then

(i)

there exists $c\in[0,\infty)$ such that for all $k\in{\mathbbm{N}}$ it holds that $e_{k}\leq\frac{c^{k}}{\sqrt{k!}},$
(ii)

if, in addition to the above assumptions, it holds that $L_{\mathfrak{z}}=0$ , then there exists $c\in[0,\infty)$ such that for all $k\in{\mathbbm{N}}$ it holds that $e_{k}\leq\frac{c^{k}}{k!},$ and
(iii)

if, in addition to the above assumptions, $d=T=1$ , $\xi=2^{m/2}e^{-\frac{\lVert W_{1}\rVert^{2}}{2}}$ , and for all $t\in[0,T]$ , $y\in{\mathbbm{R}}$ , $z\in{\mathbbm{R}}^{1\times m}$ it holds a.s. that $f(t,y,z)=z\cdot b$ , then there exists $c\in[0,\infty)$ such that for all $k\in{\mathbbm{N}}\cap[\lVert b\rVert^{2}-1,\infty)$ it holds that $\frac{1}{2}\left(\frac{\lVert b\rVert^{2}}{4}\right)^{\lfloor\frac{k+1}{2}\rfloor}\frac{1}{\sqrt{k!}}\leq e_{k}\leq\frac{c^{k}}{\sqrt{k!}}.$

Item (i) of Theorem 1.1 is a direct consequence of Proposition 4.1 and Remark 4.2. Item (ii) of Theorem 1.1 follows from Proposition 4.1 and Remark 4.3. Item (i) of Theorem 1.1 and Corollary 2.2 prove Item (iii) of Theorem 1.1. The proof of Item (i) of Theorem 1.1 is based on Lemma 3.1 which shows for all $t\in[0,T]$ , $k\in{\mathbbm{N}}$ , $\lambda\in(0,\infty)$ that

\displaystyle\begin{split}&{\mathbb{E}}\!\left[e^{\lambda t}\lvert Y^{k}_{t}-Y^{\infty}_{t}\rvert^{2}+\int_{t}^{T}e^{\lambda s}\lvert Z^{k}_{s}-Z^{\infty}_{s}\rvert^{2}\,ds\right]\\ &\leq\frac{1}{\lambda}{\mathbb{E}}\!\left[\int_{t}^{T}e^{\lambda s}\lvert f_{s}(Y_{s}^{k-1},Z_{s}^{k-1})-f_{s}(Y_{s}^{\infty},Z_{s}^{\infty})\rvert^{2}\,ds\right],\end{split}

(4)

based on the Lipschitz continuity of $f$ , iterating (4) $k\in{\mathbbm{N}}$ times and then setting $\lambda=k$ to get an upper bound of the form $c^{k}/\sqrt{k^{k}}$ for $e_{k}$ .

We finally discuss some possible consequences of Item (iii) of Theorem 1.1 on the performance of numerical approximation methods for BSDEs based on Picard iterations in high-dimensional situations. To this end we consider a sequence of BSDEs indexed by the dimension $m\in{\mathbbm{N}}$ of the driving Brownian motion $W$ whose associated Lipschitz constants $L_{\mathfrak{z},m}\in[0,\infty)$ , $m\in{\mathbbm{N}}$ , grow for some $\alpha\in(0,\infty)$ like $m^{\alpha}$ as $m\to\infty$ . Item (iii) of Theorem 1.1 shows that it is possible in such a situation that the approximation errors $e_{k,m}$ , $k,m\in{\mathbbm{N}}$ , grow faster in the dimension $m\in{\mathbbm{N}}$ than any polynomial in the sense that for all $p\in[0,\infty)$ there exists $N\in{\mathbbm{N}}$ such that for all $k\in{\mathbbm{N}}\cap[N,\infty)$ it holds that $\liminf_{m\to\infty}\frac{e_{k,m}}{m^{p}}=\infty$ .

The remainder of this article is organized as follows. In Section 2 we provide lower bounds for the convergence speed of Picard iterations. In Section 2.2 we establish in Corollary 2.2 lower bounds for the convergence speed of Picard iterations for a linear example BSDE. In our proof of Corollary 2.2 we employ lower bounds for the convergence speed of Picard iterations for a linear example PDE which we prove in Lemma 2.1 in Section 2.1. In Lemma 3.1 in Section 3 we establish explicit a priori estimates for certain backward Itô processes in appropriate $L^{2}$ -norms. In Section 4 we provide upper bounds for the convergence speed of Picard iterations of BSDEs. Proposition 4.1 establishes an explicit bound for the $L^{2}$ -distance between the Picard iterations and the solution of a BSDE with a globally Lipschitz continuous nonlinearity. In Remark 4.2 we employ the estimate of Proposition 4.1 to obtain the square root-factorial speed of convergence of Picard iterations. In Remark 4.3 we employ the estimate of Proposition 4.1 to obtain the factorial speed of convergence of Picard iterations in the z-independent case.

2 Lower bounds for the convergence speed of Picard iterations

In this section we provide lower bounds for the convergence speed of Picard iterations of BSDEs. In Lemma 2.1 in Section 2.1 we establish lower bounds for the convergence speed of Picard iterations for a linear example PDE. We employ Lemma 2.1 in our proof of Corollary 2.2 in Section 2.2 to provide lower bounds for the convergence speed of Picard iterations for a linear example BSDE. Corollary 2.2 shows that square-root factorial convergence speed cannot be improved up to exponential factors in the case of $z$ -dependent drivers. Item (ii) of Theorem 1.1 shows in the case of $z$ -independent drivers that factorial speed of convergence is possible. Lemma 2.3 observes that factorial speed of convergence cannot be improved up to exponential factors in the case of $y$ -dependent drivers.

2.1 Lower bounds for the convergence speed of Picard iterations for an example PDE

Lemma 2.1.

Let $d\in{\mathbbm{N}}$ , $b=(b_{1},b_{2},\ldots,b_{d})\in{\mathbbm{R}}^{d}$ , let $\langle\cdot,\cdot\rangle\colon{\mathbbm{R}}^{d}\times{\mathbbm{R}}^{d}\to{\mathbbm{R}}$ denote the standard scalar product on ${\mathbbm{R}}^{d}$ , let $\lVert\cdot\rVert\colon{\mathbbm{R}}^{d}\to[0,\infty)$ denote the standard norm on ${\mathbbm{R}}^{d}$ , let $(\Omega,\mathcal{F},{\mathbb{P}})$ be a probability space, let $W=(W^{1},W^{2},\ldots,W^{d})\colon[0,1]\times\Omega\to{\mathbbm{R}}^{d}$ be a standard Brownian motion, let $v^{n}\colon[0,1]\times{\mathbbm{R}}^{d}\to{\mathbbm{R}}$ , $n\in{\mathbbm{N}}_{0}\cup\{\infty\}$ , satisfy for all $t\in[0,1]$ , $x\in{\mathbbm{R}}^{d}$ , $n\in{\mathbbm{N}}$ that $v^{0}(t,x)=0$ ,

\displaystyle\begin{split}&v^{n}(t,x)={\mathbb{E}}\!\left[2^{d/2}\exp\!\left(-\tfrac{\lVert x+W_{1}-W_{t}\rVert^{2}}{2}\right)\right]\\ &+\sum_{k=1}^{n-1}\sum_{\mu_{1},\mu_{2},\ldots,\mu_{k}=1}^{d}\Biggl{[}\tfrac{(1-t)^{k}}{k!}b_{\mu_{1}}b_{\mu_{2}}\cdots b_{\mu_{k}}{\mathbb{E}}\!\left[2^{d/2}\tfrac{\partial^{k}}{\partial x_{\mu_{1}}\partial x_{\mu_{2}}\ldots\partial x_{\mu_{k}}}\exp\!\left(-\tfrac{\lVert x+W_{1}-W_{t}\rVert^{2}}{2}\right)\right]\Biggr{]},\end{split}

(5)

and

v^{\infty}(t,x)={\mathbb{E}}\!\left[2^{d/2}\exp\!\left(-\tfrac{\lVert x+b(1-t)+W_{1}-W_{t}\rVert^{2}}{2}\right)\right].

(6)

Then

(i)

it holds for all $t\in[0,1]$ , $x\in{\mathbbm{R}}^{d}$ that $v^{\infty}\in C^{\infty}([0,1]\times{\mathbbm{R}}^{d},{\mathbbm{R}})$ , $v^{\infty}(1,x)=2^{d/2}e^{-\frac{\lVert x\rVert^{2}}{2}}$ , and

\frac{\partial v^{\infty}}{\partial t}(t,x)+\frac{1}{2}(\mathop{}\!\mathbin{\bigtriangleup}_{x}v^{\infty})(t,x)+\left\langle b,(\nabla_{x}v^{\infty})(t,x)\right\rangle=0,

(7)

(ii)

it holds for all $n\in{\mathbbm{N}}_{0}$ , $t\in[0,1)$ , $x\in{\mathbbm{R}}^{d}$ that $v^{n}\in C^{\infty}([0,1]\times{\mathbbm{R}}^{d},{\mathbbm{R}})$ , $v^{n}(1,x)=2^{d/2}e^{-\frac{\lVert x\rVert^{2}}{2}}$ ,

\displaystyle\begin{split}v^{n+1}(t,x)&={\mathbb{E}}\!\left[2^{d/2}\exp\!\left(-\frac{\lVert x+W_{1}-W_{t}\rVert^{2}}{2}\right)\right]\\ &\qquad+\int_{t}^{1}{\mathbb{E}}\!\left[\Bigl{\langle}b,(\nabla_{x}v^{n})(s,x+W_{s}-W_{t})\Bigr{\rangle}\right]\!ds,\end{split}

(8)

and

\displaystyle\begin{split}\nabla_{x}v^{n+1}(t,x)&={\mathbb{E}}\!\left[2^{d/2}\exp\!\left(-\frac{\lVert x+W_{1}-W_{t}\rVert^{2}}{2}\right)\frac{W_{1}-W_{t}}{1-t}\right]\\ &\qquad+\int_{t}^{1}{\mathbb{E}}\!\left[\Bigl{\langle}b,(\nabla_{x}v^{n})(s,x+W_{s}-W_{t})\Bigr{\rangle}\frac{W_{s}-W_{t}}{s-t}\right]\!ds,\end{split}

(9)

(iii)

it holds for all $n\in{\mathbbm{N}}$ that

$\displaystyle v^{n}(0,0)=1+\sum_{i=1}^{\lfloor\frac{n-1}{2}\rfloor}\frac{(-1)^{i}\lVert b\rVert^{2i}}{4^{i}i!},$ (10)
(iv)

it holds that

$\displaystyle v^{\infty}(0,0)=\exp\!\left(-\frac{\lVert b\rVert^{2}}{4}\right),$ (11)

and

(v)

it holds for all $\epsilon\in(0,1)$ , $n\in{\mathbbm{N}}\cap\bigl{[}\frac{1}{2\epsilon}\lVert b\rVert^{2}-1,\infty\bigr{)}$ that

\displaystyle\left(\frac{\lVert b\rVert^{2}}{4}\right)^{\lfloor\frac{n+1}{2}\rfloor}\frac{1-\epsilon}{\lfloor\frac{n+1}{2}\rfloor!}\leq\lvert v^{\infty}(0,0)-v^{n}(0,0)\rvert\leq\left(\frac{\lVert b\rVert^{2}}{4}\right)^{\lfloor\frac{n+1}{2}\rfloor}\frac{1}{\lfloor\frac{n+1}{2}\rfloor!}\frac{1}{1-\epsilon}.

(12)

Proof of Lemma 2.1.

First note that the Feyman-Kac formula (cf., e.g., [16, Theorem 8.2.1]) proves (i).

Next observe that (5) proves for all $\ell\in\{1,2,\ldots,d\}$ , $s\in[0,1)$ , $x\in{\mathbbm{R}}^{d}$ , $n\in{\mathbbm{N}}$ that $v^{n}\in C^{\infty}([0,1]\times{\mathbbm{R}}^{d},{\mathbbm{R}})$ , $v^{n}(1,x)=2^{d/2}e^{-\frac{\lVert x\rVert^{2}}{2}}$ , and

\displaystyle\begin{split}&\tfrac{\partial v^{n}}{\partial x_{\ell}}(s,x)-{\mathbb{E}}\!\left[2^{d/2}\tfrac{\partial}{\partial x_{\ell}}\exp\!\left(-\tfrac{\lVert x+W_{1}-W_{s}\rVert^{2}}{2}\right)\right]=\tfrac{\partial v^{n}}{\partial x_{\ell}}(s,x)-\tfrac{\partial}{\partial x_{\ell}}{\mathbb{E}}\!\left[2^{d/2}\exp\!\left(-\tfrac{\lVert x+W_{1}-W_{s}\rVert^{2}}{2}\right)\right]\\ &=\sum_{k=1}^{n-1}\sum_{\mu_{1},\mu_{2},\ldots,\mu_{k}=1}^{d}\Biggl{[}\tfrac{(1-s)^{k}}{k!}b_{\mu_{1}}b_{\mu_{2}}\cdots b_{\mu_{k}}\tfrac{\partial}{\partial x_{\ell}}{\mathbb{E}}\!\left[2^{d/2}\tfrac{\partial^{k}}{\partial x_{\mu_{1}}\partial x_{\mu_{2}}\ldots\partial x_{\mu_{k}}}\exp\!\left(-\tfrac{\lVert x+W_{1}-W_{s}\rVert^{2}}{2}\right)\right]\Biggr{]}\\ &=\sum_{k=1}^{n-1}\sum_{\mu_{1},\mu_{2},\ldots,\mu_{k}=1}^{d}\Biggl{[}\tfrac{(1-s)^{k}}{k!}b_{\mu_{1}}b_{\mu_{2}}\cdots b_{\mu_{k}}{\mathbb{E}}\!\left[2^{d/2}\tfrac{\partial^{k+1}}{\partial x_{\ell}\partial x_{\mu_{1}}\partial x_{\mu_{2}}\ldots\partial x_{\mu_{k}}}\exp\!\left(-\tfrac{\lVert x+W_{1}-W_{s}\rVert^{2}}{2}\right)\right]\Biggr{]}.\end{split}

(13)

This, the disintegration theorem, and independence of Brownian increments show for all $t\in[0,1)$ , $s\in(t,1)$ , $x\in{\mathbbm{R}}^{d}$ , $\ell\in\{1,2,\ldots,d\}$ , $n\in{\mathbbm{N}}$ that

	$\displaystyle{\mathbb{E}}\!\left[\tfrac{\partial v^{n}}{\partial x_{\ell}}(s,x+W_{s}-W_{t})\right]-{\mathbb{E}}\!\left[2^{d/2}\tfrac{\partial}{\partial x_{\ell}}\exp\!\left(-\tfrac{\lVert x+W_{1}-W_{t}\rVert^{2}}{2}\right)\right]$
	$\displaystyle={\mathbb{E}}\!\left[\tfrac{\partial v^{n}}{\partial x_{\ell}}(s,x+W_{s}-W_{t})\right]-{\mathbb{E}}\!\left[{\mathbb{E}}\!\left[2^{d/2}\tfrac{\partial}{\partial x_{\ell}}\exp\!\left(-\tfrac{\lVert z+W_{1}-W_{s}\rVert^{2}}{2}\right)\right]\bigr{\|}_{z=x+W_{s}-W_{t}}\right]$
	$\displaystyle=\sum_{k=1}^{n-1}\sum_{\mu_{1},\mu_{2},\ldots,\mu_{k}=1}^{d}\Biggl{[}\tfrac{(1-s)^{k}}{k!}b_{\mu_{1}}b_{\mu_{2}}\cdots b_{\mu_{k}}$
	$\displaystyle\qquad\qquad\qquad\qquad\qquad\cdot{\mathbb{E}}\biggl{[}{\mathbb{E}}\!\left[2^{d/2}\tfrac{\partial^{k+1}}{\partial z_{\ell}\partial z_{\mu_{1}}\partial z_{\mu_{2}}\ldots\partial z_{\mu_{k}}}\exp\!\left(-\tfrac{\lVert z+W_{1}-W_{s}\rVert^{2}}{2}\right)\right]\bigr{\|}_{z=x+W_{s}-W_{t}}\biggr{]}\Biggr{]}$
	$\displaystyle=\sum_{k=1}^{n-1}\sum_{\mu_{1},\mu_{2},\ldots,\mu_{k}=1}^{d}\Biggl{[}\tfrac{(1-s)^{k}}{k!}b_{\mu_{1}}b_{\mu_{2}}\cdots b_{\mu_{k}}{\mathbb{E}}\!\left[2^{d/2}\tfrac{\partial^{k+1}}{\partial x_{\mu_{1}}\partial x_{\mu_{2}}\ldots\partial x_{\mu_{k}}\partial x_{\ell}}\exp\!\left(-\tfrac{\lVert x+W_{1}-W_{t}\rVert^{2}}{2}\right)\right]\Biggr{]}.$		(14)

This, the fact that $\forall\,k\in{\mathbbm{N}}_{0},t\in[0,1]\colon\int_{t}^{1}\frac{(1-s)^{k}}{k!}\,ds=\frac{(1-t)^{k+1}}{(k+1)!}$ , and (5) show for all $t\in[0,1)$ , $x\in{\mathbbm{R}}^{d}$ , $n\in{\mathbbm{N}}$ that

	$\displaystyle\int_{t}^{1}{\mathbb{E}}\!\left[\Bigl{\langle}b,(\nabla_{x}v^{n})(s,x+W_{s}-W_{t})\Bigr{\rangle}\right]\!ds=\sum_{\mu_{k+1}=1}^{d}\int_{t}^{1}{\mathbb{E}}\!\left[b_{\mu_{k+1}}\tfrac{\partial v^{n}}{\partial x_{\mu_{k+1}}}(s,x+W_{s}-W_{t})\right]\!ds$
	$\displaystyle=\sum_{k=0}^{n-1}\sum_{\mu_{1},\mu_{2},\ldots,\mu_{k+1}=1}^{d}\int_{t}^{1}\tfrac{(1-s)^{k}}{k!}b_{\mu_{1}}b_{\mu_{2}}\cdots b_{\mu_{k+1}}{\mathbb{E}}\!\left[2^{d/2}\tfrac{\partial^{k+1}}{\partial x_{\mu_{1}}\partial x_{\mu_{2}}\ldots\partial x_{\mu_{k+1}}}\exp\!\left(-\tfrac{\lVert x+W_{1}-W_{t}\rVert^{2}}{2}\right)\right]\!ds$
	$\displaystyle=\sum_{k=0}^{n-1}\sum_{\mu_{1},\mu_{2},\ldots,\mu_{k+1}=1}^{d}\Biggl{[}\tfrac{(1-t)^{k+1}}{(k+1)!}b_{\mu_{1}}b_{\mu_{2}}\cdots b_{\mu_{k+1}}{\mathbb{E}}\!\left[2^{d/2}\tfrac{\partial^{k+1}}{\partial x_{\mu_{1}}\partial x_{\mu_{2}}\ldots\partial x_{\mu_{k+1}}}\exp\!\left(-\tfrac{\lVert x+W_{1}-W_{t}\rVert^{2}}{2}\right)\right]\Biggr{]}$
	$\displaystyle=\sum_{k=1}^{n}\sum_{\mu_{1},\mu_{2},\ldots,\mu_{k}=1}^{d}\Biggl{[}\tfrac{(1-t)^{k}}{k!}b_{\mu_{1}}b_{\mu_{2}}\cdots b_{\mu_{k}}{\mathbb{E}}\!\left[2^{d/2}\tfrac{\partial^{k}}{\partial x_{\mu_{1}}\partial x_{\mu_{2}}\ldots\partial x_{\mu_{k}}}\exp\!\left(-\tfrac{\lVert x+W_{1}-W_{t}\rVert^{2}}{2}\right)\right]\Biggr{]}$
	$\displaystyle=v^{n+1}(t,x)-{\mathbb{E}}\Bigl{[}2^{d/2}\exp\!\left(-\tfrac{\lVert x+W_{1}-W_{t}\rVert^{2}}{2}\right)\Bigr{]}.$		(15)

This, the fact that $\forall\,t\in[0,1],x\in{\mathbbm{R}}^{d}\colon v^{0}(t,x)=0$ , and (5) show for all $t\in[0,1)$ , $x\in{\mathbbm{R}}^{d}$ , $n\in{\mathbbm{N}}_{0}$ that

\displaystyle v^{n+1}(t,x)={\mathbb{E}}\Bigl{[}2^{d/2}\exp\!\left(-\tfrac{\lVert x+W_{1}-W_{t}\rVert^{2}}{2}\right)\Bigr{]}+\int_{t}^{1}{\mathbb{E}}\!\left[\Bigl{\langle}b,(\nabla_{x}v^{n})(s,x+W_{s}-W_{t})\Bigr{\rangle}\right]\!ds.

(16)

Next note that Stein’s lemma proves for all $\ell\in\{1,2,\ldots,d\}$ , $x\in{\mathbbm{R}}^{d}$ , $s\in(0,1]$ , $t\in[0,s)$ , $h\in C^{1}({\mathbbm{R}}^{d},{\mathbbm{R}})$ with $\sup_{y\in{\mathbbm{R}}^{d}}\bigl{(}\lvert h(y)\rvert+\lvert\tfrac{\partial h}{\partial y_{\ell}}(y)\rvert\bigr{)}<\infty$ that

\displaystyle\frac{\partial}{\partial x_{\ell}}{\mathbb{E}}\!\left[h(x+W_{s}-W_{t})\right]={\mathbb{E}}\!\left[\frac{\partial h}{\partial x_{\ell}}(x+W_{s}-W_{t})\right]={\mathbb{E}}\!\left[h(x+W_{s}-W_{t})\frac{W^{\ell}_{s}-W^{\ell}_{t}}{s-t}\right].

(17)

This, (16), differentiation under integrals, the fact that $\forall\,\ell\in\{1,2,\ldots,d\},n\in{\mathbbm{N}}_{0},s\in[0,1)\colon\sup_{x\in{\mathbbm{R}}^{d}}\bigl{[}\exp(-\frac{\lVert x\rVert^{2}}{2})+\lvert\frac{\partial}{\partial x_{\ell}}\exp(-\frac{\lVert x\rVert^{2}}{2})\rvert+\lvert\langle b,(\nabla_{x}v^{n})(s,x)\rangle\rvert+\lvert\frac{\partial}{\partial x_{\ell}}\langle b,(\nabla_{x}v^{n})(s,x)\rangle\rvert\bigr{]}<\infty$ , and the fact that $\forall\,t\in[0,1],x\in{\mathbbm{R}}^{d}\colon v^{0}(t,x)=0$ show for all $\ell\in\{1,2,\ldots,d\}$ , $n\in{\mathbbm{N}}_{0}$ , $t\in[0,1)$ , $x\in{\mathbbm{R}}^{d}$ that

	$\displaystyle\frac{\partial v^{n+1}}{\partial x_{\ell}}(t,x)=\frac{\partial}{\partial x_{\ell}}{\mathbb{E}}\!\left[2^{d/2}\exp\!\left(-\tfrac{\lVert x+W_{1}-W_{t}\rVert^{2}}{2}\right)\right]+\int_{t}^{1}\frac{\partial}{\partial x_{\ell}}{\mathbb{E}}\!\left[\Bigl{\langle}b,(\nabla_{x}v^{n})(s,x+W_{s}-W_{t})\Bigr{\rangle}\right]\!ds$
	$\displaystyle={\mathbb{E}}\!\left[2^{d/2}\exp\!\left(-\tfrac{\lVert x+W_{1}-W_{t}\rVert^{2}}{2}\right)\frac{W_{1}^{\ell}-W_{t}^{\ell}}{1-t}\right]$
	$\displaystyle\qquad\qquad\qquad+\int_{t}^{1}{\mathbb{E}}\!\left[\Bigl{\langle}b,(\nabla_{x}v^{n})(s,x+W_{s}-W_{t})\Bigr{\rangle}\frac{W^{\ell}_{s}-W^{\ell}_{t}}{s-t}\right]\!ds.$		(18)

This and (16) show (ii).

For the next step let $0^{0}=1$ , let $H_{k}\colon{\mathbbm{R}}\to{\mathbbm{R}}$ , $k\in{\mathbbm{N}}_{0}$ , satisfy for all $k\in{\mathbbm{N}}_{0}$ , $x\in{\mathbbm{R}}$ that

H_{k}(x)=\sum_{\ell=0}^{\lfloor\frac{k}{2}\rfloor}\left[\frac{k!(-1)^{\ell}}{\ell!(k-2\ell)!}\frac{x^{k-2\ell}}{2^{\ell}}\right]

(19)

and for every $n\in{\mathbbm{N}}_{0}\cup\{-1\}$ let $n!!\in{\mathbbm{N}}$ satisfy that $n!!=\prod_{k=0}^{\left\lceil{{n}/{2}}\right\rceil-1}(n-2k)$ . A well-known fact on Hermite polynomials shows for all $x\in{\mathbbm{R}}$ , $k\in{\mathbbm{N}}_{0}$ that $\tfrac{d^{k}}{dx^{k}}(e^{-\frac{x^{2}}{2}})=e^{-\frac{x^{2}}{2}}H_{k}(x).$ Furthermore, a well-known fact on moments of normally distributed random variables shows for all $k\in{\mathbbm{N}}_{0}$ , $\ell\in[0,k]\cap{\mathbbm{N}}_{0}$ that $\tfrac{1}{\sqrt{\pi}}\int_{{\mathbbm{R}}}z^{2k+1-2\ell}e^{-z^{2}}\,dz=0$ and

\displaystyle\frac{1}{\sqrt{\pi}}\int_{{\mathbbm{R}}}z^{2k-2\ell}e^{-z^{2}}\,dz=\left[\frac{1}{\sqrt{2\pi\sigma^{2}}}\int_{{\mathbbm{R}}}z^{2k-2\ell}e^{-\frac{z^{2}}{2\sigma^{2}}}\,dz\right]\Bigr{|}_{\sigma^{2}=\frac{1}{2}}=\frac{(2k-2\ell-1)!!}{2^{k-\ell}}.

(20)

This, (19), and the binomial theorem imply for all $k\in{\mathbbm{N}}_{0}$ that

	$\displaystyle{\mathbb{E}}\!\left[\sqrt{2}\exp\!\left(-\frac{\lvert W_{1}^{1}\rvert^{2}}{2}\right)H_{2k}(W_{1}^{1})\right]=\frac{1}{\sqrt{2\pi}}\int_{{\mathbbm{R}}}\sqrt{2}e^{-\frac{z^{2}}{2}}\left[\sum_{\ell=0}^{k}\left(\frac{(2k)!(-1)^{\ell}}{\ell!(2k-2\ell)!}\frac{z^{2k-2\ell}}{2^{\ell}}\right)\right]e^{-\frac{z^{2}}{2}}\,dz$
	$\displaystyle=\sum_{\ell=0}^{k}\left[\frac{(2k)!(-1)^{\ell}}{\ell!(2k-2\ell)!2^{\ell}}\left(\frac{1}{\sqrt{\pi}}\int_{{\mathbbm{R}}}z^{2k-2\ell}e^{-z^{2}}\,dz\right)\right]=\sum_{\ell=0}^{k}\left[\frac{(2k)!(-1)^{\ell}}{\ell!(2k-2\ell)!2^{\ell}}\frac{(2k-2\ell-1)!!}{2^{k-\ell}}\right]$
	$\displaystyle=\sum_{\ell=0}^{k}\frac{(2k)!(-1)^{\ell}(2k-2\ell-1)!!}{\ell!(2k-2\ell)!2^{k}}=\sum_{\ell=0}^{k}\frac{(2k)!(-1)^{\ell}}{\ell!(2k-2\ell)!!2^{k}}$
	$\displaystyle=\sum_{\ell=0}^{k}\frac{(2k)!(-1)^{\ell}}{\ell!(k-\ell)!2^{k-\ell}2^{k}}=\frac{(2k)!}{4^{k}k!}\sum_{\ell=0}^{k}\frac{k!(-2)^{\ell}}{(k-\ell)!\ell!}=\frac{(2k)!}{4^{k}k!}(-1)^{k}$		(21)

and

\displaystyle\begin{split}&{\mathbb{E}}\!\left[\sqrt{2}\exp\!\left(-\frac{\lvert W_{1}^{1}\rvert^{2}}{2}\right)H_{2k+1}(W_{1}^{1})\right]\\ &=\frac{1}{\sqrt{2\pi}}\int_{{\mathbbm{R}}}\sqrt{2}e^{-\frac{z^{2}}{2}}\left[\sum_{\ell=0}^{\lfloor(2k+1)/{2}\rfloor}\left(\frac{(2k+1)!(-1)^{\ell}}{\ell!(2k+1-2\ell)!}\frac{z^{2k+1-2\ell}}{2^{\ell}}\right)\right]e^{-\frac{z^{2}}{2}}\,dz=0.\end{split}

(22)

Thus, it holds for all $k\in{\mathbbm{N}}_{0}$ that

{\mathbb{E}}\!\left[\sqrt{2}\exp\!\left(-\frac{\lvert W_{1}^{1}\rvert^{2}}{2}\right)H_{k}(W_{1}^{1})\right]=\mathbbm{1}_{2{\mathbbm{N}}_{0}}(k)\frac{k!(-1)^{\lfloor k/2\rfloor}}{4^{k/2}\lfloor k/2\rfloor!}.

(23)

Furthermore, the combinatorial interpretation of multinomial coefficients yields for all $k\in{\mathbbm{N}}$ , $\alpha\in({\mathbbm{N}}_{0})^{k}$ with $\lvert\alpha\rvert=k$ that

\displaystyle\begin{split}&\#\left[\bigcap_{i=1}^{d}\Bigl{\{}(\mu_{1},\mu_{2},\ldots,\mu_{k})\in\{1,2,\ldots,d\}^{k}\colon\#\{\ell\in\{1,2,\ldots,k\}\colon\mu_{\ell}=i\}=\alpha_{i}\Bigr{\}}\right]\\ &=\frac{k!}{\alpha_{1}!\alpha_{2}!\cdots\alpha_{d}!}.\end{split}

(24)

This, the fact that $W^{1},W^{2},\ldots,W^{d}$ are independent, (5), and the fact that $\forall\,x\in{\mathbbm{R}},k\in{\mathbbm{N}}_{0}\colon\frac{d^{k}}{dx^{k}}(e^{-\frac{x^{2}}{2}})=e^{-\frac{x^{2}}{2}}H_{k}(x)$ show for all $n\in{\mathbbm{N}}$ that

	$\displaystyle v^{n}(0,0)-\left[\prod_{\ell=1}^{d}{\mathbb{E}}\!\left[\sqrt{2}\exp\!\left(-\frac{(W_{1}^{\ell})^{2}}{2}\right)\right]\right]=v^{n}(0,0)-{\mathbb{E}}\!\left[2^{d/2}\exp\!\left(-\frac{\lVert W_{1}\rVert^{2}}{2}\right)\right]$
	$\displaystyle=\sum_{k=1}^{n-1}\sum_{\mu_{1},\mu_{2},\ldots,\mu_{k}=1}^{d}\Biggl{[}\frac{1}{k!}b_{\mu_{1}}b_{\mu_{2}}\cdots b_{\mu_{k}}{\mathbb{E}}\!\left[2^{d/2}\frac{\partial^{k}}{\partial x_{\mu_{1}}\partial x_{\mu_{2}}\ldots\partial x_{\mu_{k}}}\exp\!\left(-\frac{\lVert x+W_{1}\rVert^{2}}{2}\right)\right]\Bigr{\|}_{x=0}\Biggr{]}$
	$\displaystyle=\sum_{k=1}^{n-1}\sum_{\alpha\in({\mathbbm{N}}_{0})^{d}\colon\lvert\alpha\rvert=k}\left[\prod_{\ell=1}^{d}{\mathbb{E}}\!\left[\frac{b_{\ell}^{\alpha_{\ell}}\sqrt{2}}{\alpha_{\ell}!}\frac{\partial^{\alpha_{\ell}}}{\partial x_{\ell}^{\alpha_{\ell}}}\exp\!\left(-\frac{(x_{\ell}+W_{1}^{\ell})^{2}}{2}\right)\right]\Bigr{\|}_{x_{\ell}=0}\right]$
	$\displaystyle=\sum_{k=1}^{n-1}\sum_{\alpha\in({\mathbbm{N}}_{0})^{d}\colon\lvert\alpha\rvert=k}\left[\prod_{\ell=1}^{d}{\mathbb{E}}\!\left[\frac{b_{\ell}^{\alpha_{\ell}}\sqrt{2}}{\alpha_{\ell}!}\exp\!\left(-\frac{(W_{1}^{\ell})^{2}}{2}\right)H_{\alpha_{\ell}}(W_{1}^{\ell})\right]\right].$		(25)

This, (23), and the multinomial theorem show for all $n\in{\mathbbm{N}}$ that

	$\displaystyle v^{n}(0,0)=\sum_{k=0}^{n-1}\sum_{\alpha\in({\mathbbm{N}}_{0})^{d}\colon\lvert\alpha\rvert=k}\prod_{\ell=1}^{d}\left[\frac{b_{\ell}^{\alpha_{\ell}}}{\alpha_{\ell}!}\mathbbm{1}_{2{\mathbbm{N}}_{0}}(\alpha_{\ell})\frac{\alpha_{\ell}!(-1)^{\lfloor\alpha_{\ell}/2\rfloor}}{4^{\alpha_{\ell}/2}\lfloor\alpha_{\ell}/2\rfloor!}\right]$
	$\displaystyle=\sum_{i=0}^{\lfloor\frac{n-1}{2}\rfloor}\sum_{\beta\in({\mathbbm{N}}_{0})^{d}\colon\lvert\beta\rvert=i}\prod_{\ell=1}^{d}\frac{b_{\ell}^{2\beta_{\ell}}(-1)^{\beta_{\ell}}}{4^{\beta_{\ell}}\beta_{\ell}!}=\sum_{i=0}^{\lfloor\frac{n-1}{2}\rfloor}\left[\frac{(-1)^{i}}{4^{i}}\sum_{\beta\in({\mathbbm{N}}_{0})^{d}\colon\lvert\beta\rvert=i}\prod_{\ell=1}^{d}\frac{b_{\ell}^{2\beta_{\ell}}}{\beta_{\ell}!}\right]$
	$\displaystyle=\sum_{i=0}^{\lfloor\frac{n-1}{2}\rfloor}\left[\frac{(-1)^{i}}{4^{i}i!}\sum_{\beta\in({\mathbbm{N}}_{0})^{d}\colon\lvert\beta\rvert=i}\left[\frac{i!b_{1}^{2\beta_{1}}b_{2}^{2\beta_{2}}\ldots b_{d}^{2\beta_{d}}}{\beta_{1}!\beta_{2}!\cdots\beta_{d}!}\right]\right]=\sum_{i=0}^{\lfloor\frac{n-1}{2}\rfloor}\frac{(-1)^{i}\lVert b\rVert^{2i}}{4^{i}i!}.$		(26)

This establishes (iii).

Next observe that for all $a\in{\mathbbm{R}}$ , $\ell\in\{1,2,\ldots,d\}$ it holds that

\begin{split}&{\mathbb{E}}\!\left[\exp\!\left(-\frac{(a+W^{\ell}_{1})^{2}}{2}\right)\right]=\frac{1}{\sqrt{2\pi}}\int_{{\mathbbm{R}}}e^{-\frac{(a+z)^{2}}{2}}e^{-\frac{z^{2}}{2}}\,dz=\frac{1}{\sqrt{2\pi}}\int_{{\mathbbm{R}}}e^{-\frac{a^{2}}{2}}e^{-az}e^{-z^{2}}\,dz\\ &=\frac{1}{\sqrt{2\pi}}\int_{{\mathbbm{R}}}e^{-\frac{a^{2}}{2}+\frac{1}{4}a^{2}}e^{-(z+\frac{1}{2}a)^{2}}\,dz=\frac{1}{\sqrt{2\pi}}\int_{{\mathbbm{R}}}e^{-\frac{a^{2}}{4}}e^{-\frac{y^{2}}{2}}\frac{1}{\sqrt{2}}\,dy=\frac{e^{-\frac{a^{2}}{4}}}{\sqrt{2}}.\end{split}

(27)

This, (6), and the fact that $W^{1},W^{2},\ldots,W^{d}$ are independent prove that

v^{\infty}(0,0)={\mathbb{E}}\!\left[2^{d/2}\exp\!\left(-\frac{\lVert b+W_{1}\rVert^{2}}{2}\right)\right]=2^{d/2}\prod_{\ell=1}^{d}{\mathbb{E}}\!\left[\exp\!\left(-\frac{(b_{\ell}+W^{\ell}_{1})^{2}}{2}\right)\right]=e^{-\frac{\lVert b\rVert^{2}}{4}}.

(28)

This shows (iv).

Next note that (iii), (iv), and the fact that $\forall\,x\in{\mathbbm{R}}\colon e^{-\frac{x^{2}}{4}}=1+\sum_{i=1}^{\infty}\frac{(-1)^{i}x^{2i}}{4^{i}i!}$ show for all $n\in{\mathbbm{N}}$ that

\displaystyle v^{\infty}(0,0)-v^{n}(0,0)=\sum^{\infty}_{i=\lfloor\frac{n-1}{2}\rfloor+1}\frac{(-1)^{i}\lVert b\rVert^{2i}}{i!4^{i}}=\sum^{\infty}_{i=\lfloor\frac{n+1}{2}\rfloor}\frac{(-1)^{i}\lVert b\rVert^{2i}}{i!4^{i}}.

(29)

Therefore, for all $\epsilon\in(0,1)$ , $n\in{\mathbbm{N}}\cap\bigl{[}\frac{1}{2\epsilon}\lVert b\rVert^{2}-1,\infty\bigr{)}$ , $j\in{\mathbbm{N}}$ with $j=\lfloor\tfrac{n+1}{2}\rfloor$ it holds that $\frac{1}{2\epsilon}\lVert b\rVert^{2}\leq n+1=2\frac{n+1}{2}\leq 2(\lfloor\frac{n+1}{2}\rfloor+1)=2(j+1)$ , $\frac{\lVert b\rVert^{2}}{4(j+1)}\leq\epsilon$ ,

	$\displaystyle(v^{\infty}(0,0)-v^{n}(0,0))(-1)^{j}=\left(\frac{\lVert b\rVert^{2j}}{j!4^{j}}-\frac{\lVert b\rVert^{2(j+1)}}{(j+1)!4^{j+1}}\right)+\left(\frac{\lVert b\rVert^{2(j+2)}}{(j+2)!4^{j+2}}-\frac{\lVert b\rVert^{2(j+3)}}{(j+3)!4^{j+3}}\right)+\ldots$
	$\displaystyle=\frac{\lVert b\rVert^{2j}}{j!4^{j}}\left(1-\frac{\lVert b\rVert^{2}}{4(j+1)}\right)+\frac{\lVert b\rVert^{2(j+2)}}{(j+2)!4^{j+2}}\left(1-\frac{\lVert b\rVert^{2}}{4(j+3)}\right)+\ldots$
	$\displaystyle\geq\frac{\lVert b\rVert^{2j}}{j!4^{j}}\left(1-\frac{\lVert b\rVert^{2}}{4(j+1)}\right)\geq\left(\frac{\lVert b\rVert^{2}}{4}\right)^{j}\frac{1}{j!}(1-\epsilon),$		(30)

and

\displaystyle\begin{split}&\lvert v^{\infty}(0,0)-v^{n}(0,0)\rvert\\ &\leq\sum^{\infty}_{i=j}\frac{\lVert b\rVert^{2i}}{i!4^{i}}\leq\frac{\lVert b\rVert^{2j}}{j!4^{j}}\left(1+\sum_{i=j+1}^{\infty}\left[\frac{\lVert b\rVert^{2}}{4(j+1)}\right]^{{}^{i-j}}\right)\leq\frac{\lVert b\rVert^{2j}}{j!4^{j}}\sum_{\ell=0}^{\infty}\epsilon^{\ell}=\left(\frac{\lVert b\rVert^{2}}{4}\right)^{j}\frac{1}{j!}\frac{1}{1-\epsilon}.\end{split}

(31)

This establishes (v). The proof of Lemma 2.1 is thus completed. ∎

2.2 Lower bounds for the convergence speed of Picard iterations for an example BSDE

Corollary 2.2.

Let $d\in{\mathbbm{N}}$ , $b\in{\mathbbm{R}}^{d}$ , let $\langle\cdot,\cdot\rangle\colon{\mathbbm{R}}^{d}\times{\mathbbm{R}}^{d}\to{\mathbbm{R}}$ denote the standard scalar product on ${\mathbbm{R}}^{d}$ , let $\lVert\cdot\rVert\colon{\mathbbm{R}}^{d}\to[0,\infty)$ denote the standard norm on ${\mathbbm{R}}^{d}$ , let $(\Omega,\mathcal{F},{\mathbb{P}},({\mathbbm{F}}_{t})_{t\in[0,1]})$ be a filtered probability space which satisfies the usual conditions, let $W\colon[0,1]\times\Omega\to{\mathbbm{R}}^{d}$ be a standard $({\mathbbm{F}}_{t})_{t\in[0,1]}$ -Brownian motion with continuous sample paths, let $Y^{n}\colon[0,1]\times\Omega\to{\mathbbm{R}}$ , $n\in{\mathbbm{N}}_{0}\cup\{\infty\}$ , be adapted with continuous sample paths, let $Z^{n}\colon[0,1]\times\Omega\to{\mathbbm{R}}^{d}$ , $n\in{\mathbbm{N}}_{0}\cup\{\infty\}$ , be progressively measurable, and assume for all $s\in[0,1]$ , $n\in{\mathbbm{N}}\cup\{\infty\}$ that a.s. it holds that $Y^{0}_{s}=0$ , $Z^{0}_{s}=0$ , $\int_{0}^{T}{\mathbb{E}}\bigl{[}\lVert Z^{n}_{t}\rVert^{2}\bigr{]}\,dt<\infty$ and

\displaystyle\begin{split}Y_{s}^{n+1}=2^{d/2}e^{-\frac{\lVert W_{1}\rVert^{2}}{2}}+\int_{s}^{1}\langle b,Z^{n}_{t}\rangle\,dt-\int_{s}^{1}\langle Z^{n+1}_{t},dW_{t}\rangle.\end{split}

(32)

Then for all $n\in{\mathbbm{N}}\cap\bigl{[}\lVert b\rVert^{2}-1,\infty\bigr{)}$ it holds a.s. that $\left\lvert Y_{0}^{\infty}-Y^{n}_{0}\right\rvert\geq\frac{1}{2}\left(\frac{\lVert b\rVert^{2}}{4}\right)^{\lfloor\frac{n+1}{2}\rfloor}\frac{1}{\sqrt{n!}}$ .

Proof of Corollary 2.2.

Throughout this proof let $v^{n}\colon[0,1]\times{\mathbbm{R}}^{d}\to{\mathbbm{R}}$ , $n\in{\mathbbm{N}}_{0}\cup\{\infty\}$ , satisfy for all $t\in[0,1]$ , $x\in{\mathbbm{R}}^{d}$ , $n\in{\mathbbm{N}}$ that $v^{0}(t,x)=0$ ,

\displaystyle v^{\infty}(s,x)={\mathbb{E}}\!\left[2^{d/2}\exp\!\left(-\frac{\lVert x+b(1-s)+W_{1}-W_{s}\rVert^{2}}{2}\right)\right],

(33)

and

\displaystyle\begin{split}&v^{n}(s,x)={\mathbb{E}}\!\left[2^{d/2}\exp\!\left(-\frac{\lVert x+W_{1}-W_{s}\rVert^{2}}{2}\right)\right]\\ &+\sum_{k=1}^{n-1}\sum_{\mu_{1},\mu_{2},\ldots,\mu_{k}=1}^{d}\Biggl{[}\frac{(1-s)^{k}}{k!}b_{\mu_{1}}b_{\mu_{2}}\cdots b_{\mu_{k}}{\mathbb{E}}\!\left[2^{d/2}\frac{\partial^{k}}{\partial x_{\mu_{1}}\partial x_{\mu_{2}}\ldots\partial x_{\mu_{k}}}\exp\!\left(-\frac{\lVert x+W_{1}-W_{s}\rVert^{2}}{2}\right)\right]\Biggr{]}.\end{split}

(34)

Then Lemma 2.1 proves

for all $t\in[0,1]$ , $x\in{\mathbbm{R}}^{d}$ that $v^{\infty}\in C^{\infty}([0,1]\times{\mathbbm{R}}^{d},{\mathbbm{R}})$ and

\frac{\partial v^{\infty}}{\partial t}(t,x)+\frac{1}{2}(\mathop{}\!\mathbin{\bigtriangleup}_{x}v^{\infty})(t,x)+\left\langle b,(\nabla_{x}v^{\infty})(t,x)\right\rangle=0,

(35)

for all $s\in[0,1]$ , $x\in{\mathbbm{R}}^{d}$ , $n\in{\mathbbm{N}}_{0}$ that $v^{n}\in C^{\infty}([0,1]\times{\mathbbm{R}}^{d},{\mathbbm{R}})$ and

v^{n+1}(s,x)={\mathbb{E}}\!\left[2^{d/2}\exp\!\left(-\tfrac{\lVert x+W_{1}-W_{s}\rVert^{2}}{2}\right)\right]+\int_{s}^{1}{\mathbb{E}}\!\left[\Bigl{\langle}b,(\nabla_{x}v^{n})(t,x+W_{t}-W_{s})\Bigr{\rangle}\right]\!dt,

(36)

and

for all $n\in{\mathbbm{N}}\cap\bigl{[}\lVert b\rVert^{2}-1,\infty\bigr{)}$ that

\lvert v^{n}(0,0)-v^{\infty}(0,0)\rvert\geq\frac{1}{2}\left(\frac{\lVert b\rVert^{2}}{4}\right)^{\lfloor\frac{n+1}{2}\rfloor}\frac{1}{\lfloor\frac{n+1}{2}\rfloor!}.

(37)

This and Itô’s formula prove that for all $s\in[0,1]$ it holds a.s. that

\displaystyle\begin{split}&2^{d/2}e^{-\frac{\lVert W_{1}\rVert^{2}}{2}}-v^{\infty}(s,W_{s})=v^{\infty}(1,W_{1})-v^{\infty}(s,W_{s})\\ &=\int_{s}^{1}\left(\frac{\partial v^{\infty}}{\partial t}+\frac{1}{2}\mathop{}\!\mathbin{\bigtriangleup}_{x}v^{\infty}\right)\!(t,W_{t})\,dt+\int_{s}^{1}\left\langle(\nabla_{x}v^{\infty})(t,W_{t}),dW_{t}\right\rangle\\ &=-\int_{s}^{1}\left\langle b,(\nabla_{x}v^{\infty})(t,W_{t})\right\rangle dt+\int_{s}^{1}\left\langle(\nabla_{x}v^{\infty})(t,W_{t}),dW_{t}\right\rangle.\end{split}

(38)

This, (32), and a standard result on uniqueness of backward stochastic differential equations (cf., e.g., [21, Theorem 4.3.1]) prove for all $s\in[0,1]$ that ${\mathbb{P}}\bigl{(}Y^{\infty}_{s}=v^{\infty}(s,W_{s})\bigr{)}=1$ and ${\mathbb{P}}\bigl{(}Z^{\infty}_{s}=(\nabla_{x}v^{\infty})(s,W_{s})\bigr{)}=1$ .

Next, we prove by induction on $n\in{\mathbbm{N}}_{0}$ that for all $n\in{\mathbbm{N}}_{0}$ , $s\in[0,1]$ it holds that ${\mathbb{P}}\bigl{(}Y^{n}_{s}=v^{n}(s,W_{s})\bigr{)}=1$ and ${\mathbb{P}}\bigl{(}Z^{n}_{s}=(\nabla_{x}v^{n})(s,W_{s})\bigr{)}=1$ . First, the fact that $\forall\,t\in[0,1],x\in{\mathbbm{R}}^{d}\colon v^{0}(t,x)=0$ and the fact that $\forall\,s\in[0,1]\colon{\mathbb{P}}\bigl{(}(Y^{0}_{s},Z^{0}_{s})=(0,0)\bigr{)}=1$ establish the base case $n=0$ . For the induction step ${\mathbbm{N}}_{0}\ni n\mapsto n+1\in{\mathbbm{N}}$ let $n\in{\mathbbm{N}}_{0}$ satisfy for all $s\in[0,1]$ that ${\mathbb{P}}\bigl{(}Y^{n}_{s}=v^{n}(s,W_{s})\bigr{)}=1$ and ${\mathbb{P}}\bigl{(}Z^{n}_{s}=(\nabla_{x}v^{n})(s,W_{s})\bigr{)}=1$ . This, (36), the Markov property of $W$ , the fact that for all $s\in[0,1]$ it holds a.s. that ${\mathbb{E}}\bigl{[}\int_{s}^{1}\langle{Z}^{n+1}_{t},dW_{t}\rangle|{\mathbbm{F}}_{s}\bigr{]}=0$ , (32), and adaptedness of $Y^{n+1}$ imply that for all $s\in[0,1]$ it holds a.s. that

\begin{split}&v^{n+1}(s,W_{s})={\mathbb{E}}\Bigl{[}2^{d/2}e^{-\frac{\lVert W_{1}\rVert^{2}}{2}}\big{|}{\mathbbm{F}}_{s}\Bigr{]}+\int_{s}^{1}{\mathbb{E}}\!\left[\Bigl{\langle}b,(\nabla_{x}v^{n})(t,W_{t})\Bigr{\rangle}\big{|}{\mathbbm{F}}_{s}\right]\!dt\\ &={\mathbb{E}}\!\left[2^{d/2}e^{-\frac{\lVert W_{1}\rVert^{2}}{2}}\big{|}{\mathbbm{F}}_{s}\right]+\int_{s}^{1}{\mathbb{E}}\bigl{[}\langle b,{Z}^{n}_{t}\rangle\big{|}{\mathbbm{F}}_{s}\bigr{]}dt\\ &={\mathbb{E}}\!\left[2^{d/2}e^{-\frac{\lVert W_{1}\rVert^{2}}{2}}+\int_{s}^{1}\langle b,Z^{n}_{t}\rangle\,dt+\int_{s}^{1}\langle{Z}^{n+1}_{t},dW_{t}\rangle\middle|{\mathbbm{F}}_{s}\right]={\mathbb{E}}\!\left[Y_{s}^{n+1}\middle|{\mathbbm{F}}_{s}\right]=Y_{s}^{n+1}.\end{split}

(39)

This, Itô’s formula, the fact that $v^{n+1}\in C^{2}([0,1]\times{\mathbbm{R}}^{d},{\mathbbm{R}})$ , (34), and (32) show that for all $s\in[0,1]$ it holds a.s. that

\displaystyle\begin{split}&0=v^{n+1}(s,W_{s})-Y_{s}^{n+1}=-\int_{s}^{1}\left[\left(\frac{\partial v^{n+1}}{\partial t}+\frac{1}{2}\mathop{}\!\mathbin{\bigtriangleup}_{x}v^{n+1}\right)\!(t,W_{t})-\left\langle b,Z^{n}_{t}\right\rangle\right]dt\\ &\qquad\qquad\qquad-\int_{s}^{1}\left\langle(\nabla_{x}v^{n+1})(t,W_{t})-Z^{n+1}_{t},dW_{t}\right\rangle.\end{split}

(40)

This and the uniqueness of the decomposition of continuous semimartingales show for all $s\in[0,1]$ that ${\mathbb{P}}\bigl{(}Y^{n+1}_{s}=v^{n+1}(s,W_{s})\bigr{)}=1$ and ${\mathbb{P}}\bigl{(}Z^{n+1}_{s}=(\nabla_{x}v^{n+1})(s,W_{s})\bigr{)}=1$ . This completes the induction step. Induction thus shows for all $n\in{\mathbbm{N}}_{0}$ , $s\in[0,1]$ that ${\mathbb{P}}\bigl{(}Y^{n}_{s}=v^{n}(s,W_{s})\bigr{)}=1$ and ${\mathbb{P}}\bigl{(}Z^{n}_{s}=(\nabla_{x}v^{n})(s,W_{s})\bigr{)}=1$ . This and the fact that ${\mathbb{P}}\bigl{(}Y^{\infty}_{0}=v^{\infty}(0,W_{0})\bigr{)}=1$ imply that for all $n\in{\mathbbm{N}}$ it holds a.s. that $Y^{n}_{0}-Y_{0}^{\infty}=v^{n}(0,0)-v^{\infty}(0,0)$ . This, the fact that for all $k,n\in{\mathbbm{N}}_{0}$ with $n=2k$ it holds that

\begin{split}&\left\lfloor\frac{n+1}{2}\right\rfloor!=\left\lfloor\frac{2k+1}{2}\right\rfloor!=k!\\ &=1\cdot 2\cdots k\leq\sqrt{1\cdot 2\cdots k\cdot(k+1)(k+2)\cdots(2k)}=\sqrt{(2k)!}=\sqrt{n!},\end{split}

(41)

the fact that for all $k,n\in{\mathbbm{N}}_{0}$ with $n=2k+1$ it holds that

\begin{split}&\left\lfloor\frac{n+1}{2}\right\rfloor!=\left\lfloor\frac{2k+1+1}{2}\right\rfloor!=(k+1)!\\ &=2\cdot 3\cdots(k+1)\leq\sqrt{2\cdot 3\cdots(k+1)(k+2)(k+3)\cdots(2k+1)}=\sqrt{n!},\end{split}

(42)

and (37) imply that for all $n\in{\mathbbm{N}}\cap\bigl{[}\lVert b\rVert^{2}-1,\infty\bigr{)}$ it holds a.s. that

\left\lvert Y_{0}^{n}-Y_{0}^{\infty}\right\rvert=\left\lvert v^{n}(0,0)-v^{\infty}(0,0)\right\rvert\geq\frac{1}{2}\left(\frac{\lVert b\rVert^{2}}{4}\right)^{\lfloor\frac{n+1}{2}\rfloor}\frac{1}{\sqrt{n!}}.

(43)

This completes the proof of Corollary 2.2. ∎

The following Lemma 2.3 gives an example where the BSDE solution is an expoential function (see Item (i)) and the Picard approximations are just partial sums of the exponential series (see Item (ii)). Thereby, Lemma 2.3 shows that factorial speed of convergence cannot be improved up to exponential factors in the case of $y$ -dependent drivers.

Lemma 2.3.

Let $T\in(0,\infty)$ and let $Y^{k}\in C([0,T],{\mathbbm{R}})$ , $k\in{\mathbbm{N}}_{0}\cup\{\infty\}$ , satisfy for all $k\in{\mathbbm{N}}_{0}$ , $s\in[0,T]$ that $Y_{s}^{0}=1$ , $Y_{s}^{k+1}=1+\int_{s}^{T}Y_{r}^{k}\,dr$ , and $Y_{s}^{\infty}=1+\int_{s}^{T}Y_{r}^{\infty}dr$ . Then

i)

it holds for all $s\in[0,T]$ that $Y_{s}^{\infty}=e^{T-s}$ ,
ii)

it holds for all $s\in[0,T]$ , $n\in{\mathbbm{N}}_{0}$ that $Y_{s}^{n}=1+\sum_{k=1}^{n}\frac{(T-s)^{k}}{k!}$ , and
iii)

it holds that $\sup_{s\in[0,T]}\lvert Y_{s}^{\infty}-Y_{s}^{n}\rvert=\sum_{k=n+1}^{\infty}\frac{T^{k}}{k!}\geq\frac{T^{n+1}}{(n+1)!}$ .

Proof of Lemma 2.3.

The fact that $\forall\,s\in[0,T]\colon Y_{s}^{\infty}=1+\int_{s}^{T}Y_{r}^{\infty}\,dr$ and the substitution rule show for all $s\in[0,T]$ that $Y^{\infty}_{T-s}=1+\int_{T-s}^{T}Y_{r}^{\infty}\,dr=1+\int_{0}^{s}Y_{T-r}^{\infty}\,dr$ . This, the fact that $\forall\,s\in[0,T]\colon e^{T-s}=1+\int_{0}^{s}e^{T-r}\,dr$ , and the Picard–Lindelöf theorem show (i). Next, we prove (ii) by induction on $n\in{\mathbbm{N}}_{0}$ . The fact that $\forall\,s\in[0,T]\colon Y_{s}^{0}=1$ shows the base case $n=0$ . For the induction step ${\mathbbm{N}}_{0}\ni n\mapsto n+1\in{\mathbbm{N}}$ let $n\in{\mathbbm{N}}_{0}$ and assume for all $s\in[0,T]$ that $Y_{s}^{n}=1+\sum_{k=1}^{n}\frac{(T-s)^{k}}{k!}$ . The assumptions of Lemma 2.3 then show for all $s\in[0,T]$ that

\displaystyle\begin{split}&Y_{s}^{n+1}=1+\int_{s}^{T}Y_{r}^{n}\,dr=1+\int_{s}^{T}\left(1+\sum_{k=1}^{n}\frac{(T-r)^{k}}{k!}\right)dr\\ &=1+\sum_{k=1}^{n+1}\frac{-(T-r)^{k+1}}{(k+1)!}\Bigr{|}_{r=s}^{T}=1+\sum_{k=1}^{n+1}\frac{(T-s)^{k}}{k!}.\end{split}

(44)

This completes the induction step. Induction hence shows (ii). Combining (i), (ii), and the fact that $\forall\,x\in{\mathbbm{R}}\colon e^{x}=1+\sum_{k=1}^{\infty}\frac{x^{k}}{k!}$ yields (iii). The proof of Lemma 2.3 is thus completed. ∎

3 A priori estimates for backward Itô processes

In this section we establish a priori estimates for certain backward Itô processes. Results of this form are well-known in the scientific literature on BSDEs (see, e.g., [17, Proof of Theorem 3.1], [8, Proposition 2.1], [21, Theorem 4.2.1], [18, Proposition 5.2]). Lemma 3.1 below establishes estimates for an Itô process and its diffusion process in terms of the drift process and in terms of the terminal value of the Itô process. The contribution of Lemma 3.1 is to provide explicit universal constants. Moreover, the Itô process in Lemma 3.1 and its drift process are not assumed to be square-integrable and, in particular, the right-hand sides of (46), (47), and (LABEL:eq:b03) are allowed to be infinite (with positive probability). We note that square-integrability of the diffusion process $Z$ in Lemma 3.1, however, is in general required; e.g., choose $A\equiv 0$ and $Z$ such that the Itô isometry does not hold for the Itô integral $Y_{T}$ .

Lemma 3.1.

Let $T\in(0,\infty)$ , $d,m\in{\mathbbm{N}}$ , let $\langle\cdot,\cdot\rangle\colon{\mathbbm{R}}^{d}\times{\mathbbm{R}}^{d}\to{\mathbbm{R}}$ denote the standard scalar product on ${\mathbbm{R}}^{d}$ , let $\lVert\cdot\rVert\colon{\mathbbm{R}}^{d}\to[0,\infty)$ denote the standard norm on ${\mathbbm{R}}^{d}$ , let $\lVert\cdot\rVert_{\mathsf{F}}\colon{\mathbbm{R}}^{d\times m}\to[0,\infty)$ denote the Frobenius norm on ${\mathbbm{R}}^{d\times m}$ , let $(\Omega,\mathcal{F},{\mathbb{P}},({\mathbbm{F}}_{t})_{t\in[0,T]})$ be a filtered probability space which satisfies the usual conditions, let $W\colon[0,T]\times\Omega\to{\mathbbm{R}}^{m}$ be a standard $({\mathbbm{F}}_{t})_{t\in[0,T]}$ -Brownian motion with continuous sample paths, let $Y\colon[0,T]\times\Omega\to{\mathbbm{R}}^{d}$ be adapted with continuous sample paths, let $A\colon[0,T]\times\Omega\to{\mathbbm{R}}^{d}$ be measurable, let $Z\colon[0,T]\times\Omega\to{\mathbbm{R}}^{d\times m}$ be progressively measurable, and assume that for all $s\in[0,T]$ it holds a.s. that

\displaystyle\int_{0}^{T}\Bigl{(}\lVert A_{t}\rVert+{\mathbb{E}}[\lVert Z_{t}\rVert_{\mathsf{F}}^{2}]\Bigr{)}\,dt<\infty\quad\text{and}\quad Y_{s}=Y_{T}+\int_{s}^{T}A_{t}\,dt-\int_{s}^{T}Z_{t}\,dW_{t}.

(45)

Then

(i)

for all $s\in[0,T]$ , $\lambda\in(0,\infty)$ it holds a.s. that

\displaystyle{\mathbb{E}}\!\left[e^{\lambda s}\lVert Y_{s}\rVert^{2}+\int_{s}^{T}e^{\lambda t}\lVert Z_{t}\rVert_{\mathsf{F}}^{2}\,dt\Big{|}{\mathbbm{F}}_{s}\right]\leq{\mathbb{E}}\!\left[e^{\lambda T}\lVert Y_{T}\rVert^{2}+\int_{s}^{T}\frac{e^{\lambda t}}{\lambda}\lVert A_{t}\rVert^{2}\,dt\Big{|}{\mathbbm{F}}_{s}\right],

(46)

(ii)

for all $s\in[0,T]$ , $\lambda\in(0,\infty)$ it holds a.s. that

\displaystyle{\mathbb{E}}\!\left[\sup_{t\in[s,T]}\left(e^{\lambda t}\lVert Y_{t}\rVert^{2}+\int_{t}^{T}e^{\lambda u}\lVert Z_{u}\rVert_{\mathsf{F}}^{2}\,du\right)\middle|{\mathbbm{F}}_{s}\right]

\displaystyle\leq 34{\mathbb{E}}\!\left[e^{\lambda T}\lVert Y_{T}\rVert^{2}+\int_{s}^{T}\frac{e^{\lambda t}}{\lambda}\lVert A_{t}\rVert^{2}\,dt\middle|{\mathbbm{F}}_{s}\right],

(47)

and

(iii)

it holds for all $\alpha,\lambda\in(0,\infty)$ that

\displaystyle\begin{split}&\int_{0}^{T}\left[\frac{t^{\alpha-1}e^{\lambda t}{\mathbb{E}}\!\left[\left\lVert Y_{t}\right\rVert^{2}\right]}{\Gamma(\alpha)}+\frac{t^{\alpha}e^{\lambda t}{\mathbb{E}}\!\left[\left\lVert Z_{t}\right\rVert_{\mathsf{F}}^{2}\right]}{\Gamma(\alpha+1)}\right]dt\leq\frac{e^{\lambda T}T^{\alpha}{\mathbb{E}}\!\left[\lVert Y_{T}\rVert^{2}\right]}{\Gamma(\alpha+1)}+\frac{1}{\lambda}\int_{0}^{T}\frac{e^{\lambda t}t^{\alpha}{\mathbb{E}}\!\left[\lVert A_{t}\rVert^{2}\right]}{\Gamma(\alpha+1)}dt.\end{split}

(48)

Proof of Lemma 3.1.

Throughout this proof for every $s\in[0,T]$ let $B_{s}\in\mathbb{F}_{s}$ satisfy that a.s. on $B_{s}$ it holds that ${\mathbb{E}}\!\left[\|Y_{T}\|^{2}+\int_{s}^{T}\|A_{t}\|^{2}\,dt\Big{|}{\mathbbm{F}}_{s}\right]<\infty$ and a.s. on $\Omega\setminus B_{s}$ it holds that ${\mathbb{E}}\!\left[\|Y_{T}\|^{2}+\int_{s}^{T}\|A_{t}\|^{2}\,dt\Big{|}{\mathbbm{F}}_{s}\right]=\infty$ , let $\{e_{1},e_{2},\ldots,e_{m}\}\subseteq{\mathbbm{R}}^{m}$ be an orthonormal basis of ${\mathbbm{R}}^{m}$ , and let $\alpha,\lambda\in(0,\infty)$ . First note that (45), Jensen’s inequality, and the Burkholder-Davis-Gundy inequality (see, e.g., [5, Lemma 7.2]) yield that for all $s\in[0,T]$ it holds a.s. on $B_{s}$ that

\begin{split}{\mathbb{E}}\!\left[\sup_{t\in[s,T]}\lVert Y_{t}\rVert^{2}\Big{|}\mathbb{F}_{s}\right]&\leq 3\left(\lVert Y_{s}\rVert^{2}+{\mathbb{E}}\!\left[\left(\int_{s}^{T}\lVert A_{t}\rVert\,dt\right)^{2}+\sup_{u\in[s,T]}\left\lVert\int_{s}^{u}Z_{t}\,dW_{t}\right\rVert^{2}\Big{|}\mathbb{F}_{s}\right]\right)\\ &\leq 12\left(\lVert Y_{s}\rVert^{2}+{\mathbb{E}}\left[T\int_{s}^{T}\lVert A_{t}\rVert^{2}\,dt\Big{|}\mathbb{F}_{s}\right]+{\mathbb{E}}\left[\int_{s}^{T}\lVert Z_{t}\rVert_{\mathsf{F}}^{2}\,dt\Big{|}\mathbb{F}_{s}\right]\right)<\infty.\end{split}

(49)

This, the $L^{1}$ -Burkholder-Davis-Gundy inequality (e.g., [20, Theorem 1]), the Cauchy-Schwarz inequality, and Hölder’s inequality imply that for all $s\in[0,T]$ it holds a.s. on $B_{s}$ that

\begin{split}&{\mathbb{E}}\!\left[\sup_{u\in[s,T]}\left\lvert\int_{s}^{u}e^{\lambda t}\langle Y_{t},Z_{t}\,dW_{t}\rangle\right\rvert\Big{|}\mathbb{F}_{s}\right]\leq\sqrt{8}{\mathbb{E}}\left[\left(\int_{s}^{T}e^{2\lambda t}\left[\sum_{i=1}^{m}\left\lvert\left\langle Y_{t},Z_{t}e_{i}\right\rangle\right\rvert^{2}\right]dt\right)^{\nicefrac{{1}}{{2}}}\Big{|}\mathbb{F}_{s}\right]\\ &\leq\sqrt{8}{\mathbb{E}}\!\left[\left(\int_{s}^{T}e^{\lambda t}\left\lVert Z_{t}\right\rVert_{\mathsf{F}}^{2}\,dt\right)^{\!\nicefrac{{1}}{{2}}}\left(\sup_{t\in[s,T]}e^{\lambda t}\lVert Y_{t}\rVert^{2}\right)^{\!\nicefrac{{1}}{{2}}}\Big{|}\mathbb{F}_{s}\right]\\ &\leq\sqrt{8}\left({\mathbb{E}}\!\left[\int_{s}^{T}e^{\lambda t}\left\lVert Z_{t}\right\rVert_{\mathsf{F}}^{2}\,dt\Big{|}\mathbb{F}_{s}\right]{\mathbb{E}}\!\left[\sup_{t\in[s,T]}e^{\lambda t}\lVert Y_{t}\rVert^{2}\Big{|}\mathbb{F}_{s}\right]\right)^{\frac{1}{2}}.\end{split}

(50)

This, (45), and (49) yield that for all $s\in[0,T]$ it holds a.s. that $\big{(}\mathbbm{1}_{B_{s}}\int_{s}^{u}e^{\lambda t}\langle Y_{t},Z_{t}\,dW_{t}\rangle\big{)}_{u\in[s,T]}$ is a martingale with respect to ${\mathbb{P}}(\cdot|\mathbb{F}_{s})$ and

\begin{split}{\mathbb{E}}\!\left[\mathbbm{1}_{B_{s}}\int_{s}^{T}e^{\lambda t}\langle Y_{t},Z_{t}\,dW_{t}\rangle\Big{|}{\mathbbm{F}}_{s}\right]=0.\end{split}

(51)

Next note that (45) and Itô’s formula show that for all $s\in[0,T]$ it holds a.s. that

\displaystyle\begin{split}&e^{\lambda T}\lVert Y_{T}\rVert^{2}-e^{\lambda s}\lVert Y_{s}\rVert^{2}=\int_{s}^{T}d(e^{\lambda t}\lVert Y_{t}\rVert^{2})=\int_{s}^{T}\lambda e^{\lambda t}\lVert Y_{t}\rVert^{2}\,dt+\int_{s}^{T}e^{\lambda t}d(\lVert Y_{t}\rVert^{2})\\ &=\int_{s}^{T}\lambda e^{\lambda t}\lVert Y_{t}\rVert^{2}\,dt+\int_{s}^{T}e^{\lambda t}\left(\lVert Z_{t}\rVert_{\mathsf{F}}^{2}-2\langle Y_{t},A_{t}\rangle\right)dt+\int_{s}^{T}2e^{\lambda t}\langle Y_{t},Z_{t}dW_{t}\rangle\\ &=\int_{s}^{T}e^{\lambda t}\lVert Z_{t}\rVert_{\mathsf{F}}^{2}\,dt+\int_{s}^{T}\tfrac{e^{\lambda t}}{\lambda}\bigl{[}\lVert\lambda Y_{t}-A_{t}\rVert^{2}-\lVert A_{t}\rVert^{2}\bigr{]}\,dt+\int_{s}^{T}2e^{\lambda t}\langle Y_{t},Z_{t}dW_{t}\rangle.\end{split}

(52)

This shows that for all $s\in[0,T]$ it holds a.s. that

\displaystyle\begin{split}&e^{\lambda s}\lVert Y_{s}\rVert^{2}+\int_{s}^{T}e^{\lambda t}\lVert Z_{t}\rVert_{\mathsf{F}}^{2}\,dt+\int_{s}^{T}\tfrac{e^{\lambda t}}{\lambda}\lVert\lambda Y_{t}-A_{t}\rVert^{2}\,dt\\ &=e^{\lambda T}\lVert Y_{T}\rVert^{2}+\int_{s}^{T}\tfrac{e^{\lambda t}}{\lambda}\lVert A_{t}\rVert^{2}\,dt-\int_{s}^{T}2e^{\lambda t}\langle Y_{t},Z_{t}dW_{t}\rangle.\end{split}

(53)

This and (51) show that for all $s\in[0,T]$ it holds a.s. on $B_{s}$ that

\displaystyle{\mathbb{E}}\!\left[e^{\lambda s}\lVert Y_{s}\rVert^{2}+\int_{s}^{T}e^{\lambda t}\lVert Z_{t}\rVert_{\mathsf{F}}^{2}\,dt\Big{|}{\mathbbm{F}}_{s}\right]\leq{\mathbb{E}}\!\left[e^{\lambda T}\lVert Y_{T}\rVert^{2}+\int_{s}^{T}\tfrac{e^{\lambda t}}{\lambda}\lVert A_{t}\rVert^{2}\,dt\Big{|}{\mathbbm{F}}_{s}\right].

(54)

This and the definition of $B_{s}$ , $s\in[0,T]$ , prove (i).

Next observe that (LABEL:eq:WIntegralYZ), (51) and (i) yield that for all $s\in[0,T]$ it holds a.s. on $B_{s}$ that

\begin{split}&{\mathbb{E}}\bigg{[}\sup_{t\in[s,T]}\left(-2\int_{t}^{T}e^{\lambda u}\langle Y_{u},Z_{u}dW_{u}\rangle\right)\Big{|}\mathbb{F}_{s}\bigg{]}\\ &=2{\mathbb{E}}\bigg{[}\sup_{t\in[s,T]}\left(\int_{s}^{t}e^{\lambda u}\langle Y_{u},Z_{u}\,dW_{u}\rangle\right)\big{|}\mathbb{F}_{s}\bigg{]}-2{\mathbb{E}}\bigg{[}\int_{s}^{T}e^{\lambda u}\langle Y_{u},Z_{u}\,dW_{u}\rangle\big{|}\mathbb{F}_{s}\bigg{]}\\ &\leq 2\sqrt{8}\left({\mathbb{E}}\!\left[\int_{s}^{T}e^{\lambda t}\left\lVert Z_{t}\right\rVert^{2}_{\mathsf{F}}\,dt\big{|}\mathbb{F}_{s}\right]{\mathbb{E}}\!\left[\sup_{t\in[s,T]}e^{\lambda t}\|Y_{t}\|^{2}\big{|}\mathbb{F}_{s}\right]\right)^{\frac{1}{2}}.\\ &\leq 2\sqrt{8}\left({\mathbb{E}}\!\left[e^{\lambda T}\|Y_{T}\|^{2}+\int_{s}^{T}\tfrac{e^{\lambda t}}{\lambda}\|A_{t}\|^{2}\,dt\big{|}\mathbb{F}_{s}\right]{\mathbb{E}}\!\left[\sup_{t\in[s,T]}e^{\lambda t}\|Y_{t}\|^{2}\big{|}\mathbb{F}_{s}\right]\right)^{\frac{1}{2}}.\end{split}

(55)

This, (53), (45), and (49) yield that for all $s\in[0,T]$ it holds a.s. on $B_{s}$ that

\begin{split}&{\mathbb{E}}\!\left[\sup_{t\in[s,T]}\left(e^{\lambda t}\lVert Y_{t}\rVert^{2}+\int_{t}^{T}e^{\lambda u}\lVert Z_{u}\rVert^{2}_{\mathsf{F}}\,du\right)\big{|}\mathbb{F}_{s}\right]\\ &\leq{\mathbb{E}}\!\left[e^{\lambda T}\lVert Y_{T}\rVert^{2}+\int_{s}^{T}\frac{e^{\lambda t}}{\lambda}\lVert A_{t}\rVert^{2}\,dt\big{|}\mathbb{F}_{s}\right]+{\mathbb{E}}\left[\sup_{t\in[s,T]}\left(-2\int_{t}^{T}e^{\lambda u}\langle Y_{u},Z_{u}\,dW_{u}\rangle\right)\big{|}\mathbb{F}_{s}\right]\\ &\leq{\mathbb{E}}\!\left[e^{\lambda T}\lVert Y_{T}\rVert^{2}+\int_{s}^{T}\frac{e^{\lambda t}}{\lambda}\lVert A_{t}\rVert^{2}\,dt\big{|}\mathbb{F}_{s}\right]+2\sqrt{8}\left({\mathbb{E}}\left[e^{\lambda T}\lVert Y_{T}\rVert^{2}+\int_{s}^{T}\frac{e^{\lambda t}}{\lambda}\lVert A_{t}\rVert^{2}\,dt\big{|}\mathbb{F}_{s}\right]\right)^{\frac{1}{2}}\\ &\qquad\cdot\left({\mathbb{E}}\!\left[\sup_{t\in[s,T]}\left(e^{\lambda t}\lVert Y_{t}\rVert^{2}+\int_{t}^{T}e^{\lambda u}\lVert Z_{u}\rVert^{2}_{\mathsf{F}}\,du\right)\big{|}{\mathbbm{F}}_{s}\right]\right)^{\frac{1}{2}}<\infty.\end{split}

(56)

This and the fact that $\forall\,x,c\in[0,\infty)\colon\bigl{(}\bigl{[}x\leq c+2\sqrt{8}\sqrt{c}\sqrt{x}\bigr{]}\Rightarrow\bigl{[}x\leq\bigl{(}\sqrt{8}+\sqrt{8+1}\bigr{)}^{2}c\leq 34c\bigr{]}\bigr{)}$ imply for all $s\in[0,T]$ that a.s. on $B_{s}$ it holds that

\displaystyle{\mathbb{E}}\!\left[\sup_{t\in[s,T]}\left(e^{\lambda t}\lVert Y_{t}\rVert^{2}+\int_{t}^{T}e^{\lambda u}\lVert Z_{u}\rVert_{\mathsf{F}}^{2}\,du\right)\middle|{\mathbbm{F}}_{s}\right]

\displaystyle\leq 34{\mathbb{E}}\!\left[e^{\lambda T}\lVert Y_{T}\rVert^{2}+\int_{s}^{T}\frac{e^{\lambda t}}{\lambda}\lVert A_{t}\rVert^{2}\,dt\middle|{\mathbbm{F}}_{s}\right].

(57)

This and the definition of $B_{s}$ , $s\in[0,T]$ , prove (ii).

Next, the fact that $\forall\,t\in[0,T]\colon\frac{t^{\alpha}}{\Gamma(\alpha+1)}=\int_{0}^{t}\frac{s^{\alpha-1}\,ds}{\Gamma(\alpha)}$ , Tonelli’s theorem, the tower property, and (i) show that

	$\displaystyle\int_{0}^{T}\left[\frac{t^{\alpha-1}}{\Gamma(\alpha)}e^{\lambda t}{\mathbb{E}}\!\left[\left\lVert Y_{t}\right\rVert^{2}\right]+\frac{t^{\alpha}e^{\lambda t}}{\Gamma(\alpha+1)}{\mathbb{E}}\!\left[\left\lVert Z_{t}\right\rVert_{\mathsf{F}}^{2}\right]\right]\!dt$
	$\displaystyle=\int_{0}^{T}\frac{t^{\alpha-1}}{\Gamma(\alpha)}e^{\lambda t}{\mathbb{E}}\!\left[\left\lVert Y_{t}\right\rVert^{2}\right]\!dt+\int_{0}^{T}\int_{0}^{t}\frac{s^{\alpha-1}e^{\lambda t}}{\Gamma(\alpha)}{\mathbb{E}}\!\left[\left\lVert Z_{t}\right\rVert_{\mathsf{F}}^{2}\right]\!dsdt$
	$\displaystyle=\int_{0}^{T}\frac{s^{\alpha-1}}{\Gamma(\alpha)}e^{\lambda s}{\mathbb{E}}\!\left[\left\lVert Y_{s}\right\rVert^{2}\right]\!ds+\int_{0}^{T}\int_{s}^{T}\frac{s^{\alpha-1}e^{\lambda t}}{\Gamma(\alpha)}{\mathbb{E}}\!\left[\left\lVert Z_{t}\right\rVert_{\mathsf{F}}^{2}\right]\!dtds$
	$\displaystyle=\int_{0}^{T}\frac{s^{\alpha-1}}{\Gamma(\alpha)}{\mathbb{E}}\!\left[{\mathbb{E}}\!\left[e^{\lambda s}\left\lVert Y_{s}\right\rVert^{2}+\int_{s}^{T}e^{\lambda t}\left\lVert Z_{t}\right\rVert_{\mathsf{F}}^{2}dt\Bigr{\|}{\mathbbm{F}}_{s}\right]\right]\!ds$
	$\displaystyle\leq\int_{0}^{T}\frac{s^{\alpha-1}}{\Gamma(\alpha)}{\mathbb{E}}\!\left[{\mathbb{E}}\!\left[e^{\lambda T}\lVert Y_{T}\rVert^{2}+\int_{s}^{T}\frac{e^{\lambda t}}{\lambda}\lVert A_{t}\rVert^{2}\,dt\Big{\|}{\mathbbm{F}}_{s}\right]\right]\!ds$
	$\displaystyle=e^{\lambda T}{\mathbb{E}}\!\left[\lVert Y_{T}\rVert^{2}\right]\left(\int_{0}^{T}\frac{s^{\alpha-1}\,ds}{\Gamma(\alpha)}\right)+\int_{0}^{T}\int_{0}^{t}\frac{s^{\alpha-1}}{\Gamma(\alpha)}\frac{e^{\lambda t}}{\lambda}{\mathbb{E}}\!\left[\lVert A_{t}\rVert^{2}\right]dsdt$
	$\displaystyle=\frac{e^{\lambda T}T^{\alpha}{\mathbb{E}}\!\left[\lVert Y_{T}\rVert^{2}\right]}{\Gamma(\alpha+1)}+\frac{1}{\lambda}\int_{0}^{T}\frac{e^{\lambda t}t^{\alpha}{\mathbb{E}}\!\left[\lVert A_{t}\rVert^{2}\right]}{\Gamma(\alpha+1)}\,dt.$		(58)

This shows (iii). The proof of Lemma 3.1 is thus completed. ∎

4 Upper bounds for the convergence speed of Picard iterations

In this section we provide upper bounds for the convergence speed of Picard iterations of BSDEs. Proposition 4.1 establishes an explicit bound for the $L^{2}$ -distance between the Picard iterations and the solution of a BSDE with a globally Lipschitz continuous nonlinearity. Our proof of Proposition 4.1 relies on the a priori estimates for backward Itô processes provided in Lemma 3.1. In Remark 4.2 we employ the estimate of Proposition 4.1 to obtain the square root-factorial speed of convergence of Picard iterations. In Remark 4.3 we employ the estimate of Proposition 4.1 to obtain the factorial speed of convergence of Picard iterations in the z-independent case.

Proposition 4.1.

Let $T\in(0,\infty)$ , $d,m\in{\mathbbm{N}}$ , $L_{\mathfrak{y}},L_{\mathfrak{z}}\in[0,\infty)$ , let $0^{0}=1$ , let $\langle\cdot,\cdot\rangle\colon{\mathbbm{R}}^{d}\times{\mathbbm{R}}^{d}\to{\mathbbm{R}}$ denote the standard scalar product on ${\mathbbm{R}}^{d}$ , let $\lVert\cdot\rVert\colon{\mathbbm{R}}^{d}\to[0,\infty)$ denote the standard norm on ${\mathbbm{R}}^{d}$ , let $\lVert\cdot\rVert_{\mathsf{F}}\colon{\mathbbm{R}}^{d\times m}\to[0,\infty)$ denote the Frobenius norm on ${\mathbbm{R}}^{d\times m}$ , let $(\Omega,\mathcal{F},{\mathbb{P}},({\mathbbm{F}}_{t})_{t\in[0,T]})$ be a filtered probability space which satisfies the usual conditions, let $f\colon[0,T]\times\Omega\times{\mathbbm{R}}^{d}\times{\mathbbm{R}}^{d\times m}\to{\mathbbm{R}}^{d}$ be measurable, assume that for all $t\in[0,T]$ , $y,\tilde{y}\in{\mathbbm{R}}^{d}$ , $z,\tilde{z}\in{\mathbbm{R}}^{d\times m}$ it holds a.s. that

\begin{split}\left\lVert f(t,y,z)-f(t,\tilde{y},\tilde{z})\right\rVert\leq L_{\mathfrak{y}}\lVert y-\tilde{y}\rVert+L_{\mathfrak{z}}\lVert z-\tilde{z}\rVert_{\mathsf{F}},\end{split}

(59)

let $W\colon[0,T]\times\Omega\to{\mathbbm{R}}^{m}$ be a standard $({\mathbbm{F}}_{t})_{t\in[0,T]}$ -Brownian motion with continuous sample paths, let $\xi\colon\Omega\to{\mathbbm{R}}^{d}$ be ${\mathbbm{F}}_{T}$ -measurable, let $Y^{k}\colon[0,T]\times\Omega\to{\mathbbm{R}}^{d}$ , $k\in{\mathbbm{N}}_{0}\cup\{\infty\}$ , be adapted with continuous sample paths, let $Z^{k}\colon[0,T]\times\Omega\to{\mathbbm{R}}^{d\times m}$ , $k\in{\mathbbm{N}}_{0}\cup\{\infty\}$ , be progressively measurable, and assume that for all $s\in[0,T]$ , $k\in{\mathbbm{N}}_{0}\cup\{\infty\}$ it holds a.s. that $\int_{0}^{T}{\mathbb{E}}[\lVert\xi\rVert^{2}+\lVert f(t,0,0)\rVert^{2}+\lVert Y^{\infty}_{t}\rVert^{2}+\lVert Z_{t}^{k}\rVert^{2}_{\mathsf{F}}]\,dt<\infty$ , $Y^{0}_{s}=0$ , $Z^{0}_{s}=0$ , and

\displaystyle Y^{k+1}_{s}=\xi+\int_{s}^{T}f(t,Y_{t}^{k},Z_{t}^{k})\,dt-\int_{s}^{T}Z^{k+1}_{t}\,dW_{t}.

(60)

Then it holds for all $k\in{\mathbbm{N}}$ that

\displaystyle\begin{split}&{\mathbb{E}}\!\left[\sup_{t\in[0,T]}\left(\left\lVert Y^{k}_{t}-Y_{t}^{\infty}\right\rVert^{2}\right)+\int_{0}^{T}\left\lVert Z^{k}_{t}-Z_{t}^{\infty}\right\rVert^{2}_{\mathsf{F}}\,dt\right]\\ &\leq 35\left(\frac{Te}{k}\right)^{k}\left[\sum_{\ell=0}^{k}\frac{k!L_{\mathfrak{y}}^{\ell}L_{\mathfrak{z}}^{k-\ell}T^{\ell/2}}{\ell!(k-\ell)!\sqrt{\ell!}}\right]^{2}\left({\mathbb{E}}\bigl{[}\lVert\xi\rVert^{2}\bigr{]}+\frac{T}{k}\int_{0}^{T}{\mathbb{E}}\!\left[\bigl{\lVert}f(t,Y_{t}^{\infty},Z_{t}^{\infty})\bigr{\rVert}^{2}\right]\!dt\right)<\infty.\end{split}

(61)

Proof of Proposition 4.1.

First note that (60) proves that for all $s\in[0,T]$ , $k\in{\mathbbm{N}}_{0}$ it holds a.s. that

\displaystyle\begin{split}&Y^{k}_{s}-Y_{s}^{\infty}\\ &=-\mathbbm{1}_{\{0\}}(k)\xi+\int_{s}^{T}\Bigl{[}\mathbbm{1}_{{\mathbbm{N}}}(k)f(s,Y^{\lvert k-1\rvert}_{s},Z^{\lvert k-1\rvert}_{s})-f(s,Y_{s}^{\infty},Z_{s}^{\infty})\Bigr{]}dt-\int_{s}^{T}\bigl{[}Z^{k}_{t}-Z_{t}^{\infty}\bigr{]}dW_{t}.\end{split}

(62)

This, the tower property, Tonelli’s theorem, and Lemma 3.1 (applied for every $k\in{\mathbbm{N}}_{0}$ with $Y\leftarrow Y^{k}-Y^{\infty}$ , $A\leftarrow\mathbbm{1}_{{\mathbbm{N}}}(k)f(\cdot,Y^{\lvert k-1\rvert},Z^{\lvert k-1\rvert})-f(\cdot,Y^{\infty},Z^{\infty})$ , $Z\leftarrow Z^{k}-Z^{\infty}$ in the notation of Lemma 3.1) prove

(i)

that for all $k\in{\mathbbm{N}}_{0}$ , $\lambda\in(0,\infty)$ it holds that

\begin{split}&{\mathbb{E}}\!\left[\left[\sup_{t\in[0,T]}\left(e^{\lambda t}\lVert Y^{k}_{t}-Y_{t}^{\infty}\rVert^{2}\right)\right]+\int_{0}^{T}e^{\lambda t}\left\lVert Z^{k}_{t}-Z_{t}^{\infty}\right\rVert^{2}_{\mathsf{F}}\,dt\right]\\ &\leq\frac{35}{\lambda}{\mathbb{E}}\!\left[\lambda e^{\lambda T}\lVert\xi\rVert^{2}\mathbbm{1}_{\{0\}}(k)+\int_{0}^{T}e^{\lambda t}\left\lVert\mathbbm{1}_{{\mathbbm{N}}}(k)f(t,Y^{\lvert k-1\rvert}_{t},Z^{\lvert k-1\rvert}_{t})-f(t,Y_{t}^{\infty},Z_{t}^{\infty})\right\rVert^{2}dt\right],\end{split}

(63)

and

(ii)

that for all $k\in{\mathbbm{N}}_{0}$ , $\alpha,\lambda\in(0,\infty)$ it holds that

\displaystyle\begin{split}&\int_{0}^{T}\frac{t^{\alpha-1}e^{\lambda t}{\mathbb{E}}\!\left[\left\lVert Y^{k}_{t}-Y_{t}^{\infty}\right\rVert^{2}\right]}{\Gamma(\alpha)}+\frac{t^{\alpha}e^{\lambda t}{\mathbb{E}}\!\left[\left\lVert Z^{k}_{t}-Z_{t}^{\infty}\right\rVert_{\mathsf{F}}^{2}\right]}{\Gamma(\alpha+1)}dt\\ &\leq\frac{e^{\lambda T}T^{\alpha}{\mathbb{E}}\!\left[\lVert\xi\rVert^{2}\right]\mathbbm{1}_{\{0\}}(k)}{\Gamma(\alpha+1)}+\frac{1}{\lambda}\int_{0}^{T}\frac{e^{\lambda t}t^{\alpha}{\mathbb{E}}\!\left[\left\lVert\mathbbm{1}_{{\mathbbm{N}}}(k)f(t,Y^{\lvert k-1\rvert}_{t},Z^{\lvert k-1\rvert}_{t})-f(t,Y_{t}^{\infty},Z_{t}^{\infty})\right\rVert^{2}\right]}{\Gamma(\alpha+1)}\,dt.\end{split}

(64)

This, (59), and the fact that $\forall\,\alpha\in{\mathbbm{N}}_{0}\colon\Gamma(\alpha+1)=\alpha!$ show for all $\alpha,k\in{\mathbbm{N}}_{0}$ , $\lambda\in(0,\infty)$ that

	$\displaystyle\left(\int_{0}^{T}\frac{t^{\alpha}e^{\lambda t}}{\alpha!}{\mathbb{E}}\!\left[\bigl{\lVert}f(t,Y^{k}_{t},Z^{k}_{t})-f(t,Y_{t}^{\infty},Z^{\infty}_{t})\bigr{\rVert}^{2}\right]\!dt\right)^{\!\nicefrac{{1}}{{2}}}$
	$\displaystyle\leq L_{\mathfrak{y}}\left(\int_{0}^{T}\frac{t^{\alpha}e^{\lambda t}}{\alpha!}{\mathbb{E}}\!\left[\bigl{\lVert}Y^{k}_{t}-Y_{t}^{\infty}\bigr{\rVert}^{2}\right]\!dt\right)^{\!\nicefrac{{1}}{{2}}}+L_{\mathfrak{z}}\left(\int_{0}^{T}\frac{t^{\alpha}e^{\lambda t}}{\alpha!}{\mathbb{E}}\!\left[\bigl{\lVert}Z^{k}_{t}-Z_{t}^{\infty}\bigr{\rVert}^{2}_{\mathsf{F}}\right]\!dt\right)^{\!\nicefrac{{1}}{{2}}}$
	$\displaystyle\leq\sum_{\nu=0}^{1}\Biggl{[}\frac{L_{\mathfrak{y}}^{\nu}L_{\mathfrak{z}}^{1-\nu}}{\sqrt{\lambda}}\Biggl{(}\frac{T^{\alpha+\nu}}{(\alpha+\nu)!}\lambda e^{\lambda T}{\mathbb{E}}\bigl{[}\lVert\xi\rVert^{2}\bigr{]}\mathbbm{1}_{\{0\}}(k)$
	$\displaystyle\qquad\qquad+\int_{0}^{T}\frac{t^{\alpha+\nu}e^{\lambda t}}{(\alpha+\nu)!}{\mathbb{E}}\!\left[\bigl{\lVert}\mathbbm{1}_{{\mathbbm{N}}}(k)f(t,Y^{\lvert k-1\rvert}_{t},Z^{\lvert k-1\rvert}_{t})-f(t,Y_{t}^{\infty},Z_{t}^{\infty})\bigr{\rVert}^{2}\right]\!dt\Biggr{)}^{\!\nicefrac{{1}}{{2}}}\Biggr{]}.$		(65)

This and induction prove for all $k\in{\mathbbm{N}}\cap[2,\infty)$ , $\lambda\in(0,\infty)$ that

	$\displaystyle\left(\int_{0}^{T}\frac{t^{0}e^{\lambda t}}{0!}{\mathbb{E}}\!\left[\bigl{\lVert}f(t,Y^{k-1}_{t},Z^{k-1}_{t})-f(t,Y_{t}^{\infty},Z_{t}^{\infty})\bigr{\rVert}^{2}\right]\!dt\right)^{\!\nicefrac{{1}}{{2}}}$
	$\displaystyle\leq\sum_{\nu_{1},\nu_{2},\ldots,\nu_{k-1}=0}^{1}\left[\frac{L_{\mathfrak{y}}^{\sum_{i=1}^{k-1}\nu_{i}}L_{\mathfrak{z}}^{k-1-\sum_{i=1}^{k-1}\nu_{i}}}{\lambda^{(k-1)/2}}\left(\int_{0}^{T}\frac{t^{\sum_{i=1}^{k-1}\nu_{i}}e^{\lambda t}}{(\sum_{i=1}^{k-1}\nu_{i})!}{\mathbb{E}}\!\left[\bigl{\lVert}f(t,Y_{t}^{0},Z_{t}^{0})-f(t,Y_{t}^{\infty},Z_{t}^{\infty})\bigr{\rVert}^{2}\right]\!dt\right)^{\!\nicefrac{{1}}{{2}}}\right]$
	$\displaystyle\leq\sum_{\nu_{1},\nu_{2},\ldots,\nu_{k-1}=0}^{1}\Biggl{[}\frac{L_{\mathfrak{y}}^{\sum_{i=1}^{k-1}\nu_{i}}L_{\mathfrak{z}}^{k-1-\sum_{i=1}^{k-1}\nu_{i}}}{\lambda^{(k-1)/2}}\sum_{\nu_{k}=0}^{1}\frac{L_{\mathfrak{y}}^{\nu_{k}}L_{\mathfrak{z}}^{1-\nu_{k}}}{\sqrt{\lambda}}$
	$\displaystyle\qquad\qquad\qquad\qquad\cdot\left(\frac{T^{\sum_{i=1}^{k}\nu_{i}}}{(\sum_{i=1}^{k}\nu_{i})!}\lambda e^{\lambda T}{\mathbb{E}}\bigl{[}\lVert\xi\rVert^{2}\bigr{]}+\int_{0}^{T}\frac{t^{\sum_{i=1}^{k}\nu_{i}}e^{\lambda t}}{(\sum_{i=1}^{k}\nu_{i})!}{\mathbb{E}}\!\left[\bigl{\lVert}f(t,Y_{t}^{\infty},Z_{t}^{\infty})\bigr{\rVert}^{2}\right]\!dt\right)^{\!\nicefrac{{1}}{{2}}}\Biggr{]}$
	$\displaystyle=\sum_{\nu_{1},\nu_{2},\ldots,\nu_{k}=0}^{1}\frac{L_{\mathfrak{y}}^{\sum_{i=1}^{k}\nu_{i}}L_{\mathfrak{z}}^{k-\sum_{i=1}^{k}\nu_{i}}}{\lambda^{k/2}}$
	$\displaystyle\qquad\qquad\qquad\cdot\left(\frac{T^{\sum_{i=1}^{k}\nu_{i}}}{(\sum_{i=1}^{k}\nu_{i})!}\lambda e^{\lambda T}{\mathbb{E}}\bigl{[}\lVert\xi\rVert^{2}\bigr{]}+\int_{0}^{T}\frac{t^{\sum_{i=1}^{k}\nu_{i}}e^{\lambda t}}{(\sum_{i=1}^{k}\nu_{i})!}{\mathbb{E}}\!\left[\bigl{\lVert}f(t,Y_{t}^{\infty},Z_{t}^{\infty})\bigr{\rVert}^{2}\right]\!dt\right)^{\!\nicefrac{{1}}{{2}}}$		(66)

This and (65) show for all $k\in{\mathbbm{N}}$ , $\lambda\in(0,\infty)$ that

\displaystyle\begin{split}&\left(\int_{0}^{T}e^{\lambda t}{\mathbb{E}}\!\left[\bigl{\lVert}f(t,Y^{k-1}_{t},Z^{k-1}_{t})-f(t,Y_{t}^{\infty},Z_{t}^{\infty})\bigr{\rVert}^{2}\right]\!dt\right)^{\!\nicefrac{{1}}{{2}}}\\ &\leq\sum_{\ell=0}^{k}\left[\frac{k!}{\ell!(k-\ell)!}\frac{L_{\mathfrak{y}}^{\ell}L_{\mathfrak{z}}^{k-\ell}}{\lambda^{k/2}}\left(\frac{T^{\ell}}{\ell!}\lambda e^{\lambda T}{\mathbb{E}}\bigl{[}\lVert\xi\rVert^{2}\bigr{]}+\int_{0}^{T}\frac{t^{\ell}e^{\lambda t}}{\ell!}{\mathbb{E}}\!\left[\bigl{\lVert}f(t,Y_{t}^{\infty},Z_{t}^{\infty})\bigr{\rVert}^{2}\right]\!dt\right)^{\!\nicefrac{{1}}{{2}}}\right]\\ &\leq\left[\sum_{\ell=0}^{k}\frac{k!}{\ell!(k-\ell)!}\frac{L_{\mathfrak{y}}^{\ell}L_{\mathfrak{z}}^{k-\ell}}{\lambda^{k/2}}\frac{T^{\ell/2}e^{\lambda T/2}}{\sqrt{\ell!}}\right]\left(\lambda{\mathbb{E}}\bigl{[}\lVert\xi\rVert^{2}\bigr{]}+\int_{0}^{T}{\mathbb{E}}\!\left[\bigl{\lVert}f(t,Y_{t}^{\infty},Z_{t}^{\infty})\bigr{\rVert}^{2}\right]\!dt\right)^{\!\nicefrac{{1}}{{2}}}.\end{split}

(67)

This and (63) prove for all $k\in{\mathbbm{N}}$ , $\lambda\in(0,\infty)$ that

\displaystyle\begin{split}&{\mathbb{E}}\!\left[\sup_{t\in[0,T]}\left(e^{\lambda t}\left\lVert Y^{k}_{t}-Y_{t}^{\infty}\right\rVert^{2}\right)+\int_{0}^{T}e^{\lambda t}\left\lVert Z^{k}_{t}-Z_{t}^{\infty}\right\rVert^{2}_{\mathsf{F}}\,dt\right]\\ &\leq\frac{35}{\lambda}{\mathbb{E}}\!\left[\int_{0}^{T}e^{\lambda t}\left\lVert f(t,Y^{k-1}_{t},Z^{k-1}_{t})-f(t,Y_{t}^{\infty},Z_{t}^{\infty})\right\rVert^{2}\,dt\right]\\ &\leq\frac{35}{\lambda}\left[\sum_{\ell=0}^{k}\frac{k!}{\ell!(k-\ell)!}\frac{L_{\mathfrak{y}}^{\ell}L_{\mathfrak{z}}^{k-\ell}}{\lambda^{k/2}}\frac{T^{\ell/2}e^{\lambda T/2}}{\sqrt{\ell!}}\right]^{2}\left(\lambda{\mathbb{E}}\bigl{[}\lVert\xi\rVert^{2}\bigr{]}+\int_{0}^{T}{\mathbb{E}}\!\left[\bigl{\lVert}f(t,Y_{t}^{\infty},Z_{t}^{\infty})\bigr{\rVert}^{2}\right]\!dt\right).\end{split}

(68)

Furthermore, observe for all $k\in{\mathbbm{N}}$ that

\displaystyle\left[\sum_{\ell=0}^{k}\frac{k!}{\ell!(k-\ell)!}\frac{L_{\mathfrak{y}}^{\ell}L_{\mathfrak{z}}^{k-\ell}}{\lambda^{k/2}}\frac{T^{\ell/2}e^{\lambda T/2}}{\sqrt{\ell!}}\right]^{2}\Biggr{|}_{\lambda=\frac{k}{T}}=\left(\frac{Te}{k}\right)^{k}\left[\sum_{\ell=0}^{k}\frac{k!L_{\mathfrak{y}}^{\ell}L_{\mathfrak{z}}^{k-\ell}T^{\ell/2}}{\ell!(k-\ell)!\sqrt{\ell!}}\right]^{2}.

(69)

This and (68) yield for all $k\in{\mathbbm{N}}$ that

\displaystyle\begin{split}&{\mathbb{E}}\!\left[\sup_{t\in[0,T]}\left(\left\lVert Y^{k}_{t}-Y_{t}^{\infty}\right\rVert^{2}\right)+\int_{0}^{T}\left\lVert Z^{k}_{t}-Z_{t}^{\infty}\right\rVert^{2}_{\mathsf{F}}\,dt\right]\\ &\leq\frac{35T}{k}\left(\frac{Te}{k}\right)^{k}\left[\sum_{\ell=0}^{k}\frac{k!L_{\mathfrak{y}}^{\ell}L_{\mathfrak{z}}^{k-\ell}T^{\ell/2}}{\ell!(k-\ell)!\sqrt{\ell!}}\frac{}{}\frac{}{}\right]^{2}\left(\frac{k}{T}{\mathbb{E}}\bigl{[}\lVert\xi\rVert^{2}\bigr{]}+\int_{0}^{T}{\mathbb{E}}\!\left[\bigl{\lVert}f(t,Y_{t}^{\infty},Z_{t}^{\infty})\bigr{\rVert}^{2}\right]\!dt\right).\end{split}

(70)

Next note that (59) ensures that

	$\displaystyle\left(\int_{0}^{T}{\mathbb{E}}\bigl{[}\lVert f(t,Y_{t}^{\infty},Z_{t}^{\infty})\rVert^{2}\bigr{]}dt\right)^{\nicefrac{{1}}{{2}}}$		(71)
	$\displaystyle\leq\left(\int_{0}^{T}{\mathbb{E}}\bigl{[}\lVert f(t,0,0)\rVert^{2}\bigr{]}dt\right)^{\nicefrac{{1}}{{2}}}+L_{\mathfrak{y}}\left(\int_{0}^{T}{\mathbb{E}}\bigl{[}\lVert Y_{t}^{\infty}\rVert^{2}\bigr{]}dt\right)^{\nicefrac{{1}}{{2}}}+L_{\mathfrak{z}}\left(\int_{0}^{T}{\mathbb{E}}\bigl{[}\lVert Z_{t}^{\infty}\rVert^{2}_{\mathsf{F}}\bigr{]}dt\right)^{\nicefrac{{1}}{{2}}}<\infty.$

This, the fact that ${\mathbb{E}}[\lVert\xi\rVert^{2}]<\infty$ , and (70) complete the proof of Proposition 4.1. ∎

Remark 4.2.

Assume the setting of Proposition 4.1. Then it holds for all $k\in{\mathbbm{N}}$ that

\displaystyle\begin{split}&35\left(\frac{Te}{k}\right)^{k}\left[\sum_{\ell=0}^{k}\frac{k!L_{\mathfrak{y}}^{\ell}L_{\mathfrak{z}}^{k-\ell}T^{\ell/2}}{\ell!(k-\ell)!\sqrt{\ell!}}\right]^{2}\leq 35\left(\frac{\max\{T^{2},1\}e\max\{L_{\mathfrak{y}}^{2},L_{\mathfrak{z}}^{2}\}}{k}\right)^{k}\left[\sum_{\ell=0}^{k}\frac{k!}{(k-\ell)!\ell!}\right]^{2}\\ &=35\left(\frac{4\max\{T^{2},1\}e\max\{L_{\mathfrak{y}}^{2},L_{\mathfrak{z}}^{2}\}}{k}\right)^{k}\leq 35\frac{\left(4\max\{T^{2},1\}e\max\{L_{\mathfrak{y}}^{2},L_{\mathfrak{z}}^{2}\}\right)^{k}}{k!}.\end{split}

(72)

Remark 4.3.

Assume the setting of Proposition 4.1 and assume that $L_{\mathfrak{z}}=0$ . Then it holds for all $k\in{\mathbbm{N}}$ that

\displaystyle\begin{split}&35\left(\frac{Te}{k}\right)^{k}\left[\sum_{\ell=0}^{k}\frac{k!L_{\mathfrak{y}}^{\ell}L_{\mathfrak{z}}^{k-\ell}T^{\ell/2}}{\ell!(k-\ell)!\sqrt{\ell!}}\right]^{2}=35\left(\frac{Te}{k}\right)^{k}\left[\frac{L_{\mathfrak{y}}^{k}T^{k/2}}{\sqrt{k!}}\right]^{2}\leq 35\frac{(T^{2}eL_{\mathfrak{y}}^{2})^{k}}{(k!)^{2}}.\end{split}

(73)

Acknowledgements

This work has been funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) through the research grant HU1889/7-1.

References

[1] Beck, C., Hornung, F., Hutzenthaler, M., Jentzen, A., and Kruse, T. Overcoming the curse of dimensionality in the numerical approximation of Allen–Cahn partial differential equations via truncated full-history recursive multilevel Picard approximations. Journal of Numerical Mathematics 28, 4 (2020), 197–222.
[2] Bender, C., and Denk, R. A forward scheme for backward SDEs. Stochastic Process. Appl. 117, 12 (2007), 1793–1812.
[3] Bender, C., and Zhang, J. Time discretization and Markovian iteration for coupled FBSDEs. The Annals of Applied Probability 18, 1 (2008), 143–177.
[4] Briand, P., and Labart, C. Simulation of BSDEs by Wiener chaos expansion. Ann. Appl. Probab. 24, 3 (2014), 1129–1171.
[5] Da Prato, G., and Zabczyk, J. Stochastic equations in infinite dimensions, vol. 44 of Encyclopedia of Mathematics and its Applications. Cambridge University Press, Cambridge, 1992.
[6] E, W., Hutzenthaler, M., Jentzen, A., and Kruse, T. Multilevel Picard iterations for solving smooth semilinear parabolic heat equations. arXiv:1607.03295 (2016). Accepted in Partial Differential Equations and Applications.
[7] E, W., Hutzenthaler, M., Jentzen, A., and Kruse, T. On multilevel Picard numerical approximations for high-dimensional nonlinear parabolic partial differential equations and high-dimensional nonlinear backward stochastic differential equations. Journal of Scientific Computing 79, 3 (2019), 1534–1571.
[8] El Karoui, N., Peng, S., and Quenez, M. C. Backward stochastic differential equations in finance. Math. Finance 7, 1 (1997), 1–71.
[9] Geiss, C., and Labart, C. Simulation of BSDEs with jumps by Wiener chaos expansion. Stochastic Process. Appl. 126, 7 (2016), 2123–2162.
[10] Gobet, E., and Labart, C. Solving BSDE with adaptive control variate. SIAM J. Numer. Anal. 48, 1 (2010), 257–277.
[11] Hutzenthaler, M., Jentzen, A., and Kruse, T. Overcoming the curse of dimensionality in the numerical approximation of parabolic partial differential equations with gradient-dependent nonlinearities. arXiv:1912.02571 (2019). Accepted in Foundations of Computational Mathematics.
[12] Hutzenthaler, M., Jentzen, A., Kruse, T., and Nguyen, T. A. Multilevel Picard approximations for high-dimensional semilinear second-order PDEs with Lipschitz nonlinearities. arXiv:2009.02484 (2020).
[13] Hutzenthaler, M., Jentzen, A., Kruse, T., Nguyen, T. A., and von Wurstemberger, P. Overcoming the curse of dimensionality in the numerical approximation of semilinear parabolic partial differential equations. Proceedings of the Royal Society A 476, 2244 (2020), 20190630.
[14] Hutzenthaler, M., and Kruse, T. Multilevel Picard approximations of high-dimensional semilinear parabolic differential equations with gradient-dependent nonlinearities. SIAM Journal on Numerical Analysis 58, 2 (2020), 929–961.
[15] Labart, C., and Lelong, J. A parallel algorithm for solving BSDEs. Monte Carlo Methods Appl. 19, 1 (2013), 11–39.
[16] Øksendal, B. Stochastic differential equations. Universitext. Springer-Verlag, Berlin, 1985. An introduction with applications.
[17] Pardoux, É., and Peng, S. Adapted solution of a backward stochastic differential equation. Systems Control Lett. 14, 1 (1990), 55–61.
[18] Pardoux, E., and Răşcanu, A. Stochastic differential equations, Backward SDEs, Partial differential equations. Springer, 2016.
[19] Pham, H. Continuous-time stochastic control and optimization with financial applications, vol. 61. Springer Science & Business Media, 2009.
[20] Ren, Y.-F. On the Burkholder–Davis–Gundy inequalities for continuous martingales. Statistics & probability letters 78, 17 (2008), 3034–3039.
[21] Zhang, J. Backward Stochastic Differential Equations. From Linear to Fully Nonlinear Theory. Springer, 2017.

On the speed of convergence of Picard iterations of backward stochastic differential equations

Abstract

1 Introduction

Theorem 1.1.

2 Lower bounds for the convergence speed of Picard iterations

2.1 Lower bounds for the convergence speed of Picard iterations for an example PDE

Lemma 2.1.

Proof of Lemma 2.1.

2.2 Lower bounds for the convergence speed of Picard iterations for an example BSDE

Corollary 2.2.

Proof of Corollary 2.2.

Lemma 2.3.

Proof of Lemma 2.3.

3 A priori estimates for backward Itô processes

Lemma 3.1.

Proof of Lemma 3.1.

4 Upper bounds for the convergence speed of Picard iterations

Proposition 4.1.

Proof of Proposition 4.1.

Remark 4.2.

Remark 4.3.

Acknowledgements

References

On the speed of convergence of Picard iterations
of backward stochastic differential equations