\authormark

Li and Han

\corres

Yuecai Han, School of Mathematics, Jilin University, Changchun 130012, China.

\presentaddress

School of Mathematics, Jilin University, Changchun 130012, China

Stochastic Maximum Principle for a generalized Volterra Control System

Yuhang Li Yuecai Han \orgdivSchool of Mathematics, \orgnameJilin University, \orgaddress\stateChangchun 130012, \countryChina hanyc@jlu.edu.cn

(<day> <Month>, <year>; <day> <Month>, <year>; <day> <Month>, <year>)

Abstract

[Abstract]In this paper, we consider the stochastic optimal control problem for a generalized Volterra control system. The corresponding state process is a kind of a generalized stochastic Volterra integral differential equations. We prove the existence and uniqueness of the solution of this type of equations. We obtain the stochastic maximum principle of the optimal control system by introducing a kind of generalized anticipated backward stochastic differential equations. We prove the existence and uniqueness of the solution of this adjoint equation, which may be singular at some points. As an application, the linear quadratic control problem is investigated to illustrate the main results.

keywords:

Generalized Volterra control system; Volterra Integral differential equations; Maximum principle; Linear quadratic optimal control.

^†^†articletype: reasearch article⁰⁰footnotetext: Abbreviations: ANA, anti-nuclear antibodies; APC, antigen-presenting cells; IRF, interferon regulatory factor

1 Introduction

To better describe the real-world, the stochastic integral differential equations have been studied in many areas, such as in biological science, applied mathematics, physics, and other disciplines, etc ^{1, 2, 3, 4}. Mao and Riedle ⁵ study the stability of some types of stochastic volterra integral differential equations. Nesterenko⁶ substantiate the application of a modified projection-iterative method to the solution of boundary value problems for weakly nonlinear integrodifferential equations with parameters. Dzhumabaev⁷ establish the necessary and sufficient conditions for the well-posedness of linear boundary value problems for Fredholm integro-differential equations . Zhang et al.⁸ investigate numerical analysis of the following generalized stochastic volterra integral differential equations

	$\displaystyle dY(t)=$	$\displaystyle f\left(Y(t),\int_{0}^{t}k_{1}(t,s)Y(s)ds,\int_{0}^{t}\sigma_{1}(t,s)Y(s)dw(s)\right)dt$
		$\displaystyle+g\left(Y(t),\int_{0}^{t}k_{2}(t,s)Y(s)ds,\int_{0}^{t}\sigma_{2}(t,s)Y(s)dw(s)\right)dw(t).$

They prove the existence and uniqueness of the solution when $\|k_{i}\|$ and $\|\sigma_{i}\|$ are bounded.

Control problems for integral differential equations have also been studied. Kim⁹ discuss a reachability problem for a second-order integro-differential equation based upon a new kind of unique continuation property. Mashayekhi et al.¹⁰ give a new numerical method for solving the optimal control of a class of systems described by integro-differential equations with quadratic performance index. Assanova et al.¹¹ present the existence of optimal controls of systems governed by impulsive integro-differential equations of mixed type. Wang¹² investigate the optimal control problems in terms of maximum principles and linear quadratic control problems of optimal control for forward stochastic Volterra integro-differential equations.

In this paper we focus on the following Volterra control system

\displaystyle\left\{\begin{array}[]{ll}dX_{t}=b\Big{(}t,X_{t},\int_{0}^{t}k(t,s)X_{s}ds,u_{t},\int_{0}^{t}l(t,s)u_{s}ds\Big{)}dt+\sigma\Big{(}t,X_{t},\int_{0}^{t}k(t,s)X_{s}ds,u_{t},\int_{0}^{t}l(t,s)u_{s}ds\Big{)}dW_{t},\qquad 0\leq t\leq T,\\ X_{0}=x,\end{array}\right.

(3)

to minimize the cost function

\displaystyle J(u)=E\left[\int_{0}^{T}f\left(t,X_{t},\int_{0}^{t}k(t,s)X_{s}ds,u_{t},\int_{0}^{t}l(t,s)u_{s}ds\right)dt+g(X_{T})\right].

We call the corresponding stochastic differential equation (SDE in short) as a generalized stochastic Volterra integral differential equation (SVIDE in short), which is a specific type of integral differential equations. This type of equation is influenced by the past information of both state process and control process from beginning to present.

It should be pointed out that most SVIDEs can not be written as stochastic Volterra integral equations with the following form

\displaystyle X(t)=\varphi(t)+\int_{0}^{t}b(t,s,X(t),X(s))ds+\int_{0}^{t}\sigma(t,s,X(t),X(s))dW(s),\quad t\in[0,T].

For example, consider the following SDE,

\displaystyle dX_{t}=\left(X_{t}+{\rm sin}\left(\int_{0}^{t}X_{s}ds\right)\right)dt+\sigma(t)dW_{t},

where the drift term is nonlinear on the integral of state process.

We study the uniqueness of the solution of the SVIDE. Different from classical condition, to deal with the term contain the past information, we use Gronwall inequality to $\sup_{0\leq r\leq t}E|\tilde{X}_{r}-X_{r}|^{2}$ , which is a upper bound for $CE\left[\left|\frac{1}{t}\int_{0}^{t}\tilde{X}_{s}ds-\frac{1}{t}\int_{0}^{t}X_{s}ds\right|\right]^{2}$ . Then a new stochastic maximum principle for control system (3) is established. We define the Hamiltonian function and the adjoint equation to obtain the optimal system. To study the properties of the adjoint equation, we prove the existence and uniqueness of the solution of the following equation

\displaystyle\left\{\begin{array}[]{ll}-dy_{t}=h\left(t,y_{t},z_{t},E^{\mathcal{F}_{t}}\left[\int_{t}^{T}k(s,t)a_{1}(s)y_{s}ds\right],E^{\mathcal{F}_{t}}\left[\int_{t}^{T}k(s,t)a_{2}(s)z_{s}ds\right]\right)dt-z_{t}dW_{t},\\ \\ y_{T}=\xi,\end{array}\right.

which is singular at point 0. Compared with classical type investigated by El Karoui and Peng¹³, we construct a contraction mapping under a new $\beta$ -norm

\displaystyle\|(Y,Z)\|_{\beta}=\sup_{0\leq s\leq T}Ee^{\beta s}|Y_{s}|^{2}+E\int_{0}^{T}e^{\beta s}Z_{s}^{2}ds.

Furthermore, we get the necessary condition that the optimal control process should satisfy. Consider the linear quadratic case, which can be applied to a Volterra linear quadratic state regulator, we obtain the unique optimal control process for linear quadratic Volterra control system.

The rest of this paper is organized as follows. In section 2, we introduce a type of generalized SVIDE and prove the existence and the uniqueness of the solution of this type of equation. In section 3, we prove the stochastic maximum principle by introducing a kind of anticipated backward stochastic differential equations, and the existence and uniquenes of this kind of equations is proved. In the section 4, the linear quadratic case is investigated to illustrate the main result.

2 A generalized stochastic Volterra integral differential equation

Let $(\Omega,\mathcal{F},\mathbb{P})$ be a probability space. $\mathcal{F}_{0}\subset\mathcal{F}$ be a sub $\sigma$ -algebra, and $\mathbb{F}=(\mathcal{F}_{t})_{0\leq t\leq T}$ be the filtration generated by $\mathcal{F}_{0}$ and a $m$ -dimensional standard Brownian motion $\textbf{W}=(W_{t})_{0\leq t\leq T}$ . We consider the following stochastic differential equation.

\displaystyle\left\{\begin{array}[]{ll}dX_{t}=b(t,X_{t},Y_{t})dt+\sigma(t,X_{t},Y_{t})dW_{t},\qquad 0\leq t\leq T,\\ X_{0}=x,\end{array}\right.

(6)

where $E|X_{0}|^{2}<\infty$ , $b$ and $\sigma$ be measurable functions on $[0,T]\times\mathbf{R}^{d}\times\mathbf{R}^{d}$ with values in $\mathbf{R}^{d}$ and $\mathbf{R}^{d\times m}$ , respectively. Here

\displaystyle Y_{t}=\left\{\begin{array}[]{ll}\int_{0}^{t}k(t,s)X_{s}ds,&\textrm{$t>0$},\\ X_{0},&\textrm{$t=0$},\end{array}\right.

(9)

where $k(t,s)$ satisfies $\sup_{0\leq t\leq T}\int_{0}^{t}|k(t,s)|\leq M$ for some constants $M>0$ . It is obviously that $Y_{t}$ is continuous. This class of equations provides a description of the effect of past situations on the current situation. Assume that

|b(t,x,y)|^{2}\lor|\sigma(t,x,y)|^{2}\leq L(1+|x|^{2}+|y|^{2}),\quad x,y\in\mathbf{R}^{n},t\in[0,T],

(10)

and

	$\displaystyle\|b(t,x_{1},y_{1})-b(t,x_{2},y_{2})\|^{2}$	$\displaystyle\lor\|\sigma(t,x_{1},y_{1})-\sigma(t,x_{2},y_{2})\|^{2}\leq L(\|x_{1}-x_{2}\|^{2}+\|y_{1}-y_{2}\|^{2}),$
		$\displaystyle x_{1},x_{2},y_{1},y_{2}\in\mathbf{R}^{n},\quad t\in[0,T]$		(11)

for some constant $L>0$ (where $|\sigma|^{2}=\sum|\sigma_{ij}|^{2}$ ).

Now we show the existence and uniqueness of the solution of equation (6).

Lemma 2.1 If condition (10) and (2) holds, there exist a unique solution to equation (6).

Proof: Uniqueness. Let $X_{t}$ and $\tilde{X}_{t}$ be two solutions of the equation (6), $Y_{t}$ and $\tilde{Y}_{t}$ are corresponding moving average processes, and $X_{0}=Y_{0}=\tilde{X}_{0}=\tilde{Y}_{0}=x$ . Thus, we have

	$\displaystyle E\|\tilde{X}_{t}-X_{t}\|^{2}$	$\displaystyle=E\Big{[}\int_{0}^{t}b(s,\tilde{X}_{s},\tilde{Y}_{s})-b(s,X_{s},Y_{s})ds+\int_{0}^{t}\sigma(s,\tilde{X}_{s},\tilde{Y}_{s})-\sigma(s,X_{s},Y_{s})dW_{s}\Big{]}^{2}$
		$\displaystyle\leq 2(T+1)LE\int_{0}^{t}\|\tilde{X}_{s}-X_{s}\|^{2}+\|\tilde{Y}_{s}-Y_{s}\|^{2}ds$
		$\displaystyle\leq(2M^{2}+1)(T+1)L\int_{0}^{t}\sup_{0\leq r\leq s}E\|\tilde{X}_{r}-X_{r}\|^{2}ds.$

The last inequality holding is because

\displaystyle E|\tilde{Y}_{s}-Y_{s}|^{2}=E\left|\int_{0}^{s}k(s,r)(\tilde{X}_{r}-X_{r})dr\right|^{2}\leq E\left\{\left[\int_{0}^{s}|k(s,r)|dr\right]\left[\int_{0}^{s}|k(s,r)||\tilde{X}_{r}-X_{r}|^{2}dr\right]\right\}\leq M^{2}\sup_{0\leq r\leq s}E|\tilde{X}_{r}-X_{r}|^{2}.

For every $\varepsilon>0$ , there exits $\xi_{t}\in[0,t]$ , such that

\displaystyle E|\tilde{X}_{\xi_{t}}-X_{\xi_{t}}|^{2}\geq\sup_{0\leq r\leq t}E|\tilde{X}_{r}-X_{r}|^{2}-\varepsilon,

so that

	$\displaystyle\sup_{0\leq r\leq t}E\|\tilde{X}_{r}-X_{r}\|^{2}\leq$	$\displaystyle E\|\tilde{X}_{\xi_{t}}-X_{\xi_{t}}\|^{2}+\varepsilon$
	$\displaystyle\leq$	$\displaystyle(2M^{2}+1)(T+1)L\int_{0}^{\xi_{t}}\sup_{0\leq r\leq s}E\|\tilde{X}_{r}-X_{r}\|^{2}ds+\varepsilon$
	$\displaystyle\leq$	$\displaystyle(2M^{2}+1)(T+1)L\int_{0}^{t}\sup_{0\leq r\leq s}E\|\tilde{X}_{r}-X_{r}\|^{2}ds+\varepsilon.$

Through the Gronwall’s inequality and the arbitrariness of $\varepsilon$ , we get $\sup_{0\leq t\leq T}E|\tilde{X}_{t}-X_{t}|^{2}=0$ . Thus, the solution $X_{t}$ is unique.

Existence. Let

\displaystyle\left\{\begin{array}[]{ll}X_{t}^{(k+1)}&=x+\int_{0}^{t}b(s,X_{s}^{(k)},Y_{s}^{(k)})dt+\int_{0}^{t}\sigma(s,X_{s}^{(k)},Y_{s}^{(k)})dW_{s},\\ \quad Y_{t}^{(k)}&=\int_{0}^{t}k(t,s)X_{s}^{(k)}ds,\\ \quad X_{t}^{(0)}&=x,\end{array}\right.

and

\displaystyle u_{t}^{(k)}=\sup_{0\leq r\leq t}E\Big{|}X_{r}^{(k+1)}-X_{r}^{(k)}\Big{|}^{2}.

Similar to the proof of classical case, we get

\displaystyle u_{t}^{(k)}\leq\frac{A^{k+1}t^{k+1}}{(k+1)!}

for some constants $A>0$ . Let $\lambda$ be Lebesgue measure on $[0,T]$ , $0\leq n<m$ and $m,n\to\infty$ . Then we have

\displaystyle\left\|X_{t}^{(m)}-X_{t}^{(n)}\right\|_{L^{2}(\lambda\times P)}\leq\sum_{k=n}^{m-1}\left(\frac{A^{k+2}T^{k+2}}{(k+2)!}\right)^{\frac{1}{2}}\rightarrow 0.

Therefore, $\{X_{t}^{(n)}\}_{n\geq 0}$ is a Cauchy sequence in ${L^{2}(\lambda\times P)}$ . Define

\displaystyle X_{t}:=\lim_{n\to\infty}X_{t}^{(n)},\qquad Y_{t}:=\lim_{n\to\infty}Y_{t}^{(n)}=\lim_{n\to\infty}\int_{0}^{t}k(t,s)X_{s}^{(n)}ds.

Then $X_{t}$ and $Y_{t}$ are $\mathcal{F}_{t}$ -measurable for all $t$ . Since this holds for each $X_{t}^{(n)}$ and $Y_{t}^{(n)}$ , thus $X_{t}$ is the solution of (6).

3 The Maximum Principle

Consider the following control problem. The state equation is

\displaystyle\left\{\begin{array}[]{ll}dX_{t}=b\Big{(}t,X_{t},\int_{0}^{t}k(t,s)X_{s}ds,u_{t},\int_{0}^{t}l(t,s)u_{s}ds\Big{)}dt+\sigma\Big{(}t,X_{t},\int_{0}^{t}k(t,s)X_{s}ds,u_{t},\int_{0}^{t}l(t,s)u_{s}ds\Big{)}dW_{t},\qquad 0\leq t\leq T,\\ X_{0}=x,\end{array}\right.

(14)

with the cost function

\displaystyle J(u)=E\left[\int_{0}^{T}f\left(t,X_{t},\int_{0}^{t}k(t,s)X_{s}ds,u_{t},\int_{0}^{t}l(t,s)u_{s}ds\right)dt+g(X_{T})\right],

(15)

where $b(t,x,y,u,v)$ and $\sigma(t,x,y,u,v)$ are measurable functions on $\mathbf{R}\times\mathbf{R}^{d}\times\mathbf{R}^{d}\times\mathbf{R}^{k}\times\mathbf{R}^{k}$ with values in $\mathbf{R}^{d}$ and $\mathbf{R}^{d\times m}$ , respectively. $f(t,x,y,u,v)$ and $g(x)$ be measurable functions on $\mathbf{R}\times\mathbf{R}^{d}\times\mathbf{R}^{d}\times\mathbf{R}^{k}\times\mathbf{R}^{k}$ and $\mathbf{R}^{d}$ , respectively, with values in $\mathbf{R}$ . We denote by $\mathbb{U}$ the set of progressively measurable process u $=(u_{t})_{0\leq t\leq T}$ taking values in a given closed-convex set $\textbf{U}\subset\mathbb{R}^{k}$ and satisfying $E\int_{0}^{T}|u_{t}|^{2}dt<\infty$ .

To simplify the notation without losing the generality, we just consider the case $d=m=k=1$ . We assume $u_{t}^{*}$ is the optimal control process, i.e.,

\displaystyle J(u_{t}^{*})=\min_{u_{t}\in\mathbb{U}}J(u_{t}).

For all $0<\varepsilon<1$ , let

\displaystyle u_{t}^{\varepsilon}=(1-\varepsilon)u_{t}^{*}+\varepsilon\alpha_{t}\triangleq u^{*}_{t}+\varepsilon\beta_{t},

where $\alpha_{t}$ is any other admissible control.

We define the Hamiltonian function $H$ by

\displaystyle H(t,x,y,u,v,p,q)=b(t,x,y,u,v)p+\sigma(t,x,y,u,v)q+f(t,x,y,u,v).

(16)

Denote

\displaystyle\phi^{*}(t)=\phi\left(t,X_{t}^{*},\int_{0}^{t}k(t,s)X_{s}^{*}ds,u_{t}^{*},\int_{0}^{t}l(t,s)u_{s}^{*}ds\right),

for $\phi=b,\sigma,f,b_{x},\sigma_{x},f_{x},b_{y},\sigma_{y},f_{y},b_{u},\sigma_{u},f_{u},b_{v},\sigma_{v},f_{v}$ .

Theorem 3.1 If $(u_{t}^{*})_{0\leq t\leq T}$ is the optimal control process, $(X_{t}^{*})_{0\leq t\leq T}$ and $(p_{t},q_{t})$ is the process satisfying

\displaystyle\left\{\begin{array}[]{ll}-dp_{t}=\Big{[}b_{x}^{*}(t)p_{t}+E^{\mathcal{F}_{t}}\left[\int_{t}^{T}k(s,t)b_{y}^{*}(s)p_{s}ds\right]+\sigma_{x}^{*}(t)q_{t}+E^{\mathcal{F}_{t}}\left[\int_{t}^{T}k(s,t)\sigma_{y}^{*}(s)q_{s}ds\right]\\ \qquad\qquad+f_{x}^{*}(t)+E^{\mathcal{F}_{t}}\left[\int_{t}^{T}k(s,t)f_{y}^{*}(s)ds\right]\Big{]}dt-q_{t}dW_{t},\\ \\ p_{T}=g_{x}(X_{T}^{*}).\end{array}\right.

(21)

Then we have

\displaystyle\Big{[}H^{*}_{u}(t)+E^{\mathcal{F}_{t}}[\int_{t}^{T}l(s,t)H^{*}_{v}(s)ds]\Big{]}\cdot(\alpha_{t}-u^{*}_{t})\geq 0,\qquad\forall\alpha_{t}\in\mathbb{U},\quad d\lambda\otimes dP\quad a.s.

(22)

for any control process $\alpha_{t}$ , where

\displaystyle H^{*}(t)=H\Big{(}t,X_{t}^{*},\int_{0}^{t}k(t,s)X_{s}^{*}ds,u_{t}^{*},\int_{0}^{t}l(t,s)u_{s}^{*}ds,p_{t},q_{t}\Big{)}.

Remark 3.2 To investigate the adjoint equation (21), we consider a more general type of anticipated backward stochastic differential equations:

\displaystyle\left\{\begin{array}[]{ll}-dy_{t}=h\left(t,y_{t},z_{t},E^{\mathcal{F}_{t}}\left[\int_{t}^{T}k(s,t)a_{1}(s)y_{s}ds\right],E^{\mathcal{F}_{t}}\left[\int_{t}^{T}k(s,t)a_{2}(s)z_{s}ds\right]\right)dt-z_{t}dW_{t},\\ \\ y_{T}=\xi.\end{array}\right.

(26)

Without losing the generality, we assume $\sup_{0\leq t\leq T}\int_{0}^{t}|k(t,s)|ds=\sup_{0\leq t\leq T}\int_{0}^{t}|l(t,s)|ds=1$ . This type of anticipated backward stochastic differential equations may be singular even we assume $h(t,y,z,\tilde{y},\tilde{z})$ is Lipscitz continous, such as $k(t,s)=\frac{1}{t}$ . We would deal with it in the following Lemma.

Anticipated backward stochastic differential equations was first studied by Peng and Yang ¹⁴, they study the following type of equation:

\begin{cases}-dY_{t}=f\left(t,Y_{t},Z_{t},Y_{t+\delta(t)},Z_{t+\zeta(t)}\right)dt-Z_{t}dW_{t},&t\in[0,T],\\ Y_{t}=\xi_{t},&t\in[T,T+K],\\ Z_{t}=\eta_{t},&t\in[T,T+K],\end{cases}

the unique solutions, a comparison theorem, and a duality between them and stochastic differential delay equations are introduced. More properties of generalized anticipated backward stochastic differential equations refer to Yang and Elliott ¹⁵.

Lemma 3.3 The anticipated backward stochastic differential equation (26) has the unique solution pair if the following conditions hold:

	$\displaystyle\|h(t,y,z,\tilde{y},\tilde{z})\|$	$\displaystyle\leq M_{1}(\|y\|+\|z\|+\|\tilde{y}\|+\|\tilde{z}\|),$
	$\displaystyle\|h(t,y_{1},z_{1},\tilde{y_{1}},\tilde{z_{1}})-h(t,y_{2},z_{2},\tilde{y_{2}},\tilde{z_{2}})\|$	$\displaystyle\leq M_{1}(\|y_{1}-y_{2}\|+\|z_{1}-z_{2}\|+\|\tilde{y}_{1}-\tilde{y}_{2}\|+\|\tilde{z_{1}}-\tilde{z_{2}}\|),$

and

\displaystyle a_{1}(s)\vee a_{2}(s)\leq M_{2},\qquad\forall s\in[0,T],\quad a.s.

(27)

for some constants $M_{1},M_{2}>0$ satisfy $M_{1}M_{2}<8^{-1}T^{-\frac{1}{2}}$ .

Proof: Denote $\mathbb{H}_{T}^{2}\left(\mathbb{R}^{d}\right)$ is the space of all predictable processes $\phi:\Omega\times[0,T]\mapsto\mathbb{R}^{d}$ such that $\|\varphi\|^{2}=$ $\mathbb{E}\int_{0}^{T}\left|\varphi_{t}\right|^{2}dt<+\infty.$ We define $\beta$ -norm: $\|(Y,Z)\|_{\beta}=\sup_{0\leq s\leq T}Ee^{\beta s}|Y_{s}|^{2}+E\int_{0}^{T}e^{\beta s}Z_{s}^{2}ds$ on $\mathbb{H}_{T}^{2}\left(\mathbb{R}^{d}\right)\times\mathbb{H}_{T}^{2}\left(\mathbb{R}^{d\times m}\right)$ . For any $\mathcal{F}_{t}$ -adapted continuous process pair $(y_{t}^{1},z_{t}^{1}),(y_{t}^{2},z_{t}^{2})$ with bounded $\beta$ -norm, let

\displaystyle\left\{\begin{array}[]{ll}-dY^{i}_{t}&=h\left(t,y^{i}_{t},z^{i}_{t},E^{\mathcal{F}_{t}}\left[\int_{t}^{T}k(s,t)a_{1}(s)y^{i}_{s}ds\right],E^{\mathcal{F}_{t}}\left[\int_{t}^{T}k(s,t)a_{2}(s)z^{i}_{s}ds\right]\right)dt-Z^{i}_{t}dW_{t},\\ \quad Y^{i}_{T}&=\xi,\end{array}\right.

for $i=1,2$ .

We denote

\displaystyle\delta\phi_{t}=\phi^{1}_{t}-\phi^{2}_{t},

for $\phi=Y,Z,y,z$ , and

	$\displaystyle\delta h_{t}=$	$\displaystyle h\left(t,y^{1}_{t},z^{1}_{t},E^{\mathcal{F}_{t}}\left[\int_{t}^{T}k(s,t)a_{1}(s)y^{1}_{s}ds\right],E^{\mathcal{F}_{t}}\left[\int_{t}^{T}k(s,t)a_{2}(s)z^{1}_{s}ds\right]\right)$
		$\displaystyle-h\left(t,y^{2}_{t},z^{2}_{t},E^{\mathcal{F}_{t}}\left[\int_{t}^{T}k(s,t)a_{1}(s)y^{2}_{s}ds\right],E^{\mathcal{F}_{t}}\left[\int_{t}^{T}k(s,t)a_{2}(s)z^{2}_{s}ds\right]\right).$

We can directly get

\displaystyle|\delta h_{t}|\leq M_{1}\left(|\delta y_{t}|+|\delta z_{t}|\right)+M_{1}M_{2}\left(E^{\mathcal{F}_{t}}\left[\int_{t}^{T}k(s,t)|\delta y_{s}|ds\right]+E^{\mathcal{F}_{t}}\left[\int_{t}^{T}k(s,t)|\delta z_{s}|ds\right]\right).

By it $\hat{\rm o}$ ’s formula,

	$\displaystyle d\left(e^{\beta t}\delta Y_{t}^{2}\right)=$	$\displaystyle\beta e^{\beta t}\delta Y_{t}^{2}dt+2e^{\beta t}\delta Y_{t}d\delta Y_{t}+e^{\beta t}\left(d\delta Y_{t}\right)^{2}$
	$\displaystyle=$	$\displaystyle e^{\beta t}\left[\beta\delta Y_{t}^{2}-2\delta Y_{t}\delta h_{t}+\delta Z_{t}^{2}\right]dt+e^{\beta t}\delta Y_{t}\delta Z_{t}dW_{t}.$

Taking the expectation and integral, we have

$\displaystyle Ee^{\beta t}\delta Y_{t}^{2}+E\int_{t}^{T}e^{\beta s}\delta Z_{s}^{2}ds=$	$\displaystyle E\int_{t}^{T}e^{\beta s}\left[-\beta\delta Y_{s}^{2}+2\delta Y_{s}\delta h_{s}\right]ds$
$\displaystyle\leq$	$\displaystyle E\int_{t}^{T}e^{\beta s}\left[-\beta\delta Y_{s}^{2}+(c_{1}^{-1}+c_{2}^{-1})M_{1}\|\delta Y_{s}\|^{2}+c_{1}M_{1}\|\delta y_{s}\|^{2}+c_{2}M_{1}\|\delta z_{s}\|^{2}\right]ds$
	$\displaystyle+2M_{1}M_{2}E\int_{t}^{T}e^{\beta s}\|\delta Y_{s}\|\left(E^{\mathcal{F}_{s}}\left[\int_{s}^{T}k(r,s)\|\delta y_{r}\|dr\right]+E^{\mathcal{F}_{s}}\left[\int_{s}^{T}k(r,s)\|\delta z_{r}\|dr\right]\right)ds.$	(28)

Notice that

	$\displaystyle 2E\left[\int_{t}^{T}e^{\beta s}\|\delta Y_{s}\|E^{\mathcal{F}_{s}}[\int_{s}^{T}k(r,s)\|\delta y_{r}\|dr]ds\right]=$	$\displaystyle 2E\int_{t}^{T}\int_{s}^{T}k(r,s)e^{\beta s}\|\delta Y_{s}\|\|\delta y_{r}\|drds$
	$\displaystyle\leq$	$\displaystyle c_{3}^{-1}E\int_{t}^{T}\int_{s}^{T}k(r,s)e^{\beta s}\|\delta Y_{s}\|^{2}drds+c_{3}E\int_{t}^{T}\int_{s}^{T}k(r,s)e^{\beta s}\|\delta y_{r}\|^{2}drds,$

and

\displaystyle\int_{t}^{T}\int_{s}^{T}k(r,s)drds=\int_{t}^{T}\int_{t}^{r}k(r,s)dsdr=\leq T-t\leq T,

so we have

	$\displaystyle 2E\left[\int_{t}^{T}e^{\beta s}\|\delta Y_{s}\|E^{\mathcal{F}_{s}}[\int_{s}^{T}k(r,s)\|\delta y_{r}\|dr]ds\right]\leq$	$\displaystyle c_{3}^{-1}T\sup_{t\leq s\leq T}Ee^{\beta s}\|\delta Y_{s}\|^{2}+c_{3}E\int_{t}^{T}e^{\beta r}\|\delta y_{r}\|^{2}dr$
	$\displaystyle\leq$	$\displaystyle c_{3}^{-1}T\sup_{t\leq s\leq T}Ee^{\beta s}\|\delta Y_{s}\|^{2}+c_{3}T\sup_{t\leq s\leq T}Ee^{\beta s}\|\delta y_{s}\|^{2}.$		(29)

In the same way

\displaystyle 2E\left[\int_{t}^{T}e^{\beta s}|\delta Y_{s}|E^{\mathcal{F}_{s}}[\int_{s}^{T}k(r,s)|\delta z_{r}|dr]ds\right]\leq c_{3}^{-1}T\sup_{t\leq s\leq T}Ee^{\beta s}|\delta Y_{s}|^{2}+c_{3}E\int_{t}^{T}e^{\beta r}|\delta z_{r}|^{2}dr.

(30)

Substitute (3) and (30) into (3), choose $\beta>(c_{1}^{-1}+c_{2}^{-1})M_{1}$ and let $c_{1},c_{2}\to\infty$ , we get

\displaystyle Ee^{\beta t}|\delta Y_{t}|^{2}+E\int_{t}^{T}e^{\beta s}\delta Z_{s}^{2}ds\leq 2M_{1}M_{2}c_{3}^{-1}\sup_{t\leq s\leq T}Ee^{\beta s}|\delta Y_{s}|^{2}+2c_{3}M_{1}M_{2}T\left[\sup_{t\leq s\leq T}Ee^{\beta s}|\delta y_{s}|^{2}+E\int_{t}^{T}e^{\beta s}|\delta z_{s}|^{2}ds\right].

Then it is not difficult to get

\displaystyle(1-4M_{1}M_{2}c_{3}^{-1})\sup_{0\leq s\leq T}Ee^{\beta s}|\delta Y_{s}|^{2}+E\int_{0}^{T}e^{\beta s}\delta Z_{s}^{2}ds\leq 4c_{3}M_{1}M_{2}T\left[\sup_{0\leq s\leq T}Ee^{\beta s}|\delta y_{s}|^{2}+E\int_{0}^{T}e^{\beta s}|\delta z_{s}|^{2}ds\right],

which shows

\displaystyle(1-4M_{1}M_{2}c_{3}^{-1})\|(\delta Y,\delta Z)\|_{\beta}\leq 4c_{3}M_{1}M_{2}T\|(\delta y,\delta z)\|_{\beta}.

Under the assumption $M_{1}M_{2}<8^{-1}T^{-\frac{1}{2}}$ and taking $c_{3}=T^{-\frac{1}{2}}$ , we get the contraction mapping $\Phi:(y,z)\to(Y,Z)$ from $\mathbb{H}_{T,\beta}^{2}\left(\mathbb{R}^{d}\right)\times\mathbb{H}_{T,\beta}^{2}\left(\mathbb{R}^{d\times m}\right)$ onto itself and that there exists a fixed point, which is the unique continuous solution of the antici- pated backward stochastic differential equation.

This completes the proof of Lemma 3.3.

Lemma 3.4 Let $(u_{t}^{*})_{0\leq t\leq T}$ is the optimal control process and $(X_{t}^{*})_{0\leq t\leq T}$ be the corresponding state process, and $(p_{t},q_{t})$ is the adjoint process satisfying(21). Then the $G\hat{a}teaux$ derivative of $J$ at $u^{*}_{t}$ in the direction $\beta_{t}$ is

\displaystyle\frac{d}{d\varepsilon}J(u_{t}^{*}+\varepsilon\beta_{t})\Big{|}_{\varepsilon=0}=E\int_{0}^{T}\Big{[}H^{*}_{u}(t)+\int_{t}^{T}l(s,t)H^{*}_{v}(s)ds\Big{]}\cdot\beta_{t}dt.

(31)

Proof: Let $X_{t}^{*}$ and $X_{t}^{\varepsilon}$ be the state process corresponding to $u_{t}^{*}$ and $u_{t}^{\varepsilon}$ , respectively. Define $V_{t}$ by

\displaystyle\left\{\begin{array}[]{ll}dV_{t}=&\Big{[}b_{x}^{*}(t)V_{t}+b_{y}^{*}(t)\int_{0}^{t}k(t,s)V_{s}ds+b_{u}^{*}(t)\beta_{t}+b_{v}^{*}(t)\int_{0}^{t}l(t,s)\beta_{s}ds\Big{]}dt\\ &+\Big{[}\sigma_{x}^{*}(t)V_{t}+\sigma_{y}^{*}(t)\int_{0}^{t}k(t,s)V_{s}ds+\sigma_{u}^{*}(t)\beta_{t}+\sigma_{v}^{*}(t)\int_{0}^{t}l(t,s)\beta_{s}ds\Big{]}dW_{t},\\ V_{0}=0.\end{array}\right.

(35)

It’s easy to get

\displaystyle\sup_{0\leq t\leq T}\lim_{\varepsilon\to 0}E\Big{[}\frac{X_{t}^{\varepsilon}-X_{t}^{*}}{\varepsilon}-V_{t}\Big{]}^{2}=0.

So we have that

\displaystyle\frac{J(u_{t}^{\varepsilon})-J(u_{t}^{*})}{\varepsilon}\to E\left[\int_{0}^{T}\left(f_{x}^{*}(t)V_{t}+f_{y}^{*}(t)\int_{0}^{t}k(t,s)V_{s}ds+f_{u}^{*}(t)\beta_{t}+f_{v}^{*}(t)\int_{0}^{t}l(t,s)\beta_{s}ds\right)dt+g_{x}(X_{T}^{*})V_{T}\right],

(36)

as $\varepsilon\to 0$ .

By It $\hat{\rm o}$ ’s formula, we have that

$\displaystyle d(p_{t}V_{t})$	$\displaystyle=p_{t}dV_{t}+V_{t}dp_{t}+dp_{t}dV_{t}$
	$\displaystyle=p_{t}\Big{[}b_{x}^{}(t)V_{t}+b_{y}^{}(t)\int_{0}^{t}k(t,s)V_{s}ds+b_{u}^{}(t)\beta_{t}+b_{v}^{}(t)\int_{0}^{t}l(t,s)\beta_{s}ds\Big{]}dt$
	$\displaystyle\quad-V_{t}\Big{[}b_{x}^{}(t)p_{t}+E^{\mathcal{F}_{t}}[\int_{t}^{T}k(s,t)b_{y}^{}(s)p_{s}ds]+\sigma_{x}^{}(t)q_{t}+E^{\mathcal{F}_{t}}[\int_{t}^{T}k(s,t)\sigma_{y}^{}(s)q_{s}ds]+f_{x}^{}(t)+E^{\mathcal{F}_{t}}[\int_{t}^{T}k(s,t)f_{y}^{}(s)ds]\Big{]}dt$
	$\displaystyle\quad+q_{t}\Big{[}\sigma_{x}^{}(t)V_{t}+\sigma_{y}^{}(t)\int_{0}^{t}k(t,s)V_{s}ds+\sigma_{u}^{}(t)\beta_{t}+\sigma_{v}^{}(t)\int_{0}^{t}l(t,s)\beta_{s}ds\Big{]}dt+M_{t}dW_{t}$
	$\displaystyle=\Big{[}b_{y}^{}(t)p_{t}\int_{0}^{t}k(t,s)V_{s}ds-V_{t}E^{\mathcal{F}_{t}}\int_{t}^{T}k(s,t)b_{y}^{}(s)p_{s}ds+\sigma_{y}^{}(t)q_{t}\int_{0}^{t}k(t,s)V_{s}ds-V_{t}E^{\mathcal{F}_{t}}[\int_{t}^{T}k(s,t)\sigma_{y}^{}(s)q_{s}ds]\Big{]}dt$
	$\displaystyle\quad-\Big{[}f_{x}^{}(t)V_{t}-V_{t}E^{\mathcal{F}_{t}}[\int_{t}^{T}k(s,t)f_{y}^{}(s)ds]\Big{]}dt+\Big{[}b_{u}^{}(t)p_{t}+\sigma_{u}^{}(t)q_{t}\Big{]}\beta_{t}dt$
	$\displaystyle\quad+\Big{[}b_{v}^{}(t)p_{t}+\sigma_{v}^{}(t)q_{t}\Big{]}\int_{0}^{t}l(t,s)\beta_{s}dsdt+M_{t}dW_{t},$	(37)

where $(M_{t})_{0\leq t\leq T}$ is a $\mathcal{F}_{t}$ adapted process.

Consider

$\displaystyle Eg_{x}(X_{T}^{*})V_{T}$	$\displaystyle=Ep_{T}V_{T}=E\int_{0}^{T}d(p_{t}V_{t})+Ep_{0}V_{0}$
	$\displaystyle=E\int_{0}^{T}b_{y}^{}(t)p_{t}\int_{0}^{t}k(t,s)V_{s}dsdt-E\int_{0}^{T}V_{t}\int_{t}^{T}\frac{1}{s}b_{y}^{}(s)p_{s}dsdt$
	$\displaystyle\quad+E\int_{0}^{T}\sigma_{y}^{}(t)q_{t}\int_{0}^{t}k(t,s)V_{s}dsdt-E\int_{0}^{T}V_{t}\int_{t}^{T}k(s,t)\sigma_{y}^{}(s)q_{s}dsdt$
	$\displaystyle\quad-E\int_{0}^{T}f_{x}^{}(t)V_{t}dt-E\int_{0}^{T}V_{t}\int_{t}^{T}k(s,t)f_{y}^{}(s)dsdt$
	$\displaystyle\quad+E\int_{0}^{T}\Big{[}b_{u}^{}(t)p_{t}+\sigma_{u}^{}q_{t}\Big{]}\beta_{t}dt+E\int_{0}^{T}\Big{[}b_{v}^{}(t)p_{t}+\sigma_{v}^{}(t)q_{t}\Big{]}\int_{0}^{t}l(t,s)\beta_{s}dsdt.$	(38)

By exchanging the order of integration, we have

	$\displaystyle\int_{0}^{T}b_{y}^{}(t)p_{t}\int_{0}^{t}k(t,s)V_{s}dsdt=\int_{0}^{T}V_{t}\int_{t}^{T}k(s,t)b_{y}^{}(s)p_{s}dsdt,$		(39)
	$\displaystyle\int_{0}^{T}\sigma_{y}^{}(t)q_{t}\int_{0}^{t}k(t,s)V_{s}dsdt=\int_{0}^{T}V_{t}\int_{t}^{T}k(s,t)\sigma_{y}^{}(s)q_{s}dsdt,$		(40)

and

\displaystyle\int_{0}^{T}V_{t}\int_{t}^{T}k(s,t)f_{y}^{*}(s)dsdt=\int_{0}^{T}f_{y}^{*}(t)\int_{0}^{t}k(t,s)V_{s}dsdt.

(41)

Substitute (39), (40) into (3), we get

	$\displaystyle Eg_{x}(X_{T}^{*})V_{T}$	$\displaystyle=-E\int_{0}^{T}f_{x}^{}(t)V_{t}dt-E\int_{0}^{T}V_{t}\int_{t}^{T}k(s,t)f_{y}^{}(s)dsdt$
		$\displaystyle\quad+E\int_{0}^{T}\Big{[}b_{u}^{}(t)p_{t}+\sigma_{u}^{}(t)q_{t}\Big{]}\beta_{t}dt+E\int_{0}^{T}\Big{[}b_{v}^{}(t)p_{t}+\sigma_{v}^{}(t)q_{t}\Big{]}\int_{0}^{t}l(t,s)\beta_{s}dsdt.$		(42)

Then, substitute (3) into (36) and by (41), we have

$\displaystyle\frac{d}{d\varepsilon}J(u_{t}^{*}+\varepsilon\beta_{t})\Big{\|}_{\varepsilon=0}$	$\displaystyle=E\int_{0}^{T}\Big{(}f_{x}^{}V_{t}+f_{y}^{}int_{0}^{t}k(t,s)V_{s}ds+f_{u}^{}\beta_{t}+f_{v}^{}(t)\int_{0}^{t}l(t,s)\beta_{s}ds\Big{)}dt$
	$\displaystyle\quad-E\int_{0}^{T}f_{x}^{}(t)V_{t}dt-E\int_{0}^{T}V_{t}\int_{t}^{T}k(s,t)f_{y}^{}(s)dsdt$
	$\displaystyle\quad+E\int_{0}^{T}\Big{[}b_{u}^{}(t)p_{t}+\sigma_{u}^{}(t)q_{t}\Big{]}\beta_{t}dt+E\int_{0}^{T}\Big{[}b_{v}^{}(t)p_{t}+\sigma_{v}^{}(t)q_{t}\Big{]}\int_{0}^{t}l(t,s)\beta_{s}dsdt$
	$\displaystyle=E\int_{0}^{T}\Big{[}b_{u}^{}(t)p_{t}+\sigma_{t}^{}q_{t}+f_{u}^{}(t)\Big{]}\beta_{t}dt+E\int_{0}^{T}\Big{[}b_{v}^{}(t)p_{t}+\sigma_{v}^{}(t)q_{t}+f_{v}^{}(t)\Big{]}\int_{0}^{t}l(t,s)\beta_{s}dsdt$
	$\displaystyle=E\Big{[}\int_{0}^{T}H_{u}^{}(t)\beta_{t}+H_{v}^{}(t)\int_{0}^{t}l(t,s)\beta_{s}ds\Big{]}dt$
	$\displaystyle=E\int_{0}^{T}\Big{[}H^{}_{u}(t)+\int_{t}^{T}l(s,t)H^{}_{v}(s)ds\Big{]}\cdot\beta_{t}dt.$	(43)

The last equality holding is because that

\int_{0}^{T}H_{v}^{*}(t)\int_{0}^{t}l(t,s)\beta_{s}dsdt=\int_{0}^{T}\beta_{t}\int_{t}^{T}l(s,t)H_{v}^{*}(s)dsdt.

This completes the proof of Lemma 3.4.

Since $(u_{t}^{*})_{0\leq t\leq T}$ is optimal control process, we have the inequality

\displaystyle\frac{d}{d\varepsilon}J\Big{(}u_{t}^{*}+\varepsilon(\alpha_{t}-u_{t}^{*})\Big{)}\Big{|}_{\varepsilon=0}\geq 0.

By Lemma 3.4, we get

\displaystyle E\int_{0}^{T}\Big{[}H^{*}_{u}(t)+\int_{t}^{T}l(s,t)H^{*}_{v}(s)ds\Big{]}\cdot(\alpha_{t}-u^{*}_{t})dt\geq 0.

\displaystyle E\Big{[}\mathbf{1}_{A}\big{[}H^{*}_{u}(t)+\int_{t}^{T}l(s,t)H^{*}_{v}(s)ds\big{]}\Big{]}\cdot(\alpha_{t}-u^{*}_{t})\geq 0,\quad\forall t\in[0,T],\quad\forall A\subset\mathcal{F}_{t}.

To ensure adaptability, we can rewrite the above equation as

\displaystyle E\Big{[}\mathbf{1}_{A}\big{[}H^{*}_{u}(t)+E^{\mathcal{F}_{t}}[\int_{t}^{T}l(s,t)H^{*}_{v}(s)ds]\big{]}\Big{]}\cdot(\alpha_{t}-u^{*}_{t})\geq 0,\quad\forall t\in[0,T],\quad\forall A\subset\mathcal{F}_{t},

and obtain that

\displaystyle\Big{[}H^{*}_{u}(t)+E^{\mathcal{F}_{t}}[\int_{t}^{T}l(s,t)H^{*}_{v}(s)ds]\Big{]}\cdot(\alpha_{t}-u^{*}_{t})\geq 0,\qquad\forall t\in[0,T].

This completes the proof of Theorem 3.1.

Remark 3.5 If the optimal control process $(u_{t}^{*})_{0\leq t\leq T}$ takes values in the interior of the $\mathbb{U}$ , then we can replace (22) with the following condition

\displaystyle H_{u}^{*}(t)+E^{\mathcal{F}_{t}}\Big{[}\int_{t}^{T}l(s,t)H_{v}^{*}(s)ds\Big{]}=0.

Thus, we give the optimal system

\displaystyle\left\{\begin{array}[]{ll}dX_{t}^{*}=H_{p}^{*}(t)dt+H_{q}^{*}(t)dW_{t},\\ \\ -dp_{t}=\Big{[}H^{*}_{x}(t)+E^{\mathcal{F}_{t}}[\int_{t}^{T}k(s,t)H^{*}_{y}(s)ds]\Big{]}dt-q_{t}dW_{t},\\ \\ X_{0}^{*}=x,\quad p_{T}=g_{x}(X_{T}^{*}),\\ \\ H_{u}^{*}(t)+E^{\mathcal{F}_{t}}[\int_{t}^{T}l(s,t)H_{v}^{*}(s)ds]=0,\end{array}\right.

(51)

where

	$\displaystyle H^{*}(t)$	$\displaystyle=H\Big{(}t,X_{t}^{},\int_{0}^{t}k(t,s)X_{s}^{}ds,u_{t}^{},\int_{0}^{t}l(t,s)u_{s}^{}ds,p_{t},q_{t}\Big{)},$
	$\displaystyle H(t,x,y,u,v,p,q)$	$\displaystyle=b(t,x,y,u,v)p+\sigma(t,x,y,u,v)q+f(t,x,y,u,v).$

4 Linear quadratic case

In this section, we consider a linear quadratic (LQ in short) case, which can describe a moving average linear quadratic regulator problem. For simplicity, let $Y_{t}$ be the moving average process defined as (9) and $v_{t}=\int_{0}^{t}l(t,s)u_{s}ds$ . The state process is defined as follows

\displaystyle dX_{t}=\Big{(}A_{t}X_{t}+B_{t}Y_{t}+C_{t}u_{t}+P_{t}v_{t}\Big{)}dt+\Big{(}D_{t}X_{t}+F_{t}Y_{t}+H_{t}u_{t}+N_{t}v_{t}\Big{)}dW_{t},

(52)

with the cost function

\displaystyle J(u)=\frac{1}{2}E\Big{[}\int_{0}^{T}(Q_{t}X_{t}^{2}+S_{t}Y_{t}^{2}+R_{t}u_{t}^{2})dt+GX_{T}^{2}\Big{]}.

(53)

Here $G>0$ and $Q_{t},S_{t},R_{t}$ are positive functions.

Using the conclusions of Section 3, we can get the adjoint equation

\displaystyle\left\{\begin{array}[]{ll}-dp_{t}=\Big{[}A_{t}p_{t}+E^{\mathcal{F}_{t}}[\int_{t}^{T}k(s,t)B_{s}p_{s}ds]+D_{t}q_{t}+E^{\mathcal{F}_{t}}[\int_{t}^{T}k(s,t)F_{s}q_{s}ds]+Q_{t}X^{*}_{t}+E^{\mathcal{F}_{t}}[\int_{t}^{T}k(s,t)S_{s}Y^{*}_{s}ds]\Big{]}dt\\ \\ \qquad\qquad-q_{t}dW_{t},\\ \\ p_{T}=GX_{T}^{*},\end{array}\right.

(59)

and the optimal control process $u_{t}^{*}$ should satisfy

\displaystyle C_{t}p_{t}+H_{t}q_{t}+R_{t}u^{*}_{t}+E^{\mathcal{F}_{t}}[\int_{t}^{T}l(s,t)P_{s}p_{s}ds]+E^{\mathcal{F}_{t}}[\int_{t}^{T}l(s,t)N_{s}q_{s}ds]=0,

i.e.,

\displaystyle u_{t}^{*}=-R_{t}^{-1}\left(C_{t}p_{t}+H_{t}q_{t}+E^{\mathcal{F}_{t}}[\int_{t}^{T}l(s,t)P_{s}p_{s}ds]+E^{\mathcal{F}_{t}}[\int_{t}^{T}l(s,t)N_{s}q_{s}ds]\right).

(60)

Theorem 4.1 The function $u_{t}^{*}=-R_{t}^{-1}\left(C_{t}p_{t}+H_{t}q_{t}+E^{\mathcal{F}_{t}}[\int_{t}^{T}l(s,t)P_{s}p_{s}ds]+E^{\mathcal{F}_{t}}[\int_{t}^{T}l(s,t)N_{s}q_{s}ds]\right),\quad t\in[0,T]$ is the unique optimal control for moving average LQ problem (52), (53), where $(p_{t},q_{t})$ is defined by equality (59).

Proof: We now prove $u_{t}^{*}$ is the optimal control. For any $\tilde{u}_{t}\subset\mathbb{U}$ , let $(\tilde{X}_{t},\tilde{Y}_{t},\tilde{v}_{t})$ and $(X_{t}^{*},Y_{t}^{*},v_{t}^{*})$ are processes corresponding to $\tilde{u}_{t}$ and $u_{t}^{*}$ , respectively. We have that

	$\displaystyle d(\tilde{X}_{t}-X_{t}^{*})=$	$\displaystyle[A_{t}(\tilde{X}_{t}-X^{}_{t})+B_{t}(\tilde{Y}_{t}-Y^{}_{t})+C_{t}(\tilde{u}_{t}-u^{}_{t})+P_{t}(\tilde{v}_{t}-v^{}_{t})]dt$
		$\displaystyle+[D_{t}(\tilde{X}_{t}-X^{}_{t})+F_{t}(\tilde{Y}_{t}-Y^{}_{t})+H_{t}(\tilde{u}_{t}-u^{}_{t})+N_{t}(\tilde{v}_{t}-v^{}_{t})]dW_{t}.$

Consider

$\displaystyle dp_{t}(\tilde{X}_{t}-X_{t}^{*})=$	$\displaystyle p_{t}d(\tilde{X}_{t}-X^{}_{t})+(\tilde{X}_{t}-X^{}_{t})dp_{t}+dp_{t}d(\tilde{X}_{t}-X_{t}^{*})$
$\displaystyle=$	$\displaystyle p_{t}\left[A_{t}(\tilde{X}_{t}-X^{}_{t})+B_{t}(\tilde{Y}_{t}-Y^{}_{t})+C_{t}(\tilde{u}_{t}-u^{}_{t})+P_{t}(\tilde{v}_{t}-v^{}_{t})\right]dt$
	$\displaystyle-(X_{t}-X_{t}^{*})\Bigg{[}A_{t}p_{t}+E^{\mathcal{F}_{t}}\left[\int_{t}^{T}k(s,t)B_{s}p_{s}ds\right]+D_{t}q_{t}+E^{\mathcal{F}_{t}}\left[\int_{t}^{T}k(s,t)F_{s}q_{s}ds\right]$
	$\displaystyle\qquad\qquad\qquad+Q_{t}X^{}_{t}+E^{\mathcal{F}_{t}}\left[\int_{t}^{T}k(s,t)S_{s}Y^{}_{s}ds\right]\Bigg{]}dt$
	$\displaystyle+q_{t}\left[D_{t}(\tilde{X}_{t}-X^{}_{t})+F_{t}(\tilde{Y}_{t}-Y^{}_{t})+H_{t}(\tilde{u}_{t}-u^{}_{t})+N_{t}(\tilde{v}_{t}-v^{}_{t})\right]dt+M_{t}dW_{t}$
$\displaystyle=$	$\displaystyle p_{t}B_{t}(\tilde{Y}_{t}-Y_{t}^{})dt-(\tilde{X}_{t}-X_{t}^{})E^{\mathcal{F}_{t}}\left[\int_{t}^{T}k(s,t)B_{s}p_{s}ds\right]dt$
	$\displaystyle+q_{t}F_{t}(\tilde{Y}_{t}-Y^{}_{t})dt-(\tilde{X}_{t}-X_{t}^{})E^{\mathcal{F}_{t}}\left[\int_{t}^{T}k(s,t)F_{s}q_{s}ds\right]dt$
	$\displaystyle+(C_{t}p_{t}+H_{t}q_{t})(\tilde{u}_{t}-u^{}_{t})dt-Q_{t}X_{t}^{}(\tilde{X}_{t}-X_{t}^{*})dt$
	$\displaystyle+p_{t}P_{t}(\tilde{v}_{t}-v^{}_{t})dt+q_{t}N_{t}(\tilde{v}_{t}-v^{}_{t})dt$	(61)
	$\displaystyle-(\tilde{X}_{t}-X_{t}^{})E^{\mathcal{F}_{t}}\left[\int_{t}^{T}k(s,t)S_{s}Y^{}_{s}ds\right]dt+M_{t}dW_{t},$	(62)

where $(M_{t})_{0\leq t\leq T}$ is a $\mathcal{F}_{t}$ adapted process. By exchanging the order of integration, we get

\displaystyle\int_{0}^{T}p_{t}B_{t}(\tilde{Y}_{t}-Y_{t}^{*})dt=\int_{0}^{T}p_{t}B_{t}\int_{0}^{t}k(t,s)(\tilde{X}_{s}-X_{s}^{*})dsdt=\int_{0}^{T}(\tilde{X}_{t}-X_{t}^{*})\int_{t}^{T}k(s,t)_{s}p_{s}dsdt.

(63)

In the same way, we have

	$\displaystyle\int_{0}^{T}q_{t}F_{t}(\tilde{Y}_{t}-Y_{t}^{})dt=\int_{0}^{T}(\tilde{X}_{t}-X_{t}^{})\int_{t}^{T}k(s,t)F_{s}q_{s}dsdt,$		(64)
	$\displaystyle\int_{0}^{T}p_{t}P_{t}(\tilde{v}_{t}-v_{t}^{})dt=\int_{0}^{T}(\tilde{u}_{t}-u_{t}^{})\int_{t}^{T}l(s,t)P_{s}p_{s}dsdt,$		(65)
	$\displaystyle\int_{0}^{T}q_{t}N_{t}(\tilde{v}_{t}-v_{t}^{})dt=\int_{0}^{T}(\tilde{u}_{t}-u_{t}^{})\int_{t}^{T}l(s,t)N_{s}q_{s}dsdt,$		(66)

and

\displaystyle\int_{0}^{T}(\tilde{X}_{t}-X_{t}^{*})\int_{t}^{T}k(s,t)S_{s}Y^{*}_{s}dsdt=\int_{0}^{T}S_{t}Y^{*}_{t}(\tilde{Y}_{t}-Y_{t}^{*})dt.

(67)

Taking integral for the (4) from $0$ to $T$ and taking the expectation, through (60), (63), (64),(65),(66) and (67) we have

$\displaystyle EGX_{T}^{}(\tilde{X}_{T}-X_{T}^{})=$	$\displaystyle Ep_{T}(\tilde{X}_{T}-X_{T}^{*})$
$\displaystyle=$	$\displaystyle E\int_{0}^{T}dp_{t}(\tilde{X}_{T}-X_{T}^{*})$
$\displaystyle=$	$\displaystyle-E\int_{0}^{T}\Big{[}R_{t}u_{t}^{}(\tilde{u}_{t}-u^{}_{t})+Q_{t}X_{t}^{}(\tilde{X}_{t}-X^{}_{t})+S_{t}Y^{}_{t}(\tilde{Y}_{t}-Y_{t}^{})\Big{]}dt.$	(68)

So that

	$\displaystyle J(\tilde{u}_{t})-J(u_{t}^{*})=$	$\displaystyle\frac{1}{2}E\int_{0}^{T}\Big{[}Q_{t}(\tilde{X}_{t}^{2}-X_{t}^{2})+S_{t}(\tilde{Y}_{t}^{2}-Y_{t}^{2})+R_{t}(\tilde{u}_{t}^{2}-u_{t}^{*2})\Big{]}dt$
		$\displaystyle+\frac{1}{2}EG(\tilde{X}_{T}^{2}-X_{T}^{*2})$
	$\displaystyle=$	$\displaystyle\frac{1}{2}E\int_{0}^{T}\Big{[}Q_{t}(\tilde{X}_{t}^{2}-X_{t}^{2})-2Q_{t}X_{t}^{}(\tilde{X}_{t}-X^{}_{t})+S_{t}(\tilde{Y}_{t}^{2}-Y_{t}^{2})$
		$\displaystyle\qquad\qquad-2S_{t}Y^{}_{t}(\tilde{Y}_{t}-Y_{t}^{})+R_{t}(\tilde{u}_{t}^{2}-u_{t}^{2})-2R_{t}u_{t}^{}(\tilde{u}_{t}-u^{*}_{t})\Big{]}dt$
	$\displaystyle\geq$	$\displaystyle 0.$

This shows that $u_{t}^{*}$ is an optimal control.

Then we prove $u^{*}_{t}$ is unique, assume that both $u_{t}^{*,1}$ and $u_{t}^{*,2}$ are optimal controls, $X_{t}^{1}$ and $X_{t}^{2}$ are corresponding state processes, respectively. It is easy to get $\frac{X_{t}^{1}+X_{t}^{2}}{2}$ is the corresponding state process to $\frac{u_{t}^{*,1}+u_{t}^{*,2}}{2}$ . We assume there exist constants $\delta>0,\alpha\geq 0$ , such that $R_{t}\geq\delta$ and

\displaystyle J(u_{t}^{*,1})=J(u_{t}^{*,2})=\alpha.

Using the fact $a^{2}+b^{2}=2[(\frac{a+b}{2})^{2}+(\frac{a-b}{2})^{2}]$ , we have that

	$\displaystyle 2\alpha=$	$\displaystyle J(u_{t}^{,1})+J(u_{t}^{,2})$
	$\displaystyle=$	$\displaystyle\frac{1}{2}E\int_{0}^{T}\Big{[}Q_{t}(X_{t}^{1}X_{t}^{1}+X_{t}^{2}X_{t}^{2})+S_{t}(Y_{t}^{1}Y_{t}^{1}+Y_{t}^{2}Y_{t}^{2})+R_{t}(u_{t}^{,1}u_{t}^{,1}+u_{t}^{,2}u_{t}^{,2})\Big{]}dt$
		$\displaystyle+\frac{1}{2}EG(X_{T}^{1}X_{T}^{1}+X_{T}^{2}X_{T}^{2})$
	$\displaystyle\geq$	$\displaystyle E\int_{0}^{T}\Big{[}Q_{t}\Big{(}\frac{X_{t}^{1}+X_{t}^{2}}{2}\Big{)}^{2}+S_{t}\Big{(}\frac{Y_{t}^{1}+Y_{t}^{2}}{2}\Big{)}^{2}+R_{t}\Big{(}\frac{u_{t}^{,1}+u_{t}^{,2}}{2}\Big{)}^{2}\Big{]}dt$
		$\displaystyle+EG\Big{(}\frac{X_{T}^{1}+X_{T}^{2}}{2}\Big{)}^{2}+E\int_{0}^{T}R_{t}\Big{(}\frac{u_{t}^{,1}-u_{t}^{,2}}{2}\Big{)}^{2}dt$
	$\displaystyle=$	$\displaystyle 2J\Big{(}\frac{u_{t}^{,1}+u_{t}^{,2}}{2}\Big{)}+E\int_{0}^{T}R_{t}\Big{(}\frac{u_{t}^{,1}-u_{t}^{,2}}{2}\Big{)}^{2}dt$
	$\displaystyle\geq$	$\displaystyle 2\alpha+\frac{\delta}{4}E\int_{0}^{T}\|u_{t}^{,1}-u_{t}^{,2}\|^{2}dt.$

Thus, we have

\displaystyle E\int_{0}^{T}|u_{t}^{*,1}-u_{t}^{*,2}|^{2}dt\leq 0,

which shows that $u_{t}^{*,1}=u_{t}^{*,2}$ .

Acknowledgments

The authors acknowledge the financial support from the National Science Foundation of China (grant no. 11871244).

References

1 Bloom F. Bounds for Solutions to a Class of Integro-differential Equations Associated with a Theory of Rigid Nonconducting Material Dielectrics. SIAM Journal on Mathematical Analysis 1980; 11(2): 265–291.
2 Holmåker K. Global asymptotic stability for a stationary solution of a system of integro-differential equations describing the formation of liver zones. SIAM Journal on Mathematical Analysis 1993; 24(1): 116–128.
3 Forbes LK, Crozier S, Doddrell DM. Caluculating current densities and fields produced by shielded magnetic resonance imaging probes. SIAM Journal on Applied Mathematics 1997; 57(2): 401–425.
4 Du H, Zhao G, Zhao C. Reproducing kernel method for solving Fredholm integro-differential equations with weakly singularity. Journal of Computational and Applied Mathematics 2014; 255: 122–132.
5 Mao X, Riedle M. Mean square stability of stochastic Volterra integro-differential equations. Systems $\&$ Control Letters 2006; 55(6): 459–465.
6 Nesterenko O. Modified projection-iterative method for weakly nonlinear integrodifferential equations with parameters. Journal of Mathematical Sciences 2014; 198(3): 328–336.
7 Dzhumabaev DS. On one approach to solve the linear boundary value problems for Fredholm integro-differential equations. Journal of Computational and Applied Mathematics 2016; 294: 342–357.
8 Zhang W, Liang H, Gao J. Theoretical and numerical analysis of the Euler–Maruyama method for generalized stochastic Volterra integro-differential equations. Journal of Computational and Applied Mathematics 2020; 365: 112364.
9 Kim JU. Control of a second-order integro-differential equation. SIAM journal on control and optimization 1993; 31(1): 101–110.
10 Mashayekhi S, Ordokhani Y, Razzaghi M. Hybrid functions approach for optimal control of systems described by integro-differential equations. Applied Mathematical Modelling 2013; 37(5): 3355–3368.
11 Assanova AT, Bakirova E, Kadirbayeva ZM. Numerical solution to a control problem for integro-differential equations. Computational Mathematics and Mathematical Physics 2020; 60: 203–221.
12 Wang T. Backward stochastic Volterra integro-differential equations and applications in optimal control problems. SIAM Journal on Control and Optimization 2022; 60(4): 2393–2419.
13 El Karoui N, Peng S, Quenez MC. Backward stochastic differential equations in finance. Mathematical finance 1997; 7(1): 1–71.
14 Peng S, Yang Z. Anticipated backward stochastic differential equations. The Annals of Probability 2009; 37(3): 877–902.
15 Yang Z, Elliott R. Some properties of generalized anticipated backward stochastic differential equations. Electronic Communications in Probability 2013; 18: 1–10.

	$\displaystyle E\|\tilde{X}_{t}-X_{t}\|^{2}$	$\displaystyle=E\Big{[}\int_{0}^{t}b(s,\tilde{X}_{s},\tilde{Y}_{s})-b(s,X_{s},Y_{s})ds+\int_{0}^{t}\sigma(s,\tilde{X}_{s},\tilde{Y}_{s})-\sigma(s,X_{s},Y_{s})dW_{s}\Big{]}^{2}$
		$\displaystyle\leq 2(T+1)LE\int_{0}^{t}\|\tilde{X}_{s}-X_{s}\|^{2}+\|\tilde{Y}_{s}-Y_{s}\|^{2}ds$
		$\displaystyle\leq(2M^{2}+1)(T+1)L\int_{0}^{t}\sup_{0\leq r\leq s}E\|\tilde{X}_{r}-X_{r}\|^{2}ds.$

	$\displaystyle\sup_{0\leq r\leq t}E\|\tilde{X}_{r}-X_{r}\|^{2}\leq$	$\displaystyle E\|\tilde{X}_{\xi_{t}}-X_{\xi_{t}}\|^{2}+\varepsilon$
	$\displaystyle\leq$	$\displaystyle(2M^{2}+1)(T+1)L\int_{0}^{\xi_{t}}\sup_{0\leq r\leq s}E\|\tilde{X}_{r}-X_{r}\|^{2}ds+\varepsilon$
	$\displaystyle\leq$	$\displaystyle(2M^{2}+1)(T+1)L\int_{0}^{t}\sup_{0\leq r\leq s}E\|\tilde{X}_{r}-X_{r}\|^{2}ds+\varepsilon.$

	$\displaystyle\|h(t,y,z,\tilde{y},\tilde{z})\|$	$\displaystyle\leq M_{1}(\|y\|+\|z\|+\|\tilde{y}\|+\|\tilde{z}\|),$
	$\displaystyle\|h(t,y_{1},z_{1},\tilde{y_{1}},\tilde{z_{1}})-h(t,y_{2},z_{2},\tilde{y_{2}},\tilde{z_{2}})\|$	$\displaystyle\leq M_{1}(\|y_{1}-y_{2}\|+\|z_{1}-z_{2}\|+\|\tilde{y}_{1}-\tilde{y}_{2}\|+\|\tilde{z_{1}}-\tilde{z_{2}}\|),$

	$\displaystyle 2E\left[\int_{t}^{T}e^{\beta s}\|\delta Y_{s}\|E^{\mathcal{F}_{s}}[\int_{s}^{T}k(r,s)\|\delta y_{r}\|dr]ds\right]=$	$\displaystyle 2E\int_{t}^{T}\int_{s}^{T}k(r,s)e^{\beta s}\|\delta Y_{s}\|\|\delta y_{r}\|drds$
	$\displaystyle\leq$	$\displaystyle c_{3}^{-1}E\int_{t}^{T}\int_{s}^{T}k(r,s)e^{\beta s}\|\delta Y_{s}\|^{2}drds+c_{3}E\int_{t}^{T}\int_{s}^{T}k(r,s)e^{\beta s}\|\delta y_{r}\|^{2}drds,$

	$\displaystyle 2E\left[\int_{t}^{T}e^{\beta s}\|\delta Y_{s}\|E^{\mathcal{F}_{s}}[\int_{s}^{T}k(r,s)\|\delta y_{r}\|dr]ds\right]\leq$	$\displaystyle c_{3}^{-1}T\sup_{t\leq s\leq T}Ee^{\beta s}\|\delta Y_{s}\|^{2}+c_{3}E\int_{t}^{T}e^{\beta r}\|\delta y_{r}\|^{2}dr$
	$\displaystyle\leq$	$\displaystyle c_{3}^{-1}T\sup_{t\leq s\leq T}Ee^{\beta s}\|\delta Y_{s}\|^{2}+c_{3}T\sup_{t\leq s\leq T}Ee^{\beta s}\|\delta y_{s}\|^{2}.$		(29)