This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Optimal investment-consumption problem: post-retirement with minimum guarantee

Abstract

We study the optimal investment-consumption problem for a member of defined contribution plan during the decumulation phase. For a fixed annuitization time, to achieve higher final annuity, we consider a variable consumption rate. Moreover, to have a minimum guarantee for the final annuity, a safety level for the wealth process is considered. To solve the stochastic optimal control problem via dynamic programming, we obtain a Hamilton-Jacobi-Bellman (HJB) equation on a bounded domain. The existence and uniqueness of classical solutions are proved through the dual transformation. We apply the finite difference method to find numerical approximations of the solution of the HJB equation. Finally, the simulation results for the optimal investment-consumption strategies, optimal wealth process and the final annuity for different admissible ranges of consumption are given. Furthermore, by taking into account the market present value of the cash flows before and after the annuitization, we compare the outcomes of different scenarios.

Hassan Dadashi

Department of Mathematics,
Institute for Advanced Studies in Basic Sciences (IASBS),
Zanjan, 45137-66731, Iran
dadashi@iasbs.ac.ir

Keywords: Defined contribution plan, Decumulation phase, Final annuity guarantee, HJB equation, Policy iteration method

AMS Subject Classification: 60J70, 93E20, 65N06

1 Introduction

In this work, focusing on the decumulation phase in a defined contribution plan, we fix the annuitization time and investigate the optimal investment-consumption strategies in a Brownian market model with a time dependent mortality rate. In formulating the loss function, we consider a target for the consumption during the decumulation phase and a target for the terminal accumulated wealth. Since avoiding the ruin possibility, and sometimes having a minimum guarantee on the return, is an essential issue in pension fund management, motivated from [12], we assume a minimum guarantee for the terminal wealth.

In this paper, different from the existing literature, we assume a variable consumption rate. Assuming a fixed rate of consumption during the whole decumulation phase, which is usually a long period, is far from optimality. Our simulation results justify that assuming a variable consumption rate, although quite restricted, yields much higher final annuities. However, to cover the essential expenses of the retiree, it is reasonable to fix a lower bound for the consumption rate. Therefore, having in mind a desired rate of consumption, we assume that the consumption rate varies between the two limits C1C_{1} and C2C_{2}, in which C1>C2C_{1}>C_{2}. To compare the optimal portfolios corresponding to different scenarios of the admissible range of consumption [C2,C1][C_{2},C_{1}], we take into account the market present value of the cash flows before and after the annuitization.

Gerrard et al. [18] study the portfolio optimization problem post-retirement when the loss function is defined by a target function on the wealth process and the annuitization time and the consumption rate are fixed. In [19], the authors violate the fixed consumption rate assumption. In a similar framework, Di Giacinto et al. [12] explore the optimal investment strategy when a minimum guarantee for the final wealth is assumed and the consumption rate is fixed. They obtain the closed form for the optimal strategies just in the case that the running cost is neglected. For the general form of the loss function, a numerical algorithm for finding approximations of optimal investment strategies is developed in [8]. Using a loss function similar to the one that is used in this work but without any constraint on the wealth process, Gerrard et al. [20] investigate the optimal annuitization time together with the optimal consumption-investment strategies. Vigna [29] shows that the target based approach in portfolio optimization of pension funds yields portfolios that are efficient in the mean-variance setting.

As an extension of our framework, the annuitization time can be considered as a control variable which can reduce to some extent the annuity risk, the risk that exists when the interest rate is high. Furthermore, since in our problem the time horizon is long, considering a non-constant interest rate model yields a more realistic framework.

Assuming the same constraint on the final wealth as in Di Giacinto et al. [12], we consider a variable consumption rate, as a novelty in our framework. Moreover, we consider a running cost term in the loss function based on the consumptions. To solve the given stochastic optimal control by applying dynamic programming, we get a nonlinear HJB equation which does not seem to have an explicit solution. Using dual transformation, we prove that the value function is sufficiently smooth to imply the verification theorem. Then, to obtain numerical approximations of the value function and the optimal investment-consumption strategies, we employ a numerical scheme based on a fully implicit backward in time finite difference method. Moreover, to tackle the algebraic nonlinear systems that appear in the numerical procedure, the policy iteration method is applied. The policy iteration method is a well studied method in finding approximations of solutions of optimal control problems (see [14], [15] and [27]) in which the value function and the strategies are derived iteratively to converge to the solution of the corresponding HJB equation and the optimal strategies. In [6], this method is employed in a portfolio selection problem with transaction costs.

The portfolio optimization problem post-retirement has been studied by many scholars who consider different utility or loss functions, different control variables and different wealth dynamics. Furthermore, different constraints on the control variables or on the wealth process can be assumed. We mention here a few related works. Milevsky and Young [24] study extensively the optimal annuitization and investment-consumption problem for a time dependent mortality rate in the all or nothing market and also the more general anything anytimes market, where gradual annuitization strategies are allowed. Milevsky et al. [23] derive the optimal investment and annuitization strategies for a retiree whose objective is to minimize the ruin probability when the consumption rate is fixed. Stabile [28] studies the optimal investment-consumption problem and investigates the optimal time for purchasing the annuity subject to a constant force of mortality by considering different utility functions based on the consumption and the final annuity. Blake et al. [4] compare the immediate annuitization at the retirement time with distribution programs involving differing exposures to equities during the retirement. Albrecht and Maurer [1] compare the immediate annuitization and the income drawdown option and determine the probability of running out of money before an uncertain date of death.

The article is organized as follows. In the next section, we specify the market model and the restrictions of control variables. In the third section, the set of admissible strategies, the loss function and also the value function are determined. In addition, some properties and the domain of the wealth process are identified. In Section 4, the corresponding HJB equation is written and, using dual transformation, the regularity of the solution is proved. In Section 5, we express the numerical algorithm to find approximations of the solution of the HJB equation. Finally in Section 6, the simulation results for the final annuity, optimal investment-consumption strategies and optimal wealth process are presented. Furthermore, to compare the outcomes of different scenarios, the present values of the corresponding cash flows are revealed.

2 The Market Model

We consider a Brownian market model that consists of a risky and a risk-less asset with the dynamics:

dPt=Pt(μdt+σdBt),\displaystyle dP_{t}=P_{t}(\mu dt+\sigma dB_{t}),
dAt=rAtdt,\displaystyle dA_{t}=rA_{t}dt,

where B()B(\cdot) is a Brownian motion on the filtered probability space (Ω,,𝔽,𝐏)(\Omega,\mathcal{F},\mathbb{F},\mathbf{P}) and rr is the fixed interest rate. So, the risky asset is a geometric Brownian motion with constant volatility σ\sigma and expected return μ=r+σβ\mu=r+\sigma\beta, in which β\beta is its Sharpe ratio.

At any time tt, let πt1\pi^{1}_{t} and 1πt11-\pi^{1}_{t} be the fractions of the fund’s portfolio that are invested in the risky and in the risk-less asset, respectively. Denoting the consumption rate at this time by πt2\pi^{2}_{t}, we have the following dynamics of the wealth process (or the fund value dynamics)

{dXt={[πt1(μr)+r]Xtπt2}dt+σπt1XtdBt,X0=x0.\displaystyle\begin{cases}dX_{t}=\{[\pi^{1}_{t}(\mu-r)+r]X_{t}-\pi^{2}_{t}\}dt+\sigma\pi^{1}_{t}X_{t}dB_{t},\\ X_{0}=x_{0}.\end{cases} (2.1)

We restrict the consumption rate to the interval [C2,C1][C_{2},C_{1}], in which C1C_{1} is the desired rate of consumption and usually is set to be equal to the annuity that is purchasable by the accumulated wealth at the retirement time and C2C_{2} is set as the minimum admissible consumption rate during the decumulation phase. Furthermore, the investment control variable π1\pi^{1} is restricted to the interval [0,L][0,L]. This means that the short-selling is prohibited. But, we can borrow from the money market up to LL times the fund value.

Our market model can be extended in several directions. Among many other models, Boulier et al. [5] consider a market model with stochastic interest rate. Han and Hung [17] equip the model with the inflation rate which has important consequences on the optimal strategies. Deelstra et al. [10], in a stochastic interest rate framework, consider a market model that consists of three assets, a risky asset, a risk-less asset and a bond. Hainaut and Deelstra [16] investigate the optimal time for the annuity purchase when the risky asset has the jump-diffusion dynamics and the economic utility function is replaced by the expected present value operator.

3 Optimal Control Problem

Let the decumulation phase be represented by the time interval [0,T][0,T]. We consider a retiree with age a0a_{0} who is going to postpone the annuitization until the age a1=a0+Ta_{1}=a_{0}+T. The main aim of a retiree in postponing the annuity purchase is to reach the desired annuity. Let FF be the target for the terminal accumulated wealth, XTX_{T}, by which the retiree can purchase the desired annuity at the age a1a_{1}. In other words, if aa1a_{a_{1}} is the actuarial value of the unitary lifetime annuity at the age a1a_{1}, then Faa1\frac{F}{a_{a_{1}}} would be the desired annuity. Furthermore, during the decumulation phase, part of the retiree’s concern is on the consumption and he or she would like to consume at the maximum admissible rate C1C_{1}. Therefore, we write the loss function by two terms. One term, as the running cost, is defined as the penalizing deviation of the interim consumption from the desired rate C1C_{1} and is weighted by κ>0\kappa>0. The second term is written as the penalizing deviation from the targeted annuity Faa1\frac{F}{a_{a_{1}}} at the terminal time.

We assume that the mortality rate νu\nu_{u}, ua0u\geq a_{0}, is independent of the asset dynamics. So, denoting ηt=e0t(r+νs+a0)𝑑s\eta_{t}=e^{-\int^{t}_{0}(r+\nu_{s+a_{0}})ds}, the loss function is written as

κ0Tηt(C1πt2)2𝑑t+ηT(FXTaa1)2,\displaystyle\kappa\int^{T}_{0}\eta_{t}(C_{1}-\pi^{2}_{t})^{2}dt+\eta_{T}\left(\frac{F-X_{T}}{a_{a_{1}}}\right)^{2}, (3.1)

where XTX_{T} is the value of the solution of Equation (2.1) at the terminal time.

For any 0tT0\leq t\leq T, consider the filtration 𝔽t:=(st)s[t,T]\mathbb{F}^{t}:=(\mathcal{F}^{t}_{s})_{s\in[t,T]} where st\mathcal{F}^{t}_{s} is the σ\sigma-algebra generated by the random variables (BuBt)u[t,s](B_{u}-B_{t})_{u\in[t,s]}. Then the strategies π1()\pi^{1}(\cdot) and π2()\pi^{2}(\cdot), that are supposed to be 𝔽t\mathbb{F}^{t}-progressively measurable processes, are chosen from 2(Ω×[t,T];[0,L])\mathcal{L}^{2}(\Omega\times[t,T];[0,L]) and 2(Ω×[t,T];[C2,C1])\mathcal{L}^{2}(\Omega\times[t,T];[C_{2},C_{1}]), respectively. It should be noted that for the controls that are chosen from these classes, the following equation has a unique strong solution; see [22, Section 5.6.C],

{dXs={[πs1(μr)+r]Xsπs2}ds+σπs1XtdBs,st,Xt=x.\displaystyle\begin{cases}dX_{s}=\{[\pi^{1}_{s}(\mu-r)+r]X_{s}-\pi^{2}_{s}\}ds+\sigma\pi^{1}_{s}X_{t}dB_{s},\qquad s\geq t,\\ X_{t}=x.\end{cases} (3.2)

which is denoted by X(;t,x,π1(),π2()).X(\cdot;t,x,\pi^{1}(\cdot),\pi^{2}(\cdot)).

To apply dynamic programming in finding the optimal strategies which minimize the above loss function, we have to embed the stochastic optimal control problem in a family of problems with varying initial time. So, for any initial time 0tT0\leq t\leq T and initial wealth x>0x>0, a stochastic optimal control problem is considered, by minimizing the following loss function over the set of admissible strategies that are specified in (3.4),

κtTηs(C1πs2)2𝑑s+ηT(FX(T;t,x,π1(),π2())aa1)2.\displaystyle\kappa\int^{T}_{t}\eta_{s}(C_{1}-\pi^{2}_{s})^{2}ds+\eta_{T}\left(\frac{F-X(T;t,x,\pi^{1}(\cdot),\pi^{2}(\cdot))}{a_{a_{1}}}\right)^{2}. (3.3)

The main feature of this work’s framework is assuming a minimum guarantee for the final annuity or equivalently for the terminal wealth. Let the safety level for the terminal wealth amount be SS, which depends on the retiree’s level of risk aversion. So, the set of admissible strategies at time 0tT0\leq t\leq T, when Xt=xX_{t}=x, reduces to

Π~ad(t,x):={\displaystyle\tilde{\Pi}_{ad}(t,x):=\{ π12(Ω×[t,T];[0,L]),π22(Ω×[t,T];[C1,C2])|π1,π2are\displaystyle\pi^{1}\in\mathcal{L}^{2}(\Omega\times[t,T];[0,L]),\pi^{2}\in\mathcal{L}^{2}(\Omega\times[t,T];[C_{1},C_{2}])|\;\pi^{1},\pi^{2}are
𝔽tprog.meas.,X(T;t,x,π1(),π2())Sa.s.}.\displaystyle\mathbb{F}^{t}-prog.\;meas.,X(T;t,x,\pi^{1}(\cdot),\pi^{2}(\cdot))\geq S\;a.s.\}. (3.4)

Regarding the loss function (3.3), for any (t,x)[0,T]×+(t,x)\in[0,T]\times\mathbb{R}^{+}, the following objective functional on the set of admissible strategies Π~ad(t,x)\tilde{\Pi}_{ad}(t,x) is defined

J~(t,x;π1(),π2()):=𝔼x\displaystyle\tilde{J}(t,x;\pi^{1}(\cdot),\pi^{2}(\cdot)):=\mathbb{E}^{x} [κtTηs(C1πs2)2ds\displaystyle[\kappa\int^{T}_{t}\eta_{s}(C_{1}-\pi^{2}_{s})^{2}ds
+ηT(FX(T;t,x,π1(),π2())aa1)2],\displaystyle+\eta_{T}\left(\frac{F-X(T;t,x,\pi^{1}(\cdot),\pi^{2}(\cdot))}{a_{a_{1}}}\right)^{2}], (3.5)

where 𝔼x\mathbb{E}^{x} stands for the expectation subject to Xt=xX_{t}=x. Our goal is to find the admissible strategies that minimize the above functional. To solve this stochastic optimal control problem via dynamic programming, the following value function is defined

V~(t,x):=infπ1(),π2()Π~ad(t,x)J~(t,x;π1(),π2()).\displaystyle\tilde{V}(t,x):=\inf_{\pi^{1}(\cdot),\pi^{2}(\cdot)\in\tilde{\Pi}_{ad}(t,x)}\tilde{J}(t,x;\pi^{1}(\cdot),\pi^{2}(\cdot)). (3.6)

Moreover, the corresponding HJB equation is specified in the next section.

Now, using the loss function definition, dynamic programming and the safety level constraint, we explore some properties and the domain of the state process. The minimum guarantee constraint at the terminal time imposes consequently a constraint on the wealth process during the decumulation phase. Actually, the following curve is as a barrier for the wealth process,

S(t)=C2r(C2rS)er(Tt),0tT.\displaystyle S(t)=\frac{C_{2}}{r}-(\frac{C_{2}}{r}-S)e^{-r(T-t)},\qquad 0\leq t\leq T. (3.7)

Moreover, the loss function formulation, (3.3), indicates that, at any time t[0,T]t\in[0,T], the wealth amount

F(t)=C1r+(FC1r)er(Tt),\displaystyle F(t)=\frac{C_{1}}{r}+(F-\frac{C_{1}}{r})e^{-r(T-t)}, (3.8)

guarantees reaching the target FF at the terminal time T, by investing the whole portfolio during the time interval [t,T][t,T] in the risk-less asset and consuming at the maximum rate C1C_{1}. Here, we call the function F()F(\cdot) the target function.

The above statements are proved precisely in the following proposition.

Theorem 3.1.

Let the initial wealth xx of the stochastic optimal control problem with initial time tt and loss function (3.3) lies in the interval [S(t),F(t)][S(t),F(t)]. If at any time s[t,T]s\in[t,T], the wealth reaches the boundary of the interval [S(s),F(s)][S(s),F(s)], then, from that ss on, the optimal wealth amount remains in such boundary till time TT. We call this property of the boundaries the absorbing property.

Proof.

The target function formulation (3.8), implies

dF(s)=(rF(s)C1)ds.dF(s)=(rF(s)-C_{1})ds.

Moreover, notice that taking the null investment strategy, π10\pi^{1}\equiv 0, and the maximum consumption strategy, π2C1\pi^{2}\equiv C_{1}, yield the same dynamics for the wealth process

dXs=(rXsC1)ds.dX_{s}=(rX_{s}-C_{1})ds.

Suppose that, by applying the optimal strategy π=(π1,π2)\pi=(\pi^{1},\pi^{2}) at the point (t,x)(t,x), the wealth process hits the target F(s)F(s) at some time s[t,T]s\in[t,T]. The above observation indicates that the strategy (0,C1)(0,C_{1}) at the point (s,F(s))(s,F(s)) yields zero cost, and therefore it is optimal. On the other hand, due to the dynamic programming principle, the restriction of the optimal strategy π=(π1,π2)\pi=(\pi^{1},\pi^{2}) to [s,T][s,T] gives the optimal strategy at the point (s,F(s))(s,F(s)), which is (0,C1)(0,C_{1}). So, after the time ss, the optimal wealth process X(;t,x,π1(),π2())X(\cdot;t,x,\pi^{1}(\cdot),\pi^{2}(\cdot)) will remain on the curve {F(u),suT}\{F(u),s\leq u\leq T\} and never overtakes the upper barrier.

Now, assume that the wealth amount be equal to S(s)S(s) at a time s[t,T]s\in[t,T], Xs=S(s)X_{s}=S(s). Then by applying the strategy π=(0,C2)\pi=(0,C_{2}), we have dXu=dS(u)dX_{u}=dS(u), for usu\geq s. So, this strategy keeps the wealth process on the curve {S(u),suT}\{S(u),s\leq u\leq T\}. We prove that this is the only admissible strategy at the point (s,S(s))(s,S(s)).

For any admissible investment strategy π1()\pi^{1}(\cdot), the process

Yu=X(u;s,S(s),π1(),C2),us,Y_{u}=X(u;s,S(s),\pi^{1}(\cdot),C_{2}),\quad u\geq s,

has the following dynamics under the risk-neutral measure, the measure that makes the expected return of the risky asset equal to the interest rate rr,

dYu=(rYuC2)du+πu1YudWu,dY_{u}=(rY_{u}-C_{2})du+\pi^{1}_{u}Y_{u}dW_{u},

in which Wu=Bu+μrσuW_{u}=B_{u}+\frac{\mu-r}{\sigma}u stands for the Brownian motion under this measure.

Therefor, letting Zu=YuS(u)Z_{u}=Y_{u}-S(u), we have Zs=0Z_{s}=0 and for s<u<Ts<u<T

dZu=rZudu+πu1(Zu+Su)dWu,dZ_{u}=rZ_{u}du+\pi^{1}_{u}(Z_{u}+S_{u})dW_{u},

which indicates that the discounted value of ZuZ_{u}, Z~u=eruZu\tilde{Z}_{u}=e^{-ru}Z_{u}, is a martingale under the risk-neutral measure. So, we have 𝔼~(Z~T)=𝔼~(Z~s)=0\tilde{\mathbb{E}}(\tilde{Z}_{T})=\tilde{\mathbb{E}}(\tilde{Z}_{s})=0, in which 𝔼~\tilde{\mathbb{E}} denotes the expectation operator under this measure. This equality indicates

𝔼~X(T;s,S(s),π1(),C2)=S(T)=S.\displaystyle\tilde{\mathbb{E}}X(T;s,S(s),\pi^{1}(\cdot),C_{2})=S(T)=S. (3.9)

On the other hand, the minimum guarantee constraint imposes the following inequality under the physical and equivalently under the risk-neutral measure

X(T;s,S(s),π1(),C2)S,a.s.\displaystyle X(T;s,S(s),\pi^{1}(\cdot),C_{2})\geq S,\quad a.s. (3.10)

Furthermore, any investment strategy other than the null strategy yields a non-constant random terminal wealth. So, due to (3.9) and (3.10), when π2C2\pi^{2}\equiv C_{2}, the only admissible investment strategy is π10\pi^{1}\equiv 0.

Moreover, the higher consumption rate the less terminal wealth amount. This means that the only admissible strategy, and therefore optimal strategy, is π=(0,C2)\pi=(0,C_{2}). ∎

Remark 3.2.

If X0>F(0)X_{0}>F(0), it is clear from the first part of the above argument that the strategy π=(0,C1)\pi=(0,C_{1}) yields the desired consumption and a terminal wealth that is greater than the target FF. However, due to the second term of the loss function (3.1), the loss function increases when the terminal wealth increases in the region [F,+)[F,+\infty). Therefore, in the case X0>F(0)X_{0}>F(0), this loss function, and consequently the corresponding optimal control problem, is not meaningful.

If X0<S(0)X_{0}<S(0), we conclude from the second part of the argument that the constraint on the terminal wealth does not hold for any admissible strategy. So, the problem does not have a solution.

Corollary 3.3.

The absorbing property of the lower border indicates the following equivalent representation for the admissible strategies

Π~ad(t,x):={\displaystyle\tilde{\Pi}_{ad}(t,x):=\{ π12(Ω×[t,T];[0,L]),π22(Ω×[t,T];[C1,C2])|π1,π2are\displaystyle\pi^{1}\in\mathcal{L}^{2}(\Omega\times[t,T];[0,L]),\pi^{2}\in\mathcal{L}^{2}(\Omega\times[t,T];[C_{1},C_{2}])|\pi^{1},\pi^{2}\;are
𝔽tprog.meas.,X(s;t,x,π1(),π2())S(s),tsTa.s.}.\displaystyle\mathbb{F}^{t}-prog.\;meas.,X(s;t,x,\pi^{1}(\cdot),\pi^{2}(\cdot))\geq S(s),\;t\leq s\leq T\;a.s.\}. (3.11)

4 The HJB Equation

Due to the definition of the value function V~\tilde{V}, (3.6), the Bellman principle yields the following HJB equation on the domain 𝒞:={(t,x)|t[0,T],S(t)xF(t)}\mathcal{C}:=\{(t,x)|t\in[0,T],S(t)\leq x\leq F(t)\}; see [25, Chapter 11],

inf(π1,π2)[0,L]×[C2,C1]{V~(t,x)t\displaystyle\inf_{(\pi^{1},\pi^{2})\in[0,L]\times[C_{2},C_{1}]}\{\frac{\partial\tilde{V}(t,x)}{\partial t} +𝒜~V~(t,x)+κηt(C1π2)2}=0,\displaystyle+\tilde{\mathcal{A}}\tilde{V}(t,x)+\kappa\eta_{t}(C_{1}-\pi^{2})^{2}\}=0, (4.1)

where 𝒜~\tilde{\mathcal{A}} is the generator of the diffusion process XX, given in (3.2),

𝒜~={(π1[μr]+r)xπ2}x+12σ2(π1)2x22x2.\displaystyle\tilde{\mathcal{A}}=\{(\pi^{1}[\mu-r]+r)x-\pi^{2}\}\frac{\partial}{\partial x}+\frac{1}{2}\sigma^{2}(\pi^{1})^{2}x^{2}\frac{\partial^{2}}{\partial x^{2}}.

Additionally, the definition of V~\tilde{V} and the absorbing property of the upper and lower borders imply the following boundary conditions

(i)V~(T,x)=ηT(Fxaa1)2,x[S,F],\displaystyle(i)\tilde{V}(T,x)=\eta_{T}\left(\frac{F-x}{a_{a_{1}}}\right)^{2},\qquad\qquad\qquad\qquad\qquad\qquad\quad x\in[S,F],
(ii)V~(t,S(t))=ηT(FSaa1)2+κtTηs𝑑s(C1C2)2,t[0,T],\displaystyle(ii)\tilde{V}(t,S(t))=\eta_{T}\left(\frac{F-S}{a_{a_{1}}}\right)^{2}+\kappa\int^{T}_{t}\eta_{s}ds\left(C_{1}-C_{2}\right)^{2},\quad t\in[0,T], (4.2)
(iii)V~(t,F(t))=0,t[0,T].\displaystyle(iii)\tilde{V}(t,F(t))=0,\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\quad t\in[0,T].

4.1 Reducing the domain to a rectangle

The domain of Equation (4.1) is irregular, the upper and the lower borders are curved. Since we are going to apply the finite difference method as part of our algorithm, we apply a change of variable that converts the domain to a rectangle. To this end, we define the diffeomorphism :𝒞𝒞\mathcal{L}:\mathcal{C}\rightarrow\mathcal{C}^{\prime}, where 𝒞:={(t,z)|t[0,T],SzF}\mathcal{C}^{\prime}:=\{(t,z)|t\in[0,T],S\leq z\leq F\} and

(t,x)(t,z)\displaystyle(t,x)\rightarrow(t,z) =(t,x)=(t,1(t,x))\displaystyle=\mathcal{L}(t,x)=(t,\mathcal{L}_{1}(t,x))
:=(t,xer(Tt)+[C1+(C2C1)F(t)xF(t)S(t)]1er(Tt)r).\displaystyle:=\left(t,xe^{r(T-t)}+\left[C_{1}+(C_{2}-C_{1})\frac{F(t)-x}{F(t)-S(t)}\right]\frac{1-e^{r(T-t)}}{r}\right). (4.3)

Notice that for all 0tT0\leq t\leq T

1(t,S(t))=S,1(t,F(t))=F.\displaystyle\mathcal{L}_{1}(t,S(t))=S,\qquad\qquad\mathcal{L}_{1}(t,F(t))=F. (4.4)

Then, we define the diffusion process ZZ as

Zt:=1(t,Xt)=XtG(t)+H(t),t[0,T],Z_{t}:=\mathcal{L}_{1}(t,X_{t})=X_{t}G(t)+H(t),\qquad\qquad t\in[0,T],

in which

G(t)=er(Tt)+C1C2F(t)S(t)1er(Tt)r=FSF(t)S(t),\displaystyle G(t)=e^{r(T-t)}+\frac{C_{1}-C_{2}}{F(t)-S(t)}\frac{1-e^{r(T-t)}}{r}=\frac{F-S}{F(t)-S(t)}, (4.5)
H(t)=(C1+F(t)(C2C1)F(t)S(t))1er(Tt)r=C2FC1SF(t)S(t)er(Tt)1r.\displaystyle H(t)=\left(C_{1}+\frac{F(t)(C_{2}-C_{1})}{F(t)-S(t)}\right)\frac{1-e^{r(T-t)}}{r}=\frac{C_{2}F-C_{1}S}{F(t)-S(t)}\frac{e^{-r(T-t)}-1}{r}.
Proposition 4.1.

The process ZZ satisfies the following dynamics

dZt=\displaystyle dZ_{t}= Xt[(FS)(C2C1+r(FS))er(Tt)(F(t)S(t))2]dt\displaystyle X_{t}\left[-\frac{(F-S)(C_{2}-C_{1}+r(F-S))e^{-r(T-t)}}{(F(t)-S(t))^{2}}\right]dt
+G(t){[(πt1(μr)+r)Xtπt2]dt+σπt1XtdWt}\displaystyle+G(t)\left\{[(\pi^{1}_{t}(\mu-r)+r)X_{t}-\pi^{2}_{t}]dt+\sigma\pi^{1}_{t}X_{t}dW_{t}\right\}
+[C2FC1SF(t)S(t)]er(Tt)dt\displaystyle+\left[\frac{C_{2}F-C_{1}S}{F(t)-S(t)}\right]e^{-r(T-t)}dt
+(C2FC1S)(C2C1+r(FS))(F(t)S(t))2er(Tt)e2r(Tt)rdt.\displaystyle+\frac{(C_{2}F-C_{1}S)(C_{2}-C_{1}+r(F-S))}{(F(t)-S(t))^{2}}\frac{e^{-r(T-t)}-e^{-2r(T-t)}}{r}dt. (4.6)
Proof.

Due to the product rule dZt=XtdG(t)+G(t)dXt+dH(t)dZ_{t}=X_{t}dG(t)+G(t)dX_{t}+dH(t) and regarding the dynamics of XX, (2.1), we get directly the above dynamics. ∎

Notice that by a few manipulations we get

Xt=r[F(t)S(t)]Zt[C2FC1S](er(Tt)1)r(FS).X_{t}=\frac{r[F(t)-S(t)]Z_{t}-[C_{2}F-C_{1}S](e^{-r(T-t)}-1)}{r(F-S)}.

Therefore, defining the function

K(t,z):=\displaystyle K(t,z):= [F(t)S(t)]FSz[C2FC1S](er(Tt)1)r(FS)\displaystyle\frac{[F(t)-S(t)]}{F-S}z-\frac{[C_{2}F-C_{1}S](e^{-r(T-t)}-1)}{r(F-S)}
=\displaystyle= K1(t)zK2(t),\displaystyle K_{1}(t)z-K_{2}(t), (4.7)

we can rewrite the dynamics (4.6) as

dZt=\displaystyle dZ_{t}= {K(t,Zt)G(t)+G(t)[(πt1(μr)+r)K(t,Zt)πt2]+H(t)}dt\displaystyle\left\{K(t,Z_{t})G^{\prime}(t)+G(t)[(\pi^{1}_{t}(\mu-r)+r)K(t,Z_{t})-\pi^{2}_{t}]+H^{\prime}(t)\right\}dt
+G(t)σπt1K(t,Zt)dWt.\displaystyle+G(t)\sigma\pi^{1}_{t}K(t,Z_{t})dW_{t}. (4.8)
Proposition 4.2.

For each (t,z)𝒞(t,z)\in\mathcal{C}^{\prime}, the set of admissible strategies analogous to (3.11), can be written as

Πad(t,z)={\displaystyle\Pi_{ad}(t,z)=\{ π12(Ω×[t,T];[0,L]),π22(Ω×[t,T];[C2,C1])|π1,π2are\displaystyle\pi^{1}\in\mathcal{L}^{2}(\Omega\times[t,T];[0,L]),\pi^{2}\in\mathcal{L}^{2}(\Omega\times[t,T];[C_{2},C_{1}])|\pi^{1},\pi^{2}\;are
𝔽tprog.meas.,Z(s;t,z,π1(),π2())S,tsTa.s.},\displaystyle\mathbb{F}^{t}-prog.meas.,Z(s;t,z,\pi^{1}(\cdot),\pi^{2}(\cdot))\geq S,\;t\leq s\leq T\;a.s.\},

where we define

Z(s;t,z,π1(),π2()):=1(s,X(s;t,x,π1(),π2())),tsT,Z(s;t,z,\pi^{1}(\cdot),\pi^{2}(\cdot)):=\mathcal{L}_{1}(s,X(s;t,x,\pi^{1}(\cdot),\pi^{2}(\cdot))),\quad t\leq s\leq T,

in which x=11(t,)(z)x=\mathcal{L}^{-1}_{1}(t,\cdot)(z).

Proof.

This representation is obtained due to relations in (4.4) and the absorbing property of upper and lower borders of the domain 𝒞\mathcal{C}. ∎

We should emphasize that there is the equality Πad(t,z)=Π~ad(t,x)\Pi_{ad}(t,z)=\tilde{\Pi}_{ad}(t,x).

Because of the above change of variable, we should reformulate our stochastic optimal control problem on the new domain using the process ZZ. Analogous to (3.5), we define for any (t,z)𝒞(t,z)\in\mathcal{C}^{\prime} the following functional on Πad(t,z)\Pi_{ad}(t,z),

J(t,z,π1(),π2()):=𝔼z[\displaystyle J(t,z,\pi^{1}(\cdot),\pi^{2}(\cdot)):=\mathbb{E}^{z}[ κtTηs(C1πs2)2𝑑s\displaystyle\kappa\int^{T}_{t}\eta_{s}(C_{1}-\pi^{2}_{s})^{2}ds
+ηT(FZ(T;t,z,π1(),π2())aa1)2].\displaystyle+\eta_{T}\left(\frac{F-Z(T;t,z,\pi^{1}(\cdot),\pi^{2}(\cdot))}{a_{a_{1}}}\right)^{2}]. (4.9)

Then, defining the value function VV as

V(t,z):=infπ1(),π2()Πad(t,z)J(t,z;π1(),π2()),(t,z)𝒞,\displaystyle V(t,z):=\inf_{\pi^{1}(\cdot),\pi^{2}(\cdot)\in\Pi_{ad}(t,z)}J(t,z;\pi^{1}(\cdot),\pi^{2}(\cdot)),\qquad(t,z)\in\mathcal{C}^{\prime}, (4.10)

we get the following HJB equation on the domain 𝒞\mathcal{C}^{\prime}

inf(π1,π2)[0,L]×[C2,C1]{V(t,z)t+𝒜V(t,z)+κηt(C1π2)2}=0,\displaystyle\inf_{(\pi^{1},\pi^{2})\in[0,L]\times[C_{2},C_{1}]}\left\{\frac{\partial V(t,z)}{\partial t}+\mathcal{A}V(t,z)+\kappa\eta_{t}(C_{1}-\pi^{2})^{2}\right\}=0, (4.11)

where 𝒜\mathcal{A}, the generator of the diffusion process ZZ, is written, due to the dynamics (4.8), as

𝒜=\displaystyle\mathcal{A}= α(t,z)z+β(t,z)2z2:=\displaystyle\alpha(t,z)\frac{\partial}{\partial z}+\beta(t,z)\frac{\partial^{2}}{\partial z^{2}}:=
{K(t,z)G(t)+G(t)[(π1(μr)+r)K(t,z)π2]+H(t)}z\displaystyle\left\{K(t,z)G^{\prime}(t)+G(t)[(\pi^{1}(\mu-r)+r)K(t,z)-\pi^{2}]+H^{\prime}(t)\right\}\frac{\partial}{\partial z}
+12G2(t)σ2(π1)2K2(t,z)2z2.\displaystyle+\frac{1}{2}G^{2}(t)\sigma^{2}(\pi^{1})^{2}K^{2}(t,z)\frac{\partial^{2}}{\partial z^{2}}. (4.12)

Since before the annuitization the loss function just depends on the consumptions, the boundary conditions (4.2) are concluded similarly for VV. Moreover, in characterizing the dual transformation, the Neumann boundary condition at z=Fz=F, that is proved in Prop. 4.3, is needed. So, the boundary conditions of VV are written as

(i)V(T,z)=ηT(Fzaa1)2,z[S,F],\displaystyle(i)\;V(T,z)=\eta_{T}\left(\frac{F-z}{a_{a_{1}}}\right)^{2},\qquad\qquad\qquad\qquad\qquad\qquad\quad\;\;\;z\in[S,F],
(ii)V(t,S)=ηT(FSaa1)2+κ(C1C2)2tTηs𝑑s,t[0,T],\displaystyle(ii)\;V(t,S)=\eta_{T}\left(\frac{F-S}{a_{a_{1}}}\right)^{2}+\kappa(C_{1}-C_{2})^{2}\int^{T}_{t}\eta_{s}ds,\quad\qquad\;t\in[0,T],
(iii)Vz(t,F)=0,t[0,T],\displaystyle(iii)\;V_{z}(t,F)=0,\qquad\qquad\qquad\qquad\qquad\qquad\qquad\quad\qquad\qquad t\in[0,T], (4.13)
(iii)V(t,F)=0,t[0,T].\displaystyle(iii^{\prime})V(t,F)=0,\qquad\qquad\qquad\qquad\qquad\qquad\qquad\quad\qquad\qquad t\in[0,T].
Proposition 4.3.

For any t[0,T]t\in[0,T], the value function has left derivative at z=Fz=F and it is equal to zero, Vz(t,F)=0V_{z}(t,F)=0.

Proof.

Let z(S,F)z\in(S,F) be the initial state. To find the result of applying the strategy π=(π1,π2)=(0,C3)\pi=(\pi^{1},\pi^{2})=(0,C_{3}), in which C3=C1+(C2C1)FzFSC_{3}=C_{1}+(C_{2}-C_{1})\frac{F-z}{F-S}, to the wealth process (4.8), we apply this strategy to (3.2) and then use the transformation 1\mathcal{L}_{1}. Let x=[1(t,)]1(z)x=[\mathcal{L}_{1}(t,\cdot)]^{-1}(z) be the corresponding initial state in the domain 𝒞\mathcal{C}. Denoting Xs=X(s;t,x,π1(),π2())X_{s}=X(s;t,x,\pi^{1}(\cdot),\pi^{2}(\cdot)), we have Xt=xX_{t}=x and

Xs=C3r+(zC3r)er(Ts),st.X_{s}=\frac{C_{3}}{r}+(z-\frac{C_{3}}{r})e^{-r(T-s)},\quad s\geq t.

Like to the equality (8.2), we can conclude that the process Zs=1(s,,Xs)Z_{s}=\mathcal{L}_{1}(s,,X_{s}) does not move from the level zz, and in particular ZT=zZ_{T}=z.

Now, the inequality V(t,z)J(t,z,0,C3)V(t,z)\leq J(t,z,0,C_{3}) and the relation V(t,F)=0V(t,F)=0 imply that

κ(C1C3)2tTηs𝑑s+ηT(Fzaa1)2FzV(t,F)V(t,z)Fz0.\displaystyle-\frac{\kappa(C_{1}-C_{3})^{2}\int^{T}_{t}\eta_{s}ds+\eta_{T}(\frac{F-z}{a_{a_{1}}})^{2}}{F-z}\leq\frac{V(t,F)-V(t,z)}{F-z}\leq 0.

Regarding the definition of C3C_{3}, the left hand side is rewritten as

(Fz)[κ(C2C1FS)2tTηs𝑑s+ηTaa12],(F-z)\left[\kappa(\frac{C_{2}-C_{1}}{F-S})^{2}\int^{T}_{t}\eta_{s}ds+\frac{\eta_{T}}{a^{2}_{a_{1}}}\right],

which tends to zero, as zFz\uparrow F, and concludes the claim. ∎

In the following proposition, we state that the strategies that are optimal in the new optimal control problem, that has been formulated after the change of variable, are optimal in the original problem too.

Proposition 4.4.

At any point (t,z)𝒞(t,z)\in\mathcal{C}^{\prime}, the strategies that yield the minimum value of the functional JJ, (4.9), are the optimal strategies for the problem (3.6) at the point (t,x)=(t,11(t,)(z))𝒞(t,x)=(t,\mathcal{L}^{-1}_{1}(t,\cdot)(z))\in\mathcal{C}.

Proof.

Notice that at the terminal time TT there is the equality

Z(T;t,z,π1(),π2())=X(T;t,x,π1(),π2()).Z(T;t,z,\pi^{1}(\cdot),\pi^{2}(\cdot))=X(T;t,x,\pi^{1}(\cdot),\pi^{2}(\cdot)).

So, we have the equality

J(t,z,π1(),π2())=J~(t,x,π1(),π2()),J(t,z,\pi^{1}(\cdot),\pi^{2}(\cdot))=\tilde{J}(t,x,\pi^{1}(\cdot),\pi^{2}(\cdot)),

which trivially concludes the proposition. ∎

By assuming some smoothness property, the strict decreasing property of the value function is proved in Subsection 4.3. However, the decreasing property is obtained directly from the definitions.

Proposition 4.5.

For any t[0,T]t\in[0,T] and Sz1<z2FS\leq z_{1}<z_{2}\leq F, we have

V(t,z1)V(t,z2).V(t,z_{1})\geq V(t,z_{2}).
Proof.

For any admissible strategy (π1,π2)Πad(t,z1)(\pi^{1},\pi^{2})\in\Pi_{ad}(t,z_{1}), we have

X(T;t,z1,π1(),π2())X(T;t,z2,π1(),π2()).\displaystyle X(T;t,z_{1},\pi^{1}(\cdot),\pi^{2}(\cdot))\leq X(T;t,z_{2},\pi^{1}(\cdot),\pi^{2}(\cdot)). (4.14)

Notice that two processes X(;t,z1,π1(),π2())X(\cdot;t,z_{1},\pi^{1}(\cdot),\pi^{2}(\cdot)) and X(;t,z2,π1(),π2())X(\cdot;t,z_{2},\pi^{1}(\cdot),\pi^{2}(\cdot)) are continuous. So, if they are equal for some s(t,T]s\in(t,T] then they are equal from that ss on. This observation implies that, by applying the strategy (π1,π2)(\pi^{1},\pi^{2}) at the point (t,z2)(t,z_{2}) the wealth process will remain upper than, or be equal to, the safety level SS which means that this strategy belongs to Πad(t,z2)\Pi_{ad}(t,z_{2}). Now, due to the definition of the loss function (3.3), the inequality (4.14) indicates a similar inequality for the corresponding action functional JJ, which concludes the claim. ∎

4.2 Viscosity solution

To prove that the value function (4.10) satisfy the HJB Equation (4.11) in the viscosity sense, the continuity of the value function is required. To this end, we prove at first it is convex as a function of the space variable.

Proposition 4.6.

For any t[0,T]t\in[0,T], the function [S,F]+[S,F]\rightarrow\mathbb{R}^{+}, xV(t,x)x\rightarrow V(t,x) is strictly convex.

Proof.

The definitions of objective functionals J,J~J,\tilde{J} imply the following equality for any 0tT0\leq t\leq T and S(t)xF(t)S(t)\leq x\leq F(t),

V~(t,x)=V(t,1(t,x)).\displaystyle\tilde{V}(t,x)=V(t,\mathcal{L}_{1}(t,x)). (4.15)

Moreover, the definition (4.3) shows that for a fixed 0tT0\leq t\leq T, the function x1(t,x)x\rightarrow\mathcal{L}_{1}(t,x) is linear. So, the convexity of the functions xV~(t,x)x\rightarrow\tilde{V}(t,x) and xV(t,x)x\rightarrow V(t,x) are equivalent. Here, we prove the convexity of the former function.

For δ>0\delta>0, let π1,δ,x(),π2,δ,x()\pi^{1,\delta,x}(\cdot),\pi^{2,\delta,x}(\cdot) and π1,δ,y(),π2,δ,y()\pi^{1,\delta,y}(\cdot),\pi^{2,\delta,y}(\cdot) be δ\delta-optimal controls corresponding to the points (t,x)(t,x) and (t,y)(t,y), respectively, or

J~(t,x;π1,δ,x(),π2,δ,x())V~(t,x)+δ,J~(t,y;π1,δ,y(),π2,δ,y())V~(t,y)+δ.\tilde{J}(t,x;\pi^{1,\delta,x}(\cdot),\pi^{2,\delta,x}(\cdot))\leq\tilde{V}(t,x)+\delta,\qquad\tilde{J}(t,y;\pi^{1,\delta,y}(\cdot),\pi^{2,\delta,y}(\cdot))\leq\tilde{V}(t,y)+\delta.

Set Xsδ:=X(s;t,x,π1,δ,x(),π2,δ,x())X^{\delta}_{s}:=X(s;t,x,\pi^{1,\delta,x}(\cdot),\pi^{2,\delta,x}(\cdot)) and Ysδ:=X(s;t,y,π1,δ,y(),π2,δ,y())Y^{\delta}_{s}:=X(s;t,y,\pi^{1,\delta,y}(\cdot),\pi^{2,\delta,y}(\cdot)), for tsTt\leq s\leq T. Then, for a fixed γ[0,1]\gamma\in[0,1], setting Zsδ:=γXsδ+(1γ)YsδZ^{\delta}_{s}:=\gamma X^{\delta}_{s}+(1-\gamma)Y^{\delta}_{s} and πs2,δ,z:=γπs2,δ,x+(1γ)πs2,δ,y\pi^{2,\delta,z}_{s}:=\gamma\pi^{2,\delta,x}_{s}+(1-\gamma)\pi^{2,\delta,y}_{s}, we get

γV~(\displaystyle\gamma\tilde{V}( t,x)+(1γ)V~(t,y)+δ\displaystyle t,x)+(1-\gamma)\tilde{V}(t,y)+\delta
γJ~(t,x;π1,δ,x(),π2,δ,x())+(1γ)J~(t,y;π1,δ,y(),π2,δ,y())\displaystyle\geq\gamma\tilde{J}(t,x;\pi^{1,\delta,x}(\cdot),\pi^{2,\delta,x}(\cdot))+(1-\gamma)\tilde{J}(t,y;\pi^{1,\delta,y}(\cdot),\pi^{2,\delta,y}(\cdot))
=γ𝔼[κtTηs(C1πs2,δ,x)2𝑑s+ηT(FXTδaa1)2]\displaystyle=\gamma\mathbb{E}[\kappa\int^{T}_{t}\eta_{s}(C_{1}-\pi^{2,\delta,x}_{s})^{2}ds+\eta_{T}\left(\frac{F-X^{\delta}_{T}}{a_{a_{1}}}\right)^{2}]
+(1γ)𝔼[κtTηs(C1πs2,δ,y)2𝑑s+ηT(FYTδaa1)2]\displaystyle\quad+(1-\gamma)\mathbb{E}[\kappa\int^{T}_{t}\eta_{s}(C_{1}-\pi^{2,\delta,y}_{s})^{2}ds+\eta_{T}\left(\frac{F-Y^{\delta}_{T}}{a_{a_{1}}}\right)^{2}]
𝔼[κtTηs(C1πs2,δ,z)2𝑑s+ηT(FZTδaa1)2],\displaystyle\geq\mathbb{E}[\kappa\int^{T}_{t}\eta_{s}(C_{1}-\pi^{2,\delta,z}_{s})^{2}ds+\eta_{T}\left(\frac{F-Z^{\delta}_{T}}{a_{a_{1}}}\right)^{2}], (4.16)

where the last inequality is obtained from the convexity of x(C1x)2x\rightarrow(C_{1}-x)^{2} and x(Fxaa1)2x\rightarrow(\frac{F-x}{a_{a_{1}}})^{2}. Moreover, setting the control

πs1,δ,z:=1Zsδ(γπs1,δ,xXsδ+(1γ)πs1,δ,yYsδ),tsT,\pi^{1,\delta,z}_{s}:=\frac{1}{Z^{\delta}_{s}}\left(\gamma\pi^{1,\delta,x}_{s}X^{\delta}_{s}+(1-\gamma)\pi^{1,\delta,y}_{s}Y^{\delta}_{s}\right),\qquad t\leq s\leq T,

we conclude from the dynamics (3.2) that

Zsδ=X(s;t,γx+(1γ)y,π1,δ,z(),π2,δ,z()).Z^{\delta}_{s}=X(s;t,\gamma x+(1-\gamma)y,\pi^{1,\delta,z}(\cdot),\pi^{2,\delta,z}(\cdot)).

Hence, the definition of the value function V~\tilde{V} indicates

V~(t,γx+(1γ)y)𝔼[κtTηs(C1πs2,δ,z)2𝑑s+ηT(FZTδaa1)2].\displaystyle\tilde{V}(t,\gamma x+(1-\gamma)y)\leq\mathbb{E}[\kappa\int^{T}_{t}\eta_{s}(C_{1}-\pi^{2,\delta,z}_{s})^{2}ds+\eta_{T}\left(\frac{F-Z^{\delta}_{T}}{a_{a_{1}}}\right)^{2}]. (4.17)

Since δ\delta is arbitrary in (4.16), we conclude from (4.16) and (4.17) the following inequality which indicates the convexity of xV~(t,x)x\rightarrow\tilde{V}(t,x)

V~(t,γx+(1γ)y)γV~(t,x)+(1γ)V~(t,y).\displaystyle\tilde{V}(t,\gamma x+(1-\gamma)y)\leq\gamma\tilde{V}(t,x)+(1-\gamma)\tilde{V}(t,y). (4.18)

Now, by contradiction suppose that V~\tilde{V} is not strictly convex which means that (4.18) must be an equality. Hence, as δ0\delta\rightarrow 0, the last inequality in (4.16) turns into an equality. Then, from the strict convexity of x(C1x)2x\rightarrow(C_{1}-x)^{2} and x(Fxaa1)2x\rightarrow(\frac{F-x}{a_{a_{1}}})^{2}, we conclude that π2,δ,x()π2,δ,y()a.s.\pi^{2,\delta,x}(\cdot)\rightarrow\pi^{2,\delta,y}(\cdot)\;a.s. and XTδYTδa.s.X^{\delta}_{T}\rightarrow Y^{\delta}_{T}\;a.s., as δ0\delta\rightarrow 0. Since the control variables π2,δ,x,π2,δ,y\pi^{2,\delta,x},\pi^{2,\delta,y} are chosen from the bounded interval [C2,C1][C_{2},C_{1}], using the dominated convergence theorem and the dynamics (3.2), we conclude that, under the risk-neutral measure, the drift coefficient of the following Itô process is rMsrM_{s},

Ms:=limδ0(XsδYsδ),st.\displaystyle M_{s}:=\lim_{\delta\rightarrow 0}(X^{\delta}_{s}-Y^{\delta}_{s}),\qquad s\geq t.

So, the discounted process M~s=er(st)Ms,st,\tilde{M}_{s}=e^{-r(s-t)}M_{s},s\geq t, is a martingale under the risk-neutral measure with initial value M~t=xy0\tilde{M}_{t}=x-y\neq 0. Therefore, under any equivalent measure ~\tilde{\mathbb{P}} we have ~{MT=0}<1\tilde{\mathbb{P}}\{M_{T}=0\}<1 or equivalently

~{limδ0XTδlimδ0YTδ}>0,\tilde{\mathbb{P}}\{\lim_{\delta\rightarrow 0}X^{\delta}_{T}\neq\lim_{\delta\rightarrow 0}Y^{\delta}_{T}\}>0,

which is a contradiction. ∎

Proposition 4.7.

The value function VV is continuous on the domain [0,T]×[S,F][0,T]\times[S,F].

Proof.

the proof is in the Appendix. ∎

The continuity of the value function supports the following theorem.

Theorem 4.8.

The value function VV is the unique viscosity solution of the HJB Equation (4.11) with boundary conditions (4.13)-(i)-(ii)-(iii) or (4.13)-(i)-(ii)-(iii)(iii^{\prime}).

Proof.

It can be checked easily that the coefficients of differential operator 𝒜\mathcal{A}, and the running cost term κηt(C1π2)2\kappa\eta_{t}(C_{1}-\pi^{2})^{2} have continuous partial derivatives with respect to the variables tt and zz, for any fixed control variable. Furthermore, the values of control variables π1,π2\pi^{1},\pi^{2} lie in the compact intervals. So, due to the continuity of value function, [13, V. Theorem 3.1] together with [13, V. Corollary 3.1] imply that VV is the viscosity solution of (4.11). Moreover, [13, V, Corollary 8.1] indicates the uniqueness of viscosity solution of (4.11) with boundary conditions (4.13)-(i)-(ii)-(iii)(iii^{\prime}).

In addition, since the Dirichlet-type and Neumann-type boundary conditions in (4.13)-(i)-(ii)-(iii) are on unconnected parts of the boundary, on z=Sz=S and z=Fz=F, respectively, we can apply [2, Theorem 3.1] on z=Fz=F, to conclude the comparison principle and then the uniqueness of viscosity solution. ∎

4.3 Classical solution and the dual equation

To apply the verification theorem, we must show that V𝒞1,2([0,T)×(S,F);)V\in\mathcal{C}^{1,2}([0,T)\times(S,F);\mathbb{R}). Our argument is similar to the one that has been employed in [9, 11]. In fact, the dual equation of the HJB Equation (4.11) is considered.

Let the viscosity solution of Equation (4.11), VV, belongs to 𝒞1,2([0,T),(S,F);)\mathcal{C}^{1,2}([0,T),(S,F);\mathbb{R}) which implies that VV is a classical solution too. Then, due to strict convexity of V, which indicates

Vzz(t,z)>0,(t,z)[0,T]×(S,F),\displaystyle V_{zz}(t,z)>0,\qquad\quad\qquad\qquad(t,z)\in[0,T]\times(S,F), (4.19)

the investment control variable, that yields the infimum in (4.11), is given by

π1,=(μr)Vz(t,z)G(t)σ2K(t,z)Vzz(t,z).\displaystyle\pi^{1,*}=\frac{-(\mu-r)V_{z}(t,z)}{G(t)\sigma^{2}K(t,z)V_{zz}(t,z)}. (4.20)

Moreover, since the consumption control variable must be in the interval [C2,C1][C_{2},C_{1}], we get

π2,=(G(t)Vz(t,z)2κηt+C1)C2.\displaystyle\pi^{2,*}=(\frac{G(t)V_{z}(t,z)}{2\kappa\eta_{t}}+C_{1})\vee C_{2}. (4.21)

The formula (4.21), divides the domain into two parts:

A={(t,z)[0,T)×(S,F):G(t)Vz(t,z)2κηt+C1>C2},\displaystyle A=\{(t,z)\in[0,T)\times(S,F):\frac{G(t)V_{z}(t,z)}{2\kappa\eta_{t}}+C_{1}>C_{2}\}, (4.22)
B={(t,z)[0,T)×(S,F):G(t)Vz(t,z)2κηt+C1C2}.\displaystyle B=\{(t,z)\in[0,T)\times(S,F):\frac{G(t)V_{z}(t,z)}{2\kappa\eta_{t}}+C_{1}\leq C_{2}\}. (4.23)

Now, inserting the control variables (4.20) and (4.21) into (4.11), V satisfies the following equations in classical sense

Vt(t,z)\displaystyle V_{t}(t,z) +(H(t)+K(t,z)G(t)+rK(t,z)G(t)C1G(t))Vz(t,z)\displaystyle+\left(H^{\prime}(t)+K(t,z)G^{\prime}(t)+rK(t,z)G(t)-C_{1}G(t)\right)V_{z}(t,z)
12β2Vz2(t,z)Vzz(t,z)14G2(t)Vz2(t,z)κηt=0,(t,z)A,\displaystyle-\frac{1}{2}\frac{\beta^{2}V^{2}_{z}(t,z)}{V_{zz}(t,z)}-\frac{1}{4}\frac{G^{2}(t)V^{2}_{z}(t,z)}{\kappa\eta_{t}}=0,\qquad\quad(t,z)\in A, (4.24)
Vt(t,z)\displaystyle V_{t}(t,z) +(H(t)+K(t,z)G(t)+rK(t,z)G(t)C2G(t))Vz(t,z)\displaystyle+\left(H^{\prime}(t)+K(t,z)G^{\prime}(t)+rK(t,z)G(t)-C_{2}G(t)\right)V_{z}(t,z)
12β2Vz2(t,z)Vzz(t,z)+κηt(C1C2)2=0,(t,z)B.\displaystyle-\frac{1}{2}\frac{\beta^{2}V^{2}_{z}(t,z)}{V_{zz}(t,z)}+\kappa\eta_{t}(C_{1}-C_{2})^{2}=0,\qquad\quad(t,z)\in B. (4.25)

It should be noted that the coefficients of the above two equations are the same on the common boundary of AA and BB.

The relation (4.19) together with (4.13)-(iii), imply

Vz(t,z)<0,(t,z)[0,T]×(S,F).\displaystyle V_{z}(t,z)<0,\qquad\quad\qquad\qquad(t,z)\in[0,T]\times(S,F). (4.26)

In addition, we assume

limzSVz(t,z)=,t[0,T).\displaystyle\lim_{z\downarrow S}V_{z}(t,z)=-\infty,\qquad\qquad\qquad t\in[0,T). (4.27)

Intuitively, this assumption means that the marginal loss when the wealth process approaches the safety level is very large, which seems to be reasonable.

Now, we are ready to define the dual transformation. The relations (4.19), (4.26), (4.13)-(iii) and (4.27) indicate that, for every (t,y)[0,T)×[0,+)(t,y)\in[0,T)\times[0,+\infty), there is a unique minimizer g(t,y)(S,F]g(t,y)\in(S,F] of the function [S,F]+,zV(t,z)+zy[S,F]\rightarrow\mathbb{R}^{+},\;z\rightarrow V(t,z)+zy. Moreover, it is characterized by the equation

Vz(t,g(t,y))=y,(t,y)[0,T)×[0,+).\displaystyle V_{z}(t,g(t,y))=-y,\qquad\qquad(t,y)\in[0,T)\times[0,+\infty). (4.28)

This characterization together with (4.27) imply

(i)g(t,y)(S,F),(ii)limy+g(t,y)=S,(t,y)[0,T)×(0,+).\displaystyle(i)g(t,y)\in(S,F),\qquad(ii)\lim_{y\rightarrow+\infty}g(t,y)=S,\quad(t,y)\in[0,T)\times(0,+\infty). (4.29)

Furthermore, from (4.19) and (4.28), we conclude that gg is differentiable in the space variable and

gy(t,y)<0,(t,y)[0,T)×(0,+).\displaystyle g_{y}(t,y)<0,\qquad\qquad\quad\qquad(t,y)\in[0,T)\times(0,+\infty). (4.30)

Analogous to the subsets AA and BB, defined in (4.22) and (4.23), the domain [0,T)×(0,+)[0,T)\times(0,+\infty) is divided to two subsets:

A={(t,y)[0,T)×(0,+):y<2κηtG(t)(C1C2)},\displaystyle A^{\prime}=\{(t,y)\in[0,T)\times(0,+\infty):y<\frac{2\kappa\eta_{t}}{G(t)}(C_{1}-C_{2})\},
B={(t,y)[0,T)×(0,+):y2κηtG(t)(C1C2)}.\displaystyle B^{\prime}=\{(t,y)\in[0,T)\times(0,+\infty):y\geq\frac{2\kappa\eta_{t}}{G(t)}(C_{1}-C_{2})\}.

In the next proposition, we derive from the fully nonlinear Equations (4.24)-(4.25) with the boundary conditions (4.13)-(i)-(ii)-(iii) the following equations for the function gg on AA^{\prime} and BB^{\prime} with the boundary conditions (4.33).

gt(t,y)\displaystyle g_{t}(t,y) +(β2K1(t)G(t)rK1(t)G(t))ygy(t,y)+12β2y2gyy\displaystyle+(\beta^{2}-K_{1}(t)G^{\prime}(t)-rK_{1}(t)G(t))yg_{y}(t,y)+\frac{1}{2}\beta^{2}y^{2}g_{yy}
H(t)K(t,g(t,y))G(t)rK(t,g(t,y))G(t)\displaystyle-H^{\prime}(t)-K(t,g(t,y))G^{\prime}(t)-rK(t,g(t,y))G(t)
+C1G(t)12G2(t)κηty=0,(t,y)A,\displaystyle+C_{1}G(t)-\frac{1}{2}\frac{G^{2}(t)}{\kappa\eta_{t}}y=0,\qquad\qquad\qquad\qquad\qquad(t,y)\in A^{\prime}, (4.31)
gt(t,y)\displaystyle g_{t}(t,y) +(β2K1(t)G(t)rK1(t)G(t))ygy(t,y)+12β2y2gyy\displaystyle+(\beta^{2}-K_{1}(t)G^{\prime}(t)-rK_{1}(t)G(t))yg_{y}(t,y)+\frac{1}{2}\beta^{2}y^{2}g_{yy}
H(t)K(t,g(t,y))G(t)rK(t,g(t,y))G(t)\displaystyle-H^{\prime}(t)-K(t,g(t,y))G^{\prime}(t)-rK(t,g(t,y))G(t)
+C2G(t)=0,(t,y)B.\displaystyle+C_{2}G(t)=0,\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad(t,y)\in B^{\prime}. (4.32)

It should be noted that the coefficients of the above two equations are the same on the common boundary of AA^{\prime} and BB^{\prime}. More specifically, we have the equality C2G=C1G12G2(t)κηtyC_{2}G=C_{1}G-\frac{1}{2}\frac{G^{2}(t)}{\kappa\eta_{t}}y on this boundary.

Considering the condition (4.13)-(iii), and finding the minimizer of the function zV(T,z)+zyz\rightarrow V(T,z)+zy explicitly, we get the following boundary conditions:

{(i)g(t,0)=F,t[0,T],(ii)g(T,y)=(Faa12y2ηT)S,y[0,+).\displaystyle\begin{cases}(i)\;g(t,0)=F,\qquad\qquad\qquad\qquad\qquad\quad t\in[0,T],\\ (ii)\;g(T,y)=(F-\frac{a^{2}_{a_{1}}y}{2\eta_{T}})\vee S,\qquad\qquad y\in[0,+\infty).\end{cases} (4.33)

Here we prove a proposition similar to [9, Prop. 4.18].

Proposition 4.9.

Suppose that the unique viscosity solution VV of (4.11) with the boundary conditions (4.13)-(i)-(ii)-(iii) belongs to the class 𝒞1,3([0,T)×(S,F);)\mathcal{C}^{1,3}([0,T)\times(S,F);\mathbb{R}) and satisfies (4.27). Let gg be defined as above. Then gg is a classical solution of (4.31)-(4.32) with the boundary conditions (4.33). Moreover, gg satisfies (4.29) and (4.30).

Conversely, let g𝒞([0,T]×[0,+);)𝒞1,2([0,T)×(0,+);)g\in\mathcal{C}([0,T]\times[0,+\infty);\mathbb{R})\cap\mathcal{C}^{1,2}([0,T)\times(0,+\infty);\mathbb{R}) be a classical solution of (4.31)-(4.32)-(4.33) and satisfies (4.29) and (4.30). Furthermore, suppose

limy+y2gy(t,y)=0,uniformlyint[0,T),\displaystyle\lim_{y\rightarrow+\infty}y^{2}g_{y}(t,y)=0,\;uniformly\;in\;t\in[0,T), (4.34)
[g(t,)]1isintegrableatS+,t[0,T),\displaystyle[g(t,\cdot)]^{-1}is\;integrable\;at\;S^{+},\quad\qquad t\in[0,T), (4.35)

and

{(i)h(t,z):=tTκηs(C1C2)2𝑑s+ηT(FSaa1)2Sz[g(t,)]1(ζ)𝑑ζ,(t,z)[0,T)×[S,F],(ii)h(T,z):=ηT(Fzaa1)2,z[S,F].\displaystyle\begin{cases}(i)\;h(t,z):=\int^{T}_{t}\kappa\eta_{s}(C_{1}-C_{2})^{2}ds+\eta_{T}(\frac{F-S}{a_{a_{1}}})^{2}\\ \hskip 54.06006pt-\int^{z}_{S}[g(t,\cdot)]^{-1}(\zeta)d\zeta,\qquad\qquad\quad(t,z)\in[0,T)\times[S,F],\\ (ii)\;h(T,z):=\eta_{T}(\frac{F-z}{a_{a_{1}}})^{2},\quad\qquad\qquad\qquad\qquad\qquad z\in[S,F].\end{cases} (4.36)

Then h𝒞([0,T]×[S,F])𝒞0,1([0,T)×(S,F];)𝒞1,3([0,T)×(S,F);)h\in\mathcal{C}([0,T]\times[S,F])\cap\mathcal{C}^{0,1}([0,T)\times(S,F];\mathbb{R})\cap\mathcal{C}^{1,3}([0,T)\times(S,F);\mathbb{R}). Furthermore, It is a classical solution of (4.11) with boundary conditions (4.13)-(i)-(ii)-(iii) and satisfies (4.27).

Proof.

Let the unique viscosity solution VV of (4.11) with boundary conditions (4.13)-(i)-(ii)-(iii) belongs to the class 𝒞1,3([0,T)×(S,F);)\mathcal{C}^{1,3}([0,T)\times(S,F);\mathbb{R}). Then, VV satisfies Equations (4.24)-(4.25) in classical sense. Deriving these equations with respect to zz yield the following equations on AA and BB, respectively.

Vtz\displaystyle V_{tz} +(K1G+rGK1)Vzβ222VzVzz2Vz2VzzzVzz2\displaystyle+(K_{1}G^{\prime}+rGK_{1})V_{z}-\frac{\beta^{2}}{2}\frac{2V_{z}V^{2}_{zz}-V^{2}_{z}V_{zzz}}{V^{2}_{zz}}
+(KG+rGK+HC1G)VzzG2VzVzz2κηt=0,\displaystyle+(KG^{\prime}+rGK+H^{\prime}-C_{1}G)V_{zz}-\frac{G^{2}V_{z}V_{zz}}{2\kappa\eta_{t}}=0, (4.37)
Vtz\displaystyle V_{tz} +(K1G+rGK1)Vzβ222VzVzz2Vz2VzzzVzz2\displaystyle+(K_{1}G^{\prime}+rGK_{1})V_{z}-\frac{\beta^{2}}{2}\frac{2V_{z}V^{2}_{zz}-V^{2}_{z}V_{zzz}}{V^{2}_{zz}}
+(KG+rGK+HC2G)Vzz=0.\displaystyle+(KG^{\prime}+rGK+H^{\prime}-C_{2}G)V_{zz}=0. (4.38)

To shorten the above two formulas, the variables tt and zz, inside the parentheses, have been eliminated.

Since V𝒞1,3([0,T)×(S,F);)V\in\mathcal{C}^{1,3}([0,T)\times(S,F);\mathbb{R}), we have g𝒞1,2([0,T)×(0,+);)g\in\mathcal{C}^{1,2}([0,T)\times(0,+\infty);\mathbb{R}). Now, deriving Equation (4.28) w. r. t. tt, w. r. t. zz and twice w. r. t. zz, we get

Vtz(t,g(t,y))+Vzz(t,g(t,y))gt(t,y)=0,\displaystyle V_{tz}(t,g(t,y))+V_{zz}(t,g(t,y))g_{t}(t,y)=0, (4.39)
Vzz(t,g(t,y))gy(t,y)=1,\displaystyle V_{zz}(t,g(t,y))g_{y}(t,y)=-1, (4.40)
Vzzz(t,g(t,y))gy2(t,y)+Vzz(t,g(t,y))gyy(t,y)=0.\displaystyle V_{zzz}(t,g(t,y))g^{2}_{y}(t,y)+V_{zz}(t,g(t,y))g_{yy}(t,y)=0. (4.41)

Due to the above relations, we obtain Equations (4.31)-(4.32) from Equations (4.37)-(4.38). Furthermore, the boundary conditions (4.33) and the properties (4.29) and (4.30) have been demonstrated in the construction of the function gg.

Conversely, let gg be given as in the statement. The relations (4.33), (4.29) and (4.30) clearly indicate that the function [g(t,)]1[g(t,\cdot)]^{-1} is well-defined on (S,F](S,F] for every t[0,T)t\in[0,T). Furthermore, the definition (4.36) indicates that

h𝒞([0,T]×[S,F])𝒞0,1([0,T)×(S,F];)𝒞1,3([0,T)×(S,F);).h\in\mathcal{C}([0,T]\times[S,F])\cap\mathcal{C}^{0,1}([0,T)\times(S,F];\mathbb{R})\cap\mathcal{C}^{1,3}([0,T)\times(S,F);\mathbb{R}).

Moreover, the conditions (4.33) and definitions (4.36) imply (4.13)-(i)-(ii)-(iii). Deriving the equation (4.36)-(i) with respect to zz and regarding (4.29)-(ii), (4.27) is obtained with hh in place of VV.

Now, determining hzh_{z} using the definition (4.36)-(i), we see that hh satisfies (4.28) with hh in place of VV. Then, the argument of the first part shows that hh satisfies (4.39)-(4.40)-(4.41) with hh in place of VV. Setting z=g(t,y)z=g(t,y), applying backward the argument of the first part of the proof and due to the relation g(t,(0,+))=(S,F)g(t,(0,+\infty))=(S,F), we conclude that hh satisfies Equations (4.37)-(4.38) with hh in place of VV. Hence, by integrating these equations with respect to zz, we get

ht(t,z)\displaystyle h_{t}(t,z) +(H(t)+K(t,z)G(t)+rK(t,z)G(t)C1G(t))hz(t,z)\displaystyle+\left(H^{\prime}(t)+K(t,z)G^{\prime}(t)+rK(t,z)G(t)-C_{1}G(t)\right)h_{z}(t,z)
12β2hz2(t,z)hzz(t,z)14G2(t)hz2(t,z)κηt=C(t),(t,z)A,\displaystyle-\frac{1}{2}\frac{\beta^{2}h^{2}_{z}(t,z)}{h_{zz}(t,z)}-\frac{1}{4}\frac{G^{2}(t)h^{2}_{z}(t,z)}{\kappa\eta_{t}}=C(t),\qquad\quad(t,z)\in A, (4.42)
ht(t,z)\displaystyle h_{t}(t,z) +(H(t)+K(t,z)G(t)+rK(t,z)G(t)C2G(t))hz(t,z)\displaystyle+\left(H^{\prime}(t)+K(t,z)G^{\prime}(t)+rK(t,z)G(t)-C_{2}G(t)\right)h_{z}(t,z)
12β2hz2(t,z)hzz(t,z)+κηt(C1C2)2=C(t),(t,z)B.\displaystyle-\frac{1}{2}\frac{\beta^{2}h^{2}_{z}(t,z)}{h_{zz}(t,z)}+\kappa\eta_{t}(C_{1}-C_{2})^{2}=C^{\prime}(t),\qquad\quad(t,z)\in B. (4.43)

To conclude the claim, we must show that C(t)=C(t)=0C(t)=C^{\prime}(t)=0, for any t[0,T)t\in[0,T). Notice that the common boundary of the subsets AA and BB is a curve from t=0t=0 to t=Tt=T. Since h𝒞1,3([0,T)×(S,F);)h\in\mathcal{C}^{1,3}([0,T)\times(S,F);\mathbb{R}), considering the above two equations on this curve, we conclude that CCC\equiv C^{\prime}. So, it is enough to prove C0C^{\prime}\equiv 0.

Since CC^{\prime} is a time variable function, for any z0(S,F)z_{0}\in(S,F), we can write

C(t)=\displaystyle C^{\prime}(t)= ht(t,z0)12β2hz2(t,z0)hzz(t,z0)+κηt(C1C2)2\displaystyle h_{t}(t,z_{0})-\frac{1}{2}\frac{\beta^{2}h^{2}_{z}(t,z_{0})}{h_{zz}(t,z_{0})}+\kappa\eta_{t}(C_{1}-C_{2})^{2}
+(H(t)+K(t,z0)G(t)+rK(t,z0)G(t)C2G(t))hz(t,z0).\displaystyle+\left(H^{\prime}(t)+K(t,z_{0})G^{\prime}(t)+rK(t,z_{0})G(t)-C_{2}G(t)\right)h_{z}(t,z_{0}).

Integrating both sides of the above equation over the time interval [t,T][t,T] and regarding the definitions (4.36), we have

tTC(s)𝑑s=\displaystyle\int^{T}_{t}C^{\prime}(s)ds= h(T,z0)h(t,z0)β22tThz2(s,z0)hzz(s,z0)𝑑s+tTκηs(C1C2)2𝑑s\displaystyle h(T,z_{0})-h(t,z_{0})-\frac{\beta^{2}}{2}\int^{T}_{t}\frac{h^{2}_{z}(s,z_{0})}{h_{zz}(s,z_{0})}ds+\int^{T}_{t}\kappa\eta_{s}(C_{1}-C_{2})^{2}ds
+tTD(s,z0)hz(s,z0)𝑑s\displaystyle+\int^{T}_{t}D(s,z_{0})h_{z}(s,z_{0})ds
=\displaystyle= ηT(Fz0aa12)2ηT(FSaa12)2+Sz0[g(t,)]1(ζ)𝑑ζ\displaystyle\eta_{T}(\frac{F-z_{0}}{a^{2}_{a_{1}}})^{2}-\eta_{T}(\frac{F-S}{a^{2}_{a_{1}}})^{2}+\int^{z_{0}}_{S}[g(t,\cdot)]^{-1}(\zeta)d\zeta
β22tThz2(s,z0)hzz(s,z0)𝑑s+tTD(s,z0)hz(s,z0)𝑑s,\displaystyle-\frac{\beta^{2}}{2}\int^{T}_{t}\frac{h^{2}_{z}(s,z_{0})}{h_{zz}(s,z_{0})}ds+\int^{T}_{t}D(s,z_{0})h_{z}(s,z_{0})ds, (4.44)

in which

D(s,z)=H(s)+K(s,z)G(s)+rK(s,z)G(s)C2G(s).\displaystyle D(s,z)=H^{\prime}(s)+K(s,z)G^{\prime}(s)+rK(s,z)G(s)-C_{2}G(s). (4.45)

Then, taking z0Sz_{0}\downarrow S in the above formula, we get

tTC(s)𝑑s=limz0Sβ22tThz2(s,z0)hzz(s,z0)𝑑s+limz0StTD(s,z0)hz(s,z0)𝑑s.\displaystyle\int^{T}_{t}C^{\prime}(s)ds=\lim_{z_{0}\downarrow S}-\frac{\beta^{2}}{2}\int^{T}_{t}\frac{h^{2}_{z}(s,z_{0})}{h_{zz}(s,z_{0})}ds+\lim_{z_{0}\downarrow S}\int^{T}_{t}D(s,z_{0})h_{z}(s,z_{0})ds. (4.46)

For a given (s,y0)[0,T)×(0,+)(s,y_{0})\in[0,T)\times(0,+\infty), set z0(s)=g(s,y0)z_{0}(s)=g(s,y_{0}). Due to (4.40) and (4.28), we have

hz2(s,z0(s))hzz(s,z0(s))=y02gy(s,y0).\frac{h^{2}_{z}(s,z_{0}(s))}{h_{zz}(s,z_{0}(s))}=-y^{2}_{0}g_{y}(s,y_{0}).

The relation (4.29)-(ii) implies that z0(s)Sz_{0}(s)\downarrow S, when y0+y_{0}\rightarrow+\infty. Moreover, this convergence is uniform with respect to s[0,T)s\in[0,T); (see [9, Prop. 4.18]). So, we obtain the following convergence due to the condition (4.34)

limz0StThz2(s,z0)hzz(s,z0)𝑑s=limy0+tTy02gy(s,y0)𝑑s=0.\displaystyle\lim_{z_{0}\downarrow S}\int^{T}_{t}\frac{h^{2}_{z}(s,z_{0})}{h_{zz}(s,z_{0})}ds=-\lim_{y_{0}\rightarrow+\infty}\int^{T}_{t}y^{2}_{0}g_{y}(s,y_{0})ds=0. (4.47)

Now, taking zz equal to SS in (4.45) and regarding the formulas in (4.5), (4.6) and (4.7), we get

D(s,S)=\displaystyle D(s,S)= C2FC1SF(s)S(s)er(Ts)\displaystyle\frac{C_{2}F-C_{1}S}{F(s)-S(s)}e^{-r(T-s)}
+(C2FC1S)(C2C1+r(FS))(F(s)S(s))2er(Ts)e2r(Ts)r\displaystyle+\frac{(C_{2}F-C_{1}S)(C_{2}-C_{1}+r(F-S))}{(F(s)-S(s))^{2}}\frac{e^{-r(T-s)}-e^{-2r(T-s)}}{r}
S(C2C1+r(FS))er(Ts)F(s)S(s)\displaystyle-\frac{S(C_{2}-C_{1}+r(F-S))e^{-r(T-s)}}{F(s)-S(s)}
+[C2FC1S](er(Ts)1)r(FS)(FS)(C2C1+r(FS))er(Ts)(F(s)S(s))2\displaystyle+\frac{[C_{2}F-C_{1}S](e^{-r(T-s)}-1)}{r(F-S)}\frac{(F-S)(C_{2}-C_{1}+r(F-S))e^{-r(T-s)}}{(F(s)-S(s))^{2}}
+r[F(s)S(s)]S(FS)FSF(s)S(s)\displaystyle+\frac{r[F(s)-S(s)]S}{(F-S)}\frac{F-S}{F(s)-S(s)}
[C2FC1S]er(Ts)(FS)FSF(s)S(s)\displaystyle-\frac{[C_{2}F-C_{1}S]e^{-r(T-s)}}{(F-S)}\frac{F-S}{F(s)-S(s)}
+[C2FC1S]FSFSF(s)S(s)\displaystyle+\frac{[C_{2}F-C_{1}S]}{F-S}\frac{F-S}{F(s)-S(s)}
C2(FS)F(s)S(s):=e1+e2+e3+e4+e5+e6+e7+e8,\displaystyle-\frac{C_{2}(F-S)}{F(s)-S(s)}:=e_{1}+e_{2}+e_{3}+e_{4}+e_{5}+e_{6}+e_{7}+e_{8},

where H(s)=e1+e2H^{\prime}(s)=e_{1}+e_{2}, K(s,S)G(s)=e3+e4K(s,S)G^{\prime}(s)=e_{3}+e_{4} and rK(s,S)G(s)=e5+e6+e7rK(s,S)G(s)=e_{5}+e_{6}+e_{7}. Clearly e1+e6=0e_{1}+e_{6}=0 and e2+e4=0e_{2}+e_{4}=0. Moreover, regarding the identity

F(s)S(s)=C1C2r+1r(C2C1+r(FS))er(Ts),\displaystyle F(s)-S(s)=\frac{C_{1}-C_{2}}{r}+\frac{1}{r}(C_{2}-C_{1}+r(F-S))e^{-r(T-s)}, (4.48)

we get e3=rS+S(C1C2)F(s)S(s)e_{3}=-rS+\frac{S(C_{1}-C_{2})}{F(s)-S(s)}, which yields

e3+e5+e7+e8=rS+S(C1C2)F(s)S(s)+rS+C2FC1SF(s)S(s)C2(FS)F(s)S(s)=0.e_{3}+e_{5}+e_{7}+e_{8}=-rS+\frac{S(C_{1}-C_{2})}{F(s)-S(s)}+rS+\frac{C_{2}F-C_{1}S}{F(s)-S(s)}-\frac{C_{2}(F-S)}{F(s)-S(s)}=0.

Therefore, we have the identity D(,S)0D(\cdot,S)\equiv 0. Since the function DD is continuous, this identity implies

limzSD(s,z)=0,s[0,T].\displaystyle\lim_{z\downarrow S}D(s,z)=0,\qquad\qquad s\in[0,T].

Furthermore, regarding the definition of the function KK, (4.7), DD is a linear function of the variable zz. Hence, due to the boundedness of the functions in (4.45), the above convergence is uniform with respect to ss on the compact interval [0,T][0,T].

On the other hand, the integrability of [g(s,)]1[g(s,\cdot)]^{-1} at S+S^{+}, Condition (4.35), indicates that the convergence limzShz(s,z)=\lim_{z\downarrow S}h_{z}(s,z)=-\infty is sufficiently slow that its multiplication by a linear function of zz, the integrand D(s,z0)hz(s,z0)D(s,z_{0})h_{z}(s,z_{0}), tends to zero, uniformly over [0,T][0,T], as z0Sz_{0}\downarrow S. This implies the convergence

limz0StTD(s,z0)hz(s,z0)𝑑s=0.\displaystyle\lim_{z_{0}\downarrow S}\int^{T}_{t}D(s,z_{0})h_{z}(s,z_{0})ds=0. (4.49)

Since CC^{\prime} is a continuous function and t[0,T]t\in[0,T] is arbitrary in (4.46), the convergences (4.47) and (4.49) yield the identity C0C^{\prime}\equiv 0. ∎

Corollary 4.10.

Since the function hh given in the above proposition is a classical solution of (4.11) with conditions (4.13)-(i)-(ii)-(iii), it is as a viscosity solution too. Therefore, due to the uniqueness of viscosity solution, it is equal to the value function VV.

Due to the above proposition, to prove the required regularity of the value function VV, we must prove that there is a function gg that satisfies the assumptions of such proposition. This is the claim of the following theorem.

Theorem 4.11.

There exists a unique g𝒞([0,T]×[0,+);)𝒞1,2([0,T)×(0,+);)g\in\mathcal{C}([0,T]\times[0,+\infty);\mathbb{R})\cap\mathcal{C}^{1,2}([0,T)\times(0,+\infty);\mathbb{R}) that is the classical solution of (4.31)-(4.32)-(4.33) and satisfies (4.29), (4.30), (4.34) and (4.35).

Proof.

We write the Equations (4.31)-(4.32) in a unified form which is a linear parabolic equation over the domain [0,T)×(0,+)[0,T)\times(0,+\infty),

gt(t,y)+μ(t,y)gy(t,y)+12β2y2gyy(t,y)q(t)g(t,y)+f(t,y)=0,\displaystyle g_{t}(t,y)+\mu(t,y)g_{y}(t,y)+\frac{1}{2}\beta^{2}y^{2}g_{yy}(t,y)-q(t)g(t,y)+f(t,y)=0, (4.50)

with the boundary conditions

{(i)g(t,0)=F,t[0,T],(ii)g(T,y)=Ψ(y),y[0,+),\displaystyle\begin{cases}(i)\;g(t,0)=F,\qquad\qquad\qquad\qquad\quad\qquad t\in[0,T],\\ (ii)\;g(T,y)=\Psi(y),\qquad\qquad\qquad\qquad\quad y\in[0,+\infty),\end{cases} (4.51)

in which

μ(t,y)=(β2K1(t)G(t)rK1(t)G(t))y,\mu(t,y)=(\beta^{2}-K_{1}(t)G^{\prime}(t)-rK_{1}(t)G(t))y,
q(t)=K1(t)G(t)+rK1(t)G(t),q(t)=K_{1}(t)G^{\prime}(t)+rK_{1}(t)G(t),
f(t,y)=H(t)+(G(t)+rG(t))K2(t)+G(t)(C1G(t)2κηty)C2,f(t,y)=-H^{\prime}(t)+(G^{\prime}(t)+rG(t))K_{2}(t)+G(t)(C_{1}-\frac{G(t)}{2\kappa\eta_{t}}y)\vee C_{2},
Ψ(y)=(Faa12y2ηT)S.\Psi(y)=(F-\frac{a^{2}_{a_{1}}y}{2\eta_{T}})\vee S.

Existence of viscosity solution. At first we show that the above equation has a unique bounded viscosity solution.

In the first step, we find the bounded sub and supersolution of Equation (4.50). See [7, Section 2], for the definition of these types of solutions.

Since the coefficients μ/y\mu/y, qq and ff are bounded functions, by taking their upper and lower bounds, we establish sub and supersolution.

The explicit formula (4.56) indicates that qq is a nonnegative function which means that β2y\beta^{2}y is an upper bound for μ\mu. Moreover, the explicit formula (4.57) and the fact that fff_{\infty}\leq f, which is clear from their definitions, indicate that ff is nonnegative. So, setting q1=sup0tTq(t)q_{1}=\sup_{0\leq t\leq T}q(t), the solution of the following equation with conditions (4.51) can be regarded as a subsolution to the Eq. (4.50)

g¯t(t,y)+β2yg¯y(t,y)+12β2y2g¯yy(t,y)q1g¯(t,y)=0,(t,y)[0,T)×(0,+).\displaystyle\bar{g}_{t}(t,y)+\beta^{2}y\bar{g}_{y}(t,y)+\frac{1}{2}\beta^{2}y^{2}\bar{g}_{yy}(t,y)-q_{1}\bar{g}(t,y)=0,\quad(t,y)\in[0,T)\times(0,+\infty).

This equation with the given conditions has a classical solution which can be represented as

g¯(t,y)=aa122ηTe(β2q1)(Tt)p¯put(t,y)+S[S,F],\bar{g}(t,y)=\frac{a^{2}_{a_{1}}}{2\eta_{T}}e^{(\beta^{2}-q_{1})(T-t)}\bar{p}_{put}(t,y)+S\in[S,F],

where p¯put\bar{p}_{put} is the price of a European put option with strike price 2ηTaa12(FS)\frac{2\eta_{T}}{a^{2}_{a_{1}}}(F-S) in a Black-Scholes market where the volatility of the risky asset equals to β\beta and the spot rate of riskless asset is β2\beta^{2}.

To find a supersolution, setting q2=β2q1q_{2}=\beta^{2}-q_{1}, we consider the following equation with the same conditions as in (4.51).

g^t(t,y)+q2yg^y(t,y)+12β2y2g^yy(t,y)+f0=0,(t,y)[0,T)×(0,+),\displaystyle\hat{g}_{t}(t,y)+q_{2}y\hat{g}_{y}(t,y)+\frac{1}{2}\beta^{2}y^{2}\hat{g}_{yy}(t,y)+f_{0}=0,\quad(t,y)\in[0,T)\times(0,+\infty),

in which f0=supt,yf(t,y)f_{0}=\sup_{t,y}f(t,y). The explicit solution of this equation can be written as

g^(t,y)=\displaystyle\hat{g}(t,y)= aa122ηTeq2(Tt)p^put(t,y)+S+f0(Tt)\displaystyle\frac{a^{2}_{a_{1}}}{2\eta_{T}}e^{q_{2}(T-t)}\hat{p}_{put}(t,y)+S+f_{0}(T-t)
=\displaystyle= (FS)Φ(k(t,y))aa122ηTeq2(Tt)yΦ(k(t,y)βTt)\displaystyle(F-S)\Phi(k(t,y))-\frac{a^{2}_{a_{1}}}{2\eta_{T}}e^{q_{2}(T-t)}y\Phi\left(k(t,y)-\beta\sqrt{T-t}\right)
+S+f0(Tt),\displaystyle+S+f_{0}(T-t), (4.52)

where

k(t,y)=log(aa12y2ηT(FS))β22(Tt)βTt,k(t,y)=\frac{-\log(\frac{a^{2}_{a_{1}}y}{2\eta_{T}(F-S)})-\frac{\beta^{2}}{2}(T-t)}{\beta\sqrt{T-t}},

and Φ\Phi is the cumulative distribution function of a standard normal random variable. Actually, p^put\hat{p}_{put} is the price of a European put option with strike price 2ηTaa12(FS)\frac{2\eta_{T}}{a^{2}_{a_{1}}}(F-S) in a Black-Scholes market where the volatility of the risky asset equals to β\beta, and the spot rate of riskless asset is q2q_{2}. Furthermore, we easily get g^(t,y)[S,F+f0T]\hat{g}(t,y)\in[S,F+f_{0}T].

Now, we use [7, Theorem 8.2], to conclude the comparison principle for the sub and supersolution in the class of bounded upper and lower semicontinuous functions. Although, in such a theorem a bounded domain is considered, the proof can be modified directly for the domain (0,+)(0,+\infty), as it is done in [13, Theorem 9.1] for the real line (,+)(-\infty,+\infty).

Due to the existence of bounded sub and supersolution and the comparison principle, and by applying Perron’s method [7, Theorem 4.1] we can conclude the existence of a unique bounded viscosity solution gviscg_{visc} such that g¯gviscg^\bar{g}\leq g_{visc}\leq\hat{g} over [0,T)×[0,+)[0,T)\times[0,+\infty).

𝒞1,2\mathcal{C}^{1,2} regularity of the solution. For any 0<a<b<+0<a<b<+\infty, consider the Equation (4.50) over [0,T)×(a,b)[0,T)\times(a,b) with the boundary conditions that are set by the viscosity solution

gt(t,y)+μ(t,y)gy(t,y)+12β2y2gyy(t,y)q(t)g(t,y)+f(t,y)=0,\displaystyle g_{t}(t,y)+\mu(t,y)g_{y}(t,y)+\frac{1}{2}\beta^{2}y^{2}g_{yy}(t,y)-q(t)g(t,y)+f(t,y)=0,
g(t,a)=gvisc(t,a),g(t,b)=gvisc(t,b),t[0,T),\displaystyle g(t,a)=g_{visc}(t,a),\qquad\quad g(t,b)=g_{visc}(t,b),\qquad\quad\;t\in[0,T),
g(T,y)=gvisc(T,y),y[0,+).\displaystyle g(T,y)=g_{visc}(T,y),\qquad\qquad\qquad\qquad\qquad\qquad\qquad y\in[0,+\infty).

It is clear that the above equation has a unique viscosity solution which must be equal to gviscg_{visc}. On the other hand, the equation on the given domain has uniformly parabolic property, which implies the existence of a classical solution that belongs to 𝒞1,2([0,T)×(a,b);)\mathcal{C}^{1,2}([0,T)\times(a,b);\mathbb{R}). Since the classical solution is also a viscosity solution, it must coincide with gviscg_{visc}.

Now, from the arbitrariness of aa and bb we conclude that gvisc𝒞1,2([0,T)×(0,+);)g_{visc}\in\mathcal{C}^{1,2}([0,T)\times(0,+\infty);\mathbb{R}). Therefore, gviscg_{visc} is the unique classical solution of (4.50) with boundary conditions (4.51).

Feynman-Kac formula and properties of solution. After proving that Equation (4.50) with conditions (4.51) has a classical solution, we can find, using the Feynman-Kac formula, a probabilistic representation of the solution, which reveals some of its properties

g(t,y)=𝔼[tTetsq(τ)𝑑τf(s,Ysy)𝑑s+etTq(τ)𝑑τΨ(T,YTy)],\displaystyle g(t,y)=\mathbb{E}^{\mathbb{Q}}[\int^{T}_{t}e^{-\int^{s}_{t}q(\tau)d\tau}f(s,Y^{y}_{s})ds+e^{-\int^{T}_{t}q(\tau)d\tau}\Psi(T,Y^{y}_{T})], (4.53)

where the expectation is under the probability measure \mathbb{Q} such that YyY^{y} is an Itô process driven by the equation

{dYsy=μ(s,Ysy)ds+12β2(Ysy)2dWs,s>t,Yty=y,\displaystyle\begin{cases}dY^{y}_{s}=\mu(s,Y^{y}_{s})ds+\frac{1}{2}\beta^{2}(Y^{y}_{s})^{2}dW^{\mathbb{Q}}_{s},\qquad\qquad s>t,\\ Y^{y}_{t}=y,\end{cases}

with WW^{\mathbb{Q}} being a Brownian motion under \mathbb{Q}.

Since ff and Ψ\Psi are decreasing functions of yy, we can conclude from the Feynman-Kac formula that gg is strictly decreasing with respect to yy; the relation (4.30). Hence, the relation (4.51)(i) implies gFg\leq F over [0,T]×[0,+)[0,T]\times[0,+\infty).

Now, we aim to determine the asymptotic behavior of g(t,y)g(t,y) when y+y\rightarrow+\infty. Considering the definition of ff, its restriction to the subset BB^{\prime} is a function of tt and is denoted by f(t)f_{\infty}(t)

f(t):=H(t)+(G(t)+rG(t))K2(t)+G(t)C2.\displaystyle f_{\infty}(t):=-H^{\prime}(t)+(G^{\prime}(t)+rG(t))K_{2}(t)+G(t)C_{2}. (4.54)

Moreover, it is clear that as y+y\rightarrow+\infty, {s[t,T]:(s,Ysy)A}0\mathbb{P}\{\exists s\in[t,T]:(s,Y^{y}_{s})\in A^{\prime}\}\rightarrow 0, for any 0tT0\leq t\leq T. On the other hand, we have limy+(Ψ(T,YTy)=S)=1\lim_{y\rightarrow+\infty}\mathbb{P}(\Psi(T,Y^{y}_{T})=S)=1. Therefore, as y+y\rightarrow+\infty, g(t,y)g(t,y) converges to the integral

g(t):=𝔼[tTetsq(τ)𝑑τf(s)𝑑s+etTq(τ)𝑑τS].\displaystyle g_{\infty}(t):=\mathbb{E}^{\mathbb{Q}}[\int^{T}_{t}e^{-\int^{s}_{t}q(\tau)d\tau}f_{\infty}(s)ds+e^{-\int^{T}_{t}q(\tau)d\tau}S]. (4.55)

To evaluate the above integral, the functions qq and ff_{\infty} must be written explicitly.

From the formula of qq, we get

q(t)=\displaystyle q(t)= K1(t)G(t)+rK1(t)G(t)\displaystyle K_{1}(t)G^{\prime}(t)+rK_{1}(t)G(t)
=\displaystyle= F(t)S(t)FS(FS)(C2C1+r(FS))er(Tt)(F(t)S(t))2+r\displaystyle\frac{F(t)-S(t)}{F-S}\frac{-(F-S)(C_{2}-C_{1}+r(F-S))e^{-r(T-t)}}{(F(t)-S(t))^{2}}+r
=\displaystyle= C1C2F(t)S(t),\displaystyle\frac{C_{1}-C_{2}}{F(t)-S(t)}, (4.56)

where, in the last equality the relation (4.48) is applied. Moreover, regarding the formula of ff_{\infty}, we have

f(t)=\displaystyle f_{\infty}(t)= C2FC1SF(t)S(t)er(Tt)\displaystyle-\frac{C_{2}F-C_{1}S}{F(t)-S(t)}e^{-r(T-t)}
(C2FC1S)(C2C1+r(FS))(F(t)S(t))2er(Tt)e2r(Tt)r\displaystyle-\frac{(C_{2}F-C_{1}S)(C_{2}-C_{1}+r(F-S))}{(F(t)-S(t))^{2}}\frac{e^{-r(T-t)}-e^{-2r(T-t)}}{r}
+(FS)(C2C1+r(FS))er(Tt)(F(t)S(t))2[C2FC1S](er(Tt)1)r(FS)\displaystyle+\frac{-(F-S)(C_{2}-C_{1}+r(F-S))e^{-r(T-t)}}{(F(t)-S(t))^{2}}\frac{[C_{2}F-C_{1}S](e^{-r(T-t)}-1)}{r(F-S)}
+rFSF(t)S(t)[C2FC1S](er(Tt)1)r(FS)\displaystyle+r\frac{F-S}{F(t)-S(t)}\frac{[C_{2}F-C_{1}S](e^{-r(T-t)}-1)}{r(F-S)}
+C2(FS)F(t)S(t)=b1+b2+b3+b4+b5.\displaystyle+\frac{C_{2}(F-S)}{F(t)-S(t)}=b_{1}+b_{2}+b_{3}+b_{4}+b_{5}.

Clearly b2+b3=0b_{2}+b_{3}=0. Therefore, a simple manipulation shows that

f(t)=S(C1C2)F(t)S(t)=Sq(t).\displaystyle f_{\infty}(t)=\frac{S(C_{1}-C_{2})}{F(t)-S(t)}=Sq(t). (4.57)

Hence, (4.55) turns into

g(t)=S𝔼[tTetsq(τ)𝑑τq(s)𝑑s+etTq(τ)𝑑τ].\displaystyle g_{\infty}(t)=S\mathbb{E}^{\mathbb{Q}}[\int^{T}_{t}e^{-\int^{s}_{t}q(\tau)d\tau}q(s)ds+e^{-\int^{T}_{t}q(\tau)d\tau}]. (4.58)

In view of the indefinite integral calculation etsq(τ)𝑑τq(s)𝑑s=etsq(τ)𝑑τ\int e^{-\int^{s}_{t}q(\tau)d\tau}q(s)ds=-e^{-\int^{s}_{t}q(\tau)d\tau}, we obtain

tTetsq(τ)𝑑τq(s)ds=etsq(τ)𝑑τ]Tt=etTq(τ)𝑑τ+1.\int^{T}_{t}e^{-\int^{s}_{t}q(\tau)d\tau}q(s)ds=-e^{-\int^{s}_{t}q(\tau)d\tau}]^{T}_{t}=-e^{-\int^{T}_{t}q(\tau)d\tau}+1.

Therefore, we get the following identity, which indicates (4.29)-(ii),

g(t)=S,t[0,T].\displaystyle g_{\infty}(t)=S,\qquad\qquad t\in[0,T].

Now, we take into account the actual difference g(t,y)Sg(t,y)-S, when yy is very large. Due to the above identity and (4.53), we can write

g(t,y)S=\displaystyle g(t,y)-S= 𝔼[tTetsq(τ)𝑑τ(f(s,Ysy)f(s))𝑑s]\displaystyle\mathbb{E}^{\mathbb{Q}}[\int^{T}_{t}e^{-\int^{s}_{t}q(\tau)d\tau}(f(s,Y^{y}_{s})-f_{\infty}(s))ds]
+𝔼[etTq(τ)𝑑τ(Ψ(T,YTy)S)]\displaystyle+\mathbb{E}^{\mathbb{Q}}[e^{-\int^{T}_{t}q(\tau)d\tau}(\Psi(T,Y^{y}_{T})-S)]
=\displaystyle= d1(t,y)+d2(t,y).\displaystyle d_{1}(t,y)+d_{2}(t,y).

Note that f(s,Ysy)=f(s)f(s,Y^{y}_{s})=f_{\infty}(s) for (s,Ysy)B(s,Y^{y}_{s})\in B^{\prime} and Ψ(T,YTy)=S\Psi(T,Y^{y}_{T})=S when YTy2ηTaa12(FS)Y^{y}_{T}\geq\frac{2\eta_{T}}{a^{2}_{a_{1}}}(F-S). So, by rough estimates of {s[t,T]:(s,Ysy)A}\mathbb{P}\{\exists s\in[t,T]:(s,Y^{y}_{s})\in A^{\prime}\}, which is an exit probability on a finite interval, and {YTy<2ηTaa12(FS)}\mathbb{P}\{Y^{y}_{T}<\frac{2\eta_{T}}{a^{2}_{a_{1}}}(F-S)\}, which is obtained from the log-normal distribution, we obtain the following upper bounds, for large enough yy,

d1(t,y)<c0y2,d2(t,y)<c1y2,d_{1}(t,y)<\frac{c_{0}}{y^{2}},\qquad d_{2}(t,y)<\frac{c_{1}}{y^{2}},

in which c0,c1c_{0},c_{1} are positive constants. Then, for a constant c2c_{2}, that is large enough and greater than c0+c1c_{0}+c_{1}, we get the estimate

|g(t,y)S|<c2y2,(t,y)[0,T)×(0,),\displaystyle\left|g(t,y)-S\right|<\frac{c_{2}}{y^{2}},\qquad\quad(t,y)\in[0,T)\times(0,\infty), (4.59)

which implies the integrability condition (4.35).

The coefficients of Equation (4.50) satisfy the conditions of [21, Theorems 3.1 and 6.1] which implies that our linear parabolic equation, with convex terminal condition Ψ\Psi, preserves the convexity. Therefore, gg is convex in the variable yy.

From the convexity of gg and (4.30), we achieve

y(g(t,2y)g(t,y))y2gy(t,2y)<0,(t,y)[0,T)×(0,+).\displaystyle y(g(t,2y)-g(t,y))\leq y^{2}g_{y}(t,2y)<0,\qquad(t,y)\in[0,T)\times(0,+\infty). (4.60)

The estimate (4.59) indicates g(t,y)g(t,2y)c2y2g(t,y)-g(t,2y)\leq\frac{c_{2}}{y^{2}} or g(t,2y)g(t,y)c2y2g(t,2y)-g(t,y)\geq-\frac{c_{2}}{y^{2}}. So, regarding (4.60), we have

8c2yy2gy(t,y)<0,-\frac{8c_{2}}{y}\leq y^{2}g_{y}(t,y)<0,

which yields the relation (4.34). ∎

4.4 Verification theorem

After proving that the value function is the only classical solution of the Equations (4.24)-(4.25), we can state the verification theorem which concerns the classical solution of (4.11) and gives a way of testing whether the given admissible strategies are optimal. To see a standard proof of the theorem, we refer the reader to [13, Ch. IV, Theorem 3.1].

Theorem 4.12.

(verification theorem)
(a) Let h𝒞([0,T]×[S,F])𝒞1,2([0,T)×(S,F);)h\in\mathcal{C}([0,T]\times[S,F])\cap\mathcal{C}^{1,2}([0,T)\times(S,F);\mathbb{R}) be a classical solution to (4.11) with boundary conditions (4.13)-(i)-(ii)-(iiiiii^{\prime}). Then, for any initial data (t,z)𝒞(t,z)\in\mathcal{C}^{\prime} and admissible strategy π=(π1(),π2())Πad(t,z)\pi=(\pi^{1}(\cdot),\pi^{2}(\cdot))\in\Pi_{ad}(t,z), we have

h(t,z)J(t,z;π1(),π2()).h(t,z)\leq J(t,z;\pi^{1}(\cdot),\pi^{2}(\cdot)).

(b) If there exists π=(π1,(),π2,())Πad(t,z)\pi^{*}=(\pi^{1,*}(\cdot),\pi^{2,*}(\cdot))\in\Pi_{ad}(t,z) such that for any sts\geq t

(πs1,,πs2,)argmin{𝒜h(s,Zs)+κηs(C1πs2,)2},\displaystyle(\pi^{1,*}_{s},\pi^{2,*}_{s})\in{arg}\;{min}\{\mathcal{A}h(s,Z^{*}_{s})+\kappa\eta_{s}(C_{1}-\pi^{2,*}_{s})^{2}\},

in which the operator 𝒜\mathcal{A} is defined in (4.12) and ZsZ^{*}_{s} is the solution to (3.2) corresponding to the strategy π\pi^{*}, with Zt=zZ^{*}_{t}=z, then π\pi^{*} is the optimal strategy and hh is equal to the value function,

h(t,z)=V(t,z)=J(t,z;π1,(),π2,()).h(t,z)=V(t,z)=J(t,z;\pi^{1,*}(\cdot),\pi^{2,*}(\cdot)).

At any point (t,z)[0,T)×[S,F](t,z)\in[0,T)\times[S,F], we establish the closed loop equation to prove the existence and uniqueness of the optimal strategies. At first, due to the formulas (4.20) and (4.21) and using (4.28) and (4.40), we get the following formulas for feedback maps in terms of the function gg

P1(t,z)={(μr)[g(t,)]1(z)gy(t,[g(t,)]1(z))G(t)σ2K(t,z),(t,z)[0,T)×(S,F),0,(t,z)[0,T)×{S,F},\displaystyle P^{1}(t,z)=\begin{cases}-\frac{(\mu-r)[g(t,\cdot)]^{-1}(z)g_{y}(t,[g(t,\cdot)]^{-1}(z))}{G(t)\sigma^{2}K(t,z)},&(t,z)\in[0,T)\times(S,F),\\ 0,&(t,z)\in[0,T)\times\{S,F\},\end{cases} (4.61)
P2(t,z)={(G(t)[g(t,)]1(z)2κηt+C1)C2,(t,z)[0,T)×(S,F),C1,(t,z)[0,T)×{F},C2,(t,z)[0,T)×{S}.\displaystyle P^{2}(t,z)=\begin{cases}(-\frac{G(t)[g(t,\cdot)]^{-1}(z)}{2\kappa\eta_{t}}+C_{1})\vee C_{2},&(t,z)\in[0,T)\times(S,F),\\ C_{1},&(t,z)\in[0,T)\times\{F\},\\ C_{2},&(t,z)\in[0,T)\times\{S\}.\end{cases} (4.62)

The relation (4.34) implies that P1P^{1} and P2P^{2} are bounded and continuous functions on [0,T)×[S,F][0,T)\times[S,F].

For some (t,z)[0,T)×(S,F)(t,z)\in[0,T)\times(S,F), let y=[g(t,)]1(z)y^{*}=[g(t,\cdot)]^{-1}(z) and suppose that Y(;t,y)Y^{*}(\cdot;t,y^{*}) is the solution to

{dYs=(K1(s)G(s)+rK1(s)G(s))YsdsβYsdWs,t<s<T,Yt=y.\displaystyle\begin{cases}dY^{*}_{s}=-(K_{1}(s)G^{\prime}(s)+rK_{1}(s)G(s))Y^{*}_{s}ds-\beta Y^{*}_{s}dW_{s},&\quad t<s<T,\\ Y^{*}_{t}=y^{*}.\end{cases} (4.63)

Now, consider the process

Z(s;t,z)=g(s,Y(s;t,y)),s[t,T].\displaystyle Z^{*}(s;t,z)=g(s,Y^{*}(s;t,y^{*})),\qquad\qquad s\in[t,T]. (4.64)

Due to the definition of YY^{*} and properties of gg, we have

Z(s;t,z)(S,F),s[t,T].\displaystyle Z^{*}(s;t,z)\in(S,F),\qquad\qquad\qquad\forall s\in[t,T]. (4.65)
Theorem 4.13.

(closed loop equation) For any (t,z)[0,T)×(S,F)(t,z)\in[0,T)\times(S,F), Z(;t,z)Z^{*}(\cdot;t,z) solves the following closed loop equation, associated with the feedback maps P1P^{1} and P2P^{2},

{dZs={K(s,Zs)G(s)+G(s)[(P1(s,Zs)(μr)+r)K(s,Zs)P2(s,Zs)]+H(s)}ds+G(s)σP1(s,Zs)K(s,Zs)dWs,t<s<T,Zt=z.\displaystyle\begin{cases}dZ^{*}_{s}=\{K(s,Z^{*}_{s})G^{\prime}(s)+G(s)[(P^{1}(s,Z^{*}_{s})(\mu-r)+r)K(s,Z^{*}_{s})-P^{2}(s,Z^{*}_{s})]\\ \qquad\quad\;+H^{\prime}(s)\}ds+G(s)\sigma P^{1}(s,Z^{*}_{s})K(s,Z^{*}_{s})dW_{s},\qquad\qquad t<s<T,\\ Z^{*}_{t}=z.\end{cases}
Proof.

Regarding the definition (4.64), the dynamics (4.63) and the Equations (4.31), (4.32), and applying Itô’s formula, the following dynamics is obtained

dZs=\displaystyle dZ^{*}_{s}= gs(s,Ys)ds(K1(s)G(s)+rK1(s)G(s))Ysgy(s,Ys)ds\displaystyle g_{s}(s,Y^{*}_{s})ds-(K_{1}(s)G^{\prime}(s)+rK_{1}(s)G(s))Y^{*}_{s}g_{y}(s,Y^{*}_{s})ds
+12β2(Ys)2gyy(s,Ys)dsβYsgy(s,Ys)dWs\displaystyle+\frac{1}{2}\beta^{2}(Y^{*}_{s})^{2}g_{yy}(s,Y^{*}_{s})ds-\beta Y^{*}_{s}g_{y}(s,Y^{*}_{s})dW_{s}
=\displaystyle= (β2Ysgy(s,Ys)+H(s)+K(s,Zs)G(s)+rK(s,Zs)G(s))ds\displaystyle(-\beta^{2}Y^{*}_{s}g_{y}(s,Y^{*}_{s})+H^{\prime}(s)+K(s,Z^{*}_{s})G^{\prime}(s)+rK(s,Z^{*}_{s})G(s))ds
G(s)((12G(s)κηsYs+C1)C2)dsβYsgy(s,Ys)dWs.\displaystyle-G(s)((-\frac{1}{2}\frac{G(s)}{\kappa\eta_{s}}Y^{*}_{s}+C_{1})\vee C_{2})ds-\beta Y^{*}_{s}g_{y}(s,Y^{*}_{s})dW_{s}.

The definition (4.64) indicates Ys=[g(s,)]1(Zs)Y^{*}_{s}=[g(s,\cdot)]^{-1}(Z^{*}_{s}). So, regarding the formulas (4.61), (4.62), a simple manipulation yields the closed loop equation.

Moreover, notice that the closed loop equation coincides with Equation (4.8), if the feedback maps be replaced by (π1,π2)(\pi^{1},\pi^{2}). ∎

Remark 4.14.

Due to formulas (4.61) and (4.62), the closed loop equation admits the solutions Z()FZ^{*}(\cdot)\equiv F and Z()SZ^{*}(\cdot)\equiv S corresponding to the initial values z=Fz=F and z=Sz=S, respectively.

Corollary 4.15.

The boundedness of the maps P1P^{1} and P2P^{2}, the relation (4.65) and the above remark imply that, for any (t,z)[0,T]×[S,F](t,z)\in[0,T]\times[S,F], the following strategy is admissible. Moreover, the optimality is concluded from the verification and the closed loop equation theorems.

πs1={P1(s,Z(s;t,z)),s[t,T),0,s=T,\displaystyle\pi^{1}_{s}=\begin{cases}P^{1}(s,Z^{*}(s;t,z)),&s\in[t,T),\\ 0,&s=T,\end{cases} (4.66)
πs2={P2(s,Z(s;t,z)),s[t,T),0,s=T.\displaystyle\pi^{2}_{s}=\begin{cases}P^{2}(s,Z^{*}(s;t,z)),&s\in[t,T),\\ 0,&s=T.\end{cases} (4.67)

The uniqueness of the optimal strategy is proved in the following.

Proposition 4.16.

At any point (t,z)[0,T]×[S,F](t,z)\in[0,T]\times[S,F], the given above strategy, π=(π1,π2)\pi=(\pi^{1},\pi^{2}), is the only optimal strategy.

Proof.

By contradiction let π=(π1,π2)\pi^{\prime}=(\pi^{1^{\prime}},\pi^{2^{\prime}}) be another optimal strategy at the point (t,z)(t,z). Similar to the argument used in Prop. (4.6), defining Xs:=12(Xs+Xs)X^{*}_{s}:=\frac{1}{2}(X_{s}+X^{\prime}_{s}), in which Xs=X(s;t,z,π1(),π2())X_{s}=X(s;t,z,\pi^{1}(\cdot),\pi^{2}(\cdot)), Xs=X(s;t,z,π1(),π2())X^{\prime}_{s}=X(s;t,z,\pi^{1^{\prime}}(\cdot),\pi^{2^{\prime}}(\cdot)), and

πs1,:=12Xs(πs1Xs+πs1Xs),\pi^{1,*}_{s}:=\frac{1}{2X^{*}_{s}}(\pi^{1}_{s}X_{s}+\pi^{1^{\prime}}_{s}X^{\prime}_{s}),
πs2,:=12(πs2+πs2),\pi^{2,*}_{s}:=\frac{1}{2}(\pi^{2}_{s}+\pi^{2^{\prime}}_{s}),

we have Xs=X(s;t,z,π1,(),π2,())X^{*}_{s}=X(s;t,z,\pi^{1,*}(\cdot),\pi^{2,*}(\cdot)).

Now, due to the strict convexity of the functions x(C1x)2x\rightarrow(C_{1}-x)^{2} and x(Fxaa1)2x\rightarrow(\frac{F-x}{a_{a_{1}}})^{2}, the following strict inequality for the action functionals is concluded, which contradicts the optimality of the strategies π\pi and π\pi^{\prime},

12\displaystyle\frac{1}{2} [J(t,z;π1(),π2())+J(t,z;π1(),π2())]\displaystyle[J(t,z;\pi^{1}(\cdot),\pi^{2}(\cdot))+J(t,z;\pi^{1^{\prime}}(\cdot),\pi^{2^{\prime}}(\cdot))]
=12𝔼[κtTηs(C1πs2)2𝑑s+ηT(FXTaa1)2]\displaystyle=\;\frac{1}{2}\mathbb{E}[\kappa\int^{T}_{t}\eta_{s}(C_{1}-\pi^{2}_{s})^{2}ds+\eta_{T}(\frac{F-X_{T}}{a_{a_{1}}})^{2}]
+12𝔼[κtTηs(C1πs2)2𝑑s+ηT(FXTaa1)2]\displaystyle\;\;\;\;\;+\frac{1}{2}\mathbb{E}[\kappa\int^{T}_{t}\eta_{s}(C_{1}-\pi^{2^{\prime}}_{s})^{2}ds+\eta_{T}(\frac{F-X^{\prime}_{T}}{a_{a_{1}}})^{2}]
>𝔼[κtTηs(C1πs2,)2𝑑s+ηT(FXTaa1)2]=J(t,z;π1,(),π2,()).\displaystyle>\mathbb{E}[\kappa\int^{T}_{t}\eta_{s}(C_{1}-\pi^{2,*}_{s})^{2}ds+\eta_{T}(\frac{F-X^{*}_{T}}{a_{a_{1}}})^{2}]=J(t,z;\pi^{1,*}(\cdot),\pi^{2,*}(\cdot)).

Due to (4.66) and (4.67), the optimal strategy is obtained by applying the feedback maps on the solution of the closed loop equation. So, the uniqueness of the optimal strategy yields the uniqueness of solution of the closed loop equation; see [9, Remark 5.4] for a rigorous proof.

5 Numerical Algorithm

In Section 4, the Neumann boundary condition (4.13)-(iii) is employed to show the regularity of the value function VV. However, due to Theorem 4.8, the value function VV is the unique viscosity solution of (4.11) with conditions (4.13)-(i)-(ii)-(iiiiii^{\prime}). Therefore, to get the numerical approximation of the value function, we can use the Dirichlet condition (4.13)-(iiiiii^{\prime}) instead of the Neumann condition (4.13)-(iii).

5.1 Finite difference method

The finite difference method is applied to discretize Equation (4.11). The time horizon [0,T][0,T] is divided to M=T×52M=T\times 52 subintervals of equal length Δt=152\Delta t=\frac{1}{52}, the length of one week in a year. Moreover, the space interval [S,F][S,F] is discretized as S=z0,z1,z2,,zN+1=FS=z_{0},z_{1},z_{2},\cdots,z_{N+1}=F, whose the lengths of steps are equal to Δz\Delta z.

For the time and the space second derivative, the forward and the central difference schemes are employed, respectively. Moreover, at any node, we employ the forward (backward) scheme for the first space derivative when the function α\alpha is nonnegative (negative). Therefore, denoting V(i,j)=V(ti,zj)V(i,j)=V(t_{i},z_{j}), 0iM,0jN+10\leq i\leq M,0\leq j\leq N+1, the discretization of Equation (4.11) at any node (ti,zj),0iM1,1jN(t_{i},z_{j}),0\leq i\leq M-1,1\leq j\leq N, is given by

V(i+1,j)V(i,j)Δt\displaystyle\frac{V(i+1,j)-V(i,j)}{\Delta t}
+infπ1,π2{\displaystyle\quad\qquad\qquad+\inf_{\pi^{1},\pi^{2}}\{ a(i,j)V(i,j+1)+b(i,j)V(i,j1)\displaystyle a(i,j)V(i,j+1)+b(i,j)V(i,j-1)
(a(i,j)+b(i,j))V(i,j)+κηti(C1π2)2}=0,\displaystyle-\left(a(i,j)+b(i,j)\right)V(i,j)+\kappa\eta_{t_{i}}(C_{1}-\pi^{2})^{2}\}=0,\quad (5.1)

in which

a(i,j)=α(ti,zj)Δz+β(ti,zj)(Δz)2,b(i,j)=β(ti,zj)(Δz)2,whenα(ti,zj)0,a(i,j)=\frac{\alpha(t_{i},z_{j})}{\Delta z}+\frac{\beta(t_{i},z_{j})}{(\Delta z)^{2}},\quad b(i,j)=\frac{\beta(t_{i},z_{j})}{(\Delta z)^{2}},\qquad\quad when\;\;\alpha(t_{i},z_{j})\geq 0,

and

a(i,j)=β(ti,zj)(Δz)2,b(i,j)=β(ti,zj)(Δz)2α(ti,zj)Δz,whenα(ti,zj)<0.a(i,j)=\frac{\beta(t_{i},z_{j})}{(\Delta z)^{2}},\quad b(i,j)=\frac{\beta(t_{i},z_{j})}{(\Delta z)^{2}}-\frac{\alpha(t_{i},z_{j})}{\Delta z},\qquad\quad when\;\;\alpha(t_{i},z_{j})<0.

The above representation clearly implies that the coefficients of V(i,j+1)V(i,j+1) and V(i,j1)V(i,j-1) are nonnegative, which means that our scheme has the positive coefficient property; see [14, Condition 4.1] for the more precise definition of this property.

Remark 5.1.

It should be noted that for the parameters that are considered in the next section and corresponding to the obtained optimal strategies, the function α\alpha is nonnegative at any point.

Moreover, the Lipschitz continuity of the functions α\alpha, β\beta and smoothness property of the loss function satisfy the assumptions of [13, V, Theorem 8.1] which implies the comparison property for Equation (4.11).

It is well known that having the comparison property, a numerical scheme with the positive coefficient property is convergent, if it is stable in ll_{\infty} norm, monotone and consistent; see [3, Theorem 2.1]. We show these properties in the following propositions.

Proposition 5.2.

The discretization (5.1) satisfies ll_{\infty}- stability property,

V(i,)(FSaa1)2+κ(C1C2)2,0iM.\displaystyle\left\|V(i,\cdot)\right\|_{\infty}\leq\left(\frac{F-S}{a_{a_{1}}}\right)^{2}+\kappa(C_{1}-C_{2})^{2},\qquad 0\leq i\leq M. (5.2)
Proof.

By employing the optimal strategies π1,,π2,\pi^{1,*},\pi^{2,*} in the coefficients of (5.1), we have for 0iM10\leq i\leq M-1 and 1jN1\leq j\leq N

V(i,j)=\displaystyle V(i,j)= V(i+1,j)+Δta(i,j)V(i,j+1)+Δtb(i,j)V(i,j1)\displaystyle V(i+1,j)+\Delta ta(i,j)V(i,j+1)+\Delta tb(i,j)V(i,j-1)
Δt(a(i,j)+b(i,j))V(i,j)\displaystyle-\Delta t(a(i,j)+b(i,j))V(i,j)
+Δtκηti(C1π2,)2.\displaystyle+\Delta t\kappa\eta_{t_{i}}(C_{1}-\pi^{2,*})^{2}. (5.3)

So, we have

|V(i,j)|(1+Δt(a(i,j)+b(i,j)))\displaystyle\left|V(i,j)\right|\left(1+\Delta t(a(i,j)+b(i,j))\right)\leq V(i,)Δt(a(i,j)+b(i,j))\displaystyle\left\|V(i,\cdot)\right\|_{\infty}\Delta t\left(a(i,j)+b(i,j)\right)
+V(i+1,j)+Δtκηti(C1C2)2.\displaystyle+V(i+1,j)+\Delta t\kappa\eta_{t_{i}}(C_{1}-C_{2})^{2}. (5.4)

If V(i,j1)=V(i,)=max1jNV(i,j)V(i,j_{1})=\left\|V(i,\cdot)\right\|_{\infty}=max_{1\leq j\leq N}V(i,j), then considering the above inequality for the node (i,j1)(i,j_{1}), we get

V(i,)(1+Δt(a(i,j1)+b(i,j1)))\displaystyle\left\|V(i,\cdot)\right\|_{\infty}\left(1+\Delta t(a(i,j_{1})+b(i,j_{1}))\right)\leq V(i,)Δt(a(i,j1)+b(i,j1))\displaystyle\left\|V(i,\cdot)\right\|_{\infty}\Delta t\left(a(i,j_{1})+b(i,j_{1})\right)
+V(i+1,)+Δtκηti(C1C2)2,\displaystyle+\left\|V(i+1,\cdot)\right\|_{\infty}+\Delta t\kappa\eta_{t_{i}}(C_{1}-C_{2})^{2},

which means

V(i,)V(i+1,)+Δtκηti(C1C2)2.\left\|V(i,\cdot)\right\|_{\infty}\leq\left\|V(i+1,\cdot)\right\|_{\infty}+\Delta t\kappa\eta_{t_{i}}(C_{1}-C_{2})^{2}.

Now, regarding the terminal condition of VV, the bound (5.2) is achieved.

Furthermore, the bound is easily achieved on the upper and lower borders of the boundary. ∎

Remark 5.3.

The infimum in Equation (5.1) indicates that by employing any admissible strategy in the coefficients, the equality in (5.3) turns into an inequality that still yields the inequality (5.4). So, the bound (5.2) holds uniformly over the set of admissible strategies.

If we denote the left hand side of Equation (5.1) by

Gji(V(i,j),V(i,j+1),V(i,j1),V(i+1,j)),G^{i}_{j}\left(V(i,j),V(i,j+1),V(i,j-1),V(i+1,j)\right),

the monotone property is stated in the following.

Proposition 5.4.

The discretization scheme (5.1) is monotone, that is for any εi0,i=1,2,3\varepsilon_{i}\geq 0,\;i=1,2,3 we have

Gji\displaystyle G^{i}_{j} (V(i,j),V(i,j+1)+ε1,V(i,j1)+ε2,V(i+1,j)+ε3)\displaystyle\left(V(i,j),V(i,j+1)+\varepsilon_{1},V(i,j-1)+\varepsilon_{2},V(i+1,j)+\varepsilon_{3}\right)
Gji(V(i,j),V(i,j+1),V(i,j1),V(i+1,j))0.\displaystyle-G^{i}_{j}\left(V(i,j),V(i,j+1),V(i,j-1),V(i+1,j)\right)\geq 0.
Proof.

The coefficients of V(i,j+1)V(i,j+1), V(i,j1)V(i,j-1) and V(i+1,j)V(i+1,j) in (5.1) are all nonnegative. So, the above inequality is obtained directly. ∎

Proposition 5.5.

The discretization scheme (5.1) is consistent, that is for any smooth test function ϕ:[0,T]×[S,F]\phi:[0,T]\times[S,F]\rightarrow\mathbb{R} with bounded derivatives of all orders with respect to tt and zz, and denoting ϕji=ϕ(ti,zj)\phi^{i}_{j}=\phi(t_{i},z_{j}), we have

limΔt,Δz0|(ϕt+infπ1,π1{𝒜ϕ+κηt(C1π2)2})jiGji(ϕji,ϕj+1i,ϕj1i,ϕji+1)|=0.\displaystyle\lim_{\Delta t,\Delta z\rightarrow 0}\left|\left(\phi_{t}+\inf_{\pi^{1},\pi^{1}}\{\mathcal{A}\phi+\kappa\eta_{t}(C_{1}-\pi^{2})^{2}\}\right)^{i}_{j}-G^{i}_{j}(\phi^{i}_{j},\phi^{i}_{j+1},\phi^{i}_{j-1},\phi^{i+1}_{j})\right|=0.
Proof.

Since the coefficients α\alpha and β\beta, and the derivatives of ϕ\phi are all bounded on the domain [0,T]×[S,F][0,T]\times[S,F], the following approximations are concluded by using Taylor series expansion

|(𝒜ϕ)ji(α(i,j)ϕj+1iϕjiΔz+β(i,j)ϕj+1i2ϕji+ϕj1i(Δz)2)|=O(Δz),\left|(\mathcal{A}\phi)^{i}_{j}-\left(\alpha(i,j)\frac{\phi^{i}_{j+1}-\phi^{i}_{j}}{\Delta z}+\beta(i,j)\frac{\phi^{i}_{j+1}-2\phi^{i}_{j}+\phi^{i}_{j-1}}{(\Delta z)^{2}}\right)\right|=O(\Delta z),
|(𝒜ϕ)ji(α(i,j)ϕj1iϕjiΔz+β(i,j)ϕj+1i2ϕji+ϕj1i(Δz)2)|=O(Δz),\left|(\mathcal{A}\phi)^{i}_{j}-\left(-\alpha(i,j)\frac{\phi^{i}_{j-1}-\phi^{i}_{j}}{\Delta z}+\beta(i,j)\frac{\phi^{i}_{j+1}-2\phi^{i}_{j}+\phi^{i}_{j-1}}{(\Delta z)^{2}}\right)\right|=O(\Delta z),

which correspond to the forward and backward schemes of the first order derivative, respectively.

Moreover, clearly we have |(ϕt)jiϕji+1ϕjiΔt|=O(Δt)\left|(\phi_{t})^{i}_{j}-\frac{\phi^{i+1}_{j}-\phi^{i}_{j}}{\Delta t}\right|=O(\Delta t), which concludes the proposition. ∎

5.2 Policy iteration method

Since the coefficients in (5.1) depend on the control variables, applying the implicit time stepping method yields nonlinear equation at each time step. To overcome this difficulty, at each time step, we apply the policy iteration method, in which the optimal strategy and the value function are approximated iteratively.

The value function is known at the terminal time TT. So, we go backward in time and start from the last column t=TΔtt=T-\Delta t inside the domain 𝒞\mathcal{C}^{\prime}. On each column t=tit=t_{i}, we take as the starting point the value function on the next time step, the vector V(ti+Δt,)V(t_{i}+\Delta t,\cdot), and apply the following algorithm.

Policy iteration algorithm:

I) Policy improvement: For the given value function on the nodes (ti,zj),1jN(t_{i},z_{j}),1\leq j\leq N, solve the static optimization problem (5.1) to find optimal investment and consumption strategies π1\pi^{1} and π2\pi^{2}, respectively.

II) Policy evaluation: Employing the strategies π1,π2\pi^{1},\pi^{2} obtained from Step (I) and considering the boundary conditions (4.13)-(i)-(ii)-(iiiiii^{\prime}), solve the linear system (5.1) corresponding to all nodes (ti,zj),1jN(t_{i},z_{j}),1\leq j\leq N, to find new value function, Vnew(ti,)V^{new}(t_{i},\cdot), on the column t=tit=t_{i}.

III) Convergence criterion: If the following inequality does not hold, return to the Step (I)

max1jN|Vnew(ti,zj)Vold(ti,zj)|{max1jN|Vnew(ti,zj)|}×106.\max_{1\leq j\leq N}\left|V^{new}(t_{i},z_{j})-V^{old}(t_{i},z_{j})\right|\leq\{\max_{1\leq j\leq N}\left|V^{new}(t_{i},z_{j})\right|\}\times 10^{-6}.

In the next theorem, we prove that the iteration policy gives a monotone sequence of approximate value functions. Our proof is a modification of the proof given in [26, Theorem 1].

We write the discrete representation (5.1) in the matrix form. Let Vi=V(i,)V^{i}=V(i,\cdot) be the vector represents the value function on t=tit=t_{i} and DiD^{i} the (N+2)×(N+2)(N+2)\times(N+2) tridiagonal matrix in which the first and last rows are zero and

[DiVi]j=\displaystyle[D^{i}V^{i}]_{j}= a(i,j)Vi(j+1)+b(i,j)Vi(j1)\displaystyle a(i,j)V^{i}(j+1)+b(i,j)V^{i}(j-1)
(a(i,j)+b(i,j))Vi(j),1jN.\displaystyle-\left(a(i,j)+b(i,j)\right)V^{i}(j),\qquad\qquad\qquad 1\leq j\leq N.

Then we can rewrite (5.1) for all 0iM10\leq i\leq M-1 as

[IΔtDi]Vi=Vi+1+ΔtEi+GiGi+1,\displaystyle[I-\Delta tD^{i}]V^{i}=V^{i+1}+\Delta tE^{i}+G^{i}-G^{i+1}, (5.5)

in which Ei=κηti(C1π2)2E^{i}=\kappa\eta_{t_{i}}(C_{1}-\pi^{2})^{2} and GiG^{i} is a vector with just one nonzero element stands for the boundary value of VV at (ti,S)(t_{i},S). Notice that (IΔtDi)(I-\Delta tD^{i}) is a tridiagonal matrix in which the diagonal entries are positive, the off-diagonal entries are negative and the row sums are all equal to 1. So, it is an MM-matrix and therefore we have (IΔtDi)10(I-\Delta tD^{i})^{-1}\geq 0.

Theorem 5.6.

For any fixed column t=tit=t_{i}, the policy iteration algorithm gives a monotone sequence of vectors which converges to ViV^{i}.

Proof.

Let W0=Vi+1W^{0}=V^{i+1} and Wk,k1W^{k},k\geq 1 be the approximation of ViV^{i} obtained in Step (II) in the kth iteration. Then, from (5.5) we have

[IΔtDi,k(π1,k,π2,k)]Wk=Vi+1+ΔtEi,k(π1,k,π2,k)+GiGi+1,\displaystyle[I-\Delta tD^{i,k}(\pi^{1,k},\pi^{2,k})]W^{k}=V^{i+1}+\Delta tE^{i,k}(\pi^{1,k},\pi^{2,k})+G^{i}-G^{i+1}, (5.6)

where, for any k1k\geq 1,

(π1,k,π1,k)=argmin(π1,k,π2,k)Πad{Di,k(π1,k,π2,k)Wk1+Ei,k(π1,k,π2,k)}.\displaystyle(\pi^{1,k},\pi^{1,k})=argmin_{(\pi^{1,k},\pi^{2,k})\in\Pi_{ad}}\left\{D^{i,k}(\pi^{1,k},\pi^{2,k})W^{k-1}+E^{i,k}(\pi^{1,k},\pi^{2,k})\right\}. (5.7)

Hence, the coefficients Di,k,Ei,kD^{i,k},E^{i,k} are modified in each iteration.

In view of (5.6) and doing some manipulations, we get

[IΔtDi,k+1](Wk+1Wk)=Δt[(Di,k+1Wk+Ei,k+1)(Di,kWk+Ei,k)].\displaystyle[I-\Delta tD^{i,k+1}](W^{k+1}-W^{k})=\Delta t\left[(D^{i,k+1}W^{k}+E^{i,k+1})-(D^{i,k}W^{k}+E^{i,k})\right]. (5.8)

The construction (5.7) implies that, given WkW^{k}, the coefficients Di,k+1D^{i,k+1} and Ei,k+1E^{i,k+1} yield the minimum value for all elements of the vector (Di,k+1Wk+Ei,k+1)(D^{i,k+1}W^{k}+E^{i,k+1}). Therefor we have for all 0jN+10\leq j\leq N+1

[(Di,k+1Wk+Ei,k+1)(Di,kWk+Ei,k)]j0.\left[(D^{i,k+1}W^{k}+E^{i,k+1})-(D^{i,k}W^{k}+E^{i,k})\right]_{j}\leq 0.

Now, since (IΔtDi,k+1)10(I-\Delta tD^{i,k+1})^{-1}\geq 0, we conclude from (5.8) that the sequence of vectors Wk,k0W^{k},k\geq 0 is decreasing. Furthermore, due to Remark 5.3, the sequence (Wk)k0(W^{k})_{k\geq 0} is uniformly bounded. So, it is convergent.

The construction process of WkW^{k} implies that it converges to ViV^{i}, the unique solution of (5.1), which is an approximation of the value function at t=tit=t_{i}. ∎

6 Simulation Results

For comparison purposes, we assume the same market parameters as in [18] and [12]. So, we assume the interest rate to be r=0.03r=0.03 and the expected return and the volatility of the risky asset μ=0.08\mu=0.08 and σ=0.15\sigma=0.15, respectively, which implies a Sharpe ratio equal to β=0.33\beta=0.33. Furthermore, we consider a retiree with age a0=60a_{0}=60 and initial wealth x0=100x_{0}=100. In addition, the length of the decumulation phase is assumed to be equal to T=15T=15 years, which means that a1=75a_{1}=75. The maximum consumption rate is set to be C1=6.5155C_{1}=6.5155, which equals the payments of a lifetime annuity purchasable at the retirement time, in view of the mortality rate given in this section.

We consider four scenarios for the minimum admissible consumption rate, C2=C1C_{2}=C_{1}, C2=34C1C_{2}=\frac{3}{4}C_{1}, C2=23C1C_{2}=\frac{2}{3}C_{1} and C2=12C1C_{2}=\frac{1}{2}C_{1}. Moreover, the target level F=1.75C1a75F=1.75C_{1}a_{75} and the safety level S=0.5C1a75S=0.5C_{1}a_{75} are considered for the wealth process, which in the literature correspond to the medium level of risk aversion; see [18] and [12]. This means that in our setting, the final annuity that the retiree will receive is at most 1.75 and at least 0.5 times C1C_{1}. It is clear that the higher the risk aversion, the higher the target FF, and the lower the safety level SS.

We should note that our main concern in this work is comparing the different scenarios of the admissible ranges of consumption. So, here the levels FF and SS are fixed. However, our simulation results corresponding to the other levels of risk aversion, which have not been reported here, demonstrate results similar to the ones obtained for the medium level of risk aversion.

To simulate the optimal wealth process, the same stream of 5000 pseudo random numbers is applied to different scenarios. Using the simulated optimal wealth processes when κ=0.5\kappa=0.5, we present the histograms of the final annuities in Figs. 6, 6, 6 and 6 and reveal some percentiles of the optimal wealth amounts in Figs. 6, 6, 12 and 12, and the optimal investment strategies in Figs. 12, 12, 12 and 12.

The results show that when the admissible range of consumption is more restricted the investment is a little more risky and the final annuity is higher, on average. Moreover, the main indication of the graphs and histograms is the big difference between the final annuity and the optimal wealth process that correspond to the fixed consumption scenario and those that correspond to three other scenarios. Actually, we conclude from the results that by assuming a variable consumption rate, although quite restricted to a short range, we get considerably more valuable final annuities and greater optimal wealth amounts.

The percentiles of the optimal consumption rate for κ=0.5\kappa=0.5 and κ=1\kappa=1, are exhibited in Figs. 18-18, and in Figs. 18-18, respectively. We see that compared to C1=6.5155C_{1}=6.5155, the consumption rates are fifty percent greater than 66, and therefore are not much smaller than C1C_{1}. Moreover, as it is expected, the optimal consumption is higher when κ\kappa is greater.

Refer to caption
Figure 1: Final Annuity (C2=12C1C_{2}=\frac{1}{2}C_{1})
Refer to caption
Figure 2: Final Annuity (C2=34C1C_{2}=\frac{3}{4}C_{1})
Refer to caption
Figure 3: Optimal Wealth (C2=12C1C_{2}=\frac{1}{2}C_{1})
Refer to caption
Figure 4: Final Annuity (C2=23C1C_{2}=\frac{2}{3}C_{1})
Refer to caption
Figure 5: Final Annuity (C2=C1C_{2}=C_{1})
Refer to caption
Figure 6: Optimal Wealth (C2=23C1C_{2}=\frac{2}{3}C_{1})
Refer to caption
Figure 7: Optimal Wealth (C2=34C1C_{2}=\frac{3}{4}C_{1})
Refer to caption
Figure 8: Risky Investment (C2=12C1C_{2}=\frac{1}{2}C_{1})
Refer to caption
Figure 9: Risky Investment (C2=34C1C_{2}=\frac{3}{4}C_{1})
Refer to caption
Figure 10: Optimal Wealth (C2=C1C_{2}=C_{1})
Refer to caption
Figure 11: Risky Investment (C2=23C1C_{2}=\frac{2}{3}C_{1})
Refer to caption
Figure 12: Risky Investment (C2=C1C_{2}=C_{1})
Refer to caption
Figure 13: Optimal Consumption (C2=12C1,K=0.5C_{2}=\frac{1}{2}C_{1},\;K=0.5)
Refer to caption
Figure 14: Optimal Consumption (C2=23C1,K=0.5C_{2}=\frac{2}{3}C_{1},\;K=0.5)
Refer to caption
Figure 15: Optimal Consumption (C2=34C1,K=0.5C_{2}=\frac{3}{4}C_{1},\;K=0.5)
Refer to caption
Figure 16: Optimal Consumption (C2=12C1,K=1C_{2}=\frac{1}{2}C_{1},\;K=1)
Refer to caption
Figure 17: Optimal Consumption (C2=23C1,K=1C_{2}=\frac{2}{3}C_{1},\;K=1)
Refer to caption
Figure 18: Optimal Consumption (C2=34C1,K=1C_{2}=\frac{3}{4}C_{1},\;K=1)

For a comprehensive comparison of the outcomes of different scenarios, we take into account the market present value of the cash flows before and after the annuitization. The cash flow before the annuitization consists of withdrawals πt2\pi^{2}_{t} from the fund, for 0tT0\leq t\leq T. Moreover, in the case of death before the annuitization, we also take into account the accumulated wealth at the time of death. The cash flow after the annuitization consists of the constant payments of the lifetime annuity that is purchased by the accumulated wealth at the terminal time TT. Therefore, considering XtX_{t} as the fund value at time tt, the present value of the cash flows is written as

P.V.=0τd15eρtπt2dt+eρτdXτd1{τd15}+τd15τdX15a75eρtdt,\displaystyle P.V.=\int^{\tau_{d}\wedge 15}_{0}e^{-\rho t}\pi^{2}_{t}dt+e^{-\rho\tau_{d}}X_{\tau_{d}}1_{\{\tau_{d}\leq 15\}}+\int^{\tau_{d}}_{\tau_{d}\wedge 15}\frac{X_{15}}{a_{75}}e^{-\rho t}dt,

in which τd=τ~d60\tau_{d}=\tilde{\tau}_{d}-60, where τ~d\tilde{\tau}_{d} denotes the time of death.

For the mortality rate, the Gompertz-Makeham distribution is considered, which for an individual of age tt is given as

νt=A+BCt.\nu_{t}=A+BC^{t}.

The parameters A,BA,B and CC are those considered by the Belgian regulator for the pricing of the lifetime annuities purchased by males, as in [16]. So, we assume A=0.00055845A=0.00055845, B=0.000025670B=0.000025670 and C=1.1011C=1.1011.

Now, assuming that the time of death is independent of the filtration of financial returns, we get

P.V.=015ηt(πt2+ν60+tXt)dt+15Tm60ηtX15a75dt,\displaystyle P.V.=\int^{15}_{0}\eta_{t}\left(\pi^{2}_{t}+\nu_{60+t}X_{t}\right)dt+\int^{T_{m}-60}_{15}\eta_{t}\frac{X_{15}}{a_{75}}dt,

in which Tm=100T_{m}=100 is considered as the maximum lifespan.

Table 1 reveals some statistics of the final annuity and the present value of the cash flows for different admissible ranges of consumption and different weights of the running cost, κ=0.25,0.5,0.75,\kappa=0.25,0.5,0.75, and 1. Furthermore, it reports the mean of the consumptions before the annuitization and the probability that the final annuity is greater than C1C_{1}, the annuity that is purchasable at the retirement time.

As it is expected, when the minimum admissible consumption rate C2C_{2} decreases or equivalently the admissible range of consumption becomes wider, we get more valuable final annuity and higher cash flows. Furthermore, in this case the probability of achieving a final annuity greater than C1C_{1} increases.

We see in Table 1 again the remarkable difference between the outcomes of the fixed consumption rate scenario C2=C1C_{2}=C_{1} and the outcomes that correspond to three other scenarios. Actually, when the consumption rate is fixed, considerably less valuable final annuity is achieved. Furthermore, the present value of the cash flows is remarkably less than the analogous outcomes of the variable consumption rate scenarios. However, since in this case the cash flow before the annuitization is constant, the total cash flows become less deviated.

Table 1: Distribution of final annuities, consumptions and cash flows

C2=12C1C_{2}=\frac{1}{2}C_{1} C2=23C1C_{2}=\frac{2}{3}C_{1} C2=34C1C_{2}=\frac{3}{4}C_{1} C2=C1C_{2}=C_{1}
κ=0.25\kappa=0.25 κ=0.25\kappa=0.25 κ=0.25\kappa=0.25
κ=0.5\kappa=0.5 κ=0.5\kappa=0.5 κ=0.5\kappa=0.5
κ=0.75\kappa=0.75 κ=0.75\kappa=0.75 κ=0.75\kappa=0.75
κ=1\kappa=1 κ=1\kappa=1 κ=1\kappa=1
Mean of FA11footnotemark: 1 9.59 9.43 9.28 5.69
9.08 8.96 8.85
8.77 8.67 8.57
8.54 8.45 8.37
Standard deviation of FA 2.14 2.24 2.33 2.77
2.43 2.50 2.55
2.58 2.63 2.67
2.68 2.71 2.74
Mean of PV22footnotemark: 2 105.21 104.15 103.25 96.17
105.34 104.59 103.92
105.06 104.46 103.91
104.71 104.22 103.76
Standard deviation of PV 13.95 13.77 13.49 12.17
14.11 14.14 14.08
14.12 14.19 14.21
14.16 14.18 14.21
Prob(FA >C1>C_{1}) (%) 89.62 88.10 86.22 37.44
83.72 82.42 80.98
80.10 79.08 77.92
77.42 76.54 75.76
Prob(XT=SX_{T}=S) (%) 1.62 1.94 2.16 6.94
1.66 2.60 2.86
1.44 1.94 3.24
0.94 2.18 3.26
Mean of consumptions 5.7752 5.7466 5.7341 6.5155
5.9869 5.9698 5.9595
6.0873 6.0756 6.0668
6.1470 6.1398 6.1330
1FA=Final Annuity
2 PV=Present Value

For a fixed admissible range of consumption, comparing the outcomes that correspond to different weights of the running cost, we find quite reasonably, finer final annuities (higher mean with lower standard deviation) for smaller weights. Actually, when κ\kappa is smaller, more weight is devoted to the second term of the loss function, a term which is based on the final annuity.

The surprising result for a fixed admissible range of consumption is that the maximum present value of cash flows is attained when κ=0.5\kappa=0.5. This can be interpreted by considering simultaneously the two terms of the loss function. In fact, for a smaller κ\kappa, in which more weight of the loss function is devoted to the final annuity, higher cash flow after the annuitization is expected. On the other hand, for a greater κ\kappa, in which more weight is devoted to the consumption, higher cash flow before the annuitization is expected.

The last rows of the table report the mean of the consumptions. As it is expected, we have on average more consumption for the greater κ\kappa. The surprising point is that when the admissible range of consumption becomes more limited, or C2C_{2} increases, the mean of the consumptions slightly decreases. This phenomenon can be interpreted from the results obtained for the probability of hitting the lower border, Prob{XT=S}Prob\{X_{T}=S\}, since this probability is higher when the range is more limited. Furthermore, whenever the wealth process hits the lower border, afterwards the consumption rate must be at the minimum rate.

7 Conclusions

The optimal investment-consumption problem post-retirement is investigated. Considering a minimum guarantee for the final annuity, a variable consumption rate together with a running cost term of loss function, which is based on the consumption, yield a nonlinear HJB equation on an irregular bounded domain. Using the dual equation, the existence and uniqueness of a classical solution are proved. To obtain a numerical approximation of the solution of the HJB equation, we apply the backward in time implicit scheme of the finite difference method. Our scheme has the positive coefficient property which guarantees convergence to the viscosity solution. To tackle the nonlinearity that appears in the numerical scheme, the policy iteration method is applied.

Four different scenarios for the admissible range of consumption are considered. Our simulation results show that by assuming a variable consumption rate, although quite restricted, a much finer final annuity is achieved. Furthermore, the outcomes corresponding to different weights of the running cost term are compared.

Moreover, by evaluating the present value of the total cash flows, before and after the annuitization, we observe higher values when the admissible range of consumption is wider.

Our goal for future research is to consider a market model with jump-diffusion dynamics, which seems to be a challenging problem.

References

  • [1] P. Albrecht, P. Maurer, Self-annuitization, consumption shortfall in retirement and asset allocation: the annuity benchmark. Journal of Pension Economics and Finance. 1(3), (2002) 269-288.
  • [2] G. Barles, Nonlinear Neumann boundary conditions for quasilinear degenerate elliptic equations and applications, Journal of Differential Equations, 154, (1999)191-224.
  • [3] G. Barles and P. Souganidis, Convergence of approximation schemes for fully nonlinear equations. Asymptotic Analysis, 4 (1991) 271-283.
  • [4] D. Blake, A. J. G. Cairns, and K. Dowd, Pensionmetrics 2: Stochastic pension plan design during the distribution phase. Insurance: Mathematics and Economics. 33(1), (2003) 29-47.
  • [5] J. F. Boulier, S. Huang, G. Taillard, Optimal management under stochastic interest rates: The case of a protected defined contribution pension fund. Insurance: Mathematics and Economics. 28(2), (2001) 173-189.
  • [6] T. Chellathurai, T. Draviam, Dynamic portfolio selection with fixed and/or proportional transaction costs using non-singular stochastic optimal control theory. Journal of Economic Dynamics and Control, 31, (2007) 2168-2195.
  • [7] M. Crandall, H. Ishii, P.L. Lions, User’s guide to viscosity solutions of second order partial differential equations, Bulletin of the American Mathematical Society 27, (1992) 1-76.
  • [8] H. Dadashi, Optimal investment strategy post retirement without ruin possibility: A numerical algorithm, Journal of Computational and Applied Mathematics 363, (2020) 325-336.
  • [9] M. Di Giacinto, S. Federico, F. Gozzi, E. Vigna, Constrained portfolio choices in the decumulation phase of a pension plan, Carlo Alberto Notebooks, 155 (2010).
  • [10] G. Deelstra, M. Grasseli, P.F. Koehl, Optimal investment strategies in the presence of a minimum guarantee. Insurance: Mathematics and Economics 33, (2003) 189-207.
  • [11] S. Federico, P. Gassiat, and F. Gozzi, Utility maximization with current utility on the wealth: regularity of solutions to the HJB equation, Finance and Stochastics 19, (2015) 415-448.
  • [12] M. Di Giacinto, S. Federico, F. Gozzi, E. Vigna, Income drawdown option with minimum guarantee. European Journal of Operational Research, 234(3), (2014) 610-624.
  • [13] W. Fleming, H. Soner, Controlled Markov Processes and Viscosity Solutions. Second Edition. Springer-Verlag, New York, 2006.
  • [14] P. Forsyth, G. Labahn, Numerical methods for controlled Hamilton-Jacobi-Bellman PDEs in finance. Journal of Computational Finance, 11(2), (2007) 1-44.
  • [15] P. Forsyth, L. Wang, Maximal use of central differencing for Hamilton-Jacobi-Bellman PDEs in finance. SIAM Journal on Numerical Analysis, 46(3), (2008) 1580-1601.
  • [16] D. Hainaut, G. Deelstra, Optimal timing for annuitization, based on jump diffusion fund and stochastic mortality. Journal of Economic Dynamics and Control, 44, (2014) 124-146.
  • [17] N.-W. Han, M.-W. Hung, Optimal asset allocation for DC pension plans under inflation. Insurance: Mathematics and Economics 51(1), (2012) 172-181.
  • [18] R. Gerrard, S. Haberman, and E. Vigna, Optimal investment choices post-retirement in a defined contribution pension scheme. Insurance: Mathematics and Economics, 35, (2004) 321-342.
  • [19] R. Gerrard, S. Haberman, and E. Vigna, The management of decumulation risk in a defined contribution environment. North American Actuarial Journal, 10, (2006) 84-110.
  • [20] R. Gerrard, B. Højgaard, and E. Vigna, Choosing the optimal annuitization time post-retirement. Quantitative Finance, 12(7), (2012) 1143-1159.
  • [21] S. Janson, J. Tysk, Preservation of convexity of solutions to parabolic equations. Journal of Differential Equations, 206 (2004) 182-226.
  • [22] I. Karatzas, S. Shreve, Methods of Mathematical Finance. Springer-Verlag, New York, 1998.
  • [23] M.A. Milevsky, K. Moore, and V.R. Young, Optimal Asset Allocation and Ruin-Minimization Annuitization Strategies. Mathematical Finance, 16, (2006) 647-671.
  • [24] M.A. Milevsky, V.R. Young, Annuitization and asset allocation. Journal of Economic Dynamics and Control, 31(9), (2007) 3138-3177.
  • [25] B. Øksendal, Stochastic Differential Equations. 6th Edition. Springer-Verlag, New York, 2003.
  • [26] D. Pooley, P. Forsyth, and K. Vetzal, Numerical Convergence Properties of Option Pricing PDEs with Uncertain Volatility. IMA Journal of Numerical Analysis, 23, (2003) 241-267.
  • [27] M.S. Santos and J. Rust, Convergence properties of policy iteration. SIAM J. Control Optim., 42(6), (2004) 2094-2115.
  • [28] G. Stabile, Optimal timing of annuity purchases: a combined stochastic control and optimal stopping problem. International Journal of Theoretical and Applied Finance, 9(2), (2006) 151-170.
  • [29] E. Vigna, On Efficiency of Mean-Variance based Portfolio Selection in DC Pension Schemes. Quantitative Finance, 14(2), (2014) 237-258.

8 Appendix

Proof of Proposition 4.7

We follow the steps of proof of the same statement in [9, Proposition 4.8]. Since in our problem there are two control variables and the running cost term in the loss function is based on the consumption, our proof is a little different from the corresponding proof in [9]. The steps 2, 3 and 5 in the existing proof can be applied to our problem without essential modifications. So, here we just prove the steps 1 and 4.

Step 1. For any fixed SzFS\leq z\leq F, we prove that the function

tV(t,z)+(C1C3)2t,\displaystyle t\rightarrow V(t,z)+(C_{1}-C_{3})^{2}t, (8.1)

in which C3=C1+(C2C1)FzFSC_{3}=C_{1}+(C_{2}-C_{1})\frac{F-z}{F-S}, is nondecreasing. Consider the function

R:[0,T][S,T],tR(t)=C3r+(zC3r)er(Tt).R:[0,T]\rightarrow[S,T],\qquad t\rightarrow R(t)=\frac{C_{3}}{r}+(z-\frac{C_{3}}{r})e^{-r(T-t)}.

At first, we show that

1(t,R(t))=z,0tT.\displaystyle\mathcal{L}_{1}(t,R(t))=z,\qquad\qquad 0\leq t\leq T. (8.2)

A few manipulations yield

F(t)R(t)\displaystyle F(t)-R(t) =(Fz)[(C1C3)(1er(Tt))r(Fz)+er(Tt)]\displaystyle=(F-z)\left[\frac{(C_{1}-C_{3})(1-e^{-r(T-t)})}{r(F-z)}+e^{-r(T-t)}\right]
=(Fz)[(C1C2)(1er(Tt))r(FS)+er(Tt)],\displaystyle=(F-z)\left[\frac{(C_{1}-C_{2})(1-e^{-r(T-t)})}{r(F-S)}+e^{-r(T-t)}\right],

and

F(t)S(t)\displaystyle F(t)-S(t) =(FS)[(C1C2)(1er(Tt))r(FS)+er(Tt)].\displaystyle=(F-S)\left[\frac{(C_{1}-C_{2})(1-e^{-r(T-t)})}{r(F-S)}+e^{-r(T-t)}\right].

So, we have F(t)R(t)F(t)S(t)=FzFS\frac{F(t)-R(t)}{F(t)-S(t)}=\frac{F-z}{F-S}, and consequently

1(t,R(t))\displaystyle\mathcal{L}_{1}(t,R(t)) =R(t)er(Tt)+[C1+(C2C1)F(t)R(t)F(t)S(t)]1er(Tt)r\displaystyle=R(t)e^{r(T-t)}+\left[C_{1}+(C_{2}-C_{1})\frac{F(t)-R(t)}{F(t)-S(t)}\right]\frac{1-e^{r(T-t)}}{r}
=C3rer(Tt)+(zC3r)+C31er(Tt)r=z.\displaystyle=\frac{C_{3}}{r}e^{r(T-t)}+(z-\frac{C_{3}}{r})+C_{3}\frac{1-e^{r(T-t)}}{r}=z.

Moreover, it can be checked easily that by applying the strategies π1=0\pi^{1}=0 and π2=C3\pi^{2}=C_{3}, the wealth process will have the same dynamics as the function RR. Therefore, if at any time 0tT0\leq t\leq T the wealth amount equals R(t)R(t), then it will remain on the curve {R(s),tsT}\{R(s),t\leq s\leq T\}.

The above observations imply that the wealth process can move from the point (t,z)(t,z) to the point (t,z)(t^{\prime},z) by the strategies which yield the cost amount (C1C3)2(tt)(C_{1}-C_{3})^{2}(t-t^{\prime}). This yields that the function (8.1) is nondecreasing.

Step 2. The function tV(t,z)t\rightarrow V(t,z) is continuous for every SzFS\leq z\leq F.

Step 3. The function VV is continuous on the set [0,T]×[S+ε,F][0,T]\times[S+\varepsilon,F], for any ε>0\varepsilon>0.

Step 4. In this step, we claim the right continuity of the value function VV in the space variable at z=Sz=S, for any fixed 0t0T0\leq t_{0}\leq T. We prove the right continuity of V~(t0,)\tilde{V}(t_{0},\cdot) at x=S(t0)x=S(t_{0}), which trivially implies the claim.

Let at time t0t_{0} the wealth process be equal to x0=S(t0)+εx_{0}=S(t_{0})+\varepsilon, in which ε\varepsilon is a positive constant. By applying the consumption rate π2C2\pi^{2}\equiv C_{2} and the null investment strategy π10\pi^{1}\equiv 0, the wealth process will remain on the curve

L(t)=(x0C2r)er(tt0)+C2r,t0tT.L(t)=\left(x_{0}-\frac{C_{2}}{r}\right)e^{r(t-t_{0})}+\frac{C_{2}}{r},\qquad\quad t_{0}\leq t\leq T.

So, by applying the strategy π2C2\pi^{2}\equiv C_{2} and any admissible investment strategy π1\pi^{1}, the process ert(X(t;t0,x0,π1(),C2)L(t)),t0tT,e^{-rt}\left(X(t;t_{0},x_{0},\pi^{1}(\cdot),C_{2})-L(t)\right),\;t_{0}\leq t\leq T, is a martingale under the risk-neutral measure. Therefore, we get the following equality for the expectation of the terminal wealth under the risk-neutral measure

𝔼~X(T;t0,x0,π1(),C2)=L(T)=S+εer(Tt0),\displaystyle\tilde{\mathbb{E}}X(T;t_{0},x_{0},\pi^{1}(\cdot),C_{2})=L(T)=S+\varepsilon e^{r(T-t_{0})}, (8.3)

which converges to the safety level SS as ε0\varepsilon\rightarrow 0.

Then, by using the argument that is employed in [9, Proposition 4.8, Step 4], we get the following convergence under the actual measure uniformly over all admissible investment strategies

𝔼(FX(T;t0,x0,π1(),C2))2(FS)2asε0.\displaystyle\mathbb{E}\left(F-X(T;t_{0},x_{0},\pi^{1}(\cdot),C_{2})\right)^{2}\rightarrow(F-S)^{2}\quad as\;\varepsilon\rightarrow 0. (8.4)

Moreover, it is clear that when π2C2\pi^{2}\not\equiv C_{2}, the terminal wealth amount is less than X(T;t0,x0,π1(),C2)X(T;t_{0},x_{0},\pi^{1}(\cdot),C_{2}). So, the above convergence holds uniformly over all admissible strategies (π1,π2)Π~ad(t0,x0)(\pi^{1},\pi^{2})\in\tilde{\Pi}_{ad}(t_{0},x_{0}).

Now, we are going to estimate the running cost loss function. We prove the following convergence for an admissible consumption strategy π2,ε\pi^{2,\varepsilon}

(Tt0)(C1C2)2t0T(C1πt2,ε)2𝑑t0,asε0.\displaystyle(T-t_{0})(C_{1}-C_{2})^{2}-\int^{T}_{t_{0}}(C_{1}-\pi^{2,\varepsilon}_{t})^{2}dt\rightarrow 0,\quad as\;\varepsilon\rightarrow 0. (8.5)

Obviously, to prove the above convergence, it is enough to prove

t0T(πt2,εC2)𝑑t0,asε0.\displaystyle\int^{T}_{t_{0}}(\pi^{2,\varepsilon}_{t}-C_{2})dt\rightarrow 0,\quad as\;\varepsilon\rightarrow 0. (8.6)

For an admissible strategy (π1,ε,π2,ε)Π~ad(t0,x0)(\pi^{1,\varepsilon},\pi^{2,\varepsilon})\in\tilde{\Pi}_{ad}(t_{0},x_{0}), let

Y(t)=𝔼~X(t;t0,x0,π1,ε(),π2,ε()),Y(t)=\tilde{\mathbb{E}}X(t;t_{0},x_{0},\pi^{1,\varepsilon}(\cdot),\pi^{2,\varepsilon}(\cdot)),

be the expectation of the wealth process under the risk-neutral measure. Since π2,εC2\pi^{2,\varepsilon}\geq C_{2}, we have L(t)Y(t)L(t)\geq Y(t), for all t0tTt_{0}\leq t\leq T. Regarding this inequality and the dynamics of functions YY and LL, we obtain the following inequality

L(T)Y(T)\displaystyle L(T)-Y(T) =t0Tr(L(s)Y(s))𝑑s+t0T(πs2,εC2)𝑑s\displaystyle=\int^{T}_{t_{0}}r(L(s)-Y(s))ds+\int^{T}_{t_{0}}(\pi^{2,\varepsilon}_{s}-C_{2})ds
t0T(πs2,εC2)𝑑s0.\displaystyle\geq\int^{T}_{t_{0}}(\pi^{2,\varepsilon}_{s}-C_{2})ds\geq 0. (8.7)

From (8.3), we obtain L(T)SL(T)\rightarrow S, as ε0\varepsilon\rightarrow 0. So, the minimum guarantee constraint, X(T;t0,x0,π1,ε(),π2,ε())Sa.s.X(T;t_{0},x_{0},\pi^{1,\varepsilon}(\cdot),\pi^{2,\varepsilon}(\cdot))\geq S\;a.s., and consequently Y(T)SY(T)\geq S, implies that the left hand side tends to zero which concludes the convergence (8.6).

Now, (8.4) and (8.5) imply the claim,

V~(t0,S(t0)+ε)V~(t0,S(t0)),asε0.\tilde{V}(t_{0},S(t_{0})+\varepsilon)\rightarrow\tilde{V}(t_{0},S(t_{0})),\qquad as\;\varepsilon\rightarrow 0.

Step 5. The two variable function (t,x)V(t,x)(t,x)\rightarrow V(t,x) is continuous at the boundary [0,T]×{S}[0,T]\times\{S\}.