A monotone numerical integration method for mean-variance portfolio optimization under jump-diffusion models

Hanwen Zhang School of Mathematics and Physics, The University of Queensland, St Lucia, Brisbane 4072, Australia, email: hanwen.zhang1@uqconnect.edu.au. Duy-Minh Dang School of Mathematics and Physics, The University of Queensland, St Lucia, Brisbane 4072, Australia, email: duyminh.dang@uq.edu.au

(May 15, 2023)

Abstract

We develop a highly efficient, easy-to-implement, and strictly monotone numerical integration method for Mean-Variance (MV) portfolio optimization. This method proves very efficient in realistic contexts, which involve jump-diffusion dynamics of the underlying controlled processes, discrete rebalancing, and the application of investment constraints, namely no-bankruptcy and leverage.

A crucial element of the MV portfolio optimization formulation over each rebalancing interval is a convolution integral, which involves a conditional density of the logarithm of the amount invested in the risky asset. Using a known closed-form expression for the Fourier transform of this conditional density, we derive an infinite series representation for the conditional density where each term is strictly positive and explicitly computable. As a result, the convolution integral can be readily approximated through a monotone integration scheme, such as a composite quadrature rule typically available in most programming languages. The computational complexity of our proposed method is an order of magnitude lower than that of existing monotone finite difference methods. To further enhance efficiency, we propose an implementation of this monotone integration scheme via Fast Fourier Transforms, exploiting the Toeplitz matrix structure.

The proposed monotone numerical integration scheme is proven to be both $\ell_{\infty}$ -stable and pointwise consistent, and we rigorously establish its pointwise convergence to the unique solution of the MV portfolio optimization problem. We also intuitively demonstrate that, as the rebalancing time interval approaches zero, the proposed scheme converges to a continuously observed impulse control formulation for MV optimization expressed as a Hamilton-Jacobi-Bellman Quasi-Variational Inequality. Numerical results show remarkable agreement with benchmark solutions obtained through finite differences and Monte Carlo simulation, underscoring the effectiveness of our approach.

Keywords: mean-variance, portfolio optimization, monotonicity, numerical integration method, jump-diffusion, discrete rebalancing

1 Introduction

Long-term investors, such as holders of Defined Contribution plans, are typically motivated by asset allocation strategies which are optimal under multi-period criteria.¹¹1The holder of a Defined Contribution plan is effectively responsible to make investment decisions for both (i) the accumulation phase (pre-retirement) of about thirty years or more, and (ii) the decumulation phase (in retirement), of perhaps twenty years. As a result, multi-period portfolio optimisation plays a central role in asset allocation. In particular, originating with [47], mean-variance (MV) portfolio optimization forms the cornerstone of asset allocation ([24]), in part due to its intuitive nature which is the trade-off between risk (variance) and reward (mean). In multi-period settings, MV portfolio optimization aims to obtain an investment strategy (or control) that maximizes the expected value of the terminal wealth of the portfolio, for a given level of risk as measured by the associated variance of the terminal wealth [83]. In recent years, multi-period MV optimization has received considerable attention in in institutional settings, including in pension fund and insurance applications - see for example [32, 48, 40, 52, 64, 70, 75, 76, 79, 72, 29, 28, 11, 41, 78, 82, 85, 39], among many others.

It is important to distinguish between two categories of optimal investment strategies (optimal controls) for portfolio optimization. The first category, referred to as pre-commitment, typically results in time-inconsistent optimal strategies ([83, 37, 71, 20, 19]). The second category, namely the time-consistent or game theoretical approach, guarantees the time-consistency of the resulting optimal strategy by imposing a time-consistency constraint ([4, 6, 74, 12, 65]). The time-inconsistency of pre-commitment strategies is because the variance term in the MV-objective is not separable in the sense of dynamic programming (see [4, 71]). However, pre-commitment strategies are typically time-consistent under an alternative induced objective function [63], and hence implementable. The merits and demerits of time consistent and pre-commitment strategies are also discussed in [72]. In subsequent discussions, unless otherwise stated, both time consistent and pre-commitment strategies are collectively referred to strategies or controls.

1.1 Background

In the parametric approach, a parametric stochastic model is postulated, e.g. diffusion dynamics, and then is calibrated to market-observed data.²²2Recently, data-driven (i.e. non-parametric) methods have been proposed for portfolio optimization under different optimality criteria, including mean-variance [39, 10, 51]. Nonetheless, monotonicity of NN-based methods has not been established. A key concern about, and perhaps also a criticism against, MV portfolio optimization in a parametric setting is its potential lack of robustness to model misspecification error. This criticism originated from the fact that, in single-period settings, MV portfolio optimization can provide notoriously unstable asset allocation strategies arising from small changes in the underlying asset parameters ([50, 61, 54, 8]). Nonetheless, in the case of multi-period MV optimization, research findings indicate that, when the risky asset dynamics are allowed to follow pure-diffusion dynamics (e.g. GBM) or any of the standard finite-activity jump-diffusion models commonly encountered in financial settings, such as those considered in this work, the pre-commitment and time-consistent MV outcomes of terminal wealth are generally very robust to model misspecification errors [68].

It is well-documented in the finance literature that jumps are often present in the price processes of risky assets (see, for example, [15, 58]). In addition, findings in previous research work on MV portfolio optimization (pre-commitment and time-consistency strategies) also indicate that (i) jumps in the price processes of risky assets, such as Merton model [49] and the Kou model [36], and (ii) realistic investment constraints, such as no-bankruptcy or leverage, have substantial impact on efficient frontiers and optimal investment strategies of MV portfolio optimization [19, 65]. Furthermore, the results of [46] show that the effects of stochastic volatility, with realistic mean-reverting dynamics, are not important for long-term investors with time horizons greater than 10 years.

Furthermore, for multi-period MV optimization, it is documented in the literature that the composition of the risky asset basket remains relatively stable over time, which suggests that the primary question remains the overall risky asset basket vs. the risk-free asset composition of the portfolio, instead of the exact composition of the risky asset basket. See the available analytical solutions for multi-asset time-consistent MV problems (see, for example, [81]) as well as pre-commitment MV problems (see for example [37]). Therefore, it is reasonable to consider a well-diversified index, instead of a single stock or a basket of stocks, as common in the MV literature [66, 69, 68, 21, 67, 65]. This is the modeling approach adopted in this work, resulting in a low dimensional multi-period MV optimization problem.

1.1.1 Monotonicity and no-arbitrage

In general, since solutions to stochastic optimal control problems, including that of the MV portfolio optimization problem, are often non-smooth, convergence issues of numerical methods, especially monotonicity considerations, are of primary importance. This is because, in the context of numerical methods for optimal control problems, optimal decisions are determined by comparing numerically computed value functions. Non-monotone schemes could produce numerical solutions that fail to converge to financially relevant solution, i.e. a violation of the discrete no-arbitrage principle [53, 56, 77].

To illustrate the above point further, consider a generic time-advancement scheme from time-( $m$ - $1$ ) to time- $m$ of the form

v_{n}^{m}=\sum_{\ell\in\mathcal{L}_{n}}\omega_{n,\ell}~{}v_{\ell}^{m-1}.

(1.1)

Here, $\omega_{n,\ell}$ are the weights and and $\mathcal{L}_{n}$ is an index set typically capturing the computational stencil associated with the $n$ -th spatial partition point. This time-advancement scheme is monotone if, for any $n$ -th spatial partition point, we have $\omega_{n,\ell}\geq 0$ , $\forall\ell\in\mathcal{L}_{n}$ . Optimal controls at time- $m$ are determined typically by comparing candidates numerically computed from applying intervention on time-advancement results $v_{n}^{m}$ . Therefore, these candidates need to be approximated using a monotone scheme as well. If interpolation is needed in this step, linear interpolation is commonly chosen, due to its monotonicity³³3Other non-monotone interpolation schemes are discussed in, for example, [30, 59].. Loss of monotonicity occurring in the time-advancement may result in $v_{n}^{m}<0$ even $v_{\ell}^{m-1}\geq 0$ for all $\ell\in\mathcal{L}_{n}$ .

1.1.2 Numerical methods

For stochastic optimal control problems with a small number of stochastic factors, the PDE approach is often a natural choice. To the best of our knowledge, finite difference (FD) methods remain the only pointwise convergent methods established for pre-commitment and time-consistent MV portfolio optimization in realistic investment scenarios. These scenarios involve the simultaneous application of various types of investment constraints and modeling assumptions, including jumps in the price processes of risky assets, as highlighted in [19, 65]. These FD methods achieve monotonicity in time-advancement through a positive coefficient finite difference discretization method (for the partial derivatives), which is combined with implicit time-stepping. Despite their effectiveness, finite difference methods present significant computational challenges in multi-period settings with long maturities. In particular, they necessitate time-stepping between rebalancing dates, which often occur annually (i.e., control monitoring dates). This time-stepping requirement introduces errors and substantially increase the computational cost of FD methods.

Fourier-based integration methods frequently rely on the presence of an analytical expression for the Fourier transform of the underlying transition density function, or an associated Green’s function, as highlighted in various research such as [45, 62, 2, 34, 26, 44, 43]. Notably, the Fourier cosine series expansion method [25, 60] can achieve high-order convergence for piecewise smooth problems. However, in cases of optimal control problems, which are usually non-smooth, such high-order convergence should not be anticipated.

When applicable, Fourier-based methods offer unique advantages over FD methods and Monte Carlo simulation. These advantages include the absence of timestepping errors between rebalancing (or control monitoring) dates, and the ability to handle complex underlying dynamics such as jump-diffusion, regime-switching, and stochastic variance in a straightforward manner. However, standard Fourier-based methods, much like Monte Carlo simulations, do have a significant drawback: they can potentially lose monotonicity. This potential loss of monotonicity in the context of variable annuities is discussed in depth in [34, 33].

In more detail, consider $g(s,s^{\prime},t_{m}-t_{m-1})$ as the underlying (scaled) transition density, or a related Green’s function. For Lévy processes, which have independent and stationary increments, $g(\cdot)$ relies on $s$ and $s^{\prime}$ only through their difference, i.e., $g(s,s^{\prime},\cdot)=g(s-s^{\prime},\cdot)$ . Thus, the advancement of solutions between control monitoring dates takes the form of a convolution integral as follows

v(s,t_{m-1})=\int_{\mathbb{R}}g\left(s-s^{\prime},t_{m}-t_{m-1}\right)v\left(s^{\prime},t_{m}\right)\mathrm{d}s^{\prime}.

(1.2)

In the case of Lévy processes, even though $g(\cdot)$ is not known analytically, the Lévy-Khintchine formula provides an explicit representation of the Fourier transform (or the characteristic function) of $g(\cdot)$ , denoted by $G(\cdot)$ . This permits the use of Fourier series expansion to reconstruct the entire integral (1.2), not just the integrand. The approach creates a numerical integration scheme of the form (1.1), with the weights $\omega_{n,\ell}$ typically available in the Fourier domain via $G(\cdot)$ . Consequently, the algorithms boil down to the utilization of finite FFTs, which operate efficiently on most platforms. However, there is no assurance that the weights $\omega_{n,\ell}$ are non-negative for all $n$ and $l$ , which can potentially lead to a loss of monotonicity.

As highlighted in [3], the requirement for monotonicity in a numerical scheme can be relaxed. This notion of weak monotonicity was initially explored in [7] and was later examined in great detail in [26, 44, 43, 42] for general control problems in finance, including variable annuities. More specifically, the condition for monotonicity, i.e. $\omega_{n,\ell}\geq 0$ for all $\ell\in\mathcal{L}_{n}$ , is relaxed to $\sum{\ell\in\mathcal{L}_{n}}|\min(\omega{n,\ell},0)|\leq\epsilon$ , with $\epsilon>0$ being a user-defined tolerance for monotonicity. By projecting the underlying transition density or an associated Green’s function onto linear basis functions, this approach allows for full control over potential monotonicity loss via the tolerance $\epsilon>0$ : the potential monotonicity loss can be quantified and restricted to $\mathcal{O}(\epsilon)$ , thereby enabling (pointwise) convergence as $\epsilon\to 0$ .

1.2 Objectives

In general, many industry practitioners find implementing monotone finite difference methods for jump-diffusion models to be complex and time-consuming, particularly when striving to utilize central differencing as much as possible, as proposed in [73]. As well-noted in the literature (e.g. [56, 59]), many seemingly reasonable finite difference discretization schemes can yield incorrect solutions. In addition, while the concept of (strict) monotonicity in numerical schemes is directly tied to the discrete no-arbitrage principle, making it easy to comprehend, weak monotonicity is less clear, which further hinders its application in practice. Moreover, the convergence analysis of weakly monotone schemes is often complex, potentially introducing additional obstacles to their practical application.

This paper aims to fill the aforementioned research gap through the development of an efficient, easy-to-implement and monotone numerical integration method for MV portfolio optimization under a realistic setting. This setting involves the simultaneous application of different types of investment constraints and jump-diffusion dynamics for the price processes of risky assets. In our method, the guarantee of the key monotonicity property comes with a trade-off: a modest level of analytical tractability is required for the jump sizes. Essentially, this stipulates that the associated jump-diffusion model should provide a closed-form expression for European vanilla options. This condition, however, conveniently accommodates a number of widely used models in finance, marking it as a relatively mild requirement in practice. To demonstrate this, we presents results for two popular distributions for jump sizes in finance: the normal distribution as put forth by [49], and the asymmetric double-exponential distribution as proposed in [36]. Although we focus on the pre-commitment strategy case, the proposed method can be extended to time-consistent MV optimization in a straightforward manner by enforcing a time-consistency constraint as in [65].

The main contributions of the paper are as follows.

(i)

We present a recursive and localized formulation of the pre-commitment MV portfolio optimization under a realistic context that involves the simultaneous application of different types of investment constraints and jump-diffusion dynamics [36, 49]. Over each rebalancing interval, the key component of the formulation of MV portfolio optimization is a convolution integral involving a conditional density of the logarithm of amount invested in the risky asset.
(ii)

Through a known closed-form expression of the Fourier transform of the underlying transition density, we derive an infinite series representation for this density in which all the terms of the series are non-negative and readily computable explicitly. Therefore, the convolution integral can be approximated in a straightforward manner using a monotone integration scheme via a composite quadrature rule. A significant benefit of the proposed method compared to existing monotone finite difference methods is that it offers a computational complexity that is an order of magnitude lower. Utilizing the Toeplitz matrix structure, we propose an efficient implementation of the proposed monotone integration scheme via FFTs.
(iii)

We mathematically demonstrate that the proposed monotone scheme is also $\ell_{\infty}$ -stable and pointwise consistent with the convolution integral formulation. We rigourously prove the pointwise convergence of the scheme as the discretization parameter approach zero. As the rebalancing time interval approaches zero, we intuitively demonstrate that the proposed scheme converges to a continuously observed impulse control formulation for MV optimization in the form of a Hamilton-Jacobi-Bellman Quasi-Variational Inequality.
(iv)

All numerical experiments are conducted using model parameters calibrated to inflation-adjusted, long-term US market data (89 years), enabling realistic conclusions to be drawn from the results. Numerical experiments demonstrate an agreement with benchmark results obtained by FD method and Monte Carlo simulation of [19].

Although we focus specifically on monotone integration methods for multi-period MV portfolio optimization, our comprehensive and systematic approach could serve as numerical and convergence analysis framework for the development of similar monotone integration methods for other multi-period or continuously observed control problems in finance.

In Section 2, we describe the underlying dynamics and a multi-period rebalancing framework for MV portfolio optimization. A localization of the pre-commitment MV portfolio optimization in the form of an convolution integral together with appropriate boundary conditions are presented in Section 3. Also therein, we present an infinite series representation of the transition density. A simple and easy-to-implement monotone numerical integration method via a composite quadrature rule is described in Section 4. In Section 5, we mathematically establish pointwise convergence the proposed integration method. Section 6 explore possible convergence between the proposed scheme and a Hamilton-Jacobi-Bellman Quasi-Variational Inequality arising from continuously observed impulse control formulation for MV optimization. Numerical results are given in Section 4. Section 8 concludes the paper and outlines possible future work.

2 Modelling

We consider portfolios consisting of a risk-free asset and a well-diversified stock index (the risky asset). With respect to the risk-free asset, we consider different lending and borrowing rates. Specifically, we denote by $r_{b}$ and $r_{\iota}$ the positive, continuously compounded rates at which the investor can respectively borrow funds or earn on cash deposits (with $r_{b}>r_{\iota}$ ). We make the standard assumption that the real world drift rate $\mu$ of the risky asset is strictly greater than $r_{\iota}$ . Since there is only one risky asset, with a constant risk-aversion parameter, it is never MV-optimal to short stock. Therefore, the amount invest in the risky-asset is non-negative for all $t\in[0,T]$ , where $T>0$ denotes the fixed investment time horizon or maturity. In contrast, we do allow short positions in the risk-free asset, i.e. it is possible that the amount invested in the risk-free asset is negative. With this in mind, we denote by $B_{t}\equiv B(t)$ the time- $t$ amount invested in the risk-free asset and by $S_{t}\equiv S(t)$ the natural logarithm of the time- $t$ amount invested in the risky (so that $e^{S_{t}}$ is the amount).

For defining the jump-diffusion model dynamics, let $\xi$ be a random variable denoting the jump size. For any functional $f$ , we let $f_{t^{-}}:=\lim_{\epsilon\rightarrow 0^{+}}f_{t-\epsilon}$ and $f_{t^{+}}:=\lim_{\epsilon\rightarrow 0^{+}}f_{t+\epsilon}$ . Informally, $t^{-}$ (resp. $t^{+}$ ) denotes the instant of time immediately before (resp. after) the forward time $t\in[0,T]$ . When a jump occurs, we have $S_{t}=S_{t^{-}}+\xi$ .

2.1 Discrete portfolio rebalancing

Define $\mathcal{T}_{M}$ as the set of $M$ predetermined, equally spaced rebalancing times in $\left[0,T\right]$ ,

\displaystyle\mathcal{T}_{M}

\displaystyle=

\displaystyle\left\{\left.t_{m}\right|t_{m}=m\Delta t,\;m=0,\ldots,M-1\right\},\quad\Delta t=T/M.

(2.1)

We adopt the convention that $t_{M}=T$ and the portfolio is not rebalanced at the end of the investment horizon $t_{M}=T$ . The evolution of the portfolio over a rebalancing interval $[t_{m-1},t_{m}]$ , $t_{m-1}\in\mathcal{T}_{M}$ , can be viewed as consisting of three steps as follows. Over $[t_{m-1},t_{m-1}^{+}]$ , $(S_{t},B_{t})$ , change according to some rebalancing strategy (i.e. an impulse control). Over the time period $[t_{m-1}^{+},t_{m}^{-}]$ , there is no intervention by the investor according to some control (investment strategy), and therefore $(S_{t},B_{t})$ are uncontrolled, and are assumed to follow some dynamics for all $t\in[t_{m-1}^{+},t_{m}^{-}]$ . Over $[t_{m}^{-},t_{m}]$ , the settlement (payment or receipt) of interest due for the time period $[t_{m-1},t_{m}]$ . In the following, we first discuss stochastic modeling for $(S_{t},B_{t})$ over $[t_{m-1}^{+},t_{m}^{-}]$ , then describe settlement of interest and modelling of rebalancing strategies using impulse controls.

Over the time period $[t_{m-1}^{+},t_{m}^{-}]$ , in the absence of control (investor’s intervention according to some control strategy), the amounts in the risk-free and risky assets are assumed to have the following dynamics:

	$\displaystyle\mathrm{d}B_{t}$	$\displaystyle=$	$\displaystyle\mathcal{R}\left(B_{t}\right)B_{t}\,\mathrm{d}t,\text{ where }\mathcal{R}\left(B_{t}\right)=r_{\iota}+(r_{b}-r_{\iota})\mathbb{I}_{\{B_{t}<0\}},$		(2.2)
	$\displaystyle\mathrm{d}S_{t}$	$\displaystyle=$	$\displaystyle\left(\mu-\lambda\kappa-\frac{\sigma^{2}}{2}\right)\mathrm{d}t+\sigma\,\mathrm{d}W_{t}\ +\mathrm{d}\left(\sum_{\ell=1}^{\pi_{t}}\xi_{\ell}\right),\quad t\in[t_{m-1}^{+},t_{m}^{-}].$

Here, as noted earlier, $r_{b}$ and $r_{\iota}$ denote the positive, continuously compounded rates at which the investor can respectively borrow funds or earn on cash deposits (with $r_{b}>r_{\iota}$ ), while $\mathbb{I}_{\left[A\right]}$ denotes the indicator function of the event $A$ ; $\{W_{t}\}_{t\in[0,T]}$ is a standard Wiener process, and $\mu$ and $\sigma$ are the real world drift rate and the instantaneous volatility, respectively. The jump term $\sum_{\ell=1}^{\pi(t)}\xi_{\ell}$ is a compound Poisson process. Specifically, $\{\pi(t)\}_{0\leq t\leq T}$ is a Poisson process with a constant finite jump intensity $\lambda\geq 0$ ; and, with $\xi$ being a random variable representing the jump size, $\{\xi_{\ell}\}_{\ell=1}^{\infty}$ are independent and identically distributed (i.i.d.) random variables having the same same distribution as the random variable $\xi$ . In the dynamics (2.2), $\kappa=\mathbb{E}\left[e^{\xi}-1\right]$ . Here, $\mathbb{E}[\cdot]$ is the expectation operator taken under a suitable measure. The Poisson process $\{\pi(t)\}_{0\leq t\leq T}$ , the sequence of random variables $\{\xi_{\ell}\}_{\ell=1}^{\infty}$ , and the Wiener process and $\{W_{t}\}_{0\leq t\leq T}$ are mutually independent.

We consider two distributions for the random variable $\xi$ , namely the normal distribution [49] and the asymmetric double-exponential distribution [36]. To this end, let $p(y)$ be the probability density function (pdf) of $\xi$ . In the former case, $\xi\sim\text{Normal}\left(\widetilde{\mu},\widetilde{\sigma}^{2}\right)$ , so that its pdf is given by

\displaystyle{\color[rgb]{0,0,0}{p(y)}}=\frac{1}{\sqrt{2\pi\widetilde{\sigma}^{2}}}\exp\left\{-\frac{\left(y-\widetilde{\mu}\right)^{2}}{2\widetilde{\sigma}^{2}}\right\}.

(2.3)

Also, in this case, $\mathbb{E}[e^{\xi}]=\exp(\widetilde{\mu}+\widetilde{\sigma}^{2}/2)$ , and hence $\kappa=\mathbb{E}\left[e^{\xi}-1\right]$ can be computed accordingly. In the latter case, we consider an asymmetric double-exponential distribution for $\xi$ . Specifically, we consider $\xi\sim\text{Asym-Double-Exponential}(q_{1},\eta_{1},\eta_{2})$ , $\left(q_{1}\in(0,1),\,\eta_{1}>1,\,{\color[rgb]{0,0,0}{\eta_{2}>0}}\right)$ so that its pdf is given by

\displaystyle{\color[rgb]{0,0,0}{p(y)}}=q_{1}\eta_{1}e^{-\eta_{1}y}\mathbb{I}_{\left[y\geq 0\right]}{\color[rgb]{0,0,0}{+q_{2}\eta_{2}e^{\eta_{2}y}\mathbb{I}_{\left[y<0\right]},}}\quad q_{1}+q_{2}=1.

(2.4)

Here $q_{1}$ and $q_{2}=1-q_{1}$ respectively are the probabilities of upward and downward jump sizes. In this case, $\mathbb{E}[e^{\xi}]=\frac{q_{1}\eta_{1}}{\eta_{1}-1}+\frac{q_{2}\eta_{2}}{\eta_{2}+1}$ , so $\kappa=\mathbb{E}\left[e^{\xi}-1\right]$ can be computed accordingly.

2.2 Impulse controls

Discrete portfolio rebalancing is modelled using the discrete impulse control formulation as discussed in for example [19, 65, 66], which we now briefly summarize below. Let $c_{m}$ denote the impulse applied at rebalancing time $t_{m}\in\mathcal{T}_{M}$ , which corresponds to the amount invested in the risk-free asset according to the investor’s intervention at time $t_{m}$ , and let $\mathcal{Z}$ denote the set of admissible impulse values, i.e. $c_{m}\in\mathcal{Z}$ for all $t_{m}\in\mathcal{T}_{M}$ .

Let $X_{t}=\left(S_{t},B_{t}\right),\,t\in\left[0,T\right]$ be the multi-dimensional underlying process, and $x=(s,b)$ denote the state of the system. Suppose that at time- $t_{m}$ , the state of the system is $x=\left(s,b\right)=\left(S\left(t_{m}\right),B\left(t_{m}\right)\right)$ for some $t_{m}\in\mathcal{T}_{M}$ . We denote by $(S_{t_{m}^{+}},B_{t_{m}^{+}})\equiv\left(s^{+}(s,b,c_{m}),b^{+}(s,b,c_{m})\right)$ the state of the system immediately after the application of the impulse $c_{m}$ at time $t_{m}$ , where

\displaystyle{\color[rgb]{0,0,0}{S_{t_{m}^{+}}\equiv s^{+}(s,b,c_{m})=\ln\left(\max(e^{s}+b-c_{m}-\delta,\,e^{s_{\scalebox{0.6}{-$\infty$}}})\right),}}\quad B_{t_{m}^{+}}\equiv b^{+}(s,b,c_{m})=c_{m},\quad t_{m}\in\mathcal{T}_{M}.

(2.5)

Here, $\delta\geq 0$ is a fixed cost⁴⁴4It is straightforward to include a proportional cost into (2.5) as in [19]. However, to focus on the main advantages of the proposed method, we do not consider a proportional cost in this work.; since $\log(\cdot)$ is undefined if $e^{s}+b-c_{m}-\delta\leq 0$ , the amount $S_{t_{m}^{+}}$ becomes $\ln\left(\max(e^{s}+b-c_{m}-\delta,\,e^{s_{\scalebox{0.6}{-$\infty$}}})\right)$ for a finite $s_{\scalebox{0.6}{-$\infty$}}\ll 0$ .

Associated with the fixed set of rebalancing times $\mathcal{T}_{M}$ , defined in (2.1), an impulse control $\mathcal{C}$ will be written as the set of impulse values

\displaystyle\mathcal{C}=\left\{c_{m}\,|\,c_{m}\in\mathcal{Z},\,m=0,\ldots,M-1\right\},

(2.6)

and we define $\mathcal{C}_{m}$ to be the subset of the control $\mathcal{C}$ applicable to the set of times $\left\{t_{m},\ldots,t_{M-1}\right\}$ ,

\displaystyle\mathcal{C}_{m}=\left\{c_{l}\,|\,c_{l}\in\mathcal{Z},\,l=m,\ldots,M-1\right\}\subset\mathcal{C}_{0}\equiv\mathcal{C}.

(2.7)

In a discrete setting, the amount invested in the risk-free asset changes only at rebalancing date. Specifically, over each time interval $\left[t_{m-1},t_{m}\right],\,m=1,\ldots,M$ , we suppose the amount invested in the risk-free asset at time $t_{m-1}^{+}$ after rebalancing being $B_{t_{m}^{+}}=b$ . For test function $f(S_{t},B_{t},t)$ with both $S_{t}$ and $B_{t}$ varying, we model the change in $f(S_{t},B_{t},t)$ with $(S_{t},B_{t}=b)$ for $t\in[t_{m-1}^{+},t_{m}^{-}]$ . Then, the amount in the risk-free asset would jump to $be^{R(b)\Delta t}$ at time $t_{m}$ , reflecting the settlement (payment or receipt) of interest due for the time interval $[t_{m-1},t_{m}]$ , $m=1,\ldots,M$ . Here, we note that, although there is no rebalancing at time $t_{M}=T$ , there is still settlement of interest for the interval $[t_{M-1},t_{M}]$ .

2.3 Investment constraints

With the time- $t$ state of the system being $(s,b)$ , to include transaction cost, we define the liquidation value $W_{\text{liq}}(t)\equiv W_{\text{liq}}(s,b)$ to be

\displaystyle W_{\text{liq}}(t)\equiv W_{\text{liq}}(s,b)=e^{s}+b-\delta,\quad t\in[0,T].

(2.8)

We strictly enforce two realistic investment constraints on the joint values of $S$ and $B$ , namely a solvency condition and a maximum leverage condition. The solvency condition takes the following form: when $W_{\text{liq}}(s,b)\leq 0$ , we require that the position in the risky asset be liquidated, the total remaining wealth be placed in the risk-free asset, and the ceasing of all subsequent trading activities. Specifically, assume that the system is in the state $x=\left(s,b\right)\in\Omega^{\infty}$ at time $t_{m}$ , where $t_{m}\in\mathcal{T}_{M}$ and

\Omega^{\infty}=\left(-\infty,\infty\right)\times\left(-\infty,\infty\right).

(2.9)

We define a solvency region $\mathcal{N}$ and an insolvency or bankruptcy region $\mathcal{B}$ as follows

\displaystyle\mathcal{N}=\left\{\left(s,b\right)\in\Omega^{\infty}:\,W_{\text{liq}}(s,b)>0\right\},\,\,\mathcal{B}=\left\{\left(s,b\right)\in\Omega^{\infty}:\,W_{\text{liq}}(s,b)\leq 0\right\},\,\,W_{\text{liq}}(s,b)\text{ defined in \eqref{eq:W_dis}}.

(2.10)

The solvency constraint can then be stated as

\displaystyle\text{If }\left(s,b\right)\in\mathcal{B}\text{ at }t_{m}\Rightarrow\begin{cases}\text{we require}\left(S_{t_{m}^{+}}=s_{\scalebox{0.6}{-$\infty$}},B_{t_{m}^{+}}=W(s,b)\right)\\ \text{and {\color[rgb]{0,0,0}{$S_{t}$ remains}} so }\forall t\in\left[t_{m}^{+},T\right],\end{cases}

(2.11)

where, as noted above, $s_{\scalebox{0.6}{-$\infty$}}\ll 0$ and is finite. This effectively means that the investment in the risky asset has to be liquidated, the total wealth is to be placed in the risk-free asset, and all subsequent trading activities much cease.

The maximum leverage constraint specifies that the leverage ratio after rebalancing at $t_{m}$ , where $t_{m}\in\mathcal{T}_{M}$ , is stipulated by (2.5) must satisfy

\displaystyle\frac{{\color[rgb]{0,0,0}{\exp(S_{t_{m}^{+}})}}}{\exp(S_{t_{m}^{+}})+B_{t_{m}^{+}}}\leq q_{\max},

(2.12)

for some positive constant $q_{\max}$ typically in the range $\left[1.0,2.0\right]$ . Given above the solvency constraint and the maximum leverage constraint, the set of admissible impulse values, namely the set $\mathcal{Z}$ is therefore defined as follows

\displaystyle\mathcal{Z}

\displaystyle=

\displaystyle\begin{cases}~{}\Big{\{}c_{m}\equiv B_{t_{m}^{+}}\in\mathbb{R}:(S_{t_{m}^{+}},B_{t_{m}^{+}})\text{ via \eqref{eq:S+_B+}}\Big{\}}\qquad\text{no constraints},\\ ~{}\begin{cases}\left\{c_{m}\equiv B_{t_{m}^{+}}\in\mathbb{R}:(S_{t_{m}^{+}},B_{t_{m}^{+}})\text{ via \eqref{eq:S+_B+}, s.t.\ }S_{t_{m}^{+}}\geq s_{\scalebox{0.6}{-$\infty$}}\text{ and \eqref{eq:leverage}}\right\}&(s,b)\in\mathcal{N}\\ \left\{{c_{m}={\color[rgb]{0,0,0}{W_{\text{liq}}(s,b)}}}\right\}&(s,b)\in\mathcal{B}\end{cases}\\ \qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\text{solvency \& maximum leverage}\end{cases}

Based on the definition (2.7), the set of admissible impulse controls is given by

\displaystyle\mathcal{A}

\displaystyle=

\displaystyle\left\{\mathcal{C}_{m}\,|\,\mathcal{C}_{m}\text{ defined in \eqref{eq:control_C_n}},\,m=0,\ldots,M-1\right\}.

(2.13)

3 Formulation

Let $E_{\mathcal{C}_{m}}^{x,t_{m}}\left[W_{\text{liq}}(T)\right]$ and $Var_{\mathcal{C}_{m}}^{x,t_{m}}\left[W_{\text{liq}}(T)\right]$ respectively denote the mean and variance of the terminal liquidation wealth, given the system state $x=(s,b)$ at time $t_{m}$ for some $t_{m}\in\mathcal{T}_{M}$ following the control $\mathcal{C}_{m}\in\mathcal{A}$ over $[t_{m},T]$ , assuming the underlying dynamics (2.2). The standard scalarization method for multi-criteria optimization problem in [80] gives the mean-variance (MV) objective as

\displaystyle\sup_{\mathcal{C}_{m}\in\mathcal{A}}\left\{E_{\mathcal{C}_{m}}^{x,t_{m}}\left[{\color[rgb]{0,0,0}{W_{\text{liq}}(T)}}\right]-\rho\cdot Var_{\mathcal{C}_{m}}^{x,t_{m}}\left[{\color[rgb]{0,0,0}{W_{\text{liq}}(T)}}\right]\right\},

(3.1)

where the scalarization parameter $\rho>0$ reflects the investor’s risk aversion level.

3.1 Value function

Dynamic programming cannot be applied directly to (3.1), since no smoothing property of conditional expectation for variance. The technique of [38, 84] embeds (3.1) in a new optimisation problem, often referred to as the embedding problem, which is amenable to dynamic programming techniques. We follow the example of [13, 22] in defining the PCMV optimization problem as the associated embedding MV problem⁵⁵5For a discussion of the elimination of spurious optimization results when using the embedding formulation, see [23].. Specifically, with $\gamma\in\mathbb{R}$ being the embedding parameter, we define the value function $v\left(s,b,t_{m}\right)$ , $m=M-1,\ldots,0$ as follows

\displaystyle(PCMV_{\Delta t}(t_{m};\gamma)):\quad v\left(s,b,t_{m}\right)=\inf_{\mathcal{C}_{m}\in\mathcal{A}}E_{\mathcal{C}_{m}}^{x,t_{m}}\left[\left({\color[rgb]{0,0,0}{W_{\text{liq}}(T)}}-\frac{\gamma}{2}\right)^{2}\right],\quad\gamma\in\mathbb{R},\quad m=0,\dots,M-1,

(3.2)

where $W_{T}$ is given in (2.8), subject to dynamics (2.2) between rebalancing times. We denote by $\mathcal{C}_{m}^{*}$ the optimal control for the problem $PCMV_{\Delta t}(t_{m};\gamma)$ , where

\displaystyle\mathcal{C}_{m}^{*}=\left\{c_{m}^{*},\ldots,c_{M-1}^{*}\right\},\quad m=0,\ldots,M-1.

(3.3)

For an impulse value $c\in\mathcal{Z}$ , we define the intervention operator $\mathcal{M}(\cdot)$ applied at $t_{m}\in\mathcal{T}_{M}$ as follows

\mathcal{M}(c)~{}v(s,b,t_{m}^{+})=v\left(s^{+}(s,b,c),b^{+}(s,b,c),t_{m}^{+}\right),\quad s^{+}(s,b,c)\text{ and }b^{+}(s,b,c)\text{ are given in \eqref{eq:S+_B+}}.

(3.4)

By dynamic programming arguments [55, 57], for a fixed embedding parameter $\gamma\in\mathbb{R}$ , and $(s,b)\in\Omega^{\infty}$ , the recursive relationship for the value function $v(s,b,t_{m})$ in (3.2) is given by


$\displaystyle v\left(s,b,t_{m}\right)$	$\displaystyle~{}=~{}$	$\displaystyle{\color[rgb]{0,0,0}{\left({\color[rgb]{0,0,0}{W_{\text{liq}}(s,b)}}-\frac{\gamma}{2}\right)^{2}}},$	$\displaystyle\quad\quad m=M,$	(3.5a)
$\displaystyle v\left(s,b,t_{m}\right)$	$\displaystyle~{}=~{}$	$\displaystyle\min\left\{v\left(s,b,t_{m}^{+}\right),\inf_{c\in\mathcal{Z}}\mathcal{M}(c)~{}v\left(s,b,t_{m}^{+}\right)\right\},$	$\displaystyle\quad\quad m=M-1,\ldots,0,$	(3.5b)
$\displaystyle v\left(s,b,t_{m}^{-}\right)$	$\displaystyle~{}=~{}$	$\displaystyle v\left(s,be^{R(b)\Delta t},t_{m}\right),$	$\displaystyle\quad\quad m=M,\ldots,1,$	(3.5c)
$\displaystyle v(s,b,t_{m-1}^{+})$	$\displaystyle~{}=~{}$	$\displaystyle\int_{-\infty}^{\infty}v\left(s^{\prime},b,t_{m}^{-}\right)g(s,s^{\prime};\Delta t)~{}\mathrm{d}s^{\prime},$	$\displaystyle\quad\quad m=M,\ldots,1.$	(3.5d)

Here, in (3.5b), the intervention operator $\mathcal{M}(\cdot)$ is given by (3.4), with the $\min\{\cdot,\cdot\}$ operator reflecting the optimal choice between no-rebalancing and rebalancing (which is subject to a fixed cost $\delta$ ); (3.5c) reflects the settlement (payment or receipt) of interest due for the time interval $[t_{m-1},t_{m}]$ , $m=1,\ldots,M$ . In the integral (3.5d) the functions $g(s,s^{\prime};\Delta t)$ denotes the probability density of $s$ , the log of the amount invested in the risky asset at a future time ( $t_{m}^{-}$ ), and the information $s^{\prime}$ at the current time ( $t_{m-1}^{+}$ ), given $\Delta t=t_{m}-t_{m-1}$ . Also, we note that the fact that amount invested in the risk-free asset does not change in the in the interval $[t_{m-1}^{+},t_{m}^{-}]$ is reflected in (3.5d) since this amount is kept constant ( $=b$ ) on both sides of (3.5d).

It can be shown that $g(s,s^{\prime};\Delta t)$ has the form $g(s-s^{\prime};\Delta t)$ , and therefore, in (3.5d), the integral takes the form of the convolution of $g(\cdot)$ and $v(\cdot,t_{m}^{-})$ . That is, (3.5d) becomes

v(s,b,t_{m-1}^{+})~{}=~{}\int_{-\infty}^{\infty}v\left(s^{\prime},b,t_{m}^{-}\right)g(s-s^{\prime};\Delta t)~{}\mathrm{d}s^{\prime},\quad m=M,\ldots,1.

(3.6)

Although a closed-form expression for $g(s;\Delta t)$ is not known to exist, its Fourier transform, denoted by $G(\cdot;\Delta t)$ , is known in closed-form. Specifically, we recall the Fourier transform pair

\displaystyle\mathfrak{F}[g(s;\cdot)]=G(\eta;\cdot)=\int_{-\infty}^{\infty}e^{-i\eta s}g(s;\cdot)~{}ds,~{}\quad~{}\mathfrak{F}^{-1}[G(\eta;\cdot)]=g(s;\cdot)=\frac{1}{2\pi}\int_{-\infty}^{\infty}e^{i\eta s}G(\eta;\cdot)~{}d\eta.

(3.7)

A closed-form expression for $G\left(\eta;\Delta t\right)$ is given by

\displaystyle G\left(\eta;\Delta t\right)=\exp\left(\Psi\left(\eta\right)\Delta t\right),~{}~{}\text{with}~{}~{}\Psi(\eta)=\left(-\frac{\sigma^{2}\eta^{2}}{2}+\left(\mu-\lambda\kappa-\frac{\sigma^{2}}{2}\right)\left(i\eta\right)-\lambda+\lambda\Gamma\left(\eta\right)\right).

(3.8)

Here, $\Gamma\left(\eta\right)=\int_{-\infty}^{\infty}p(y)~{}e^{i\eta y}~{}dy$ , where $p(y)$ is the probability density function of $\ln\left(\xi\right)$ with $\xi$ being the random variable representing the jump multiplier.

3.2 An infinite series representation of $\boldsymbol{g\left(\cdot\right)}$

The proposed monotone integration method depends on an infinite series representation of the probability density function $g(\cdot)$ , which is presented in Lemma 3.1.

Lemma 3.1.

Let $g(s;\Delta t)$ and $G(\eta;\Delta t)$ be a Fourier transform pair defined in (3.7) and $G(\eta;\Delta t)$ is given in (3.8). Then $g(s;\Delta t)\equiv g\left(s;\Delta t,\infty\right)$ can be written as

	$\displaystyle g\left(s;\Delta t,\infty\right)$	$\displaystyle={\color[rgb]{0,0,0}{\frac{1}{\sqrt{4\pi\alpha}}}}\sum_{k=0}^{\infty}\frac{\left(\lambda\Delta t\right)^{k}}{k!}\int_{-\infty}^{\infty}\ldots\int_{-\infty}^{\infty}\exp\left(\theta-\frac{\left(\beta+s+Y_{k}\right)^{2}}{4\alpha}\right)\left(\prod_{\ell=1}^{k}p(y_{\ell})\right)\mathrm{d}y_{1}\ldots\mathrm{d}y_{k},$
	where	$\displaystyle\alpha=\frac{\sigma^{2}}{2}\,\Delta t,\quad\beta=\left(\mu-\lambda\kappa-\frac{\sigma^{2}}{2}\right)\,\Delta t,\quad\theta=-\lambda\Delta t,\quad Y_{k}=\sum_{\ell=1}^{k}y_{\ell},\quad Y_{0}=0,$		(3.9)

and $p(y)$ is the PDF of the random variable $\xi$ . When $k=0$ , we have $g\left(s;\Delta t,0\right)=\frac{1}{\sqrt{4\pi\alpha}}\exp\left(\theta-\frac{\left(\beta+s+\right)^{2}}{4\alpha}\right)$ .

A proof of of Lemma 3.1 is given in Appendix A.

The infinite series representation in (3.9) cannot be applied directly for computation as the $k$ -th term of the series involves a multiple integral with $\left(\prod_{\ell=1}^{k}p(y_{\ell})\right)$ , where $p(y)$ denotes the probability density of $\xi$ . To feasibly evaluate this multiple integral analytically, as noted earlier, our method requires a quite modest level of analytical tractability for the random variable $\xi$ . To put it more precisely, we need the associated jump-diffusion model of the form $\frac{\mathrm{d}X_{t}}{X_{t}^{-}}=(\mu-\lambda\kappa)\mathrm{d}t+\sigma\,\mathrm{d}W_{t}\ +\mathrm{d}\left(\sum_{\ell=1}^{\pi_{t}}(\xi_{\ell}-1)\right)$ to yield a closed-form expression for European vanilla options. Arguments in [17][Section 3.4.1] can be invoked to demonstrate this relationship. Thus, the requirement of analytical tractability for the random variable $\xi$ is not overly stringent, thereby broadening the applicability of our approach in practice.

Next, we demonstrate the applicability of our approach for two popular distributions for $\xi$ in finance: when $\xi$ follow a normal distribution [49] and an asymmetric double-exponential distribution [36].

Corollary 3.1.

For the case $\xi\sim\text{Normal}\left(\widetilde{\mu},\widetilde{\sigma}^{2}\right)$ whose PDF is given by (2.3), the infinite series representation of the conditional density $g(s;\Delta t,\infty)$ given in Lemma 3.1 is evaluated to

	$\displaystyle\displaystyle g(s;\Delta t,\infty)$	$\displaystyle=$	$\displaystyle g(s;\Delta t,0)+\sum_{k=1}^{\infty}\Delta g_{k}(s;\Delta t),$		(3.10)
	$\displaystyle\text{where}\qquad g(s;\Delta t,0)$	$\displaystyle=$	$\displaystyle\frac{\exp\left(\theta-\frac{\left(\beta+s+\right)^{2}}{4\alpha}\right)}{\sqrt{4\pi\alpha}},~{}~{}\text{and}~{}~{}\Delta g_{k}(s;\Delta t)=\frac{\left(\lambda\Delta t\right)^{k}}{k!}\,\frac{\exp\left(\theta-\frac{\left(\beta+s+k\widetilde{\mu}\right)^{2}}{4\alpha+2k\widetilde{\sigma}^{2}}\right)}{\sqrt{4\pi\alpha+2\pi k\widetilde{\sigma}^{2}}},$

with $\alpha$ , $\beta$ and $\theta$ are given in (3.9).

For the case $\xi\sim\text{Asym-Double-Exponential}(q_{1},\eta_{1},\eta_{2})$ , $\left(q_{1}\in(0,1),\,\eta_{1}>1,\,{\color[rgb]{0,0,0}{\eta_{2}>0}}\right)$ whose PDF given by (2.4), the infinite series representation of the conditional density $g(s;\Delta t,\infty)$ given in Lemma 3.1 is evaluated to $\displaystyle g(s;\Delta t,\infty)=g(s;\Delta t,0)+\sum_{k=1}^{\infty}\Delta g_{k}(s;\Delta t)$ , where $g(s;\Delta t,0)=\frac{\exp\left(\theta-\frac{\left(\beta+s\right)^{2}}{4\alpha}\right)}{\sqrt{4\pi\alpha}}$ , and

	$\displaystyle\Delta g_{k}(s;\Delta t)=$	$\displaystyle\frac{e^{\theta}}{\sqrt{4\pi\alpha}}\frac{\left(\lambda\Delta t\right)^{k}}{k!}\left[\sum_{\ell=1}^{k}\,Q_{1}^{k,\ell}\,\left(\eta_{1}\,\sqrt{2\alpha}\right)^{\ell}\,\mathrm{e}^{\eta_{1}\,\left(\beta+s-s^{\prime}\right)+\eta_{1}^{2}\alpha}\,\mathrm{Hh}_{\ell-1}\left(\eta_{1}\sqrt{2\alpha}+\frac{\beta+s-s^{\prime}}{\sqrt{2\alpha}}\right)\right.$
		$\displaystyle\qquad\qquad\qquad\left.+\sum_{\ell=1}^{k}\,Q_{2}^{k,\ell}\,\left(\eta_{2}\,\sqrt{2\alpha}\right)^{\ell}\,\mathrm{e}^{-\eta_{2}\,\left(\beta+s-s^{\prime}\right)+\eta_{2}^{2}\alpha}\,\mathrm{Hh}_{\ell-1}\left(\eta_{2}\sqrt{2\alpha}-\frac{\beta+s-s^{\prime}}{\sqrt{2\alpha}}\right)\right].$		(3.11)

Here, $\alpha$ , $\beta$ and $\theta$ are given in (3.9); $Q_{1}^{k,\ell}$ , $Q_{2}^{k,\ell}$ and $\mathrm{Hh}_{\ell}$ are defined as follows

	$\displaystyle Q_{1}^{k,\ell}$	$\displaystyle=\sum_{i=\ell}^{k-1}\binom{k-\ell-1}{i-\ell}\binom{k}{i}\left(\frac{\eta_{1}}{\eta_{1}+\eta_{2}}\right)^{i-\ell}\left(\frac{\eta_{2}}{\eta_{1}+\eta_{2}}\right)^{k-i}q_{1}^{i}q_{2}^{k-i},\quad 1\leq\ell\leq k-1,$
	$\displaystyle Q_{2}^{k,\ell}$	$\displaystyle=\sum_{i=\ell}^{k-1}\binom{k-\ell-1}{i-\ell}\binom{k}{i}\left(\frac{\eta_{1}}{\eta_{1}+\eta_{2}}\right)^{k-i}\left(\frac{\eta_{2}}{\eta_{1}+\eta_{2}}\right)^{i-\ell}q_{1}^{k-i}q_{2}^{i},\quad 1\leq\ell\leq k-1,$		(3.12)

where $q_{1}+q_{2}=1$ , $Q_{1}^{k,k}=q_{1}^{k}$ and $Q_{2}^{k,k}=q_{2}^{k}$ , and

\displaystyle Hh_{\ell}(x)=\frac{1}{\ell!}\int_{x}^{\infty}\left(y-x\right)^{\ell}e^{-\frac{1}{2}y^{2}}\mathrm{d}y,\text{ with }Hh_{-1}(x)=e^{-x^{2}/2},\text{ and }Hh_{0}(x)=\sqrt{2\pi}\text{NorCDF}(-x).

(3.13)

Here, NorCDF denotes CDF of standard normal distribution $\mathcal{N}(0,1)$ . For brevity, we omit a straightforward proof for the log-normal case (3.10) using Equation (A.3). A proof for the log-double exponential case (3.1) is given in Appendix C. For this case, we note that function $Hh_{\ell}(\cdot)$ can be evaluated very efficiently using the standard normal density function and standard normal distribution function via the three-term recursion [1]

\displaystyle\ell\,Hh_{\ell}(x)=Hh_{\ell-2}(x)-xHh_{\ell-1}(x),\quad\ell\geq 1.

In the subsequent section, we present a definition of the localized problem to be solved numerically.

3.3 Localization and problem statement

The MV formulation (3.5) is posed on an infinite domain. For the problem statement and convergence analysis of numerical schemes, we define a localized MV portfolio optimsation formulation. To this end, with $s_{\min}^{\dagger}<s_{\min}<0<s_{\max}<s_{\max}^{\dagger}$ , $-b_{\max}<0<b_{\max}$ , where $|s_{\min}^{\dagger}|$ , $|s_{\min}|$ , $s_{\max}$ , $s_{\max}^{\dagger}$ , and $b_{\max}$ are sufficiently large, we define the following spatial sub-domains:

$\displaystyle\Omega$	$\displaystyle=[s_{\min}^{\dagger},s_{\max}^{\dagger}]\times\left[-b_{\max},b_{\max}\right],$	$\displaystyle\quad\Omega_{\mathcal{B}}$	$\displaystyle=\left\{\left(s,b\right)\in\Omega\,\backslash\,\Omega_{s_{\max}}\,\backslash\,\Omega_{s_{\min}}:\,{\color[rgb]{0,0,0}{W_{\text{liq}}\left(s,b\right)}}\leq 0\right\},$
$\displaystyle\Omega_{s_{\max}}$	$\displaystyle=\left[s_{\max},s_{\max}^{\dagger}\right]\times\left[-b_{\max},b_{\max}\right],$	$\displaystyle\quad\Omega_{b_{\max}}$	$\displaystyle=\left(s_{\min},s_{\max}\right)\times\left[-b_{\max}e^{r_{b}T},-b_{\max}\right)\cup\left(b_{\max},b_{\max}e^{r_{\iota}T}\right],$
$\displaystyle\Omega_{s_{\min}}$	$\displaystyle=\left[s_{\min}^{\dagger},s_{\min}\right]\times\left[-b_{\max},b_{\max}\right],$	$\displaystyle\quad\Omega_{\scalebox{0.7}{\text{in}}}$	$\displaystyle=\Omega\,\backslash\,\Omega_{s_{\max}}\,\backslash\,\Omega_{s_{\min}}\,\backslash\,\Omega_{\mathcal{B}}.$	(3.14)

We emphasize that we do not actually solve the MV optimization problem in $\Omega_{b_{\max}}$ . However, we may use an approximate value to the solution in $\Omega_{b_{\max}}$ , obtained by means of extrapolation of the computed solution in $\Omega_{\scalebox{0.7}{\text{in}}}$ , to provide any information required by the MV optimization problem in $\Omega_{\scalebox{0.7}{\text{in}}}$ . We also define the following sub-domains:

	$\displaystyle\Omega_{s_{\max}^{\dagger}}$	$\displaystyle=\left[s_{\max}^{\dagger},s_{\max}^{\ddagger}\right]\times\left[-b_{\max},b_{\max}\right],\quad\Omega_{s_{\min}^{\dagger}}=\left[s_{\min}^{\ddagger},s_{\min}^{\dagger}\right]\times\left[-b_{\max},b_{\max}\right],$
		$\displaystyle\qquad\qquad\text{where }s_{\max}^{\ddagger}=s_{\max}-s_{\min}^{\dagger}\text{ and }\quad s_{\min}^{\ddagger}=s_{\min}-s_{\max}^{\dagger}.$		(3.15)

The solutions within the sub-domains $\Omega_{s_{\min}^{\dagger}}$ and $\Omega_{s_{\max}^{\dagger}}$ are not required for our purposes. These sub-domains are introduced to ensure the well-defined computation of the conditional probability density function $g(\cdot)$ in (3.6) for the convolution integral (3.6) in the MV optimization problem within $\Omega_{\scalebox{0.7}{\text{in}}}$ . To simplify our discussion, we will adopt a zero-padding convention going forward. This convention assumes that the value functions within these sub-domains are zero for all time $t$ , and we will exclude these sub-domains from further discussions.

Due to rebalancing, the intervention operator $\mathcal{M}(\cdot)$ for $\Omega_{\scalebox{0.7}{\text{in}}}$ , defined in (3.4), may require evaluating a candidate value at a point having $s^{+}=\ln(\max(W_{\text{liq}}(s,b)-c,\,e^{s_{\scalebox{0.6}{-$\infty$}}}))$ , and $s^{+}$ could be outside $[s_{\min}^{\dagger},s_{\max}^{\dagger}]$ , if $s_{\scalebox{0.6}{-$\infty$}}<s^{\dagger}_{\min}$ . Therefore, with $|s^{\dagger}_{\min}|$ selected sufficiently large, we assume $s_{\scalebox{0.6}{-$\infty$}}=s^{\dagger}_{\min}$ .

We now present equations for spatial sub-domains defined in (3.3). We note that boundary conditions for $s\to-\infty$ and $s\to\infty$ are obtained by relevant asymptotic forms $e^{s}\to 0$ and $e^{s}\to\infty$ , respectively, similar to [19]. This is detailed below.

•

For $(s,b,T)\in\Omega\times\{T\}$ , we apply the terminal condition (3.5a)

$v(s,b,T)={\color[rgb]{0,0,0}{\left(W_{\text{liq}}(s,b)-\frac{\gamma}{2}\right)^{2}}}.$ (3.16)

•

For $(s,b,t_{m})\in\Omega\times\mathcal{T}_{M}$ , $m=M-1,\ldots,0$ , the intervention result (3.5b) is given by

v\left(s,b,t_{m}\right)~{}=~{}\min\left\{v\left(s,b,t_{m}^{+}\right),\inf_{c\in\mathcal{Z}}\mathcal{M}(c)~{}v\left(s,b,t_{m}^{+}\right)\right\},

(3.17)

where the intervention $\mathcal{M}(\cdot)$ is defined in (3.4).

•

For $(s,b,t_{m}^{-})\in\Omega\times\{t_{m}^{-}\}$ , $m=M,\ldots,1$ , settlement of interest (3.5c) is enforced by

v\left(s,b,t_{m}^{-}\right)~{}=~{}v\left(s,be^{R(b)\Delta t},t_{m}\right),\quad m=M,\ldots,1,\quad\text{ and $v\left(s,\cdot,t_{m}\right)$ is given in \eqref{eq:recursive_b_comp_dom}. }

(3.18)

•

For $(s,b,t_{m}^{+})\in\Omega_{b_{\max}}\times\{t_{m}^{+}\}$ , where $m=M,\ldots,1$ , we impose the boundary condition

{\color[rgb]{0,0,0}{v\left(s,b,t_{m}^{+}\right)=\left(\frac{b}{b_{\max}}\right)^{2}v\left(s,\operatorname{sgn}(b)b_{\max},t_{m}^{+}\right).}}

(3.19)

•

For $(s,b,t_{m-1}^{+})\in\Omega_{s_{\min}}\times\{t_{m-1}^{+}\}$ , where $t_{m-1}\in\mathcal{T}_{M}$ , from (3.16), we assume that $v(s,b,t)\approx A_{0}(t)b^{2}$ for some unknown function $A_{0}(t)$ , which mimics asymptotic behaviour of the value function as $s\to-\infty$ (or equivalently, $e^{z}\to 0$ ). Substituting this asymptotic form into the integral (3.5d) gives the boundary condition

\displaystyle v(s,b,t_{m-1}^{+})=A_{0}(t_{m}^{-})b^{2}\int_{-\infty}^{\infty}g(s-s^{\prime};\Delta t)~{}\mathrm{d}s^{\prime}=v(s,b,t_{m}^{-}),

(3.20)

where $v(s,b,t_{m}^{-})$ is given by (3.18).

•

For $(s,b,t_{m-1}^{+})\in\Omega_{s_{\max}}\times\{t_{m-1}^{+}\}$ , where $t_{m-1}\in\mathcal{T}_{M}$ , from (3.16), for fixed $b$ , we assume that $v(z,b,t)\approx A_{1}(t)e^{2s}$ for some unknown function $A_{1}(t)$ , which mimics asymptotic behaviour of the value function as $s\to\infty$ (or equivalently, $e^{z}\to\infty$ ). We substitute this asymptotic form into the integral (3.5d), noting the infinite series representation of $g(\cdot;\Delta t)$ given Lemma 3.1, and obtain the correspoding boundary condition:

v\left(s,b,t_{m-1}^{+}\right)=v(s,b,t_{m}^{-})~{}e^{\left(\sigma^{2}+2\mu+\lambda\kappa_{2}\right)\Delta t},\quad\kappa_{2}=\mathbb{E}\left[\left(\mathrm{e}^{\xi}-1\right)^{2}\right],

(3.21)

where $v(s,b,t_{m}^{-})$ is given by (3.18). For a proof, see Appendix B.

•

For $(s,b,t_{m-1}^{+})\in\Omega_{\scalebox{0.7}{\text{in}}}\times\{t_{m-1}^{+}\}$ , where $t_{m-1}\in\mathcal{T}_{M}$ , from the convolution integral $\eqref{eq:recursive_c_comp_int_1}$ , we have

v\left(s,b,t_{m-1}^{+}\right)=\int_{s_{\min}^{\dagger}}^{s_{\max}^{\dagger}}v\left(s^{\prime},b,t_{m}^{-}\right)g(s-s^{\prime};\Delta t)~{}\mathrm{d}s^{\prime}.

(3.22)

where the terminal condition $v\left(s^{\prime},b,t_{m}^{-}\right)$ is given by (3.18). The conditional density $g(\cdot;\Delta t)$ is given by the infinite series in (3.9) (Lemma (3.1)), and is defined on $[s_{\min}^{\ddagger},s_{\max}^{\ddagger}]$ .

In Definition 3.1 below, we formally define the MV portfolio optimization problem

Definition 3.1 (Localized MV portfolio optimization problem).

The MV portfolio optimization problem with the set of rebalancing times $\mathcal{T}_{M}$ defined in (2.1), and dynamics (2.2) with the PDF $p(y)$ given by (2.3) or (2.4), is defined in $\Omega\times{\color[rgb]{0,0,0}{\mathcal{T}_{M}}}\cup\left\{t_{M}\right\}$ as follows.

At each $t_{m-1}\in\mathcal{T}_{M}$ , the solution to the MV portfolio optimization problem $v(s,b,t_{m-1})$ given by (3.17), where $v(s,b,t_{m-1}^{+})$ satisfies (i) the integral (3.22) in $\Omega_{\scalebox{0.7}{\text{in}}}\times\{t_{m-1}^{+}\}$ , (ii) the boundary conditions (3.20), (3.21), and (3.19) in $\left\{\Omega_{s_{\min}},\Omega_{s_{\max}},\Omega_{b_{\max}}\right\}\times\{t_{m-1}^{+}\}$ , respectively, and (iii) subject to the terminal condition (3.16) in $\Omega\times\{t_{M}\}$ , with the settlement of interest subject to (3.18) in $\Omega\times\{t_{m}^{-}\}$ .

We introduce a result on uniform continuity of the solution to the MV portfolio optimization.

Proposition 3.1.

The solution $v(s,b,t_{m})$ to the MV portfolio optimization in Definition 3.1 is uniformly continuous within each sub-domain $\Omega_{\scalebox{0.7}{\text{in}}}\times\{t_{m}\}$ , $m=M,\ldots,0$ .

Proof.

This proposition can be proved using mathematical induction on $m$ . For brevity, we outline key details below. We first note that the domain $\Omega$ is bounded and $T$ is finite. We observe that if $v(s,b,t)$ is a uniformly continuous function, then $\displaystyle\inf_{c\in\mathcal{Z}}\mathcal{M}(c)v(s,b,t)$ , where $\mathcal{M}(\cdot)$ defined in (3.4), is also uniformly continuous [31, Lemma 2.2]. As such, $\min\{v(s,b,t),\inf_{c\in\mathcal{Z}}\mathcal{M}(c)v(s,b,t)\}$ is also uniformly continuous since $\Omega$ is bounded. Therefore, it follows that if $v(s,b,t_{m}^{+}),m=M-1,\ldots,0$ , is uniformly continuous then the intervention result $v(s,b,t_{m})$ obtained in (3.17) is also uniformly continuous. Next, if $v(s,b,t_{m})$ , $m=M,\ldots,1$ , is uniformly continuous, then the interest settlement result $v(s,b,t_{m}^{-})$ defined in (3.18) is also uniformly continuous. The other key step is to show that, if $v(s,b,t_{m}^{-})$ , $m=M,\ldots,1$ , is uniformly continuous, then the solution $v(s,b,t_{m-1}^{+})$ for $(s,b)\in\Omega_{\scalebox{0.7}{\text{in}}}$ given by the convolution integral (3.22) is also uniformly continuous. Combining these above three steps with the fact that the initial condition $v(s,b,t_{M})$ given in (3.16) is uniformly continuous in $(s,b)\in\Omega$ , with $\Omega$ a bounded domain, gives the desired result. ∎

We conclude this section by emphasizing that the value function may not be continuous across $s_{\min}$ and $s_{\max}$ . The interior domain $\Omega_{\scalebox{0.7}{\text{in}}}\times\{t_{m}\}$ , $m=M-1,\ldots,0$ , is the target region where provable pointwise convergence of the proposed numerical method is investigated, which relies on Proposition 3.1.

4 Numerical methods

Given the closed-form expressions of $g\left(s-s^{\prime};\Delta t\right)$ , the convolution integral (3.22) is approximated by a discrete convolution which can be efficiently computed via FFTs. For our scheme, the intervals $[s^{\dagger}_{\min},s_{\min}]$ and $[s_{\max},s^{\dagger}_{\max}]$ also serve as padding areas for nodes in $\Omega_{\scalebox{0.7}{\text{in}}}$ . Without loss of generality, for convenience, we assume that $|s_{\min}|$ and $s_{\max}$ are chosen sufficiently large with

\displaystyle s^{\dagger}_{\min}=s_{\min}-\frac{s_{\max}-s_{\min}}{2},~{}~{}~{}\text{and}~{}~{}~{}s^{\dagger}_{\max}=s_{\max}+\frac{s_{\max}-s_{\min}}{2}.

(4.1)

With this in mind, $s_{\min}^{\ddagger}$ and $s_{\max}^{\ddagger}$ , defined in (3.15), are given by

s_{\min}^{\ddagger}=s^{\dagger}_{\min}-s_{\max}=-\frac{3}{2}\left(s_{\max}-s_{\min}\right),~{}~{}~{}\text{and}~{}~{}~{}s_{\max}^{\ddagger}=s^{\dagger}_{\max}-s_{\min}=\frac{3}{2}\left(s_{\max}-s_{\min}\right).

4.1 Discretization

We discretize MV portfolio optimization problem defined in Defn. 3.1 on the localized domain $\Omega$ as follows.

(i)

We denote by $N$ (resp. $N^{\dagger}$ and $N^{\ddagger}$ ) the number of intervals of a uniform partition of $[s_{\min},s_{\max}]$ (resp. $[s_{\min}^{\dagger},s_{\max}^{\dagger}]$ and $[s_{\min}^{\ddagger},s_{\max}^{\ddagger}]$ ). For convenience, we typically choose $N^{\dagger}=2N$ and $N^{\ddagger}=3N$ so that only one set of $s$ -coordinates is needed. We use an equally spaced partition in the $s$ -direction, denoted by $\{s_{n}\}$ , where

	$\displaystyle s_{n}$	$\displaystyle=\hat{s}_{0}+n\Delta s;~{}~{}s=-N^{\ddagger}/2,\ldots,N^{\ddagger}/2,~{}~{}\text{where }\hat{s}_{0}=\frac{s_{\min}+s_{\max}}{2}=\frac{s^{\dagger}_{\min}+s^{\dagger}_{\max}}{2}=\frac{s^{\ddagger}_{\min}+s^{\ddagger}_{\max}}{2},$
	$\displaystyle\text{and }\Delta s$	$\displaystyle=\frac{s_{\max}-s_{\min}}{N}~{}=~{}\frac{s^{\dagger}_{\max}-s^{\dagger}_{\min}}{N^{\dagger}}~{}=~{}\frac{s^{\ddagger}_{\max}-s^{\ddagger}_{\min}}{N^{\ddagger}}.$		(4.2)

(ii)

We use an unequally spaced partition in the $b$ -direction, denoted by $\{b_{j}\}$ , where $j=0,\ldots,J$ , with $b_{0}=b_{\min}$ , $b_{J}=b_{\max}$ , $\Delta b_{\max}=\max_{0\leq j\leq J-1}\left(b_{j+1}-b_{j}\right)$ , and $\Delta b_{\min}=\max_{0\leq j\leq J-1}\left(b_{j+1}-b_{j}\right)$ .

We emphasize that no timestepping is required for the interval $[t_{m-1}^{+},t_{m}^{-}]$ , $t_{m-1}\in\mathcal{T}_{M}$ . As noted earlier, $\Delta t=T/M$ is kept constant. We assume that there exists a discretization parameter $h>0$ such that

\displaystyle\Delta s=C_{1}h,\quad\Delta b_{\max}=C_{2}h,\quad\Delta b_{\min}=C_{2}^{\prime}h,

(4.3)

where the positive constants $C_{1}$ , $C_{2}$ , $C_{2}^{\prime}$ are independent of $h$ . For convenience, we occasionally use ${\bf{x}}_{n,j}^{m}\equiv(s_{n},b_{j},t_{m})$ to refer to the reference gridpoint $(s_{n},b_{j},t_{m})$ , $n=-N^{\dagger}/2,\ldots,N^{\dagger}/2$ , $j=0,\ldots,J$ , $m=M,\ldots,0$ . Nodes ${\bf{x}}_{n,j}^{m}$ have (i) $n=-N^{\dagger}/2,\ldots,-N/2$ , in $\Omega_{s_{\min}}$ , (ii) $n=-N/2+1,\ldots N/2-1$ , in $\Omega_{\scalebox{0.7}{\text{in}}}$ , (iii) $n=N/2,\ldots N^{\dagger}/2$ , in $\Omega_{s_{\max}}$ . and (iv) $n=-N^{\ddagger}/2+1\ldots-N^{\dagger}/2-1$ and $n=N^{\dagger}/2+1\ldots N^{\ddagger}/2-1$ , in padding sub-domains.

For $t_{m}\in\mathcal{T}_{M}$ , we denote by $v(s_{n},b_{j},t)$ the exact solution at the reference node $(s_{n},b_{j},t)$ , where $t=\{t_{m}^{\pm},t_{m}\}$ , and by $v_{h}(s,b,t)$ the approximate solution at an arbitrary point $(s,b,t)$ obtained using the discretization parameter $h$ . We refer to the approximate solution at the reference node $(s_{n},b_{j},t)$ , where $t=\{t_{m}^{\pm},t_{m}\}$ , as $v_{n,j}^{m\pm}\equiv v_{h}(s_{n},b_{j},t_{m}^{\pm})$ and $v_{n,j}^{m}\equiv v_{h}(s_{n},b_{j},t_{m})$ . In the event that we need to evaluate $v_{h}$ at a point other than a node on the computational gridpoint, linear interpolation is used. We define by $\mathcal{Z}_{h}$ the discrete set of admissible impulse values defined as follows

\displaystyle\mathcal{Z}_{h}=\left\{b_{0},b_{1},\ldots,b_{J}\right\}\cap\mathcal{Z}.

(4.4)

where $\mathcal{Z}$ is defined in (2.3), and $h$ is the discretization parameter. With $b^{+}\in\mathcal{Z}_{h}$ being an impulse value (a control), applying $b^{+}$ at the reference spatial node $(s_{n},b_{j})$ results in

s^{+}_{n}=s^{+}(s_{n},b_{j},b^{+})\text{ computed by \eqref{eq:S+_B+}},\quad b_{j}^{+}=b^{+}(s_{n},b_{j},b^{+})=b^{+}.

(4.5)

For the special case $t_{M}$ , as discussed earlier, we only have interest rate payment, but no rebalancing, and therefore, only $v_{n,j}^{M}$ and $v_{n,j}^{M-}$ are used.

4.2 Numerical schemes

For convenience, we define $\mathbb{N}=\left\{-N/2+1,\ldots,N/2-1\right\}$ , $\mathbb{N}^{\dagger}=\left\{-N^{\dagger}/2,\ldots,N^{\dagger}/2\right\}$ and $\mathbb{J}=\left\{0,\ldots,J\right\}$ . Backwardly, over the time interval $\left[t_{m-1},t_{m}\right]$ , $t_{m-1}\in\mathcal{T}_{M}$ , there are three key components solving the MV optimisation problem, namely (i) the interest settlement over $\left[t_{m}^{-},t_{m}\right]$ as given in (3.18); (ii) the time advancement from $t_{m}^{-}$ to $t_{m-1}^{+}$ , as captured by (3.20)-(3.22), and (iii) the intervention action over $[t_{m-1},t_{m-1}^{+}]$ as given in (3.17). We now propose the numerical schemes for these steps.

For $(s_{n},b_{j},t_{M})\in\Omega\times\{T\}$ , we impose the terminal condition (3.16) by

v_{n,j}^{M}={\color[rgb]{0,0,0}{\left(W_{\text{liq}}(s_{n},b_{j})-\frac{\gamma}{2}\right)^{2},\quad n\in\mathbb{N}^{\dagger},~{}~{}j\in\mathbb{J}.}}

(4.6)

For imposing the intervention action (3.17), we solve the optimization problem

\displaystyle{\color[rgb]{0,0,0}{v_{n,j}^{m}=\min\left\{v_{n,j}^{m+},\min_{b^{+}\in\mathcal{Z}_{h}}v_{h}(s_{n}^{+},b^{+},t_{m}^{+})\right\},}}\quad s_{n}^{+}=s^{+}(s_{n},b_{j},b^{+}),\quad n\in\mathbb{N}^{\dagger},~{}j\in\mathbb{J}.

(4.7)

Here, $v_{h}(s^{+}_{n},b^{+},t_{m}^{+})$ is the approximate solution to the exact solution $v(s_{n}^{+},b^{+},t_{m}^{+})$ , where $b^{+}\in\mathcal{Z}_{h}$ and $s^{+}_{n}=s^{+}(s_{n},b_{j},b^{+})$ is given by (2.5). The approximation $v_{h}(s^{+}_{n},b^{+},t_{m}^{+})$ is computed by linear interpolation as follows

\displaystyle v_{h}(s_{n}^{+},b^{+},t_{m}^{+})=\mathcal{I}\left\{v^{m+}\right\}\left(s^{+}_{n},b^{+}\right),\quad n\in\mathbb{N}^{\dagger},~{}~{}j\in\mathbb{J}.

(4.8)

Here, $\mathcal{I}\left\{v^{m+}\right\}(\cdot)$ is a linear interpolation operator acting on the time- $t_{m}^{+}$ discrete solutions $\left\{s_{q},b_{p},v_{q,p}^{m+}\right\}$ , $q\in\mathbb{N}^{\dagger}$ and $p\in\mathbb{J}$ . We note that since $b^{+}\in\left\{b_{0},b_{1},\ldots,b_{J}\right\}$ , (4.8) boils down to a single dimensional interpolation along the $s$ -dimension.

Remark 4.1 (Attainability of local minima).

We determine the infimum of the intervention operator in (3.5b) by a linear search over the discrete set of controls $\mathcal{Z}_{h}$ in (4.4), that is, an exhaustive search through all admissible controls. As mentioned in [22], using this approach, we can guarantee obtain the global minimum as $h\rightarrow 0$ .

For the settlement of interest (3.18), linear interpolation/extrapolation is applied to compute $v_{n,j}^{m-}$ as follows.

v_{n,j}^{m-}=\mathcal{I}\left\{v^{m}_{n}\right\}\left(b_{j}e^{R(b_{j})\Delta t}\right),\quad n\in\mathbb{N}^{\dagger},~{}~{}j\in\mathbb{J}

(4.9)

Here $\mathcal{I}\left\{v^{m}_{n}\right\}(\cdot)$ be linear interpolation/extrapolation operator acting on the time- $t_{m}$ discrete solutions $\left\{b_{q},v_{n,q}^{m}\right\}$ , $q\in\mathbb{J}$ , where $v_{n,q}^{m}$ are given by (4.6) at $t_{m}=T$ and by (4.7) at $t_{m},m=M-1,\ldots,1$ . Note that when $\left(s_{n},b\right)\in\Omega_{b_{\max}}$ , $\mathcal{I}\left\{v^{m}_{n}\right\}(b)$ becomes a linear extrapolation operator which imposes the boundary condition (3.19). That is,

\displaystyle v(s_{n},b,t_{m})=\left(\frac{b}{b_{J}}\right)^{2}v\left(s_{n},\operatorname{sgn}(b)b_{J},t_{m}\right),\qquad(s_{n},b,t_{m})\in\Omega_{b_{\max}}\times\left\{t_{m}\right\},\,m=M,\ldots,1.

(4.10)

For the time advancement of $(s_{n},b_{j},t_{m-1}^{+})\in\Omega_{s_{\min}}\cup\Omega_{s_{\scalebox{0.7}{\text{in}}}}\cup\Omega_{s_{\max}}\times\{t_{m-1}^{+}\}$ , $t_{m-1}\in\mathcal{T}_{M}$ . The boundary conditions, for $\Omega_{s_{\min}}\cup\Omega_{s_{\max}}\times\{t_{m-1}^{+}\}$ as (3.20) and (3.21), can be imposed by

	$\displaystyle v_{n,j}^{(m-1)+}$	$\displaystyle=$	$\displaystyle v_{n,j}^{m-},\qquad\qquad\qquad\quad n=-N^{\dagger}/2,\ldots,-N/2,~{}~{}j\in\mathbb{J},{\text{ and $v_{n,j}^{m-}$ is given in \eqref{eq:recursive_interest**}, }}$		(4.11)
	$\displaystyle v_{n,j}^{(m-1)+}$	$\displaystyle=$	$\displaystyle e^{\left(\sigma^{2}+2\mu+\lambda\kappa_{2}\right)\Delta t}v_{n,j}^{m-},\quad n=N/2,\ldots,N^{\dagger}/2,~{}j\in\mathbb{J},{\text{ and $v_{n,j}^{m-}$ is given in \eqref{eq:recursive_interest**}}}.$		(4.12)

In $\Omega_{\scalebox{0.7}{\text{in}}}$ , we tackle the convolution integral in (3.22), where $j\in\mathbb{J}$ is fixed. For simplicity, we adopt the following notational convention: with $n\in\mathbb{N}$ and $l\in\mathbb{N}^{\dagger}$ , we let ${\color[rgb]{0,0,0}{g_{n-l}(\Delta t,\infty)}}=g(s_{n}-s_{l};\Delta t,\infty)$ , where $g(\cdot)$ is given by the infinite series (3.9). We also denote by $g_{n-l}(\Delta t,K)$ an approximation to $g_{n-l}(\Delta,\infty)$ using the first $K$ terms of the infinite series (3.9). Applying the composite trapezoidal rule to approximate the convolution integral (3.22) gives the approximation in the form of a discrete convolution as follows

\displaystyle v_{n,j}^{(m-1)+}=\Delta s{\color[rgb]{0,0,0}{\sum_{l=-N^{\dagger}/2}^{N^{\dagger}/2}}}\omega_{l}\,{\color[rgb]{0,0,0}{g_{n-l}(\Delta t,K)}}\,v_{l,j}^{m-},\quad{\color[rgb]{0,0,0}{n\in\mathbb{N},~{}j\in\mathbb{J}.}}

(4.13)

where $v_{l,j}^{m-}$ are given in (4.9) and $\omega_{l}=1$ , $l=-N^{\dagger}/2+1,\ldots,N^{\dagger}/2-1$ , and $\omega_{-N^{\dagger}/2}=\omega_{N^{\dagger}/2}=1/2$ .

Remark 4.2 (Monotonicity).

We highlight that the conditional density $g_{n-l}(\Delta t,\infty)$ given by the infinite series (3.9) is defined and non-negative for all $n\in\mathbb{N}$ and $l\in\mathbb{N}^{\dagger}$ (or, alternatively, for all $s_{n}\in(s_{\min},s_{\max})$ and $s_{l}\in[s_{\min}^{\dagger},s_{\max}^{\dagger}]$ ). Therefore, scheme (4.13) is monotone.

We highlight that for computational purposes, $g_{n-l}(\Delta t,\infty)$ , given by the infinite series (3.9), is truncated to $g_{n-l}(\Delta t,K)$ . However, since each term of the series is non-negative, this truncation does not result in loss of monotonicity, which is a key advantage of the proposed approach.

As $K\to\infty$ , there is no loss of information in the discrete convolution (4.13). For a finite $K$ , however, there is an error, namely $\left|g_{n-l}(\Delta t,\infty)-g_{n-l}(\Delta t,K)\right|$ , due to the use of a truncated Taylor series. Specifically, this truncation error can be bounded as follows:

	$\displaystyle\left\|g_{n-l}(\Delta t,\infty)-g_{n-l}(\Delta t;K)\right\|$	$\displaystyle=\left\|\sum_{k=K+1}^{\infty}\frac{\left(\lambda\Delta t\right)^{k}}{k!}\int_{-\infty}^{\infty}e^{-\alpha\eta^{2}+\left(\beta+s_{n}-s_{l}\right)\left(i\eta\right)+\theta}\,\left(\Gamma\left(\eta\right)\right)^{k}~{}\mathrm{d}\eta\right\|$
		$\displaystyle{\color[rgb]{0,0,0}{~{}{\buildrel(\text{i})\over{\leq}}~{}\frac{\left(\lambda\Delta t\right)^{K+1}}{(K+1)!}\,g_{n-l}(\Delta t,\infty)}}~{}{\buildrel(\text{ii})\over{\leq}}~{}\frac{\left(\lambda\Delta t\right)^{K+1}}{(K+1)!}\,\frac{1}{\sqrt{2\pi\sigma^{2}\Delta t}}.$		(4.14)

Here, in (i), $\left|\left(\Gamma\left(\eta\right)\right)^{K+1}\right|\leq\left(\int_{-\infty}^{\infty}p(y)~{}\left|e^{i\eta y}\right|~{}dy\right)^{K+1}\leq 1$ ; in (ii), $g_{n-l}(\Delta t,\infty)\leq\frac{e^{\theta}}{\sqrt{4\pi\alpha}}\sum\limits_{k=0}^{\infty}\frac{\left(\lambda\Delta t\right)^{k}}{k!}=\frac{1}{\sqrt{2\pi\sigma^{2}\Delta t}}$ . Therefore, from (4.14), as $K\rightarrow\infty$ , we have $\frac{\left(\lambda\Delta t\right)^{K+1}}{(K+1)!}\rightarrow 0$ , resulting in no loss of information. For a given $\epsilon>0$ , we can choose $K$ such that the error $\left|g_{n-l}(\Delta t,\infty)-g_{n-l}(\Delta t,K)\right|<\epsilon$ , for all $n\in\mathbb{N}$ and $l\in\mathbb{N}^{\dagger}$ . This can be achieved by enforcing

\displaystyle\frac{\left(\lambda\Delta t\right)^{K+1}}{(K+1)!}\leq\epsilon\sqrt{2\pi\sigma^{2}\Delta t}.

(4.15)

It is straightforward to see that, if $\epsilon=\mathcal{O}(h)$ , then $K=\mathcal{O}(\ln(h^{-1}))$ , as $h\rightarrow 0$ . For a given $\epsilon$ , we denote by $K_{\epsilon}$ be the smallest $K$ values that satisfies (4.15). We then have

0<g_{n-l}(\Delta t,\infty)-g_{n-l}(\Delta t,K_{\epsilon})<\epsilon,\quad{\color[rgb]{0,0,0}{n\in\mathbb{N},~{}l\in\mathbb{N}^{\dagger}}}.

(4.16)

This value $K_{\epsilon}$ can be obtained through a simple iterative procedure, as illustrated in Algorithm 4.1.

4.3 Efficient implementation and algorithms

In this section, we discuss an efficient implementation of the scheme presented above using FFT. For convenience, we define/recall sets of indices: $\mathbb{N}^{\ddagger}=\left\{-N^{\ddagger}/2+1,\ldots,N^{\ddagger}/2-1\right\}$ , $\mathbb{N}^{\dagger}=\left\{-N^{\dagger}/2,\ldots,N^{\dagger}/2\right\}$ , $\mathbb{N}=\left\{-N/2+1,\ldots,N/2-1\right\}$ , $\mathbb{J}=\left\{0,\ldots,J\right\}$ , with $N^{\dagger}=2N$ and $N^{\ddagger}=N+N^{\dagger}=3N$ . For brevity, we adopt the notational convention: for $n\in\mathbb{N}$ and $l\in\mathbb{N}^{\dagger}$ , $g_{n-l}\equiv g_{n-l}(\Delta t,K)$ , where $K$ is chosen by (4.15). To effectively compute the discrete convolution in (4.13) for a fixed $j\in\mathbb{J}$ , we rewrite (4.13) in a matrix-vector product form as follows

\displaystyle\underbrace{\begin{bmatrix}v_{-N/2+1,j}^{(m-1)+}\\ v_{-N/2+2,j}^{(m-1)+}\\ \vdots\\ \vdots\\ v_{N/2-1,j}^{(m-1)+}\end{bmatrix}}_{~{}~{}~{}\scalebox{1.0}{$v_{j}^{(m-1)+}$}}~{}~{}~{}=~{}~{}~{}\Delta s~{}\underbrace{\begin{bmatrix}g_{N/2+1}&g_{N/2}&\dots&g_{-3N/2+1}\\ g_{N/2+2}&g_{N/2+1}&\dots&g_{-3N/2+2}\\ \vdots&\vdots&&\vdots\\ \vdots&\vdots&&\vdots\\ g_{3N/2-1}&g_{3N/2-2}&\dots&g_{-N/2-1}\end{bmatrix}}_{\quad\quad\scalebox{1.1}{$\left[g_{n-l}\right]_{n\in\mathbb{N},\,l\in\mathbb{N}^{\dagger}}$}}\,~{}~{}~{}\underbrace{\begin{bmatrix}\frac{1}{2}v_{-N^{\dagger}/2,j}^{m-}\\ v_{-N^{\dagger}/2+1,j}^{m-}\\ \vdots\\ v_{N^{\dagger}/2-1,j}^{m-}\\ \frac{1}{2}v_{N^{\dagger}/2,j}^{m-}\end{bmatrix}}_{\quad\scalebox{1.0}{$v_{j}^{m-}$}}.

(4.17)

Here, in (4.17), the vector $v_{j}^{(m-1)+}\equiv\left[v_{n,j}^{(m-1)+}\right]_{n\in\mathbb{N}}$ is of size $(N-1)\!\times\!1$ , the matrix $\left[g_{n-l}\right]_{n\in\mathbb{N},\,l\in\mathbb{N}^{\dagger}}$ is of size $(N-1)\!\times\!(2N+1)$ , and the vector $v_{j}^{m-}\equiv\left[v_{n,j}^{m-}\right]_{n\in\mathbb{N}^{\dagger}}$ is of size $(2N+1)\!\times\!1$ . It is important to note that $\left[g_{n-l}\right]_{n\in\mathbb{N},\,l\in\mathbb{N}^{\dagger}}$ is a Toeplitz matrix [9] having constant along diagonals. To compute the matrix-vector product in (4.17) efficiently using FFT, we take advantage of a cicular convolution product described below.

•

We first expand the non-square matrix $\left[g_{n-l}\right]_{n\in\mathbb{N},\,l\in\mathbb{N}^{\dagger}}$ (of size $(N-1)\!\times\!(N^{\dagger}+1)$ ) into a circulant matrix of size $(3N-1)\!\times\!(3N-1)$ denoted by $\tilde{g}$ , and is defined as follows

\displaystyle\tilde{g}=\left[\begin{array}[]{c|c}\tilde{g}^{\prime}_{-1,0}&\tilde{g}^{\prime}_{-1,1}\\ \hline\cr\left[g_{n-l}\right]_{n\in\mathbb{N},\,l\in\mathbb{N}^{\dagger}}&\tilde{g}^{\prime}_{0,1}\\ \hline\cr\tilde{g}^{\prime}_{1,0}&\tilde{g}^{\prime}_{1,1}\end{array}\right].

(4.21)

Here, $\tilde{g}^{\prime}_{-1,0}$ , $\tilde{g}^{\prime}_{1,0}$ , $\tilde{g}^{\prime}_{-1,1}$ , $\tilde{g}^{\prime}_{0,1}$ and $\tilde{g}^{\prime}_{1,1}$ are matrices of sizes $N\!\times\!(2N+1)$ , $N\!\times\!(2N+1)$ , $N\!\times\!(N-2)$ , $(N-1)\!\times\!(N-2)$ , and $N\!\times\!(N-2)$ , respectively, and are given below

$\displaystyle\tilde{g}^{\prime}_{-1,0}$	$\displaystyle=$	$\displaystyle\left[\begin{array}[]{cccccccc}g_{-N/2+1}&g_{-N/2}&\dots&g_{-3N/2+1}&g_{3N/2-1}&g_{3N/2-2}&\dots&g_{N/2}\\ g_{-N/2+2}&g_{-N/2+1}&\dots&g_{-3N/2+2}&g_{-3N/2+1}&g_{3N/2-1}&\dots&g_{N/2+1}\\ \vdots&\vdots&&\vdots&\vdots&\vdots&&\vdots\\ g_{N/2}&g_{N/2-1}&\dots&g_{-N/2}&g_{-N/2-1}&g_{-N/2-2}&\dots&g_{3N/2-1}\end{array}\right],$	(4.26)
$\displaystyle\tilde{g}^{\prime}_{1,0}$	$\displaystyle=$	$\displaystyle\left[\begin{array}[]{cccccccc}g_{-3N/2+1}&g_{3N/2-1}&g_{3N/2-2}&\dots&g_{N/2+1}&g_{N/2}&\dots&g_{-N/2}\\ g_{-3N/2+2}&g_{-3N/2+1}&g_{3N/2-1}&\dots&g_{N/2+2}&g_{N/2+1}&\dots&g_{-N/2+1}\\ \vdots&\vdots&\vdots&&\vdots&\vdots&&\vdots\\ g_{-N/2}&g_{-N/2-1}&g_{-N/2-2}&\dots&g_{-3N/2+1}&g_{3N/2-1}&\dots&g_{N/2-1}\end{array}\right],$	(4.31)
$\displaystyle\tilde{g}^{\prime}_{-1,1}$	$\displaystyle=$	$\displaystyle\left[\begin{array}[]{cccc}g_{N/2-1}&g_{N/2-2}&\dots&g_{-N/2+2}\\ g_{N/2}&g_{N/2-1}&\dots&g_{-N/2+3}\\ \vdots&\vdots&&\vdots\\ g_{3N/2-2}&g_{3N/2-3}&\dots&g_{N/2+1}\end{array}\right],$	(4.36)
$\displaystyle\tilde{g}^{\prime}_{0,1}$	$\displaystyle=$	$\displaystyle{\color[rgb]{0,0,0}{\left[\begin{array}[]{ccccc}g_{3N/2-1}&g_{3N/2-2}&\dots&g_{N/2+3}&g_{N/2+2}\\ g_{-3N/2+1}&g_{3N/2-1}&\dots&g_{N/2+4}&g_{N/2+3}\\ \vdots&\vdots&&\vdots&\vdots\\ g_{-N/2-2}&g_{-N/2-3}&\dots&g_{-3N/2+2}&g_{-3N/2+1}\end{array}\right],}}$	(4.41)
$\displaystyle\tilde{g}^{\prime}_{1,1}$	$\displaystyle=$	$\displaystyle\left[\begin{array}[]{cccc}g_{-N/2-1}&g_{-N/2-2}&\dots&g_{-3N/2+2}\\ g_{-N/2}&g_{-N/2-1}&\dots&g_{-3N/2+3}\\ \vdots&\vdots&&\vdots\\ g_{N/2-2}&g_{N/2-1}&\dots&g_{-N/2+1}\end{array}\right].$	(4.46)

•

For fixed $j\in\mathbb{J}$ , we construct $\tilde{v}_{j}^{m-}$ a vector of size $(3N-1)\!\times\!1$ by augmenting vector $v_{j}^{m-}$ , defined in (4.17), with zeros as follows

\displaystyle\tilde{v}_{j}^{m-}=\left[(v_{j}^{m-})^{\top},\,0,\,0,\ldots,0\right]^{\top}=\left[\frac{1}{2}v_{-N^{\dagger}/2,j}^{m-},\,v_{-N^{\dagger}/2+1,j}^{m-},\ldots,v_{N^{\dagger}/2-1,j}^{m-},\,\frac{1}{2}v_{N^{\dagger}/2,j}^{m-},\,0,\,0,\ldots,0\right]^{\top}.

(4.47)

Then, (4.17) can be implemented by applying a circulant matrix-vector product to compute an intermediate vector of discrete solutions $\tilde{v}_{j}^{(m-1)+}$ as follows

\displaystyle\tilde{v}_{j}^{(m-1)+}=\Delta s\,\tilde{g}\,\tilde{v}_{j}^{m-},\qquad j\in\mathbb{J}.

(4.48)

Here, the circulant matrix $\tilde{g}$ is given by (4.21), and the vector $\tilde{v}_{j}^{m-}$ is given by (4.47), and the intermediate result $\tilde{v}_{j}^{(m-1)+}$ is a vector of size $(3N-1)\!\times\!1$ , with $v_{j}^{(m-1)+}$ is the middle $2N-1$ (from the $(N+1)$ -th to the $(2N-1)$ -th) elements of $\tilde{v}_{j}^{(m-1)+}$ .

•

Observing that a circulant matrix-vector product is equal to a circular convolution product, (4.48) can further be written as a circular convolution product. More specifically, let $\tilde{g}_{1}$ be the first column of the circulant matrix $\tilde{g}$ defined in (4.21), and is given by

\displaystyle\tilde{g}_{1}

\displaystyle=

\displaystyle\big{[}g_{-N/2+1},\,g_{-N/2+2},\ldots,g_{3N/2-1},\,g_{-3N/2+1},\,g_{-3N/2+2},\ldots,g_{-N/2}\big{]}^{\top}.

(4.49)

The circular convolution product $z=x\ast y$ is defined componentwise by

\displaystyle z_{k^{\prime}}=\sum_{k=-N^{\ddagger}/2+1}^{N^{\ddagger}/2-1}x_{k^{\prime}-k+1}\,y_{k},\quad k^{\prime}=-N^{\ddagger}/2+1,\ldots,N^{\ddagger}/2-1,

where $x$ and $y$ are two sequences with the period $(N^{\ddagger}-1)$ (i.e. $x_{k}=x_{k+(N^{\ddagger}-1)}$ and $y_{k}=y_{k+(N^{\ddagger}-1)}$ , $k^{\prime}\in\mathbb{N}^{\ddagger}$ ). Then, (4.48) can be written as the following circular convolution product

\displaystyle\tilde{v}_{j}^{(m-1)+}=\Delta s\,\tilde{g}\,\tilde{v}_{j}^{m-}=\Delta s\,\tilde{g}_{1}*\tilde{v}_{j}^{m-},\qquad j=0,\ldots,J.

(4.50)

•

The circular convolution product in (4.50) can be computed efficiently using FFT and iFFT as follows

\displaystyle\tilde{v}_{j}^{(m-1)+}=\Delta s\,\text{FFT}^{-1}\left\{\text{FFT}(\tilde{v}_{j}^{m-})\circ\text{FFT}(\tilde{g}_{1})\right\},\qquad j=0,\ldots,J.

(4.51)

•

Once the vector of intermediate discrete solutions $\tilde{v}_{j}^{(m-1)+}\equiv\tilde{v}_{n,j}^{(m-1)+}$ is computed, we then obtain the vector of discrete solutions $\left[v_{n,j}^{(m-1)+}\right]_{n\in\mathbb{N}}$ (of size $(2N+1)\!\times\!1$ ) for $\Omega_{\scalebox{0.7}{\text{in}}}$ by discarding values $\tilde{v}_{n,j}^{(m-1)+}$ , $n\in\mathbb{N}^{\ddagger}\setminus\mathbb{N}$ .

The implementation (4.51) suggests that we compute the weight components of $\tilde{g}_{1}$ only once, and reuse them for the computation over all time intervals. More specifically, for a given user-tolerance $\epsilon$ , using (4.15), we can compute a sufficiently large the number of terms $K=K_{\epsilon}$ in the infinite series representation (3.9) for these weights. Then, using Corollary 3.1, these weights for the case $\xi$ following a normal distribution [49] or a double-exponential distribution [36] can be computed only once in the Fourier space, as in (4.51), and reused for all time intervals. The step is described in Algorithm 4.1.

Algorithm 4.1 Computation of weight vector

\tilde{g}_{1}(\Delta t,K_{\epsilon})

in the Fourier space;

\epsilon>0

is an user-defined tolerance.

1: set

k=K_{\epsilon}=0

;

2: compute

\text{test}=\frac{\left(\lambda\Delta t\right)^{k+1}}{(k+1)!\,\sqrt{2\pi\sigma^{2}\Delta t}}

; compute

g_{n-l}(\Delta t,K_{\epsilon})=g(s_{n}-s_{l};\Delta t,0)

n\in\mathbb{N},\,l\in\mathbb{N}^{\dagger}

, given in Corollary 3.1;

3: construct the weight vector

\tilde{g}_{1}(\Delta t,K_{\epsilon})

using

g_{n-l}(\Delta t,K_{\epsilon})

as defined in (4.49);

4: while

\text{test}\geq\epsilon

5: set

k=k+1

, and

K_{\epsilon}=k

;

6: compute

\text{test}=\frac{\left(\lambda\Delta t\right)^{k+1}}{(k+1)!\,\sqrt{2\pi\sigma^{2}\Delta t}}

;

7: compute the increments

\Delta g_{k}(s_{n}-s_{l};\Delta t)

n\in\mathbb{N},\,l\in\mathbb{N}^{\dagger}

, given in Corollary 3.1;

8: compute

g_{n-l}(\Delta t,K_{\epsilon})=g_{n-l}(\Delta t,K_{\epsilon})+\Delta g_{k}(s_{n}-s_{l};\Delta t)

n\in\mathbb{N},\,l\in\mathbb{N}^{\dagger}

;

9: construct the weight vector

\tilde{g}_{1}(\Delta t,K_{\epsilon})

using

g_{n-l}(\Delta t,K_{\epsilon})

as defined in (4.49);

10: end while

11: output weight vector

\text{FFT}(\tilde{g}_{1})

;

Putting everything together, in Algorithm 4.2, we present a monotone integration algorithm for MV portfolio optimization.

Algorithm 4.2 A monotone numerical integration algorithm for MV portfolio optimization when

\xi

follows a normal distribution [49] or a double-exponential distribution [36];

\epsilon>0

is a user-tolerance; the embedding parameter

\gamma\in\mathbb{R}

is a fixed;

1: compute weight vector

\tilde{g}_{1}

using Algorithm 4.1;

2: initialize

v_{n,j}^{M}={\color[rgb]{0,0,0}{\left(W_{\text{liq}}(s_{n},b_{j})-\frac{\gamma}{2}\right)^{2}}}

n=-N^{\dagger}/2,\ldots,N^{\dagger}/2

j=0,\ldots,J

;

3: for

m=M,\ldots,1

4: enforce interest rate payment (4.9) to obtain

v_{n,j}^{m-}

n=-N^{\dagger}/2,\ldots,N^{\dagger}/2

j=0,\ldots,J

;

5: compute vectors of intermediate values

\tilde{v}_{j}^{(m-1)+}

j=0,\ldots,J

using (4.51);

6: obtain vectors of discrete solutions

\left[v_{n,j}^{(m-1)+}\right]_{n\in\mathbb{N}}

j=0,\ldots,J

by discarding all values

\tilde{v}_{n,j}^{(m-1)+}

(Line (5)) where

n\in\mathbb{N}^{\ddagger}\setminus\mathbb{N}

;

\Omega_{\scalebox{0.7}{\text{in}}}

7: compute

v_{n,j}^{(m-1)+}

n=-N^{\dagger}/2,\ldots,-N/2

j=0,\ldots,J

, using (4.11);

\Omega_{s_{\min}}

8: compute

v_{n,j}^{(m-1)+}

n=N/2,\ldots,N^{\dagger}/2

j=0,\ldots,J

using (4.12);

\Omega_{s_{\max}}

9: solve the optimization problem (4.7) to obtain

v_{n,j}^{m-1}

n\in\mathbb{N}^{\dagger}

j\in\mathbb{J}

; save the optimal impulse value

c_{n,j}^{m,*}

;

10: end for

Remark 4.3 (Complexity).

Algorithm 4.2 involves, for $m=M\ldots,1$ , the key steps as follows.

•

Compute $v_{n,j}^{(m-1)+}$ , $n\in\mathbb{N}^{\ddagger}$ , $j\in\mathbb{J}$ via FFT algorithm. The complexity of this step is $\mathcal{O}\left(JN^{\ddagger}\log_{2}N^{\ddagger}\right)=\mathcal{O}\left(1/h^{2}\cdot\log_{2}(1/h)\right)$ , where we take into account (4.3).
•

We use exhaustive search through all admissible controls in $\mathcal{Z}_{h}$ to obtain global minimum. Each optimization problem is solved by evaluating the objective function $\mathcal{O}(1/h)$ times. There are $\mathcal{O}(1/h^{2})$ nodes, and $\mathcal{O}(1)$ timesteps giving a total complexity $\mathcal{O}(1/h^{3})$ . This is an order reduction compared to complexity of finite difference methods, which typically is $\mathcal{O}(1/h^{4})$ for discrete rebalancing (see [19][Section 6.1].)

4.4 Construction of efficient frontier

We know discuss construction of efficient frontier. To this end, we define the auxiliary function $u(s,b,t_{m})=E_{\mathcal{C}_{m}^{*}}^{x,t_{m}}\left[W_{T}\right]$ , where $\mathcal{C}_{m}^{*}$ , as defined in (3.3), is the optimal control for the problem $PCMV_{\Delta t}(t_{m};\gamma)$ obtained by solving the localized problem in Definition 3.1. Similar to [19, 66, 65], we now present a localized problem for $u(x^{m})=u(s,b,t_{m})$ , with $x^{m}=(s,b,t_{m})$ and $t_{m}\in\mathcal{T}_{M}\cup\left\{T\right\}$ , in the sub-domains (3.3) as below


$\displaystyle u\left(x^{M}\right)~{}$	$\displaystyle=$	$\displaystyle~{}W_{\text{liq}}(s,b)-\varepsilon,\qquad\qquad\qquad\qquad\qquad~{}x^{M}\in\Omega\times\left\{T\right\},$	(4.52a)
$\displaystyle u(x^{m})~{}$	$\displaystyle=$	$\displaystyle~{}\mathcal{M}(c_{m}^{*})\,u\left(x^{m+}\right),\qquad\qquad\qquad\qquad\quad x^{m}\in\Omega\times\mathcal{T}_{M},$	(4.52b)
$\displaystyle u(x^{m-})~{}$	$\displaystyle=$	$\displaystyle~{}u\left(s,be^{R(b)\Delta t},t_{m}\right),\qquad\qquad\qquad\qquad x^{m}\in\Omega\times\left\{t_{m}\right\},\quad m=M,\ldots,1,$	(4.52c)
$\displaystyle u(x^{m})~{}$	$\displaystyle=$	$\displaystyle~{}{\color[rgb]{0,0,0}{\frac{\left\|b\right\|}{b_{\max}}u\left(s,\operatorname{sgn}(b)b_{\max},t_{m}\right),\qquad\qquad\quad x^{m}\in\Omega_{b_{\max}}\times\left\{t_{m}\right\},\,m=M,\ldots,1,}}$	(4.52d)
$\displaystyle u(x^{(m-1)+})~{}$	$\displaystyle=$	$\displaystyle~{}\left\{\small\begin{aligned} &\int_{s_{\min}^{\dagger}}^{s_{\max}^{\dagger}}u\left(s^{\prime},b,t_{m}^{-}\right)g(s-s^{\prime};\Delta t)~{}\mathrm{d}s^{\prime},&x^{(m-1)+}\in\Omega_{\scalebox{0.7}{\text{in}}}\times\left\{t_{m-1}^{+}\right\},\,t_{m-1}\in\mathcal{T}_{M},\\ &u(x^{m-})\,e^{\mu\Delta t},&x^{(m-1)+}\in\Omega_{s_{\max}}\times\left\{t_{m-1}^{+}\right\},\,t_{m-1}\in\mathcal{T}_{M},\\ &u(x^{m-}),&x^{(m-1)+}\in\Omega_{s_{\min}}\times\left\{t_{m-1}^{+}\right\},\,t_{m-1}\in\mathcal{T}_{M}.\end{aligned}\right\}$	(4.52e)

Here, in (4.52b), $c_{m}^{*}$ is the optimal impulse value obtained from solving the value function problem (3.17); (4.52c) is due to the settlement (payment or receipt) of interest due for the time interval $[t_{m-1},t_{m}]$ , $m=M,\ldots,1$ ; (4.52d)-(4.52e) are equations for spatial sub-domains $\Omega_{b_{\max}}$ , $\Omega_{\scalebox{0.7}{\text{in}}}$ , $\Omega_{s_{\max}}$ and $\Omega_{s_{\min}}$ . The localized problem (4.52) can be solved numerically in a straightforward manner. In particular, at a reference gridpoint $(s_{n},b_{j})$ , the optimal impulse value $c_{m}^{*}$ in (4.52b) becomes $c_{n,j}^{m,*}$ which is the optimal impulse value obtained from Line (9) of Algorithm 4.2. We emphasize the convention that it may be non-optimal to rebalance, in which case, the convention is $c_{n,j}^{m,*}=b_{j}$ . Furthermore, the convolution integral in (4.52e) can be approximated using a scheme similar to (4.13). For brevity, we only provide the proof of numerical scheme for $\Omega_{s_{\max}}$ in Appendix B, and omit details of the other schemes for (4.52).

We assume that given the initial state $x=(s,b)$ at time $t_{0}$ and the positive discretization parameter $h$ , the efficient frontier (EF), denote by $\mathcal{Y}_{h}$ , can be traced out using the embedding parameter $\gamma\in\mathbb{R}$ as below

\mathcal{Y}_{h}=\bigcup\limits_{\gamma\in\mathbb{R}}\left(\sqrt{\left(Var_{\mathcal{C}_{0}^{*}}^{x,t_{0}}\left[W_{T}\right]\right)_{h}},\,\left(E_{\mathcal{C}_{0}^{*}}^{x,t_{0}}\left[W_{T}\right]\right)_{h}\right)_{\gamma}.

(4.53)

Here, $(\cdot)_{h}$ refers to a discretization approximation to the expression in the brackets. Specifically, for fixed $\gamma$ , we let

\displaystyle V_{0}\equiv{\color[rgb]{0,0,0}{v(s,b,t_{0})}}=E_{\mathcal{C}_{0}^{*}}^{x,t_{0}}\left[\left(W_{T}-\frac{\gamma}{2}\right)^{2}\right]\quad{\text{and}}\quad U_{0}\equiv{\color[rgb]{0,0,0}{u(s,b,t_{0})}}=E_{\mathcal{C}_{0}^{*}}^{x,t_{0}}\left[W_{T}\right].

(4.54)

Then $\left(Var_{\mathcal{C}_{0}^{*}}^{x,t_{0}}\left[W_{T}\right]\right)_{h}$ and $\left(E_{\mathcal{C}_{0}^{*}}^{x,t_{0}}\left[W_{T}\right]\right)_{h}$ corresponding to $\gamma$ in (4.53) are computed as follows

\displaystyle\left(Var_{\mathcal{C}_{0}^{*}}^{x,t_{0}}\left[W_{T}\right]\right)_{h}=V_{0}+\gamma U_{0}-\frac{\gamma^{2}}{4}-\left(U_{0}\right)^{2}\quad{\text{and}}\quad\left(E_{\mathcal{C}_{0}^{*}}^{x,t_{0}}\left[W_{T}\right]\right)_{h}=U_{0}.

(4.55)

5 Pointwise convergence

In this section, we establish pointwise convergence of the proposed numerical integration method. We start by verifying three properties: $\ell_{\infty}$ -stability, monotonicity, and consistency (with respect to the integral formulation (3.22)). We recall that the infinite series $g_{n-l}(\Delta t,\infty)$ is approximated by $g_{n-l}(\Delta t,K_{\epsilon})$ , where $\epsilon>0$ is an user-defined tolerance, and we have the error bound $g_{n-l}(\Delta t,\infty)-g_{n-l}(\Delta t,K_{\epsilon})<\epsilon$ , as noted in (4.16).

It is straightforward to see that the proposed scheme is monotone since all the weights $g_{n-l}$ are positive. Therefore, we will primarily focus on $\ell_{\infty}$ -stability and consistency of the scheme. We will then show that convergence of our scheme is ensured if $K_{\epsilon}\to\infty$ as $h\to 0$ , or equivalently, $\epsilon\to 0$ as $h\to 0$ .

For subsequent use, we present a remark about $g_{n-l}(\Delta t;K_{\epsilon})$ , $n\in\mathbb{N}$ , $l\in\mathbb{N}^{\dagger}$ .

Remark 5.1.

Recalling that $g(s,s^{\prime};\Delta t)\equiv g(s,s^{\prime};\Delta t,\infty)$ is a (conditional) probability density function, for a fixed $s_{n}\in[s_{\min},s_{\max}]$ , we have $\displaystyle\int_{\mathbb{R}}g(s_{n},s;\Delta t,\infty)~{}ds=1$ , hence $\int_{s_{\min}^{\dagger}}^{s_{\max}^{\dagger}}g(s_{n},s;\Delta t,\infty)~{}ds\leq 1$ . Furthermore, applying quadrature rule to approximate $\displaystyle{\color[rgb]{0,0,0}{\int_{s_{\min}^{\dagger}}^{s_{\max}^{\dagger}}}}g(s_{n},s;\Delta t,\infty)~{}ds$ gives rise to an approximation error, denoted by $\epsilon_{g}$ , defined as follows

\epsilon_{g}:=\bigg{|}{\color[rgb]{0,0,0}{\Delta s\sum_{l=-N^{\dagger}/2}^{N^{\dagger}/2}\omega_{l}}}~{}g_{n-l}(\Delta t,\infty)-\int_{s_{\min}^{\dagger}}^{s_{\max}^{\dagger}}g(s_{n},s;\Delta t,\infty)~{}ds\bigg{|}.

It is straightforward to see that $\epsilon_{g}\to 0$ as $N^{\dagger}\to\infty$ , i.e. as $h\to 0$ . Using the above results, recalling the weights $\omega_{l}$ , $l\in\mathbb{N}^{\dagger}$ , are positive, and the error bound (4.16), we have

\displaystyle 0~{}\leq~{}\Delta s{\color[rgb]{0,0,0}{\sum_{l=-N^{\dagger}/2}^{N^{\dagger}/2}}}\omega_{l}~{}g_{n-l}(\Delta t,K_{\epsilon})~{}\leq~{}\Delta s{\color[rgb]{0,0,0}{\sum_{l=-N^{\dagger}/2}^{N^{\dagger}/2}}}\omega_{l}~{}g_{n-l}(\Delta t,\infty)~{}\leq~{}1+\epsilon_{g}<e^{\epsilon_{g}}.

(5.1)

5.1 Stability

Our scheme consists of the following equations: (4.6) for $\Omega\times\{T\}$ , (4.11) for $\Omega_{s_{\min}}$ , (4.12) for $\Omega_{s_{\max}}$ , and finally (4.13) for $\Omega_{\scalebox{0.7}{\text{in}}}$ . We start by verifying $\ell_{\infty}$ -stability of our scheme.

Lemma 5.1 ( $\ell_{\infty}$ -stability).

Suppose the discretization parameter $h$ satisfies (4.3). If linear interpolation is used for the intervention action (4.7), then the scheme (4.6), (4.11), (4.12), and (4.13) satisfies the bound $\displaystyle\sup_{h>0}\left\|v^{m}\right\|_{\infty}<\infty$ for all $m=M,\ldots,0$ , as the discretization parameter $h\to 0$ . Here, we have $\left\|v^{m}\right\|_{\infty}=\max_{n,j}|v_{n,j}^{m}|$ , $n\in\mathbb{N}^{\dagger}$ and $j\in\mathbb{J}$ .

Proof of Lemma 5.1.

First, we note that, for any fixed $h>0$ , as given by (4.6), and for a finite $\gamma$ , we have $\left\|v^{M}\right\|_{\infty}<\infty$ , since $\Omega$ is a bounded domain. Therefore, we have $\sup_{h>0}\left\|v^{M}\right\|_{\infty}<\infty$ . Motivated by this observation, to demonstrate $\ell_{\infty}$ -stability of our scheme, we will show that, for a fixed $h>0$ , at any $(s_{n},b_{j},t_{m})$ , $m=M,\ldots,0$ , we have

|v_{n,j}^{m}|<e^{(M-m)\left(\epsilon_{g}+\left({\color[rgb]{0,0,0}{2r_{\max}}}+\sigma^{2}+2\mu+\lambda\kappa_{2}\right)\Delta t\right)}\left\|v^{M}\right\|_{\infty}{\color[rgb]{0,0,0}{,}}

(5.2)

where (i) $\epsilon_{g}$ is the error of the quadrature rule discussed in Remark 5.1, (ii) $r_{\max}=\max\left\{r_{b},r_{\iota}\right\}$ , and (iii) $\kappa_{2}=\mathbb{E}\left[\left(e^{\xi}-1\right)^{2}\right]$ . In (5.2), the term $e^{(M-m){\color[rgb]{0,0,0}{2r_{\max}}}\Delta t}$ is a result of the evaluation of $v_{n,j}^{m-}$ using (4.10) for nodes near $\pm b_{\max}$ . For the rest of the proof, we will show the key inequality (5.2) when $h>0$ is fixed. The proof follows from a straightforward maximum analysis, since $\Omega$ is a bounded domain. For brevity, we outline only key steps of an induction proof below.

We use induction $m$ , $m=M-1,\ldots,0$ , to show the bound (5.2) for $\Omega_{s_{\min}}\cup\Omega_{\scalebox{0.7}{\text{in}}}\cup\Omega_{s_{\max}}$ . For the base case, $m=M-1$ and thus (5.2) becomes

\displaystyle|v_{n,j}^{M-1}|<e^{\epsilon_{g}+({\color[rgb]{0,0,0}{2r_{\max}}}+\sigma^{2}+2\mu+\lambda\kappa_{2})\Delta t}\left\|v^{M}\right\|_{\infty},\quad{\color[rgb]{0,0,0}{n\in\mathbb{N}^{\dagger}\text{ and }j\in\mathbb{J}}}.

(5.3)

For the settlement of interest rate for all $\Omega_{s_{\min}}\cup\Omega_{\scalebox{0.7}{\text{in}}}\cup\Omega_{s_{\max}}$ , as reflected by (4.9), we have $|v_{n,j}^{M-}|<e^{{\color[rgb]{0,0,0}{2r_{\max}}}\Delta t}|v_{n,j}^{M}|$ , $n\in\mathbb{N}^{\dagger}$ and $j\in\mathbb{J}$ . Since $|v_{n,j}^{M}|\leq\left\|v^{M}\right\|_{\infty}$ , it follows that

|v_{n,j}^{M-}|<e^{{\color[rgb]{0,0,0}{2r_{\max}}}\Delta t}\left\|v^{M}\right\|_{\infty}.

(5.4)

We now turn to $\ell_{\infty}$ -stability of (4.12) (for $\Omega_{s_{\max}}$ ). From (4.12), we note that for $n\in\big{\{}N/2,\ldots,N^{\dagger}/2\big{\}}$ and $j\in\mathbb{J}$ ,

|v_{n,j}^{(M-1)+}|=e^{\Delta t(\sigma^{2}+2\mu+\lambda\kappa_{2})}|v_{n,j}^{M-}|\overset{\eqref{eq:itner}}{\leq}e^{\Delta t({\color[rgb]{0,0,0}{2r_{\max}}}+\sigma^{2}+2\mu+\lambda\kappa_{2})}\left\|v^{M}\right\|_{\infty}\leq e^{\epsilon_{g}+\Delta t({\color[rgb]{0,0,0}{2r_{\max}}}+\sigma^{2}+2\mu+\lambda\kappa_{2})}\left\|v^{M}\right\|_{\infty}

(5.5)

noting $e^{\epsilon_{g}}\geq 1$ . Using (5.4), it is trivial that (4.11) (for $\Omega_{s_{\min}}$ ) satisfies

|v_{n,j}^{(M-1)+}|\leq e^{\epsilon_{g}+\Delta t({\color[rgb]{0,0,0}{2r_{\max}}}+\sigma^{2}+2\mu+\lambda\kappa_{2})}\left\|v^{M}\right\|_{\infty},n\in\big{\{}-N^{\dagger}/2,\ldots,-N/2\big{\}},~{}{\color[rgb]{0,0,0}{j\in\mathbb{J}}}.

(5.6)

Now, we focus on the timestepping scheme (4.13) (for $\Omega_{\scalebox{0.7}{\text{in}}}$ ). For $n\in\mathbb{N}$ and $j\in\mathbb{J}$ , we have

	$\displaystyle\|v_{n,j}^{(M-1)+}\|\leq\Delta s{\color[rgb]{0,0,0}{\sum_{l=-N^{\dagger}/2}^{N^{\dagger}/2}}}\omega_{l}\,g_{n-l}(\Delta t,K_{\epsilon})\,\|v_{l,j}^{M-}\|$	$\displaystyle\overset{\eqref{eq:itner}}{\leq}e^{{\color[rgb]{0,0,0}{2r_{\max}}}\Delta t}\left\\|v^{M}\right\\|_{\infty}\bigg{(}\Delta s{\color[rgb]{0,0,0}{\sum_{l=-N^{\dagger}/2}^{N^{\dagger}/2}}}\omega_{l}\,g_{n-l}(\Delta t,K_{\epsilon})\bigg{)}\,$		(5.7)
		$\displaystyle\overset{\text{(i)}}{\leq}e^{\epsilon_{g}+\left({\color[rgb]{0,0,0}{2r_{\max}}}+\sigma^{2}+2\mu+\lambda\kappa_{2}\right)\Delta t}\left\\|v^{M}\right\\|_{\infty}.$		(5.8)

Here, (i) is due to (5.1) and (5.5) and (5.6).

Finally, given (5.8), we bound the intervention result $|v_{n,j}^{(M-1)}|$ , $n\in\mathbb{N}^{\dagger}$ and $j\in\mathbb{J}$ , obtained from (4.7). Since linear interpolation is used, the weights for interpolation are non-negative. In addition, due to (5.5), (5.6), and (5.8), the numerical solutions at nodes used for interpolation, namely $|v_{l,j}^{(M-1)+}|$ , $l\in\mathbb{N}^{\dagger}$ , are also bounded by

|v_{l,j}^{(M-1)+}|\leq e^{\epsilon_{g}+\left({\color[rgb]{0,0,0}{2r_{\max}}}+\sigma^{2}+2\mu+\lambda\kappa_{2}\right)\Delta t}\left\|v^{M}\right\|_{\infty}.

Therefore, by monotonicity of linear interpolation, which is preserved by the $\sup(\cdot)$ operator in (4.7), $|v_{n,j}^{(M-1)}|$ , $n\in\mathbb{N}^{\dagger}$ and $j\in\mathbb{J}$ , satisfy (5.3). We have proved the base case (5.3). Similar arguments can be used to show the induction step. This concludes the proof. ∎

5.2 Consistency

In this subsection, we mathematically demonstrate the pointwise consistency of the proposed scheme with respect to the MV optimization in Definition 3.1. Since it is straightforward that (4.6) is consistent with the terminal condition (3.16) ( $\Omega\times\{T\}$ ), we primarily focus on the consistency of the scheme on $\Omega\times\{t_{m-1}\}$ , $m=M,\ldots,1$ .

We start by introducing notational convention. We use ${\bf{x}}=(s,b)\in\Omega$ and ${\bf{x}}^{m}\equiv(s,b,t_{m})\in\Omega\times\{t_{m}\}$ , $m=M,\ldots,0$ . In addition, for brevity, we use $v^{m}({\bf{x}})$ instead of $v(s,b,t_{m})$ , $m=M,\ldots,0$ . We now write the MV portfolio optimization in Definition 3.1 and the proposed scheme in forms amendable for analysis. Recalling $s^{+}(s,b,c)$ defined in (2.5), over each time interval $[t_{m-1},t_{m}]$ , where $m=M,\ldots,1$ , we write the MV portfolio optimization in Definition 3.1 via an operator $\mathcal{D}(\cdot)$ as follows

	$\displaystyle v^{m-1}(s,b)=\mathcal{D}\big{(}{\bf{x}}^{m-1},v^{m}\big{)}$	$\displaystyle\coloneqq{\color[rgb]{0,0,0}{\min\left\{v(s,b,t_{m-1}^{+}),\,\inf_{c\in\mathcal{Z}}\mathcal{M}(c)~{}v(s,b,t_{m-1}^{+})\right\}}}$
		$\displaystyle={\color[rgb]{0,0,0}{\min\left\{v(s,b,t_{m-1}^{+}),\,\inf_{c\in\mathcal{Z}}v(s^{+}(s,b,c),c,t_{m-1}^{+})\right\}}}.$		(5.9)

Here, $v(s^{+}(s,b,c),c,t_{m-1}^{+})$ is given by


$\displaystyle v\left(s^{+}(s,be^{R(b)\Delta t},c),c,t_{m}\right)$	$\displaystyle\quad\quad(s,b)\in\Omega_{s_{\min}},$	(5.10a)
$\displaystyle\displaystyle\int_{s_{\min}^{\dagger}}^{s_{\max}^{\dagger}}v(s^{+}(s^{\prime},be^{R(b)\Delta t},c),c,t_{m})\,g(s-s^{\prime};\Delta t)~{}ds^{\prime}$	$\displaystyle\quad\quad(s,b)\in\Omega_{\scalebox{0.7}{\text{in}}},$	(5.10b)
$\displaystyle e^{\left(\sigma^{2}+2\mu+\lambda\kappa_{2}\right)\Delta t}v\left(s^{+}(s,be^{R(b)\Delta t},c),c,t_{m}\right)$	$\displaystyle\quad\quad(s,b)\in\Omega_{s_{\max}}.$	(5.10c)

Similarly, the term $v(s,b,t_{m-1}^{+})$ in (5.2) is defined as follows


	$\displaystyle v\left(s,be^{R(b)\Delta t},t_{m}\right)$	$\displaystyle\quad\quad(s,b)\in\Omega_{s_{\min}},$
	$\displaystyle\displaystyle\int_{s_{\min}^{\dagger}}^{s_{\max}^{\dagger}}v(s^{\prime},be^{R(b)\Delta t},t_{m})\,g(s-s^{\prime};\Delta t)~{}ds^{\prime}$	$\displaystyle\quad\quad(s,b)\in\Omega_{\scalebox{0.7}{\text{in}}},$
	$\displaystyle e^{\left(\sigma^{2}+2\mu+\lambda\kappa_{2}\right)\Delta t}v\left(s,be^{R(b)\Delta t},t_{m}\right)$	$\displaystyle\quad\quad(s,b)\in\Omega_{s_{\max}}.$

Next, we write the proposed scheme at $({\bf{x}}_{n,j}^{m-1})=\left(s_{n},b_{j},t_{m-1}\right)\in\Omega\times t_{m-1}$ , $m=M,\ldots,1$ , in an equivalent form via an operator $\mathcal{D}_{h}(\cdot)$ as follows

\displaystyle v_{n,j}^{m-1}

\displaystyle=\mathcal{D}_{h}\left({\bf{x}}_{n,j}^{m-1},\left\{v_{l,j}^{m}\right\}_{l=-N^{\dagger}/2}^{N^{\dagger}/2}\right)\coloneqq{\color[rgb]{0,0,0}{\min\left\{v_{n,j}^{(m-1)+},\min_{b^{+}\in\mathcal{Z}}v_{h}\left(s^{+}(s_{n},b_{j},b^{+}),\,b^{+},\,t_{m-1}^{+}\right)\right\},}}

(5.12)

where $v_{h}\left(s^{+}(s_{n},b_{j},b^{+}),\,b^{+},\,t_{m-1}^{+}\right)=\mathcal{C}_{h}\left({\bf{x}}_{n,j}^{(m-1)+},\left\{v_{l,j}^{m}\right\}_{l=-N^{\dagger}/2}^{N^{\dagger}/2-1};b^{+}\right)=\ldots$


$\displaystyle v_{h}\left(s^{+}(s_{n},b_{j}e^{R(b_{j})\Delta t},b^{+}),\,b^{+},\,t_{m}\right)$	$\displaystyle~{}n=-N^{\dagger}/2,\ldots,-N/2,$	(5.13a)
$\displaystyle\displaystyle\Delta s\sum_{l=-N^{\dagger}/2}^{N^{\dagger}/2}\omega_{l}\,g_{n-l}(\Delta t;K_{\epsilon})\,~{}v_{h}\left(s^{+}(s_{l},b_{j}e^{R(b_{j})\Delta t},b^{+}),\,b^{+},\,t_{m}\right)$	$\displaystyle~{}n=-N/2+1,\ldots,N/2-1,$	(5.13b)
$\displaystyle v_{h}\left(s^{+}(s_{n},b_{j}e^{R(b_{j})\Delta t},b^{+}),\,b^{+},\,t_{m}\right)e^{\left(\sigma^{2}+2\mu+\lambda\kappa_{2}\right)\Delta t}$	$\displaystyle~{}n=N/2,\ldots,N^{\dagger}/2.$	(5.13c)

Similarly, the term $v_{n,j}^{(m-1)+}$ in (5.12) is given by


	$\displaystyle v_{h}\left(s_{n},\,b_{j}e^{R(b_{j})\Delta t},\,t_{m}\right)$	$\displaystyle~{}n=-N^{\dagger}/2,\ldots,-N/2,$
	$\displaystyle\displaystyle\Delta s\sum_{l=-N^{\dagger}/2}^{N^{\dagger}/2}\omega_{l}\,g_{n-l}(\Delta t;K_{\epsilon})\,~{}v_{h}\left(s_{l},\,b_{j}e^{R(b_{j})\Delta t},\,t_{m}\right)$	$\displaystyle~{}n=-N/2+1,\ldots,N/2-1,$
	$\displaystyle e^{\left(\sigma^{2}+2\mu+\lambda\kappa_{2}\right)\Delta t}v_{h}\left(s_{n},b_{j}e^{R(b_{j})\Delta t}\,t_{m}\right)$	$\displaystyle~{}n=N/2,\ldots,N^{\dagger}/2.$

We now introduce a lemma on local consistency of the proposed scheme.

Lemma 5.2 (Local consistency).

Suppose that (i) the discretization parameter $h$ satisfies (4.3), (ii) linear interpolation is used for the intervention action (4.7). For any smooth test function $\phi\in\mathcal{C}^{\infty}(\Omega\cup\Omega_{b_{\max}}\times[0,T])$ , with $\phi_{n,j}^{m}\equiv\phi({\bf{x}}_{n,j}^{m})$ and ${\bf{x}}_{n,j}^{m-1}\in\Omega_{\scalebox{0.7}{\text{in}}}\times\{t_{m-1}\}$ , $m=M,\ldots,1$ , and for a sufficiently small $h,\chi$ , we have

\displaystyle\mathcal{D}_{h}\left({\bf{x}}_{n,j}^{m-1},\left\{\phi_{l,j}^{m}+\chi\right\}_{l=-N^{\dagger}/2}^{N^{\dagger}/2-1}\right)=\mathcal{D}\left({\bf{x}}_{n,j}^{m-1},~{}\phi^{m}\right)+\mathcal{E}\left({\bf{x}}_{n,j}^{m-1},\epsilon,h\right)+\mathcal{O}\left(\chi+h\right).

(5.15)

Here, $\mathcal{E}({\bf{x}}_{n,j}^{(m-1)+},\epsilon,h)\to 0$ as $\epsilon,h\to 0$ . The operators $\mathcal{D}\left(\cdot\right)$ and $\mathcal{D}_{h}(\cdot)$ are defined in (5.2) and (5.12), respectively, noting that $\mathcal{D}_{h}(\cdot)$ depends on $\mathcal{C}_{h}(\cdot)$ given in (5.13).

Proof of Lemma 5.2.

We first consider the operator $\mathcal{C}_{h}\left(\cdot\right)$ defined in (5.13). For the case (5.13b) of (5.13), $\mathcal{C}_{h}\left({\bf{x}}_{n,j}^{m-1},\left\{\phi_{l,j}^{m}+\chi\right\}_{l=-N^{\dagger}/2}^{N^{\dagger}/2};b^{+}\right)$ becomes

$\displaystyle\mathcal{C}_{h}\left(\cdot\right)$	$\displaystyle\overset{\text{(i)}}{=}\Delta s\sum_{l=-N^{\dagger}/2}^{N^{\dagger}/2}\omega_{l}\,g_{n-l}(\Delta t;K_{\epsilon})\,\left(\phi_{h}\left(s^{+}(s_{l},b_{j}e^{R(b_{j})\Delta t},b^{+}),\,b^{+},\,t_{m}\right)+\chi\right)$
	$\displaystyle\overset{\text{(ii)}}{=}\Delta s\sum_{l=-N^{\dagger}/2}^{N^{\dagger}/2}\omega_{l}\,g_{n-l}(\Delta t;K_{\epsilon})\,\left(\phi\left(s^{+}(s_{l},b_{j}e^{R(b_{j})\Delta t},b^{+}),\,b^{+},\,t_{m}\right)+\mathcal{O}(h^{2})+\chi\right)$
	$\displaystyle\overset{\text{(iii)}}{=}\Delta s\sum_{l=-N^{\dagger}/2}^{N^{\dagger}/2}\omega_{l}\,g_{n-l}(\Delta t;\infty)\,\phi\left(s^{+}(s_{l},b_{j}e^{R(b_{j})\Delta t},b^{+}),\,b^{+},\,t_{m}\right)+\scalebox{0.9}{$\mathcal{E}_{f}$}+\mathcal{O}\left(h^{2}+\chi\right)$
	$\displaystyle\displaystyle\overset{\text{(iv)}}{=}\underbrace{\int_{s^{\dagger}_{\min}}^{s^{\dagger}_{\max}}\phi\left(s^{+}(s^{\prime},b_{j}e^{R(b_{j})\Delta t},b^{+}),\,b^{+},\,t_{m}\right)g(s_{n}-s^{\prime};\Delta t,\infty)~{}ds^{\prime}}_{\text{$\LARGE{=~{}\phi\left(s^{+}(s_{n},b_{j},b^{+}),\,b^{+},\,t_{m-1}^{+}\right)}$}}+\scalebox{0.9}{$\mathcal{E}_{f}$}+\scalebox{0.9}{$\mathcal{E}_{c}$}+\mathcal{O}\left(h^{2}+\chi\right)$	(5.16)

Here, (i) and (ii) are due to the facts that, for linear interpolation, the constant $\chi$ can be completely separated from interpolated values, and the interpolation error is of size $\mathcal{O}\left((\Delta s)^{2}+(\Delta b_{\max})^{2}\right)=\mathcal{O}(h^{2})$ for sufficiently small $h$ ; (iii) is due to (5.1) and $\chi$ being sufficiently small. The errors $\mathcal{E}_{f}$ and $\mathcal{E}_{c}$ in (iii) and (iv) are respectively described below.

•

In (iii), $\scalebox{0.9}{$\mathcal{E}_{f}$}\equiv\scalebox{0.9}{$\mathcal{E}_{f}$}\left({\bf{x}}_{n,j}^{m-1},\epsilon\right)$ is the error arising from truncating the infinite series $g_{n-l}(\cdot,\Delta t,\infty)$ , defined in (3.9), to $g_{n-l}(\cdot,\Delta t,K_{\epsilon})$ . Taking into account the fact that function $\phi$ is continuous and hence bounded on the closed domain $\Omega\cup\Omega_{b_{\max}}\times[0,T])$ , together with (4.16) and (5.1), we have $\mid\scalebox{0.9}{$\mathcal{E}_{f}$}\mid\leq C^{\prime}\epsilon e^{\epsilon_{g}}$ , where $C^{\prime}>0$ is a bounded constant independently of $\epsilon$ .

•

In (iv), $\scalebox{0.9}{$\mathcal{E}_{c}$}\equiv\scalebox{0.9}{$\mathcal{E}_{c}$}\left({\bf{x}}_{n,j}^{m-1},h\right)$ is the error arising from the simple lhs numerical integration rule

	$\mathcal{E}_{c}$	$\displaystyle=\Delta s\sum_{l=-N^{\dagger}/2}^{N^{\dagger}/2}\omega_{l}g_{n-l}(\Delta t,\infty)\,\phi\left(s^{+}(s_{l},b_{j}e^{R(b_{j})\Delta t},b^{+}),\,b^{+},\,t_{m}\right)$
		$\displaystyle\qquad\qquad-\int_{s^{\dagger}_{\min}}^{s^{\dagger}_{\max}}\phi\left(s^{+}(s^{\prime},b_{j}e^{R(b_{j})\Delta t},b^{+}),\,b^{+},\,t_{m}\right)~{}g(s_{n}-s^{\prime};\Delta t,\infty)~{}ds^{\prime}.$		(5.17)

Due to the continuity and the boundedness of the integrand, we have $\scalebox{0.9}{$\mathcal{E}_{c}$}\to 0$ as $h\to 0$ .

Since $c=b^{+}$ , $\phi$ is smooth, and $\mathcal{Z}$ is compact, for $\phi\left(s^{+}(s_{n},b_{j},b^{+}),\,b^{+},\,t_{m-1}^{+}\right)$ given in (5.16), we have

	$\displaystyle\inf_{b^{+}\in\mathcal{Z}_{h}}\phi\left(s^{+}(s_{n},b_{j},b^{+}),\,b^{+},\,t_{m-1}^{+}\right)$	$\displaystyle=\inf_{c\in\mathcal{Z}}\phi\left(s^{+}(s_{n},b_{j},c),\,c,\,t_{m-1}^{+}\right)+\mathcal{O}(h)$
		$\displaystyle=\inf_{c\in\mathcal{Z}}\mathcal{M}(c)\phi(s_{n},b_{j},t_{m-1}^{+})+\mathcal{O}(h).$		(5.18)

Next, similar to (5.13b), the term $v_{n,j}^{(m-1)+}$ in (5.2) corresponding to $n=-N/2+1,\ldots,N/2-1$ is written as follows $\displaystyle v_{n,j}^{(m-1)+}=\Delta s\sum_{l=-N^{\dagger}/2}^{N^{\dagger}/2}\omega_{l}\,g_{n-l}(\cdot,\Delta t;K_{\epsilon})\,~{}\left(\phi_{h}\left(s_{l},b_{j}e^{R(b_{j})\Delta t},\,t_{m}\right)+\chi\right)=\ldots$

	$\displaystyle\ldots$	$\displaystyle\overset{\eqref{eq:OO}}{=}\int_{s^{\dagger}_{\min}}^{s^{\dagger}_{\max}}\phi(s^{\prime},b_{j}e^{R(b_{j})\Delta t},t_{m})~{}g(s_{n}-s^{\prime};\Delta t,\infty)~{}ds^{\prime}+\scalebox{0.9}{$\mathcal{E}_{f}$}+\scalebox{0.9}{$\mathcal{E}_{c}$}+\mathcal{O}\left(h^{2}+\chi\right).$
		$\displaystyle=\phi\left(s_{n},b_{j},t_{m-1}^{+}\right)+\scalebox{0.9}{$\mathcal{E}_{f}$}+\scalebox{0.9}{$\mathcal{E}_{c}$}+\mathcal{O}\left(h^{2}+\chi\right).$		(5.19)

Therefore, using (5.16), (5.2) and (5.18), and letting $\mathcal{E}\left({\bf{x}}_{n,j}^{m-1},\epsilon,h\right)=\scalebox{0.9}{$\mathcal{E}_{f}$}+\scalebox{0.9}{$\mathcal{E}_{c}$}$ , for $n=-N/2+1,\ldots,N/2-1$ , the operator $\mathcal{D}_{h}\left({\bf{x}}_{n,j}^{m-1},\left\{\phi_{l,j}^{m}+\chi\right\}_{l=-N^{\dagger}/2}^{N^{\dagger}/2}\right)$ in (5.15) can be written as

	$\displaystyle\mathcal{D}_{h}(\cdot)$	$\displaystyle={\color[rgb]{0,0,0}{\min\left\{\phi\left(s_{n},b_{j},t_{m-1}^{+}\right),\inf_{c\in\mathcal{Z}}\mathcal{M}(c)\,\phi(s_{n},b_{j},t_{m-1}^{+})\right\}}}+\mathcal{O}(h+\chi)+\mathcal{E}\left({\bf{x}}_{n,j}^{m-1},\epsilon,h\right)$
		$\displaystyle=\mathcal{D}\left({\bf{x}}_{n,j}^{m-1},\left\{\phi_{l,j}^{m}\right\}_{l=-N^{\dagger}/2}^{N^{\dagger}/2}\right)+\mathcal{O}(h+\chi)+\mathcal{E}\left({\bf{x}}_{n,j}^{m-1},\epsilon,h\right),$		(5.20)

which is (5.15) for ${\bf{x}}_{n,j}^{m-1}\in\Omega_{\scalebox{0.7}{\text{in}}}\times\{t_{m-1}\}$ , $m=M-1,\ldots,1$ .

For the cases (5.13a) and (5.13c), $\mathcal{C}_{h}\left({\bf{x}}_{n,j}^{m-1},b^{+},\left\{\phi_{l,j}^{m}+\chi\right\}_{l=-N^{\dagger}/2}^{N^{\dagger}/2}\right)$ can be respectively written into

\left\{\phi\left(s^{+}(s_{n},b_{j}e^{R(b_{j})\Delta t},b^{+}),\,b^{+},\,t_{m}\right)\text{ and }e^{\left(\sigma^{2}+2\mu+\lambda\kappa_{2}\right)\Delta t}\phi\left(s^{+}(s_{n},b_{j}e^{R(b_{j})\Delta t},b^{+}),\,b^{+},\,t_{m}\right)\right\}+\mathcal{O}(\chi+h^{2}),

where arguments similar to those used for (i)-(ii) of (5.16) are used. Then using (5.18) on (5.2), following (5.16) and (5.20), we obtain (5.15) for ${\bf{x}}_{n,j}^{m-1}\in\left(\Omega_{s_{\min}}\cup\Omega_{s_{\max}}\right)\times\{t_{m-1}\}$ , $m=M-1,\ldots,1$ . ∎

5.3 Main convergence theorem

Given the $\ell$ -stability and consistency of the proposed numerical scheme established in Lemmas 5.1 and 5.2, as well as together with its monotonicity, we now mathematically demonstrate the pointwise convergence of the scheme in $\Omega_{\scalebox{0.7}{\text{in}}}\times\{t_{m-1}\}$ , $m=M,\ldots,1$ , as $h\to 0$ . Here, as noted earlier, we assume that $\epsilon\to 0$ as $h\to 0$ . We first need to recall/introduce relevant notation.

We denote by $\Omega^{h}$ the computational grid parameterized by $h$ , noting that $\Omega^{h}\to\Omega$ as $h\to 0$ . We also have the respective $\Omega_{\scalebox{0.7}{\text{in}}}^{h}$ . In general, a generic gridpoint in $\Omega^{h}_{\scalebox{0.7}{\text{in}}}\times\{t_{m}\}$ , $m=M,\ldots,0$ , is denoted by ${\bf{x}}_{h}^{m}=({\bf{x}}_{h},t_{m})$ , whereas an arbitrary point in $\Omega_{\scalebox{0.7}{\text{in}}}\times\{t_{m}\}$ is denoted by ${\bf{x}}^{m}=({\bf{x}},t_{m})$ . Numerical solutions at $({\bf{x}}_{h},t_{m-1})$ , $m=M,\ldots,1$ , is denoted by $v_{h}^{m-1}({\bf{x}}_{h};v_{h}^{m})$ , where it is emphasized that $v_{h}^{m}$ , which is the time- $t_{m}$ numerical solution at gridpoints is used for the computation of $v_{h}^{m-1}$ . The exact solution at an arbitrary point in ${\bf{x}}^{m-1}=({\bf{x}},t_{m-1})\in\Omega_{\scalebox{0.7}{\text{in}}}\times\{t_{m-1}\}$ , $m=M,\ldots,1$ , is denoted by $v^{m-1}({\bf{x}};v^{m})$ , where it is emphasized that $v^{m}$ , which is the time- $t_{m}$ exact solution in $\Omega$ is used. More specifically, $v_{h}^{m-1}({\bf{x}}_{h};v_{h}^{m})$ and $v^{m-1}({\bf{x}};v^{m})$ are defined via operators $\mathcal{D}_{h}\left(\cdot\right)$ and $\mathcal{D}(\cdot)$ as follows

v_{h}^{m-1}\big{(}{\bf{x}}_{h};v_{h}^{m}\big{)}\coloneqq\mathcal{D}_{h}\big{(}{\bf{x}}_{h}^{m-1};\{v_{l,j}^{m}\}\big{)},\qquad v^{m-1}\big{(}{\bf{x}};v^{m}\big{)}\coloneqq\mathcal{D}\big{(}{\bf{x}}^{m-1};v^{m}\big{)},\quad m=M,\ldots,1.

(5.21)

Here, our convention is that $v_{h}\big{(}{\bf{x}}^{M-1};v_{h}^{M}\big{)}=v_{h}\big{(}{\bf{x}}^{M-1};v^{M}\big{)}$ .

The pointwise convergence of the proposed scheme is stated in the main theorem below.

Theorem 5.1 (Pointwise convergence).

Suppose that all the conditions for Lemma 5.1 and 5.2 are satisfied. Under the assumption that the infinite series truncation tolerance $\epsilon\to 0$ as $h\to 0$ , scheme (5.12) converges pointwise in $\Omega_{\scalebox{0.7}{\text{in}}}\times\{t_{m-1}\}$ , $m\in\{M,\ldots,1\}$ , to the unique bounded solution of the MV portfolio optimization in Definition 3.1, i.e. for any $m\in\{M,\ldots,1\}$ , we have

{\color[rgb]{0,0,0}{v^{m-1}\left({\bf{x}};v^{m}\right)}}~{}=~{}\lim_{\vbox{\Let@\restore@math@cr\default@tag\halign{\hfil$\m@th\scriptstyle#$&$\m@th\scriptstyle{}#$\cr h&\to 0\\ {\bf{x}}_{h}&\to{\bf{x}}\crcr}}}v_{h}^{m-1}\left({\bf{x}}_{h};v_{h}^{m}\right),\quad\text{ for }{\bf{x}}_{h}\in\Omega_{\scalebox{0.7}{\text{in}}}^{h},\quad{\bf{x}}\in\Omega_{\scalebox{0.7}{\text{in}}}.

(5.22)

Proof of Theorem 5.1.

By Proposition 3.1, there exists (bounded) $\phi\in\mathcal{C}^{\infty}(\Omega\times[0,T])$ such that, for any $h>0$ ,

v\leq\phi\leq v+h,\quad\text{in }\Omega\times\{t_{m}\},\quad m=M,\ldots,0.

(5.23)

We then define

\displaystyle{\color[rgb]{0,0,0}{v_{h}^{m-1}\big{(}{\bf{x}}_{h};\phi^{m}\big{)}\coloneqq\mathcal{D}_{h}\big{(}{\bf{x}}_{h}^{m-1};\{\phi_{l,j}^{m+}\}\big{)},\quad v^{m-1}\big{(}{\bf{x}};\phi^{m}\big{)}\coloneqq\mathcal{D}\big{(}{\bf{x}}^{m-1};\phi^{m}\big{)},}}

noting our convention that $\phi_{l,j}^{m}={\color[rgb]{0,0,0}{\phi({\bf{x}}_{l,j}^{m})}}$ . To show (5.22), we will prove by mathematical induction on $m$ the following result: for any $m\in\{M,\ldots,1\}$ , and for sequence $\{{\bf{x}}_{h}\}_{h>0}$ such that ${\bf{x}}_{h}\to{\bf{x}}$ as $h\to 0$ ,

\left|{\color[rgb]{0,0,0}{v_{h}^{m-1}\left({\bf{x}}_{h};v^{m}\right)-v^{m-1}\left({\bf{x}};v^{m}\right)}}\right|\leq\chi_{h}^{m-1},\quad\chi_{h}^{m-1}\text{ is bounded $\forall h>0$ and }\chi_{h}^{m-1}\to 0\text{ as }h\to 0.

(5.24)

In the following proof, we let $K_{1}$ , $K_{2}$ , and $K_{3}$ be generic positive constants independent of $h$ and $\epsilon$ , which may take different values from line to line.

Base case $m=M$ : by (5.23), we can write $v^{M}\leq\phi^{M}\leq v^{M}+h$ . Therefore, by monotonicity of the scheme and (5.1), we have

\displaystyle v_{h}^{M-1}\left({\bf{x}}_{h};v^{M}\right)\leq v_{h}^{M-1}\left({\bf{x}}_{h};\phi^{M}\right)\leq v_{h}^{M-1}\left({\bf{x}}_{h};v^{M}+h\right)\leq v_{h}^{M-1}\left({\bf{x}}_{h};v^{M}\right)+K_{1}h.

(5.25)

Using (5.25) and the triangle inequality gives

	$\displaystyle\left\|v_{h}^{M-1}\left({\bf{x}}_{h};v^{M}\right)-v^{M-1}\left({\bf{x}};v^{M}\right)\right\|\overset{\eqref{eq:uh}}{\leq}\left\|v_{h}^{M-1}\big{(}{\bf{x}}_{h};\phi^{M}\big{)}-v^{M-1}({\bf{x}};\phi^{M})\right\|+K_{1}h$
	$\displaystyle\overset{(i)}{\leq}\left\|v_{h}^{M-1}\left({\bf{x}}_{h};\phi^{M}\right)-v^{M-1}\left({\bf{x}}_{h};\phi^{M}\right)\right\|+\left\|v^{M-1}\left({\bf{x}}_{h};\phi^{M}\right)-v^{M-1}\left({\bf{x}};\phi^{M}\right)\right\|+K_{1}h.$		(5.26)

By local consistency established in Lemma 5.2, we have

\displaystyle v_{h}^{M-1}\left({\bf{x}}_{h};\phi^{M}\right)-v^{M-1}\left({\bf{x}}_{h};\phi^{M}\right)=\mathcal{E}({\bf{x}}_{h}^{M-1},\epsilon,h)+\mathcal{O}(h).

(5.27)

Due to smoothness of $\phi(\cdot)$ and regularity of $g(\cdot)$ , we have

\displaystyle\left|v^{M-1}\left({\bf{x}}_{h};\phi^{M}\right)-v^{M-1}\left({\bf{x}};\phi^{M}\right)\right|\leq{\color[rgb]{0,0,0}{K_{1}\|{\bf{x}}_{h}-{\bf{x}}\|}}.

(5.28)

Therefore, using (5.26), (5.27), (5.28), we can show that

	$\displaystyle\left\|v_{h}^{M-1}\left({\bf{x}}_{h};v^{M}\right)-v^{M-1}\left({\bf{x}};v^{M}\right)\right\|\leq\chi_{h}^{M-1},$		(5.29)
	$\displaystyle\qquad\qquad\chi_{h}^{M-1}=K_{1}h+\mathcal{O}(h)+\|\mathcal{E}({\bf{x}}_{h}^{M-1},\epsilon,h)\|+K_{2}\\|{\bf{x}}_{h}-{\bf{x}}\\|\longrightarrow 0,\text{ as }h\to 0,$

noting ${\bf{x}}_{h}\to{\bf{x}}$ as $h\to 0$ , and $\chi_{h}^{M-1}$ is bounded for all $h>0$ .

Induction hypothesis: assume that, for some $m\in\{M,\ldots,2\}$ , we have

\left|v_{h}^{m-1}\left({\bf{x}}_{h};v_{h}^{m}\right)-v^{m-1}\left({\bf{x}};v^{m}\right)\right|\leq\chi_{h}^{m-1},\text{ where $\chi_{h}^{m-1}$ is bounded, $\chi_{h}^{m-1}\to 0$ as $h\to 0$}.

(5.30)

Induction step: By the triangle inequality, we have $\left|v_{h}^{m-2}\left({\bf{x}}_{h};v_{h}^{m-1}\right)-v^{m-2}\left({\bf{x}};v^{m-1}\right)\right|\leq\ldots$

\displaystyle\ldots\leq\left|v_{h}^{m-2}\left({\bf{x}}_{h};v_{h}^{m-1}\right)-{\color[rgb]{0,0,0}{v_{h}^{m-2}}}\left({\bf{x}}_{h};v^{m-1}\right)\right|+\left|{\color[rgb]{0,0,0}{v_{h}^{m-2}}}\left({\bf{x}}_{h};v^{m-1}\right)-v^{m-2}\left({{\bf{x}}};v^{m-1}\right)\right|.

(5.31)

Now, we examine the first term (5.31). By the induction hypothesis (5.30), $|v_{h}^{m-1}-v^{m-1}|\leq\chi_{h}^{m-1}$ , where $\chi_{h}^{m-1}\to 0$ as $h\to 0$ . Therefore, the first term in (5.31) can be bounded by

\displaystyle\left|v_{h}^{m-2}\left({\bf{x}}_{h};v_{h}^{m-1}\right)-{\color[rgb]{0,0,0}{v_{h}^{m-2}}}\left({\bf{x}}_{h};v^{m-1}\right)\right|\overset{\text{(i)}}{\leq}\chi_{h}^{\prime}={\color[rgb]{0,0,0}{\mathcal{O}(h+\chi_{h}^{m-1})}}+|\mathcal{E}({\bf{x}}_{h}^{m-2},\epsilon,h)|\to 0\text{ as }h\to 0.

(5.32)

Here, (i) follows from the local consistency of the numerical scheme established in Lemma 5.2. Next, we focus on the second term. Using the same arguments for the base case $m=M$ (see (5.29), with $M$ being replaced by $m$ ), the second term in (5.31) can be bounded by $\chi_{h}^{\prime\prime}$ , where $\chi_{h}^{\prime\prime}\to 0$ as $h\to 0$ . Here, we note that $v_{h}^{m-1}$ is bounded for all $h>0$ by Lemma 5.1 on stability. Combining this with (5.32), we have $\left|v_{h}^{m-2}\left({\bf{x}}_{h};{\color[rgb]{0,0,0}{v_{h}^{m-1}}}\right)-v^{m-2}\left({\bf{x}};v^{m-1}\right)\right|\leq\chi_{h}^{m-2}$ , where $\chi_{h}^{m-2}$ is bounded for all $h>0$ , and $\chi_{h}^{m-2}\to 0$ as $h\to 0$ . This concludes the proof. ∎

Remark 5.2 (Convergence on an infinite domain).

It is possible to develop a numerical scheme that converge to the solution of the theoretical formulation (3.5), in particular to the convolution integral (3.5d) which is posed on an infinite domain. This can be achieved by making a requirement on the discretization parameter $h$ (in addition to the assumption in (4.3)) as follows:

s_{\max}-s_{\min}=C_{3}^{\prime}/h,\text{ where $C_{3}^{\prime}>0$ is independent of $h$}.

(5.33)

As such, as $h\to 0$ , we have $|s_{\min}|,s_{\max}\to\infty$ (implying $|s_{\min}^{\dagger}|,s_{\max}^{\dagger},|s_{\min}^{\ddagger}|,s_{\max}^{\ddagger}\to\infty$ as well. It is straightforward to ensure (4.3) and (5.33) simultaneously as $h$ is being refined. For example, with $C_{3}^{\prime}=1$ , we can quadruple $N^{\dagger}$ and sextuple $N^{\ddagger}$ as $h$ is being halved. Nonetheless, with $|s_{\min}|$ and $s_{\max}$ chosen sufficiently large as in our extensive experiments, numerical solutions in the interior ( $\Omega_{\scalebox{0.7}{\text{in}}}$ ) virtually do not get affected by the boundary conditions.

6 Continuously observed impulse control MV portfolio optimization

Recall that $\Delta t=t_{m+1}-t_{m}$ is the rebalancing time interval. In this section, we intuitively demonstrate that as $\Delta t\to 0$ , of the proposed numerical scheme converge to the viscosity solution [16, 3] of an impulse formulation of the continuously rebalanced MV portfolio optimization in [19]. A rigorous analysis of convergence to the viscosity solution of this impulse formulation is the topic of a paper in progress.

The impulse formulation proposed in [19] takes the form of an Hamilton-Jacobi-Bellman Quasi-Variational Inequality (HJB-QVI) as follows (boundary conditions omitted for brevity)


$\displaystyle\max\left\{-v_{t}-\mathcal{L}v-\mathcal{J}v-R(b)bv_{b},\,v-\inf_{c\in\mathcal{Z}}v(s^{+}(s,b,c),c,t)\right\}=0$	$\displaystyle~{}(s,b,t)\in\mathcal{N}\times[0,T),$	(6.1a)
$\displaystyle v\left(s,b,t\right)~{}=~{}\left({\color[rgb]{0,0,0}{W_{\text{liq}}(s,b)}}-\frac{\gamma}{2}\right)^{2},$	$\displaystyle~{}t=T,$	(6.1b)

where $\mathcal{L}(\cdot)$ and $\mathcal{J}(\cdot)$ respectively are the differential and jump operators defined as follows

\displaystyle\mathcal{L}v\coloneqq\frac{\sigma^{2}}{2}v_{ss}+\left(\mu-\lambda\kappa-\frac{\sigma^{2}}{2}\right)v_{s}-{\color[rgb]{0,0,0}{\lambda v}},\qquad\mathcal{J}v=\int_{-\infty}^{\infty}v(s+y,b,t)p(y)\,\mathrm{d}y.

A $\ell$ -stable and consistent finite difference numerical scheme for the HJB-QVI (6.1) is presented in [19] in which monotonicity is ensured via a positive coefficient method. Therefore, convergence of this finite different scheme to the viscosity solution of the HJB-QVI is guaranteed [16, 3].

To intuitively see that the proposed scheme (5.12) is consistent in the viscosity sense with the impulse formulation (6.1) in $\Omega_{\scalebox{0.7}{\text{in}}}\times\{t_{m-1}\}$ , $m=M,\ldots,1$ , we write (5.12) for ${\color[rgb]{0,0,0}{{\bf{x}}_{n,j}^{m-1}}}\in\Omega_{\scalebox{0.7}{\text{in}}}\times\{t_{m-1}\}$ via $\mathcal{G}_{h}\left({\bf{x}}_{n,j}^{m-1},\left\{v_{l,j}^{m}\right\}_{l=-N^{\dagger}/2}^{N^{\dagger}/2}\right)$ , where

0=\mathcal{G}_{h}\left(\cdot\right)\coloneqq\max\left\{\underbrace{\frac{v_{n,j}^{m-1}-v_{n,j}^{(m-1)+}}{\Delta t}}_{A_{1}},\,\underbrace{v_{n,j}^{m-1}-\min_{b^{+}\in\mathcal{Z}}v_{h}\left(s^{+}(s_{n},b_{j},b^{+}),\,b^{+},\,t_{m-1}^{+}\right)}_{A_{2}}\right\},

(6.2)

where $v_{n,j}^{(m-1)+}$ and $v_{h}\left(s^{+}(s_{n},b_{j},b^{+}),\,b^{+},\,t_{m-1}^{+}\right)$ are respectively given by

	$\displaystyle v_{n,j}^{(m-1)+}$	$\displaystyle=\Delta s\sum_{l=-N^{\dagger}/2}^{N^{\dagger}/2}\omega_{l}\,g_{n-l}(\Delta t;K_{\epsilon})\,~{}v_{h}\left(s_{l},b_{j}e^{R(b_{j})\Delta t},t_{m}\right),$
	$\displaystyle v_{h}\left(s^{+}(s_{n},b_{j},b^{+}),\,b^{+},\,t_{m-1}^{+}\right)$	$\displaystyle=\Delta s\sum_{l=-N^{\dagger}/2}^{N^{\dagger}/2}\omega_{l}\,g_{n-l}(\Delta t;K_{\epsilon})\,~{}v_{h}\left(s^{+}(s_{l},b_{j}e^{R(b_{j})\Delta t},b^{+}),\,b^{+},\,t_{m}\right).$

For a smooth test function $\phi$ , and a constant $\chi$ , term $A_{1}$ in (6.2) of $\mathcal{G}_{h}\left({\bf{x}}_{n,j}^{m-1},\left\{\phi_{l,j}^{m}+\chi\right\}_{l=-N^{\dagger}/2}^{N^{\dagger}/2}\right)$ are

\displaystyle\underbrace{\frac{1}{\Delta t}\left(\phi_{n,j}^{m-1}-\Delta s\sum_{l=-N^{\dagger}/2}^{N^{\dagger}/2}\omega_{l}\,g_{n-l}(\Delta t;K_{\epsilon})\,~{}\phi_{h}\left(s_{l},b_{j}e^{R(b_{j})\Delta t},\,t_{m}\right)\right)}_{B_{1}}\,+\underbrace{\frac{\chi}{\Delta t}\left(1-\Delta s\sum_{l=-N^{\dagger}/2}^{N^{\dagger}/2}\omega_{l}\,g_{n-l}(\Delta t;K_{\epsilon})\right)}_{B_{2}}.

(6.3)

Now, we first examine the term $B_{1}$ . Noting that

	$\displaystyle\phi_{h}\left(s_{l},b_{j}e^{R(b_{j})\Delta t},\,t_{m}\right)$	$\displaystyle=\phi_{l,j}^{m}+R(b_{j})b_{j}\,(\phi_{b})_{l,j}^{m}\,\Delta t+\mathcal{O}(h^{2}),$
	$\displaystyle s^{+}(s_{l},b_{j}e^{R(b_{j})\Delta t},b^{+})$	$\displaystyle=s^{+}(s_{l},b_{j},b^{+})+\mathcal{O}(h),$

we have term- $B_{1}$ of (6.3) $=\displaystyle\frac{1}{\Delta t}\left(\phi_{n,j}^{m-1}-\Delta s\sum_{l=-N^{\dagger}/2}^{N^{\dagger}/2}\omega_{l}\,g_{n-l}(\Delta t;K_{\epsilon})\,~{}\phi_{h}\left(s_{l},b_{j}e^{R(b_{j})\Delta t},\,t_{m}\right)\right)=\ldots$

\displaystyle\ldots=\frac{1}{\Delta t}\left(\phi_{n,j}^{m-1}-\Delta s\sum_{l=-N^{\dagger}/2}^{N^{\dagger}/2}\omega_{l}\,g_{n-l}(\Delta t;K_{\epsilon})\phi_{l,j}^{m}\right)-\Delta s\sum_{l=-N^{\dagger}/2}^{N^{\dagger}/2}\omega_{l}\,g_{n-l}(\Delta t;K_{\epsilon})R(b_{j})b_{j}(\phi_{b})_{l,j}^{m}+\mathcal{O}(h).

(6.4)

Using similar techniques as in [44][Lemma 5.4], noting the it is possible to show that

\Delta s\sum_{l=-N^{\dagger}/2}^{N^{\dagger}/2}\omega_{l}\,g_{n-l}(\Delta t;K_{\epsilon})\phi_{l,j}^{m}=\phi_{l,j}^{m}+\Delta t\left[\mathcal{L}\phi+\mathcal{J}\phi\right]_{n,j}^{m}+\mathcal{O}(h^{2})+{\color[rgb]{0,0,0}{\mathcal{O}\left(\epsilon\right)}}.

(6.5)

By choosing $\epsilon=\mathcal{O}(h^{2})$ , from (6.5), noting $\phi_{n,j}^{m-1}-\phi_{l,j}^{m}=-(\phi_{t})_{l,j}^{m}\Delta t+\mathcal{O}\left((\Delta t)^{2}\right)$ , the first term of (6.4) can be written as

\left[-\phi_{t}-\mathcal{L}\phi-\mathcal{J}\phi\right]_{n,j}^{m}+\mathcal{O}\left(h\right).

(6.6)

The second term of (6.4) can be simplified as [44][Lemma 5.4]

\Delta s\sum_{l=-N^{\dagger}/2}^{N^{\dagger}/2-1}\omega_{l}\,g_{n-l}(\Delta t;K_{\epsilon})R(b_{j})b_{j}(\phi_{b})_{l,j}^{m}=R(b_{j})b_{j}(\phi_{b})_{n,j}^{m}+\mathcal{O}\left(h\right).

(6.7)

Putting (6.6)-(6.7) into (6.4), noting that term- $B_{2}$ in (6.3) has the form $\mathcal{O}(\chi)$ , we have

\text{term }A_{1}\text{ in }\eqref{eq:QQ}=\left[-\phi_{t}-\mathcal{L}\phi-\mathcal{J}\phi-R(b)b\phi_{b}\right]_{n,j}^{m}+\mathcal{O}\left(h\right)+{\color[rgb]{0,0,0}{\mathcal{O}\left(\chi\right)}}.

(6.8)

Term $A_{2}$ in (6.2) can be handled using similar steps as in (5.16)-(5.18). Thus, (6.2) becomes

0=\max\left\{\big{[}-\phi_{t}-\mathcal{L}\phi-\mathcal{J}\phi-R(b)b\phi_{b}\big{]}_{n,j}^{m-1},\,\big{[}\phi-\inf_{c\in\mathcal{Z}}\mathcal{M}(c)\phi\big{]}_{n,j}^{m-1}\right\}+\mathcal{E}\left({\bf{x}}_{n,j}^{(m-1)+},h\right)+\mathcal{O}\left(\chi+h\right).

This show local consistency of the proposed scheme to (6.1a), that is,

\mathcal{G}_{h}\left({\bf{x}}_{n,j}^{(m-1)},\left\{\phi_{l,j}^{m}+\chi\right\}_{l=-N^{\dagger}/2}^{N^{\dagger}/2-1}\right)=\mathcal{F}\left[\phi_{l,j}^{m}\right]++\mathcal{E}\left({\bf{x}}_{n,j}^{(m-1)+},h\right)+\mathcal{O}\left(\chi+h\right).

(6.9)

Together with $\ell$ -stability and monotonicity of the proposed scheme, it is possible to utilize a Barles-Souganidis analysis [3] to show convergence to the viscosity solution of the impulse formulation (6.1) as $h\to 0$ . We leave this for our future work.

7 Numerical examples

7.1 Empirical data and calibration

In order to parameterize the underlying asset dynamics, the same calibration data and techniques are used as detailed in [20, 27]. We briefly summarize the empirical data sources. The risky asset data is based on daily total return data (including dividends and other distributions) for the period 1926-2014 from the CRSP’s VWD index⁶⁶6Calculations were based on data from the Historical Indexes 2015©, Center for Research in Security Prices (CRSP), The University of Chicago Booth School of Business. Wharton Research Data Services was used in preparing this article. This service and the data available thereon constitute valuable intellectual property and trade secrets of WRDS and/or its third party suppliers. , which is a capitalization-weighted index of all domestic stocks on major US exchanges. The risk-free rate is based on 3-month US T-bill rates⁷⁷7Data has been obtained from See http://research.stlouisfed.org/fred2/series/TB3MS. over the period 1934-2014, and has been augmented with the NBER’s short-term government bond yield data⁸⁸8Obtained from the National Bureau of Economic Research (NBER) website,
http://www.nber.org/databases/macrohistory/contents/chapter13.html. for 1926-1933 to incorporate the impact of the 1929 stock market crash. Prior to calculations, all time series were inflation-adjusted using data from the US Bureau of Labor Statistics⁹⁹9The annual average CPI-U index, which is based on inflation data for urban consumers, were used - see http://www.bls.gov.cpi ..

In terms of calibration techniques, the calibration of the jump models is based on the thresholding technique of [15, 14] using the approach of [20, 27] which, in contrast to maximum likelihood estimation of jump model parameters, avoids problems such as ill-posedness and multiple local maxima¹⁰¹⁰10If $\Delta\hat{X}_{i}$ denotes the $i$ th inflation-adjusted, detrended log return in the historical risky asset index time series, a jump is identified in period $i$ if $\left|\Delta\hat{X}_{i}\right|>\alpha\hat{\sigma}\sqrt{\Delta t}$ , where $\hat{\sigma}$ is an estimate of the diffusive volatility, $\Delta t$ is the time period over which the log return has been calculated, and $\alpha$ is a threshold parameter used to identify a jump. For both the Merton and Kou models, the parameters in Table 7.2 is based on a value of $\alpha=3$ , which means that a jump is only identified in the historical time series if the absolute value of the inflation-adjusted, detrended log return in that period exceeds 3 standard deviations of the “geometric Brownian motion change”, definitely a highly unlikely event. . In the case of GBM, standard maximum likelihood techniques are used. The calibrated parameters are provided in Table 7.2 (reproduced from [65, 66][Table 5.1]).

Ref. level	$s$ -grid ( $N$ )	$b$ -grid ( $J$ )
0	128	25
1	256	50
2	512	100
3	1024	200
4	2048	400

Table 7.1: Grid refinement levels for convergence analysis;

N^{\dagger}=2N

and

N^{\ddagger}=3N

Parameters	Merton	Kou
$\mu$ (drift)	0.0817	0.0874
$\sigma$ (diffusion volatility)	0.1453	0.1452
$\lambda$ (jump intensity)	0.3483	0.3483
$\widetilde{\mu}$ (log jump multiplier mean)	-0.0700	n/a
$\widetilde{\sigma}$ (log jump multiplier stdev)	0.1924	n/a
$q_{1}$ (probability of an up-jump)	n/a	0.2903
$\eta_{1}$ (exponential parameter up-jump)	n/a	4.7941
$\eta_{2}$ (exponential parameter down-jump)	n/a	5.4349
$r_{b}$ (borrowing interest rate)	0.00623	0.00623
$r_{\iota}$ (lending interest rate)	0.00623	0.00623

Table 7.2: Calibrated risky and risk-free asset process parameters. Reproduced from [65, 66][Table 5.1].

For all experiments, unless otherwise noted, we use $b_{\max}=1000$ , and with the initial wealth being $w_{0}=10$ . We set $s_{\min}=-10+\ln(w_{0})$ , $s_{\max}=5+\ln(w_{0})$ , $s_{-\infty}=s_{\min}^{\dagger}=-17.5+\ln(w_{0})$ , and $s_{\max}^{\dagger}=12.5+\ln(w_{0})$ , so that $s_{\min}^{\ddagger}=-22.5$ and $s_{\max}^{\ddagger}=22.5$ .

For the user-defined tolerance $\epsilon$ used for the truncation of the infinite series representation of $g(\cdot)$ in (4.15), we use $\epsilon=10^{-20}$ , which can be satisfied in discretely rebalancing examples when $K_{\epsilon}=15$ (i.e. the number terms in the truncated series of $g(\cdot)$ is 15).

7.2 Validation examples

Since for PCMV portfolio optimization under a solvency condition (no bankruptcy allowed) and a maximum leverage condition does not admit known analytical solution, we rely on existing numerical methods, namely (i) finite difference [19, 21] and (ii) Monte Carlo (MC) simulation to verify results. For brevity, we will refer to the proposed monotone integration method as the “MI” method, and to the finite difference method of [19, 21] as the “FD” method.

As noted earlier, to the best of our knowledge, the FD methods proposed in [19, 21] are the only existing FD methods for MV optimization under jump-diffusion dynamics with investment constraints. In discrete rebalancing setting, FD methods typically involve solving, between two consecutive rebalancing times, a Partial Integro Differential Equation (PIDE), where the amount invested in the risky asset $z=e^{s}$ is the independent variable. These FD methods achieve monotonicity in time-advancement through a positive coefficient finite difference discretization method (for the partial derivatives), which is combined with implicit timestepping. Optimal strategies are obtained by solving an optimization problem across rebalancing times. Despite their effectiveness, finite difference methods present significant computational challenges in multi-period settings with very long maturities, as encountered in DC plans. In particular, they necessitate time-stepping between rebalancing dates (i.e., control monitoring dates), which often occur annually. This time-stepping requirement introduces errors and substantially increase the computational cost of FD methods (as noted earlier in Remark 4.3. In the numerical experiments, the FD results are obtained on the same computational domain as those obtained by the MI method with the number of partition points in the $z$ - and $b$ -grids being $2048$ and $400$ , respectively, with 256 timesteps between yearly rebalancing times. (This approximates to a daily timestepping.)

Validation against MC simulation is proceeded in two steps. In Step 1, we solve the PCMV problem using the MI method on a relatively fine computational grid: the number of partition points in the $s$ - and $b$ -grids are $N=1024$ and $J=200$ , respectively. During this step, the optimal controls are stored for each discrete state value and rebalancing time $t_{m}\in\mathcal{T}_{M}$ . In Step 2, we carry out Monte Carlo simulations with $10^{6}$ paths from $t=0$ to $t=T$ following these stored numerical optimal strategies for asset allocation across each $t_{m}\in\mathcal{T}_{M}$ , using linear interpolation, if necessary, to determine the controls for a given state value. For Step 2, an Euler’s timestepping method is used for timestepping between consecutive rebalancing times and we use a total of $180$ timesteps.

7.2.1 Discrete rebalancing

For discretely rebalancing experiments, we use $T=\{20,30\}$ (years), with $\Delta t=1$ year (yearly rebalancing). The the details of the mesh size/timestep refinement level used are given in Table 7.1.

Table 7.3 presents the numerically computed $E_{\mathcal{C}_{0}}^{x_{0},t_{0}}\left[W_{T}\right]$ and $Std_{\mathcal{C}_{0}}^{x,t_{0}}\left[W_{T}\right]$ under the Kou model obtained for different refinement levels with $\gamma=100$ and $T=20$ and $T=30$ (years). To provide an estimate of the convergence rate of the proposed MI method, we compute the “Change” as the difference in values from the coarser grid and the “Ratio” as the ratio of changes between successive grids. For validation purposes, $E_{\mathcal{C}_{0}}^{x_{0},t_{0}}\left[W_{T}\right]$ and $Std_{\mathcal{C}_{0}}^{x,t_{0}}\left[W_{T}\right]$ obtained by FD method, as well as those obtained by MC methods, together with 95% confidence intervals (CIs), are also provided. As evident from Table 7.3, means and standard deviations obtained by the MI method exhibit excellent agreement with those obtained by the FD method and MC simulation.

Results obtained by MC simulation agree with those obtained by our numerical method. Results for the Merton jump case when $T=20$ and $T=30$ (years) are presented in Table 7.4 and similar observations can be made.

Table 7.3: PCMV under the Kou model with parameters in the Table 7.2;

\gamma=100

; solvency constraint applied; maximum leverage constraint applied with

q_{\max}=1.5

;

T=\{20,30\}

years.

$T$	Method	Ref. level	$E_{\mathcal{C}_{0}}^{x_{0},t_{0}}\left[W_{T}\right]$	Change	Ratio	$Std_{\mathcal{C}_{0}}^{x,t_{0}}\left[W_{T}\right]$	Change	Ratio
		$0$	$35.7768$			$16.2451$
		$1$	$35.9828$	$0.2060$		$15.7801$	$0.4650$
	MI	$2$	$36.1051$	$0.1223$	$1.7$	$15.6192$	$0.1609$	$2.9$
		$3$	$36.1893$	$0.0842$	$1.5$	$15.5995$	$0.0197$	$8.2$
20		$4$	$36.2182$	$0.0289$	$2.9$	$15.5942$	$0.0053$	$3.7$
(years)	MC		$36.1994$			$15.5965$
	$95\%$ CI		$[36.1688,36.2299]$
	FD		$36.2208$			$15.5938$
		$0$	$41.9856$			$14.1207$
		$1$	$42.1626$	$0.1770$		$13.2359$	$0.8848$
	MI	$2$	$42.2776$	$0.1150$	$1.5$	$12.9544$	$0.2815$	$3.1$
		$3$	$42.3462$	$0.0686$	$1.7$	$12.8772$	$0.0772$	$3.6$
30		$4$	$42.3834$	$0.0372$	$1.8$	$12.8569$	$0.0203$	$3.8$
(years)	MC		$42.3827$			$12.8512$
	$95\%$ CI		$[42.3575,42.4079]$
	FD		$42.3850$			$12.8520$

In Figure 7.1 presents we present efficient frontiers for the Merton model (Figure 7.1 (a)) and for the Kou model (Figure 7.1 (b)) obtained by the MI’s methods with refinement level ( $N=1024$ and $J=200$ ). We observe that efficient frontiers produced by the MI’s method agree well with those obtained by the FD method.

Refer to caption — (a) Merton, $T=20$ , $\Delta t=1$

Table 7.4: PCMV under the Merton model with parameters in the Table 7.2;

\gamma=250

; liquidate risky asset when insolvency;

q_{\max}=3.0

;

T=\{20,30\}

years.

	Method	Ref. level	$E_{\mathcal{C}_{0}}^{x_{0},t_{0}}\left[W_{T}\right]$	Change	Ratio	$Std_{\mathcal{C}_{0}}^{x,t_{0}}\left[W_{T}\right]$	Change	Ratio
		$0$	$72.0225$			$46.6043$
		$1$	$73.2063$	$1.1838$		$46.4594$	$0.1449$
	MI	$2$	$73.7386$	$0.5323$	$2.2$	$46.5059$	$0.0465$	$3.1$
		$3$	$73.9277$	$0.1891$	$2.8$	$46.5172$	$0.0113$	$4.1$
20		$4$	$73.9916$	$0.0639$	$3.0$	$46.5247$	$0.0075$	$1.5$
(years)	MC		$73.9041$			$46.5249$
	$95\%$ CI		$[73.8129,73.9953]$
	FD		73.9518			46.5288
		$0$	$91.7970$			$42.3493$
		$1$	$92.9707$	$1.1737$		$41.9321$	$0.4172$
	MI	$2$	$93.4524$	$0.4817$	$2.4$	$41.7774$	$0.1547$	$2.7$
		$3$	$93.6167$	$0.1643$	$2.9$	$41.7263$	$0.0511$	$3.0$
30		$4$	$93.6844$	$0.0677$	$2.4$	$41.7197$	$0.0066$	$7.7$
(years)	MC		$93.6696$			$41.7753$
	$95\%$ CI		$[93.5877,93.7515]$
	FD		93.6812			41.7168

7.2.2 Continuous rebalancing

While continuous rebalancing is not the primary focus of this paper, we believe it is valuable to include numerical results for the continuous rebalancing setting in order to validate the method. For experiments in the continuous rebalancing setting, we use $T=10$ (years), with $\Delta t=T/M$ year. The details of the mesh size/timestep refinement level used are given in Table 7.5. The model parameters, according to [19], are given in Table 7.6.

In these experiments, we also consider the scenario where the dynamics of the risky asset follow a Geometric Brownian Motion (GBM) model. In cases where pure diffusion is desired, such as in a GBM model, the occurrence of jumps can be eliminated by setting $\lambda=0$ . Table 7.7 displays the numerical results for $E_{\mathcal{C}_{0}}^{x_{0},t_{0}}\left[W_{T}\right]$ (expected value) and $Std_{\mathcal{C}_{0}}^{x,t_{0}}\left[W_{T}\right]$ (standard deviation) obtained at various refinement levels. These results correspond to the case where $\gamma=80$ and $T=10$ years, assuming a GBM model. For validation purposes, we also include the expected values and standard deviations obtained using the FD method proposed by [19]. The results are presented alongside the values obtained through the proposed MI method in Table 7.7. Notably, it is evident from the table that the FD and MI results show excellent agreement.

Ref. level	Timesteps $M$	$s$ -grid $N$	$b$ -grid $J$
0	10	128	25
1	20	256	50
2	40	512	100
3	80	1024	200
4	160	2048	400

Table 7.5: Grid and timestep refinement levels for convergence analysis;

N^{\dagger}=2N

and

N^{\ddagger}=3N

Parameters	GBM	Merton
$\mu$ (drift)	0.15	0.0795487
$\sigma$ (diffusion volatility)	0.15	0.1765
$\lambda$ (jump intensity)	n/a	0.0585046
$\widetilde{\mu}$ (log jump multiplier mean)	n/a	-0.788325
$\widetilde{\sigma}$ (log jump multiplier stdev)	n/a	0.450500
$r_{b}$ (borrowing interest rate)	0.04	0.0445
$r_{\iota}$ (lending interest rate)	0.04	0.0445

Table 7.6: Calibrated risky and risk-free asset process parameters. Reproduced from Table 7.2 in [22].

Table 7.7: PCMV under the GBM model (no jumps) with parameters in Table 7.6;

\gamma=80

; liquidate risky asset when insolvency;

q_{\max}=\infty

;

T=10

years.

Method	Ref. level	$E_{\mathcal{C}_{0}}^{x_{0},t_{0}}\left[W_{T}\right]$	Change	Ratio	$Std_{\mathcal{C}_{0}}^{x,t_{0}}\left[W_{T}\right]$	Change	Ratio
	$0$	$37.4438$			$6.2582$
	$1$	$38.3201$	$0.8763$		$5.4393$	$0.8189$
MI	$2$	$38.4084$	$0.0883$	$9.9$	$5.1285$	$0.3108$	$2.6$
	$3$	$38.4689$	$0.0605$	$1.5$	$5.0394$	$0.0891$	$3.5$
	$4$	$38.4820$	$0.0131$	$4.6$	$4.9999$	$0.0395$	$2.3$
FD		38.4789			5.0888

Table 7.8: PCMV under the Merton model with parameters in the Table 7.6;

\gamma=200

; liquidate risky asset when insolvency;

q_{\max}=2.0

;

T=10

years.

Method	Ref. level	$E_{\mathcal{C}_{0}}^{x_{0},t_{0}}\left[W_{T}\right]$	Change	Ratio	$Std_{\mathcal{C}_{0}}^{x,t_{0}}\left[W_{T}\right]$	Change	Ratio
	$0$	$24.1538$			$21.5101$
	$1$	$24.4528$	$0.2990$		$22.0738$	$0.5637$
MI	$2$	$24.5468$	$0.0940$	$3.2$	$22.1659$	$0.0921$	$6.1$
	$3$	$24.5855$	$0.0387$	$2.4$	$22.1907$	$0.0248$	$3.7$
	$4$	$24.6042$	$0.0187$	$2.1$	$22.2005$	$0.0098$	$2.5$
FD		24.6032			22.2024

8 Conclusion

In this study, we present a highly efficient, straightforward-to-implement, and monotone numerical integration method for MV portfolio optimization. The model considered in this paper addresses a practical context that includes a variety of investment constraints, as well as jump-diffusion dynamics that govern the price processes of risky assets. Our method employs an infinite series representation of the transition density, wherein all series terms are strictly positive and explicitly computable. This approach enables us to approximate the convolution integral for time-advancement over each rebalancing interval via a monotone integration scheme. The scheme uses a composite quadrature rule, simplifying the computation significantly. Furthermore, we introduce an efficient implementation of the proposed monotone integration scheme using FFTs, exploiting the structure of Toeplitz matrices. The pointwise convergence of this scheme, as the discretization parameter approaches zero, is rigorously established. Numerical experiments affirm the accuracy of our approach, aligning with benchmark results obtained through the FD method and Monte Carlo simulation, as demonstrated in [19]. Notably, our proposed method offers superior efficiency compared to existing FD methods, owing to its computational complexity being an order of magnitude lower.

Further work includes investigation of self-exciting jumps for MV optimization, together with a convergence analysis as $\Delta t\to 0$ to a continuously observed impulse control formulation for MV optimization taking the form of a HJB Quasi-Variational Inequality.

References

[1] M. Abramowitz and I. A. Stegun. Handbook of mathematical functions. Dover, New York, 1972.
[2] J. Alonso-Garcca, O. Wood, and J. Ziveyi. Pricing and hedging guaranteed minimum withdrawal benefits under a general Lévy framework using the COS method. Quantitative Finance, 18:1049–1075, 2018.
[3] G. Barles and P.E. Souganidis. Convergence of approximation schemes for fully nonlinear equations. Asymptotic Analysis, 4:271–283, 1991.
[4] S. Basak and G. Chabakauri. Dynamic mean-variance asset allocation. Review of Financial Studies, 23:2970–3016, 2010.
[5] Edouard Berthe, Duy-Minh Dang, and Luis Ortiz-Gracia. A shannon wavelet method for pricing foreign exchange options under the heston multi-factor cir model. Applied Numerical Mathematics, 136:1–22, 2019.
[6] T. Björk and A. Murgoci. A theory of Markovian time-inconsistent stochastic control in discrete time. Finance and Stochastics, (18):545–592, 2014.
[7] O. Bokanowski, A. Picarelli, and C. Reisinger. High-order filtered schemes for time-dependent second order HJB equations. ESAIM Mathematical Modelling and Numerical Analysis, 52:69–97, 2018.
[8] T. Bourgeron, E. Lezmi, and T. Roncalli. Robust asset allocation for robo-advisors. Working paper, 2018.
[9] Wlodzimierz Bryc, Amir Dembo, and Tiefeng Jiang. Spectral measure of large random hankel, markov and toeplitz matrices. Annals of probability, 34(1):1–38, 2006.
[10] Andrew Butler and Roy H Kwon. Data-driven integration of regularized mean-variance portfolios. arXiv preprint arXiv:2112.07016, 2021.
[11] Z. Chen, G. Li, and J. Guo. Optimal investment policy in the time consistent mean–variance formulation. Insurance: Mathematics and Economics, 52(2):145–156, March 2013.
[12] F. Cong and C.W. Oosterlee. On pre-commitment aspects of a time-consistent strategy for a mean-variance investor. Journal of Economic Dynamics and Control, 70:178–193, 2016.
[13] F Cong and CW Oosterlee. On robust multi-period pre-commitment and time-consistent mean-variance portfolio optimization. International Journal of Theoretical and Applied Finance, 20(07):1750049, 2017.
[14] R. Cont and C. Mancini. Nonparametric tests for pathwise properties of semi-martingales. Bernoulli, (17):781–813, 2011.
[15] R. Cont and P. Tankov. Financial modelling with jump processes. Chapman and Hall / CRC Press, 2004.
[16] M. G. Crandall, H. Ishii, and P. L. Lions. User’s guide to viscosity solutions of second order partial differential equations. Bulletin of the American Mathematical Society, 27:1–67, 1992.
[17] D. M. Dang, K. R. Jackson, and S. Sues. A dimension and variance reduction Monte Carlo method for pricing and hedging options under jump-diffusion models. Applied Mathematical Finance, 24:175–215, 2017.
[18] D. M. Dang and Luis Ortiz-Gracia. A dimension reduction Shannon-wavelet based method for option pricing. Journal of Scientific Computing, 2017. https://doi.org/10.1007/s10915-017-0556-y.
[19] D.M. Dang and P.A. Forsyth. Continuous time mean-variance optimal portfolio allocation under jump diffusion: A numerical impulse control approach. Numerical Methods for Partial Differential Equations, 30:664–698, 2014.
[20] D.M. Dang and P.A. Forsyth. Better than pre-commitment mean-variance portfolio allocation strategies: A semi-self-financing Hamilton–Jacobi–Bellman equation approach. European Journal of Operational Research, (250):827–841, 2016.
[21] D.M. Dang, P.A. Forsyth, and K.R. Vetzal. The 4 percent strategy revisited: a pre-commitment mean-variance optimal approach to wealth management. Quantitative Finance, 17(3):335–351, 2017.
[22] Duy-Minh Dang and Peter A Forsyth. Continuous time mean-variance optimal portfolio allocation under jump diffusion: An numerical impulse control approach. Numerical Methods for Partial Differential Equations, 30(2):664–698, 2014.
[23] Duy-Minh Dang, Peter A Forsyth, and Yuying Li. Convergence of the embedded mean-variance optimal points with discrete sampling. Numerische Mathematik, 132(2):271–302, 2016.
[24] E.J. Elton, M.J. Gruber, S.J. Brown, and W.N. Goetzmann. Modern portfolio theory and investment analysis. Wiley, 9th edition, 2014.
[25] F. Fang and C.W. Oosterlee. A novel pricing method for European options based on Fourier-Cosine series expansions. SIAM Journal on Scientific Computing, 31:826–848, 2008.
[26] P. A. Forsyth and G. Labahn. $\epsilon$ -monotone Fourier methods for optimal stochastic control in finance. 2017. Working paper, School of Computer Science, University of Waterloo.
[27] P.A. Forsyth and K.R. Vetzal. Dynamic mean variance asset allocation: Tests for robustness. International Journal of Financial Engineering, 4:2, 2017. 1750021 (electronic).
[28] P.A. Forsyth and K.R. Vetzal. Optimal asset allocation for retirement saving: Deterministic vs. time consistent adaptive strategies. Applied Mathematical Finance, 26(1):1–37, 2019.
[29] P.A. Forsyth, K.R. Vetzal, and G. Westmacott. Management of portfolio depletion risk through optimal life cycle asset allocation. North American Actuarial Journal, 23(3):447–468, 2019.
[30] F. N. Fritsch and R. E. Carlson. Monotone piecewise cubic interpolation. SIAM Journal on Numerical Analysis, 17:238–246, 1980.
[31] X. Guo and G. Wu. Smooth fit principle for impulse control of multidimensional diffusion processes. SIAM Journal on Control and Optimization, 48(2):594–617, 2009.
[32] B. Hojgaard and E. Vigna. Mean-variance portfolio selection and efficient frontier for defined contribution pension schemes. Research Report Series, Department of Mathematical Sciences, Aalborg University, R-2007-13, 2007.
[33] Y.T. Huang and Y.K. Kwok. Regression-based Monte Carlo methods for stochastic control models: variable annuities with lifelong guarantees. Quantitative Finance, 16(6):905–928, 2016.
[34] Y.T. Huang, P. Zeng, and Y.K. Kwok. Optimal initiation of guaranteed lifelong withdrawal benefit with dynamic withdrawals. SIAM Journal on Financial Mathematics, 8:804–840, 2017.
[35] S. G. Kou. A jump diffusion model for option pricing. Management Science, 48:1086–1101, August 2002.
[36] S.G. Kou. A jump-diffusion model for option pricing. Management Science, 48(8):1086–1101, 2002.
[37] D. Li and W.-L. Ng. Optimal dynamic portfolio selection: multi period mean variance formulation. Mathematical Finance, 10:387–406, 2000.
[38] D. Li and W.L. Ng. Optimal Dynamic Portfolio Selection: Multiperiod Mean-Variance Formulation. Mathematical Finance, 10(3):387–406, 2000.
[39] Y. Li and P.A. Forsyth. A data-driven neural network approach to optimal asset allocation for target based defined contribution pension plans. Insurance: Mathematics and Economics, (86):189–204, 2019.
[40] X. Liang, L. Bai, and J. Guo. Optimal time-consistent portfolio and contribution selection for defined benefit pension schemes under mean-variance criterion. ANZIAM, (56):66–90, 2014.
[41] X. Lin and Y. Qian. Time-consistent mean-variance reinsurance-investment strategy for insurers under cev model. Scandinavian Actuarial Journal, (7):646–671, 2016.
[42] Y. Lu and D.M. Dang. A pointwise convergent numerical integration method for Guaranteed Lifelong Withdrawal Benefits under stochastic volatility.
[43] Y. Lu and D.M. Dang. A semi-Lagrangian $\epsilon$ -monotone Fourier method for continuous withdrawal GMWBs under jump-diffusion with stochastic interest rate.
[44] Y. Lu, D.M. Dang, P.A. Forsyth, and G. Labahn. An $\epsilon$ -monotone Fourier method for Guaranteed Minimum Withdrawal Benefit (GMWB) as a continuous impulse control problem.
[45] X. Luo and P.V. Shevchenko. Valuation of variable annuities with guaranteed minimum withdrawal and death benefits via stochastic control optimization. Insurance: Mathematics and Economics, 62(3):5–15, 2015.
[46] K. Ma and P. A. Forsyth. Numerical solution of the Hamilton-Jacobi-Bellman formulation for continuous time mean variance asset allocation under stochastic volatility. Journal of Computational Finance, 20(01):1–37, 2016.
[47] H. Markowitz. Portfolio selection. The Journal of Finance, 7(1):77–91, March 1952.
[48] F. Menoncin and E. Vigna. Mean-variance target-based optimisation in DC plan with stochastic interest rate. Working paper, Collegio Carlo Alberto, (337), 2013.
[49] R.C. Merton. Option pricing when underlying stock returns are discontinuous. Journal of Financial Economics, 3:125–144, 1976.
[50] R.O. Michaud and R.O. Michaud. Efficient asset management: A practical guide to stock portfolio optimization and asset allocation. Oxford University Press, 2 edition, 2008.
[51] Chendi Ni, Yuying Li, Peter Forsyth, and Ray Carroll. Optimal asset allocation for outperforming a stochastic benchmark target. Quantitative Finance, 22(9):1595–1626, 2022.
[52] C.I. Nkeki. Stochastic funding of a defined contribution pension plan with proportional administrative costs and taxation under mean-variance optimization approach. Statistics, optimization and information computing, (2):323–338, 2014.
[53] A.M. Oberman. Convergent difference schemes for degenerate elliptic and parabolic equations: Hamilton–Jacobi Equations and free boundary problems. SIAM Journal Numerical Analysis, 44(2):879–895, 2006.
[54] S. Perrin and T. Roncalli. Machine Learning Optimization Algorithms and Portfolio Allocation, chapter 8, pages 261–328. Wiley Online Library, 2020.
[55] H. Pham. On some recent aspects of stochastic control and their applications. Probability Surveys, 2:506–549, 2005.
[56] D.M. Pooley, P.A. Forsyth, and K.R. Vetzal. Numerical convergence properties of option pricing PDEs with uncertain volatility. IMA Journal of Numerical Analysis, 23:241–267, 2003.
[57] M. Puterman. Markov Decison Processes: Discrete Stochastic Dynamic Programming. Wiley, New York, 1994.
[58] C. Ramezani and Y. Zeng. Maximum likelihood estimation of the double exponential jump-diffusion process. Annals of Finance, 3(4):487–507, 2007.
[59] C. Reisinger and P.A. Forsyth. Piecewise constant Policy approximations to Hamilton-Jacobi-Bellman equations. Applied Numerical Mathematics, 103:27–47, 2016.
[60] M.J. Ruijter, C.W. Oosterlee, and R.F.T. Aalbers. On the Fourier cosine series expansion (COS) method for stochastic control problems. Numerical Linear Algebra with Applications, 20:598–625, 2013.
[61] Y. Sato. Model-free reinforcement learning for financial portfolios: A brief survey. Working paper, 2019.
[62] P.V. Shevchenko and X. Luo. A unified pricing of variable annuity guarantees under the optimal stochastic control framework. Risks, 4(3):1–31, 2016.
[63] M. Strub, D. Li, and X. Cui. An enhanced mean-variance framework for robo-advising applications. SSRN 3302111, 2019.
[64] J. Sun, Z. Li, and Y. Zeng. Precommitment and equilibrium investment strategies for defined contribution pension plans under a jump–diffusion model. Insurance: Mathematics and Economics, (67):158–172, 2016.
[65] P. M. Van Staden, D.M. Dang, and P.A. Forsyth. Time-consistent mean-variance portfolio optimization: a numerical impulse control approach. Insurance: Mathematics and Economics, 83(C):9–28, 2018.
[66] P. M. Van Staden, D.M. Dang, and P.A. Forsyth. Mean-quadratic variation portfolio optimization: A desirable alternative to time-consistent mean-variance optimization? SIAM Journal on Financial Mathematics, 10(3):815–856, 2019.
[67] P. M. Van Staden, D.M. Dang, and P.A. Forsyth. On the distribution of terminal wealth under dynamic mean-variance optimal investment strategies. SIAM Journal on Financial Mathematics, 12(2):566–603, 2021.
[68] P. M. Van Staden, D.M. Dang, and P.A. Forsyth. The surprising robustness of dynamic mean-variance portfolio optimization to model misspecification errors. European Journal of Operational Research, 289:774–792, 2021.
[69] Pieter M Van Staden, Duy-Minh Dang, and Peter A Forsyth. Practical investment consequences of the scalarization parameter formulation in dynamic mean–variance portfolio optimization. International Journal of Theoretical and Applied Finance, 24(05):2150029, 2021.
[70] E. Vigna. On efficiency of mean-variance based portfolio selection in defined contribution pension schemes. Quantitative Finance, 14(2):237–258, 2014.
[71] E. Vigna. On time consistency for mean-variance portfolio selection. International Journal of Theoretical and Applied Finance, 23(6), 2020.
[72] Elena Vigna. Tail optimality and preferences consistency for intertemporal optimization problems. SIAM Journal on Financial Mathematics, 13(1):295–320, 2022.
[73] J. Wang and P.A. Forsyth. Maximal use of central differencing for Hamilton-Jacobi-Bellman PDEs in finance. SIAM Journal on Numerical Analysis, 46:1580–1601, 2008.
[74] J. Wang and P.A. Forsyth. Continuous time mean variance asset allocation: A time-consistent strategy. European Journal of Operational Research, 209(2):184–201, 2011.
[75] L. Wang and Z. Chen. Nash equilibrium strategy for a DC pension plan with state-dependent risk aversion: A multiperiod mean-variance framework. Discrete Dynamics in Nature and Society, (1-17), 2018.
[76] L. Wang and Z. Chen. Stochastic game theoretic formulation for a multi-period DC pension plan with state-dependent risk aversion. Mathematics, 7(108):1–16, 2019.
[77] Xavier Warin. Some non-monotone schemes for time dependent Hamilton-Jacobi-Bellman equations in stochastic control. Journal of Scientific Computing, 66(3):1122–1147, 2016.
[78] J. Wei and T. Wang. Time-consistent mean-variance asset-liability management with random coefficients. Insurance: Mathematics and Economics, (77):84–96, 2017.
[79] H. Wu and Y. Zeng. Equilibrium investment strategy for defined-contribution pension schemes with generalized mean-variance criterion and mortality risk. Insurance: Mathematics and Economics, 64:396–408, 2015.
[80] P Le Yu. Cone convexity, cone extreme points, and nondominated solutions in decision problems with multiobjectives. Journal of Optimization Theory and Applications, 14(3):319–377, 1974.
[81] Y. Zeng and Z. Li. Optimal time-consistent investment and reinsurance policies for mean-variance insurers. Insurance: Mathematics and Economics, 49(1):145–154, July 2011.
[82] H. Zhao, Y. Shen, and Y. Zeng. Time-consistent investment-reinsurance strategy for mean-variance insurers with a defaultable security. Journal of Mathematical Analysis and Applications, 437(2):1036–1057, May 2016.
[83] X. Zhou and D. Li. Continuous time mean variance portfolio selection: a stochastic LQ framework. Applied Mathematics and Optimization, 42:19–33, 2000.
[84] X.Y. Zhou and D. Li. Continuous-time mean-variance portfolio selection: A stochastic LQ framework. Applied Mathematics and Optimization, 42(1):19–33, 2000.
[85] Z. Zhou, H. Xiao, J. Yin, X. Zeng, and L. Lin. Pre-commitment vs. time-consistent strategies for the generalized multi-period portfolio optimization with stochastic cash flows. Insurance: Mathematics and Economics, (68):187–202, 2016.

Appendices

Appendix A Proof of Lemma 3.1

Recalling the inverse Fourier transform $\mathfrak{F}^{-1}[\cdot]$ in (3.7) and the closed-form expression for $G(\eta;\Delta t)$ in (3.8), we have

\displaystyle{\color[rgb]{0,0,0}{g(s;\Delta t)~{}=~{}}}\frac{1}{2\pi}\,\int_{-\infty}^{\infty}e^{-\alpha\eta^{2}+\left(\beta+s\right)\left(i\eta\right)+\theta}\,e^{\lambda\Gamma\left(\eta\right)\Delta t}~{}\mathrm{d}\eta,

(A.1)

where $\alpha$ , $\beta$ and $\theta$ are given in (3.9), and $\Gamma\left(\eta\right)=\int_{-\infty}^{\infty}p(y)~{}e^{i\eta y}~{}dy$ . Following the approach developed in [17, 18, 5] , we expand the term $e^{\lambda\Gamma(\eta)\Delta t}$ in (A.1) in a Taylor series, noting that

\displaystyle\left(\Gamma(\eta)\right)^{k}

\displaystyle=

\displaystyle\left(\int_{-\infty}^{\infty}p(y)\exp(i\eta y)\mathrm{d}y\right)^{k}=\int_{-\infty}^{\infty}\ldots\int_{-\infty}^{\infty}\prod_{\ell=1}^{k}p(y_{\ell})\exp\left(i\eta Y_{k}\right)\mathrm{d}y_{1}\mathrm{d}y_{2}\ldots\mathrm{d}y_{k}

(A.2)

where $p(y)$ is the probability density of $\xi$ , and $Y_{k}=\sum_{\ell=1}^{k}y_{\ell}$ , with $Y_{0}=0$ , and for $k=0$ , $\left(\Gamma(\eta)\right)^{0}=1$ . Then, we have

	$\displaystyle g(s;\Delta t)$	$\displaystyle=$	$\displaystyle\frac{1}{2\pi}\sum_{k=0}^{\infty}\frac{\left(\lambda\Delta t\right)^{k}}{k!}\int_{-\infty}^{\infty}e^{-\alpha\eta^{2}+\left(\beta+s\right)\left(i\eta\right)+\theta}\,\left(\Gamma\left(\eta\right)\right)^{k}~{}\mathrm{d}\eta,$		(A.3)
		$\displaystyle{\buildrel(\text{i})\over{=}}$	$\displaystyle\frac{1}{\sqrt{4\pi\alpha}}\sum_{k=0}^{\infty}\frac{\left(\lambda\Delta t\right)^{k}}{k!}\int_{-\infty}^{\infty}\ldots\int_{-\infty}^{\infty}\exp\left(\theta-\frac{\left(\beta+s+Y_{k}\right)^{2}}{4\alpha}\right)\left(\prod_{\ell=1}^{k}p(y_{\ell})\right)\mathrm{d}y_{1}\ldots\mathrm{d}y_{k},$		(A.4)

where the first term of the series corresponds to $k=0$ and is equal to $\frac{1}{\sqrt{4\pi\alpha}}\exp\left(\theta-\frac{\left(\beta+s\right)^{2}}{4\alpha}\right)$ . Here, in (i), we use the Fubini’s theorem and the well known result $\int_{-\infty}^{\infty}e^{-a\phi^{2}-b\phi}\mathrm{d}\phi=\sqrt{\frac{\pi}{a}}e^{b^{2}/4a}$ .

Appendix B $\boldsymbol{\Omega_{s_{\max}}}$ : boundary expressions

Recalling the sub-domain definitions in (3.3), we observe that $\Omega_{s_{\max}}$ is the boundary where $s\rightarrow\infty$ . For fixed $b$ , noting the terminal condition (3.16), we assume that $v(s\rightarrow\infty,b,t)\approx A_{1}(t)e^{2s}$ for some unknown function $A_{1}(t)$ . Using the infinite series representation of $g(s-s^{\prime};\Delta t)$ given Lemma 3.1 (proof in Appendix A) and the integral (3.5d), we have $v(s,b,t_{m-1}^{+})=\int_{-\infty}^{\infty}A_{1}(t_{m}^{-})e^{2s^{\prime}}g(s-s^{\prime};\Delta t)~{}\mathrm{d}s^{\prime}~{}\ldots$

$\displaystyle\ldots$	$\displaystyle~{}=~{}$	$\displaystyle\int_{-\infty}^{\infty}\frac{A_{1}(t_{m}^{-})}{\sqrt{4\pi\alpha}}\sum_{k=1}^{\infty}\frac{\left(\lambda\Delta t\right)^{k}}{k!}\int_{-\infty}^{\infty}\ldots\int_{-\infty}^{\infty}\exp\left(\theta-\frac{\left(\beta+s-s^{\prime}+Y_{k}\right)^{2}}{4\alpha}+2s^{\prime}\right)\prod_{\ell=1}^{k}p(y_{\ell})\mathrm{d}y_{1}\ldots\mathrm{d}y_{k}~{}\mathrm{d}s^{\prime}$
		$\displaystyle\qquad\qquad\qquad\qquad\qquad\qquad+\int_{-\infty}^{\infty}\frac{A_{1}(t_{m}^{-})}{\sqrt{4\pi\alpha}}\exp\left(\theta-\frac{\left(\beta+s-s^{\prime}\right)^{2}}{4\alpha}+2s^{\prime}\right)\mathrm{d}s^{\prime},$
	$\displaystyle~{}=~{}$	$\displaystyle A_{1}(t_{m}^{-})\exp\left(\theta+4\alpha+2\left(\beta+s\right)\right)\sum_{k=1}^{\infty}\frac{\left(\lambda\Delta t\right)^{k}}{k!}\int_{-\infty}^{\infty}\ldots\int_{-\infty}^{\infty}\exp\left(2Y_{k}\right)\prod_{\ell=1}^{k}p(y_{\ell})\mathrm{d}y_{1}\ldots\mathrm{d}y_{k}$
		$\displaystyle\qquad\qquad\qquad\qquad\qquad\qquad+A_{1}(t_{m}^{-})\exp\left(\theta+4\alpha+2\left(\beta+s\right)\right),$
	$\displaystyle~{}=~{}$	$\displaystyle A_{1}(t_{m}^{-})\exp\left\{\left(2\mu+\sigma^{2}-2\lambda\kappa-\lambda\right)\Delta t+2s\right\}\,\sum_{k=0}^{\infty}\frac{\left(\lambda\Delta t\right)^{k}}{k!}\left(\int_{-\infty}^{\infty}e^{2y}p(y)\mathrm{d}y\right)^{k},$
	$\displaystyle~{}=~{}$	$\displaystyle A_{1}(t_{m}^{-})e^{2s}\,\exp\left\{\left(2\mu+\sigma^{2}-2\lambda\kappa-\lambda\right)\Delta t\right\}\,\exp\left\{\lambda\left(\kappa_{2}+2\kappa+1\right)\Delta t\right\},$
	$\displaystyle~{}=~{}$	$\displaystyle v(s,b,t_{m}^{-})\,e^{\left(\sigma^{2}+2\mu+\lambda\kappa_{2}\right)\Delta t},$

where we use $\alpha=\frac{\sigma^{2}}{2}\,\Delta t$ , $\beta=\left(\mu-\lambda\kappa-\frac{\sigma^{2}}{2}\right)\,\Delta t$ and $\theta=-\lambda\Delta t$ .

Similarly, for each fixed $B(t)=b$ , we assume the auxiliary linear value function $u(s\rightarrow\infty,b,t)\approx A_{2}(t)e^{s}$ , for some unknown function $A_{2}(t)$ . we have ${\color[rgb]{0,0,0}{u(s,b,t_{m-1}^{+})}}=\int_{-\infty}^{\infty}A_{2}(t_{m}^{-})e^{s^{\prime}}g(s-s^{\prime};\Delta t)~{}\mathrm{d}s^{\prime}~{}\ldots$

$\displaystyle\ldots$	$\displaystyle~{}=~{}$	$\displaystyle\int_{-\infty}^{\infty}\frac{A_{2}(t_{m}^{-})}{\sqrt{4\pi\alpha}}\sum_{k=1}^{\infty}\frac{\left(\lambda\Delta t\right)^{k}}{k!}\int_{-\infty}^{\infty}\ldots\int_{-\infty}^{\infty}\exp\left(\theta-\frac{\left(\beta+s-s^{\prime}+Y_{k}\right)^{2}}{4\alpha}+s^{\prime}\right)\prod_{\ell=1}^{k}p(y_{\ell})\mathrm{d}y_{1}\ldots\mathrm{d}y_{k}~{}\mathrm{d}s^{\prime}$
		$\displaystyle\qquad\qquad\qquad\qquad\qquad\qquad+\int_{-\infty}^{\infty}\frac{A_{2}(t_{m}^{-})}{\sqrt{4\pi\alpha}}\exp\left(\theta-\frac{\left(\beta+s-s^{\prime}\right)^{2}}{4\alpha}+s^{\prime}\right)\mathrm{d}s^{\prime},$
	$\displaystyle~{}=~{}$	$\displaystyle A_{2}(t_{m}^{-})\exp\left(\theta+\alpha+\beta+s\right)\sum_{k=1}^{\infty}\frac{\left(\lambda\Delta t\right)^{k}}{k!}\int_{-\infty}^{\infty}\ldots\int_{-\infty}^{\infty}\exp\left(Y_{k}\right)\prod_{\ell=1}^{k}p(y_{\ell})\mathrm{d}y_{1}\ldots\mathrm{d}y_{k}$
		$\displaystyle\qquad\qquad\qquad\qquad\qquad\qquad+A_{2}(t_{m}^{-})\exp\left(\theta+\alpha+\beta+s\right),$
	$\displaystyle~{}=~{}$	$\displaystyle A_{2}(t_{m}^{-})\exp\left\{\left(\mu-\lambda\kappa-\lambda\right)\Delta t+s\right\}\,\sum_{k=0}^{\infty}\frac{\left(\lambda\Delta t\right)^{k}}{k!}\left(\int_{-\infty}^{\infty}e^{y}p(y)\mathrm{d}y\right)^{k},$
	$\displaystyle~{}=~{}$	$\displaystyle A_{2}(t_{m}^{-})e^{s}\,\exp\left\{\left(\mu-\lambda\kappa-\lambda\right)\Delta t\right\}\,\exp\left\{\lambda\kappa\Delta t+\lambda\Delta t\right\},$
	$\displaystyle~{}=~{}$	$\displaystyle{\color[rgb]{0,0,0}{u(s,b,t_{m}^{-})}}\,e^{\mu\,\Delta t}.$

Appendix C Proof of $\boldsymbol{g\left(s;\Delta t\right)}$ for $\boldsymbol{\xi\sim\text{Asym-Double-Exponential}(q_{1},\eta_{1},\eta_{2})}$

In this case, according to Lemma 3.1, we have $g(s;\Delta t)=\ldots$

	$\displaystyle\ldots$	$\displaystyle=\frac{\exp\left(\theta-\frac{\left(\beta+s\right)^{2}}{4\alpha}\right)}{\sqrt{4\pi\alpha}}+\frac{e^{\theta}}{\sqrt{4\pi\alpha}}\sum_{k=1}^{\infty}\frac{\left(\lambda\Delta t\right)^{k}}{k!}\underbrace{\int_{-\infty}^{\infty}\ldots\int_{-\infty}^{\infty}\exp\left(-\frac{\left(\beta+s+Y_{k}\right)^{2}}{4\alpha}\right)\prod_{\ell=1}^{k}p(y_{\ell})\mathrm{d}y_{1}\ldots\mathrm{d}y_{k}}_{E_{k}}$
		$\displaystyle=\frac{\exp\left(\theta-\frac{\left(\beta+s\right)^{2}}{4\alpha}\right)}{\sqrt{4\pi\alpha}}+\frac{e^{\theta}}{\sqrt{4\pi\alpha}}\sum_{k=1}^{\infty}\frac{\left(\lambda\Delta t\right)^{k}}{k!}~{}E_{k}.$		(C.1)

Here, the term $E_{k}$ in (C.1) is clearly non-negative and can be computed as

\displaystyle\displaystyle E_{k}=\int_{-\infty}^{\infty}\exp\left(-\frac{\left(\beta+s+y\right)^{2}}{4\alpha}\right)\,p_{\hat{\xi}_{k}}(y)\mathrm{d}y,

(C.2)

where $p_{\hat{\xi}_{k}}(y)$ is the PDF of the random variable $\hat{\xi}_{k}=\sum_{\ell=1}^{k}\xi_{\ell}$ , for fixed $k$ . To find $p_{\hat{\xi}_{k}}(y)$ , the key step is the decomposition of $\hat{\xi}_{k}=\sum_{\ell=1}^{k}\xi_{\ell}$ into sums of i.i.d exponential random variables [35]. More specifically, we have

\displaystyle\hat{\xi}_{k}=\sum_{\ell=1}^{k}\xi_{\ell}

\displaystyle{\buildrel dist.\ \over{=}}

\displaystyle\begin{cases}\hat{\xi}_{\ell}^{+}=~{}~{}\sum_{i=1}^{\ell}\varepsilon_{i}^{+}&\mbox{with probability }Q_{1}^{k,\ell},\quad\ell=1,\ldots,k\\ \hat{\xi}_{\ell}^{-}=-\sum_{i=1}^{\ell}\varepsilon_{i}^{-}&\mbox{with probability }Q_{2}^{k,\ell},\quad\ell=1,\ldots,k\end{cases}\ .

(C.3)

Here, $Q_{1}^{k,\ell}$ and $Q_{2}^{k,\ell}$ are given in (3.1), and $\varepsilon_{i}^{+}$ and $\varepsilon_{i}^{-}$ are i.i.d. exponential variables with rates $\eta_{1}$ and $\eta_{2}$ , respectively. The PDF for each of the cases in (C.3) respectively are

\displaystyle p_{\hat{\xi}_{\ell}^{+}}(y)

\displaystyle=\frac{\mathrm{e}^{-\eta_{1}y}\,y^{\ell-1}\,\eta_{1}^{\ell}}{(\ell-1)!}\quad\text{for }\hat{\xi}_{\ell}^{+},\quad\text{and}\quad p_{\hat{\xi}_{\ell}^{-}}(y)

\displaystyle=\frac{\mathrm{e}^{\eta_{2}y}\,(-y)^{\ell-1}\,\eta_{2}^{\ell}}{(\ell-1)!}\quad\text{for }\hat{\xi}_{\ell}^{-}\ .

(C.4)

Taking into account (C.3)-(C.4), (C.2) becomes

\displaystyle E_{k}

\displaystyle=\sum_{\ell=1}^{k}\,Q_{1}^{k,\ell}\underbrace{\int_{0}^{\infty}\exp\left(-\frac{\left(\beta+s+y\right)^{2}}{4\alpha}\right)\,p_{\hat{\xi}_{\ell}^{+}}(y)\,\mathrm{d}y}_{E_{1,\ell}}+\sum_{\ell=1}^{k}\,Q_{2}^{k,\ell}\underbrace{\int_{-\infty}^{0}\exp\left(-\frac{\left(\beta+s+y\right)^{2}}{4\alpha}\right)\,p_{\hat{\xi}_{\ell}^{-}}(y)\,\mathrm{d}y}_{E_{2,\ell}}.

(C.5)

Considering the term $E_{1,\ell}$ ,

\displaystyle E_{1,\ell}=\int_{0}^{\infty}\exp\left(-\frac{\left(\beta+s+y\right)^{2}}{4\alpha}\right)\,\frac{\mathrm{e}^{-\eta_{1}y}\,y^{\ell-1}\,{\eta_{1}}^{\ell}}{(\ell-1)!}\,\mathrm{d}y={\eta_{1}}^{\ell}\,\int_{0}^{\infty}\frac{1}{(\ell-1)!}\,\exp\left(-\frac{\left(\beta+s+y\right)^{2}}{4\alpha}-\eta_{1}y\right)\,y^{\ell-1}\,\mathrm{d}y\ .

Making the change of variable $y_{1}=\frac{\beta+s+y}{\sqrt{2\alpha}}$ ,

	$\displaystyle E_{1,\ell}$	$\displaystyle={\eta_{1}}^{\ell}\,\int_{\frac{\beta+s}{\sqrt{2\alpha}}}^{\infty}\frac{1}{(\ell-1)!}\,\mathrm{e}^{-\frac{1}{2}y_{1}^{2}-\eta_{1}\sqrt{2\alpha}y_{1}}\,\mathrm{e}^{\eta_{1}\,\left(\beta+s\right)}\,\left(\sqrt{2\alpha}\,y_{1}-\beta-s\right)^{\ell-1}\,\sqrt{2\alpha}\,\mathrm{d}y_{1}$
		$\displaystyle=\left(\eta_{1}\,\sqrt{2\alpha}\right)^{\ell}\,\mathrm{e}^{\eta_{1}\,\left(\beta+s\right)}\,\int_{\frac{\beta+s}{\sqrt{2\alpha}}}^{\infty}\frac{1}{(\ell-1)!}\,\mathrm{e}^{-\frac{1}{2}y_{1}^{2}-\eta_{1}\sqrt{2\alpha}y_{1}}\,\left(y_{1}-\frac{\beta+s}{\sqrt{2\alpha}}\right)^{\ell-1}\,\mathrm{d}y_{1}\ .$

Making the change of variable $y_{2}=y_{1}+\eta_{1}\sqrt{2\alpha}$ ,

	$\displaystyle E_{1,\ell}$	$\displaystyle=\left(\eta_{1}\,\sqrt{2\alpha}\right)^{\ell}\,\mathrm{e}^{\eta_{1}\,\left(\beta+s\right)+\eta_{1}^{2}\alpha}\,\int_{\frac{\beta+s}{\sqrt{2\alpha}}+\eta_{1}\sqrt{2\alpha}}^{\infty}\frac{1}{(\ell-1)!}\,\left(y_{2}-\left(\eta_{1}\sqrt{2\alpha}+\frac{\beta+s}{\sqrt{2\alpha}}\right)\right)^{\ell-1}\,\mathrm{e}^{-\frac{1}{2}y_{2}^{2}}\,\mathrm{d}y_{2}$		(C.6)
		$\displaystyle=\left(\eta_{1}\,\sqrt{2\alpha}\right)^{\ell}\,\mathrm{e}^{\eta_{1}\,\left(\beta+s\right)+\eta_{1}^{2}\alpha}\,\mathrm{Hh}_{\ell-1}\left(\eta_{1}\sqrt{2\alpha}+\frac{\beta+s}{\sqrt{2\alpha}}\right)\ ,$		(C.6)

where $\mathrm{Hh}_{\ell}$ is defined in (3.13). Similarly for the term $E_{2,\ell}$ ,

$\displaystyle E_{2,\ell}$	$\displaystyle=\int_{-\infty}^{0}\,\exp\left(-\frac{\left(\beta+s+y\right)^{2}}{4\alpha}\right)\,\frac{\mathrm{e}^{\eta_{2}y}\,\left(-y\right)^{\ell-1}\,\eta_{2}^{\ell}}{(\ell-1)!}\,\mathrm{d}y$	(C.7)
	$\displaystyle=\left(\eta_{2}\,\sqrt{2\alpha}\right)^{\ell}\,\mathrm{e}^{-\eta_{2}\,\left(\beta+s\right)}\,\int_{-\infty}^{\frac{\beta+s}{\sqrt{2\alpha}}}\frac{1}{(\ell-1)!}\,\mathrm{e}^{-\frac{1}{2}y_{1}^{2}+\eta_{2}\,\sqrt{2\alpha}\,y_{1}}\,\left(-y_{1}+\frac{\beta+s}{\sqrt{2\alpha}}\right)^{\ell-1}\,\mathrm{d}y_{1}$
	$\displaystyle=\left(\eta_{2}\,\sqrt{2\alpha}\right)^{\ell}\,\mathrm{e}^{-\eta_{2}\,\left(\beta+s\right)+\eta_{2}^{2}\alpha}\,\int_{-\frac{\beta+s}{\sqrt{2\alpha}}+\eta_{2}\sqrt{2\alpha}}^{\infty}\frac{1}{(\ell-1)!}\,\left(y_{2}-\left(\eta_{2}\sqrt{2\alpha}-\frac{\beta+s}{\sqrt{2\alpha}}\right)\right)^{\ell-1}\mathrm{e}^{-\frac{1}{2}y_{2}^{2}}\,\mathrm{d}y_{2}$
	$\displaystyle=\left(\eta_{2}\,\sqrt{2\alpha}\right)^{\ell}\,\mathrm{e}^{-\eta_{2}\,\left(\beta+s\right)+\eta_{2}^{2}\alpha}\,\mathrm{Hh}_{\ell-1}\left(\eta_{2}\sqrt{2\alpha}-\frac{\beta+s}{\sqrt{2\alpha}}\right),$

where $\mathrm{Hh}_{\ell}$ is defined in (3.13). Using (C.5), (C.6) and (C.7) together with further simplifications gives us (3.1).

	$\displaystyle\|v_{n,j}^{(M-1)+}\|\leq\Delta s{\color[rgb]{0,0,0}{\sum_{l=-N^{\dagger}/2}^{N^{\dagger}/2}}}\omega_{l}\,g_{n-l}(\Delta t,K_{\epsilon})\,\|v_{l,j}^{M-}\|$	$\displaystyle\overset{\eqref{eq:itner}}{\leq}e^{{\color[rgb]{0,0,0}{2r_{\max}}}\Delta t}\left\\|v^{M}\right\\|_{\infty}\bigg{(}\Delta s{\color[rgb]{0,0,0}{\sum_{l=-N^{\dagger}/2}^{N^{\dagger}/2}}}\omega_{l}\,g_{n-l}(\Delta t,K_{\epsilon})\bigg{)}\,$		(5.7)
		$\displaystyle\overset{\text{(i)}}{\leq}e^{\epsilon_{g}+\left({\color[rgb]{0,0,0}{2r_{\max}}}+\sigma^{2}+2\mu+\lambda\kappa_{2}\right)\Delta t}\left\\|v^{M}\right\\|_{\infty}.$		(5.8)

A monotone numerical integration method for mean-variance portfolio optimization under jump-diffusion models

Abstract

1 Introduction

1.1 Background

1.1.1 Monotonicity and no-arbitrage

1.1.2 Numerical methods

1.2 Objectives

2 Modelling

2.1 Discrete portfolio rebalancing

2.2 Impulse controls

2.3 Investment constraints

3 Formulation

3.1 Value function

3.2 An infinite series representation of 𝒈​(⋅)\boldsymbol{g\left(\cdot\right)}

Lemma 3.1.

Corollary 3.1.

3.3 Localization and problem statement

Definition 3.1 (Localized MV portfolio optimization problem).

Proposition 3.1.

Proof.

4 Numerical methods

4.1 Discretization

4.2 Numerical schemes

Remark 4.1 (Attainability of local minima).

Remark 4.2 (Monotonicity).

4.3 Efficient implementation and algorithms

Remark 4.3 (Complexity).

4.4 Construction of efficient frontier

5 Pointwise convergence

Remark 5.1.

5.1 Stability

Lemma 5.1 (ℓ∞\ell_{\infty}-stability).

Proof of Lemma 5.1.

5.2 Consistency

Lemma 5.2 (Local consistency).

Proof of Lemma 5.2.

5.3 Main convergence theorem

Theorem 5.1 (Pointwise convergence).

Proof of Theorem 5.1.

Remark 5.2 (Convergence on an infinite domain).

6 Continuously observed impulse control MV portfolio optimization

7 Numerical examples

7.1 Empirical data and calibration

7.2 Validation examples

7.2.1 Discrete rebalancing

7.2.2 Continuous rebalancing

8 Conclusion

References

Appendices

Appendix A Proof of Lemma 3.1

Appendix B 𝛀𝒔𝐦𝐚𝐱\boldsymbol{\Omega_{s_{\max}}}: boundary expressions

Appendix C Proof of 𝒈​(𝒔;𝚫​𝒕)\boldsymbol{g\left(s;\Delta t\right)} for 𝝃∼Asym-Double-Exponential​(𝒒𝟏,𝜼𝟏,𝜼𝟐)\boldsymbol{\xi\sim\text{Asym-Double-Exponential}(q_{1},\eta_{1},\eta_{2})}

3.2 An infinite series representation of $\boldsymbol{g\left(\cdot\right)}$

Lemma 5.1 ( $\ell_{\infty}$ -stability).

Appendix B $\boldsymbol{\Omega_{s_{\max}}}$ : boundary expressions

Appendix C Proof of $\boldsymbol{g\left(s;\Delta t\right)}$ for $\boldsymbol{\xi\sim\text{Asym-Double-Exponential}(q_{1},\eta_{1},\eta_{2})}$