This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Optimal Portfolios of Illiquid Assets

T. R. Hurd1, Quentin H. Shao1, Tuan Tran1
1Department of Mathematics & Statistics,
McMaster University, Canada
Abstract

This paper investigates the investment behaviour of a large unregulated financial institution (FI) with CARA risk preferences. It shows how the FI optimizes its trading to account for market illiquidity using an extension of the Almgren-Chriss market impact model of multiple risky assets. This expected utility optimization problem over the set of adapted strategies turns out to have the same solutions as a mean-variance optimization over deterministic trading strategies. That means the optimal adapted trading strategy is both deterministic and time-consistent. It is also found to have an explicit closed form that clearly displays interesting properties. For example, the classic constant Merton portfolio strategy, a particular solution of the frictionless limit of the problem, behaves like an attractor in the space of more general solutions. The main effect of temporary market impact is to slow down the speed of convergence to this constant Merton portfolio. The effect of permanent market impact is to incentivize the FI to buy additional risky assets near the end of the period. This property, that we name the Ponzi property, is related to the creation and bursting of bubbles in the market. The proposed model can be used as a stylized dynamic model of a typical FI in the study of the asset fire sale channel relevant to understanding systemic risk and financial stability.

1 Introduction

Long after such landmark contributions as the Markowitz mean-variance strategy (Markowitz [11]) and the Merton portfolio model introduced in Merton [12], our understanding of optimal portfolio selection has continued to develop. We now have learned how to analyze investment in imperfect markets that have frictions such as transaction costs (Davis and Norman [8], Perold [14]) and price impact (Almgren and Chriss [1], Almgren [3], Schöneborn [18]), and have complex dynamics such as jumps (Cartea and Jaimungal [6], Moazeni et al. [13] and Pham and Tankov [15]). Indeed, this problem has generated hundreds of research papers. Our goal now is to present a solvable model of optimal investment for a large financial institution (FI) in a many-asset setting. It is based on the expected utility maximization criterion, and it accounts for market illiquidity, which means the transaction costs to pay and the fact that trades have a permanent price impact. The underlying investment assets, which may be very illiquid, are assumed to follow Bachelier dynamics, meaning they are modelled by correlated arithmetic Brownian motions. For these assumptions to make financial sense, the optimal strategy should be implemented only over a time horizon [0,T][0,T] short enough that the Bachelier dynamics remains a reasonable approximation (we take as a benchmark T=1/2T=1/2 years in our examples).

The class of optimal strategies we obtain has several remarkable properties. First, the general multidimensional problem has a closed-form solution expressible in terms of a matrix-valued equation that can be efficiently computed with a controllable error. Second, the solution depends on the full range of important parameters: temporary price impact, permanent impact, risk aversion, the initial portfolio weights, the risk free interest rate, and the parameters underlying the Bachelier dynamics. Thirdly, the optimal strategies, which are a priori adapted processes that solve a version of Merton’s problem, turn out to be deterministic over a finite time horizon and to solve a version of the Markowitz mean-variance optimization. This property implies that our investment strategies are fully consistent with dynamic programming, despite being deterministic solutions of a time-inconsistent mean-variance optimization problem.

The aim of this paper is to study the effect of market illiquidity on the behaviour of an FI. Funding illiquidity (see for example Brunnermeier and Pedersen [5]) is the distinct effect that the balance sheet of an FI may experience funding shocks caused by unanticipated withdrawals by depositors. To keep the focus of the paper squarely on market liquidity, funding illiquidity is ruled out by the assumption that deposits are constant and sufficient to support all asset purchases of the FI.

The proposed model and its solution is closely related to some important contributions to the existing literature. Our solutions reduce to the Markowitz optimal portfolios, or equivalently to Merton’s optimal solutions, when permanent and temporary impact are both assumed to be zero. The posed finance problem is inspired by the mean-variance optimal liquidation problem studied by Almgren and Chriss [1], but differs in that there is no constraint placed on the portfolio holdings at the terminal time TT. Finally, under certain initial conditions the FI will seek to liquidate a large position, creating what has been called an asset fire sale. Our strategies extend to this setting and give natural criteria similar to those discussed by Brown et al. [4] that solve the problem of the order in which different assets are liquidated.

2 Optimal Portfolio Strategies

This paper will investigate the investment strategies of a large financial institution (FI) with CARA risk preferences (CARA is short for constant absolute risk aversion that trades continuously over a finite time horizon [0,T][0,T] in a market with imperfect liquidity. This is similar to a problem studied in Zhang [19]. The changes caused by rebalancing a portfolio of a large FI may amount to a large fraction of the total daily volume traded of these assets and significantly impact these assets’ prices. It is well understood that this effect will lead the FI to break large orders into small portions spread over time to reduce market liquidity costs, while still aiming to rebalance its portfolio. By taking additional time to reduce liquidity costs, the FI now faces additional uncertainty in the price of the assets. To handle this delicate balance between liquidity costs and price uncertainty, the FI will be inclined to consider utility optimization.

There are sound economic reasons to optimize using an exponential (CARA) utility function: It leads to a tractable time-consistent strategy where additional information does not provide additional utility, and is similar to the original Mean-Variance optimization of Almgren and Chriss [1]. Since the strategy is only implemented over [0,T][0,T], at time TT the FI will update its information and continue in a similar way to rebalance over the subsequent period. This rebalancing is necessary to account for shortcomings of the model, changes in the balance sheet, and unanticipated events that cause fundamental changes to the parameters of the price dynamics.

A number of simplifications will be assumed about this problem. The total information available to the FI up to any given instant of time tt is modelled by a filtration {t}t0\{{\cal F}_{t}\}_{t\geq 0} on a given probability space (Ω,,)(\Omega,{\cal F},\mathbb{P}). The market consists of one risk-free asset with zero interest rate, and dd risky assets whose true price process is St=(St(1),,St(d))S_{t}=(S^{(1)}_{t},...,S^{(d)}_{t})^{\prime} and whose transaction price process is S~t=(S~t(1),,S~t(d))\tilde{S}_{t}=(\tilde{S}^{(1)}_{t},...,\tilde{S}^{(d)}_{t})^{\prime}. Here and in the following, we adopt matrix notation where MM^{\prime} denotes the matrix transpose of MM. Let us denote the vector of the amounts held in risky assets by (qu)u[t,T](q_{u})_{u\in[t,T]} and the vector of trading rates of the large trader by vu:=q˙u:=dqu/du,u[t,T]v_{u}:=\dot{q}_{u}:=dq_{u}/du,u\in[t,T].

Like Almgren and Chriss [1] and others, we suppose that the price of risky assets follows a dd-dimensional Bachelier model with both linear permanent and linear temporary market impact (parametrized by Λ\Lambda and Γ\Gamma respectively):

dSt\displaystyle dS_{t} =\displaystyle= (Λvt+μ)dt+ΣdBt,\displaystyle(\Lambda v_{t}+\mu)\ dt+\Sigma\ dB_{t}\ ,
S~t\displaystyle\tilde{S}_{t} =\displaystyle= St+Γvt.\displaystyle S_{t}+\Gamma v_{t}\ . (1)

Here, BtB_{t} is a dd-dimensional Brownian motion and Σd×d\Sigma\in\mathbb{R}^{d\times d} is the volatility matrix. The drift term μ=bd+ΛQd\mu=b-d+\Lambda Q\in\mathbb{R}^{d} is assumed to be constant. It takes into account the trending rate bb, dividend rate dd and aggregated permanent market price impact due to external traders QQ.

A more general formulation of the model that does not require linear market impact is certainly possible, and will not change many of the same basic properties. However, the assumption of linear impact leads to significantly more tractable optimal strategies. Moreover, as shown by Gatheral and Schied [9], so-called dynamic arbitrage is ruled out by choosing the permanent impact to be linear. It is further assumed that the permanent and temporary impact matrices Λ\Lambda and Γ\Gamma are symmetric and non-negative definite. The assumption that Γ\Gamma is symmetric is without loss of generality. On the other hand, Λ\Lambda is assumed to be symmetric not for economic reasons but for convenience: when it has an anti-symmetric part, a somewhat more complicated explicit solution is obtainable. Models similar to ours have been studied by Almgren and Lorenz [2], Gatheral and Schied [9] and Schied and Schöneborn [16]. We refer to the review by Hurd et al. [10] for further background and justification of these and other similar models.

2.1 The Merton Problem

Merton’s problem, introduced in Merton [12], aims to determine the strategies followed by utility optimizing investors in continuous time market models. To this end, we now consider the most general portfolio strategy, or control process, that trades within the market impact model (1) over some time interval [s,t]+[s,t]\subset\mathbb{R}_{+}. In our setting, each possible strategy will be simply a dd- dimensional trading rate process v=(vu)u[s,t]v=(v_{u})_{u\in[s,t]} that is adapted to the information filtration {t}\{{\cal F}_{t}\}: We denote the set of such admissible strategies by Πad[s,t]\Pi^{ad}[s,t]. The subclass of deterministic strategies where each value vu,u[s,t]v_{u},u\in[s,t] is s\mathcal{F}_{s} measurable is denoted by Πdet[s,t]\Pi^{det}[s,t].

Given any control process vΠad[0,s]v\in\Pi^{ad}[0,s] for 0<t<s0<t<s, the cash net of debt owed Ct:=CtvC_{t}:=C^{v}_{t} and marked-to-market equity, or assets net of debt owed, Xt:=Xtv:=Ct+qtStX_{t}:=X^{v}_{t}:=C_{t}+q_{t}^{\prime}S_{t} are given by:

Ct\displaystyle C_{t} =\displaystyle= C00tvuS~u𝑑u=C00tvuSu𝑑u0tvuΓvu𝑑u,\displaystyle C_{0}-\int_{0}^{t}v_{u}^{\prime}\tilde{S}_{u}\ du=C_{0}-\int_{0}^{t}v_{u}^{\prime}S_{u}\ du-\int_{0}^{t}v_{u}^{\prime}\Gamma v_{u}\ du\ , (2)
Xt\displaystyle X_{t} =\displaystyle= X0+0tqu𝑑Su0tvuΓvu𝑑u.\displaystyle X_{0}+\int_{0}^{t}q_{u}\ dS_{u}-\int_{0}^{t}v_{u}^{\prime}\Gamma v_{u}\ du\ . (3)

where the second equation is obtained by integration by parts. Note that here and henceforth, the superscript vv that labels processes controlled by vv will be omitted.

The interpretation of (2) and (3) in terms of the firm’s balance sheet is that assets are stochastic due to fluctuations of SS, while the debt, thought of as deposits, is assumed to be constant and sufficient to fund all trades. In other words, we focus on market illiquidity without funding illiquidity. It is consistent with the Principle of Limited Liability that a firm becomes insolvent when its equity XtX_{t} becomes negative. In the following, an insolvent firm with negative equity XT<0X_{T}<0 at a time TT, will be declared to be in default, implying that the laws of bankruptcy will be applied to the firm.

The FI can now try to solve Merton’s optimal problem of a CARA investor with constant absolute risk aversion parameter λ>0\lambda>0 over any period [t,T][t,T]. For each tt, they may express the value function JtJ_{t} achieved in terms of a certainty equivalent value WtW_{t},

Jt:=eλWt:=supvΠad[t,T](𝔼[eλXT|t]).J_{t}:=-e^{-\lambda W_{t}}:=\text{sup}_{v\in\Pi^{ad}[t,T]}\ \left(-\mathbb{E}[-e^{-\lambda X_{T}}|{\cal F}_{t}]\right)\ . (4)

If the supremum exists, it is achieved by adopting an optimal control denoted by v(t)=(vu(t))u[t,T]v^{*}(t)=(v_{u}^{*}(t))_{u\in[t,T]}, which will be an adapted process over [t,T][t,T]. The CARA investment problem in general always satisfies the dynamic programming principle (see Schied et al. [17]), which means that for any stTs\leq t\leq T, vu(t)=vu(s)v_{u}^{*}(t)=v_{u}^{*}(s) for all utu\geq t and

eλWs=supvΠad[s,t](𝔼[eλWt|s]).-e^{-\lambda W_{s}}=\text{sup}_{v\in\Pi^{ad}[s,t]}\ \left(-\mathbb{E}[e^{-\lambda W_{t}}|{\cal F}_{s}]\right)\ . (5)

An investor restricted to deterministic strategies over [t,T][t,T] cannot achieve a higher certainty equivalent value than equation (4). Therefore, if W~t\widetilde{W}_{t} is defined by

eλW~t:=supvΠdet[t,T](𝔼[eλXT|t])-e^{-\lambda\widetilde{W}_{t}}:=\text{sup}_{v\in\Pi^{det}[t,T]}\ \left(-\mathbb{E}[-e^{-\lambda X_{T}}|{\cal F}_{t}]\right) (6)

then W~tWt\widetilde{W}_{t}\leq W_{t}. The first result of this paper, stated next, is that (4) is always optimized by deterministic strategies and therefore Wt=W~tW_{t}=\widetilde{W}_{t} for t0t\geq 0. Moreover, it will be found in subsequent sections that the optimal control and value functions can be expressed in closed forms involving one-dimensional integrals that solve a system of ordinary differential equations of Riccati type. First, however, a note about notation: Because vv^{*} and qq^{*} turn out to be deterministic, we henceforth replace the stochastic process notation vuv_{u} by function notation v(u)v(u) and moreover suppress the dependence on the investment period [t,T][t,T].

Theorem 2.1.

Under the above modelling assumptions, there is a (possibly infinite) maximal time T+{}T^{*}\in\mathbb{R}_{+}\cup\{\infty\}) such that for any finite time horizon [t,T][t,T] with 0tTT0\leq t\leq T\leq T^{*}:

  1. 1.

    The optimal strategy v(u),u[t,T]v^{*}(u),u\in[t,T] exists, is unique and t{\cal F}_{t} measurable, hence deterministic.

  2. 2.

    The value function W~t\widetilde{W}_{t} achieved over [t,T][t,T], when restricted to deterministic strategies, equals WtW_{t}.

  3. 3.

    The value function has the form Wt=Xt+V(Tt,q)W_{t}=X_{t}+V(T-t,q) where V(τ,q),τ=TtV(\tau,q),\tau=T-t solves the non-linear partial differential equation

    τV+qμλ2qΣΣq+14(Λq+qV)Γ1(Λq+qV)=0,V(0,q)=0\hskip-54.2025pt-\partial_{\tau}V+q^{\prime}\mu-\frac{\lambda}{2}q^{\prime}\Sigma\Sigma^{\prime}q+\frac{1}{4}(\Lambda q+\partial_{q}V)^{\prime}\Gamma^{-1}(\Lambda q+\partial_{q}V)=0,\quad V(0,q)=0\ (7)

    on the domain [0,T]×d[0,T]\times\mathbb{R}^{d}.

  4. 4.

    Given initial holdings qq at time tt, the optimal portfolio holdings q(u)q^{*}(u) for u[t,T]u\in[t,T] solves the system of ODEs:

    dqdu=Γ12(qV(Tu,q)+Λq),q(t)=q.\displaystyle\frac{dq}{du}=\frac{\Gamma^{-1}}{2}\bigl{(}\partial_{q}V(T-u,q)^{\prime}+\Lambda q\bigr{)}\ ,\quad q(t)=q\ . (8)

The proof of this theorem is found in the Appendix. As we shall see in Section 3, V(τ,q)V(\tau,q) is a quadratic form in qq with time-dependent coefficients and thus the ODE (8) for qq^{*} is linear and can be solved explicitly.

2.2 Mean, Variance, Probability of Default and Time Consistency

From equations (2) and (3) we can deduce that, if vv is deterministic, then for any 0stT0\leq s\leq t\leq T, the equity XtX_{t} conditioned on s{\cal F}_{s} is normally distributed with mean and variance given by

𝔼[Xt|s]\displaystyle\mathbb{E}[X_{t}|{\cal F}_{s}] =\displaystyle= Xs+st(q(u)(Λv(u)+μ)v(u)Γv(u))𝑑u,\displaystyle X_{s}+\int_{s}^{t}\Bigl{(}q^{\prime}(u)(\Lambda v(u)+\mu)-v^{\prime}(u)\Gamma v(u)\Bigr{)}\ du, (9)
Var[Xt|s]\displaystyle{\mbox{Var}}[X_{t}|{\cal F}_{s}] =\displaystyle= stq(u)ΣΣq(u)𝑑u.\displaystyle\int_{s}^{t}\ q^{\prime}(u)\Sigma\Sigma^{\prime}q(u)\ du\ . (10)

In particular, the fact that XT|tX_{T}|{\cal F}_{t} is always normal implies that

𝔼[eλXT|t]=eλ(𝔼[XT|t]λ2Var[XT|t])\mathbb{E}[e^{-\lambda X_{T}}|{\cal F}_{t}]=e^{-\lambda\left(\mathbb{E}[X_{T}|{\cal F}_{t}]-\frac{\lambda}{2}{\mbox{Var}}[X_{T}|{\cal F}_{t}]\right)} (11)

and hence from (6) and Theorem 2.1 one deduces that

Wt=W~t=supvΠdet[t,T](𝔼[XT|t]λ2Var[XT|t]).W_{t}=\widetilde{W}_{t}=\text{sup}_{v\in\Pi^{det}[t,T]}\left(\mathbb{E}[X_{T}|{\cal F}_{t}]-\frac{\lambda}{2}{\mbox{Var}}[X_{T}|{\cal F}_{t}]\right)\ . (12)

This demonstrates the well-known equality of the certainty equivalent value for CARA optimization with the value function for Markowitz’ mean-variance (M-V) optimization, as well as the coincidence of their optimal strategies, when the optimal equity processes under consideration are all normally distributed.

In practice, the firm’s default probability (DP), meaning the probability that XT<0X_{T}<0, may be preferable to variance as a risk measure for institutional investors, as it gives more information about bad scenarios that need to be controlled. In the Bachelier model, the normality that follows for deterministic strategies implies that over any time horizon [t,T][t,T], the Mean-Variance (M-V) criterion


Problem M-V

WV(t,T,q,x,E)\displaystyle W_{V}(t,T,q,x,E) :=\displaystyle:= minvΠdet[t,T]Var[XT|t]\displaystyle\text{min}_{v\in\Pi^{det}[t,T]}\ \text{Var}[X_{T}|{\cal F}_{t}]
subject to 𝔼[XT|t]=E,\displaystyle\text{subject to }\mathbb{E}[X_{T}|{\cal F}_{t}]=E\ ,

and the Mean-Default Probability (M-DP) criterion


Problem M-DP

WDP(t,T,q,x,E)\displaystyle W_{DP}(t,T,q,x,E) :=\displaystyle:= minvΠdet[t,T][XT<0|t]\displaystyle\text{min}_{v\in\Pi^{det}[t,T]}\ \mathbb{P}[X_{T}<0|{\cal F}_{t}]
subject to 𝔼[XT|t]=E,\displaystyle\text{subject to }\mathbb{E}[X_{T}|{\cal F}_{t}]=E\ ,

are both solved by the same optimal trading strategy when 𝔼[XT|t]=E>0\mathbb{E}[X_{T}|{\cal F}_{t}]=E>0. This is because [XT<0|t]\mathbb{P}[X_{T}<0|{\cal F}_{t}] is strictly increasing in Var[XT|t]\text{Var}[X_{T}|{\cal F}_{t}] as long as 𝔼[XT|t]>0\mathbb{E}[X_{T}|{\cal F}_{t}]>0 is fixed. Moreover, if v(λ)v^{*}(\lambda) denotes the optimizer of (12), and XT(λ)X^{*}_{T}(\lambda) is the optimal equity it achieves, then v(λ)v^{*}(\lambda) also optimizes problems (2.2) provided E=E(λ):=𝔼[XT(λ)|t]E=E(\lambda):=\mathbb{E}[X^{*}_{T}(\lambda)|{\cal F}_{t}], and also (2.2) if in addition E(λ)>0E(\lambda)>0 .

Since Merton’s optimal problem (4) satisfies Bellman’s Dynamic Programming Principle at all times, its optimal strategies are “time-consistent”, which means that the optimal strategies computed for any two periods [t,T][t,T] and [s,T][s,T] always coincide on the intersection [st,T][s\vee t,T]. On the other hand, it is known () that mean-variance optimization is generally time-inconsistent and optimal adapted strategies starting at one time do not usually appear optimal at a later time. Surprisingly, Theorem 2.1 combined with equation 12 implies that in the present context, both the mean-variance and mean-default probability problems (2.2) and (2.2) are in fact time consistent, provided the optimization is restricted to deterministic strategies. The following result summarizes these relationships.

Corollary 2.1.1.

For any fixed time horizon [t,T][t,T], let E(λ)=𝔼[XT(λ)|t]E(\lambda)=\mathbb{E}[X^{*}_{T}(\lambda)|{\cal F}_{t}] be the expected value of equity computed for the optimal strategy v(λ)v^{*}(\lambda) of the CARA investment problem (12) with risk aversion parameter λ\lambda. Let E¯\underline{E} and E¯\overline{E} be the infimum and supremum of E(λ)E(\lambda) when λ\lambda varies over [0,)[0,\infty). Then:

  1. 1.

    For any E(E¯,E¯),E\in(\underline{E},\overline{E}), there exists a unique λ=λ(E)\lambda=\lambda(E) such that E(λ)=EE(\lambda)=E.

  2. 2.

    For all possible values of E(λ)E(\lambda), the deterministic optimal strategies vv^{*} of Problem M-V coincide with the unique adapted optimal strategy v(λ)v^{*}(\lambda) of (12).

  3. 3.

    The optimal strategies computed for any two periods [t,T][t,T] and [s,T][s,T] are time-consistent, meaning they coincide on the intersection [st,T][s\vee t,T].

  4. 4.

    If E(λ)>0E(\lambda)>0, the deterministic optimal strategies vv^{*} of Problem M-V and Problem M-DP also coincide with each other.

3 Explicit Optimal Strategies

We now exploit the tractability of Merton’s problem in the market impact setting to obtain closed formulas (involving matrix algebra and one dimensional integration) for the optimal trading curve of the financial institution (FI). The techniques invoked in this section are closely related to the methods developed for the optimal liquidation problem in .

Proposition 3.1.

Under the modelling assumptions of Theorem 2.1, for any finite TT with TTT\leq T^{*}, the value function Wt:=W(t,T,q,x)=x+V(τ,q),τ=TtW_{t}:=W(t,T,q,x)=x+V(\tau,q),\tau=T-t for any t[0,T]t\in[0,T] has the form

V(τ,q)=qA(τ)q+B(τ)q+C(τ)V(\tau,q)=q^{\prime}A(\tau)q+B(\tau)q+C(\tau) (15)

where A,B,CA,B,C are matrix valued functions of dimension [d,d][d,d], [1,d][1,d] and [1,1][1,1] respectively with AA symmetric. These functions satisfy Riccati-type ODEs for τ>0\tau>0:

Aτ\displaystyle\frac{\partial A}{\partial\tau} \displaystyle- (A+Λ/2)Γ1(A+Λ/2)+λ2ΣΣ=0,A(0)=0,\displaystyle(A+\Lambda/2)^{\prime}\Gamma^{-1}(A+\Lambda/2)+\frac{\lambda}{2}\Sigma\Sigma^{\prime}=0,\quad A(0)=0, (16)
Bτ\displaystyle\frac{\partial B}{\partial\tau} \displaystyle- BΓ1(A+Λ/2)μ=0,B(0)=0,\displaystyle B\Gamma^{-1}(A+\Lambda/2)-\mu^{\prime}=0,\quad B(0)=0, (17)
Cτ\displaystyle\frac{\partial C}{\partial\tau} \displaystyle- 14BΓ1B=0,C(0)=0.\displaystyle\frac{1}{4}B\Gamma^{-1}B^{\prime}=0,\quad C(0)=0. (18)

The next theorem, whose proof is given in Appendix A, solves this system of Riccati equations in closed form in terms of E:=Γ1/2(Λ/2)Γ1/2E:=\Gamma^{-1/2}(\Lambda/2)\Gamma^{-1/2} and the symmetric square root DD of

D2:=λ2Γ1/2ΣΣΓ1/2.D^{2}:=\frac{\lambda}{2}\Gamma^{-1/2}\Sigma\Sigma^{\prime}\Gamma^{-1/2}\ .

It also provides a closed form for the optimal strategy qq^{*}.

Theorem 3.2.
  1. 1.

    The solution of the system of Riccati equations (16)–(18) over the maximal interval [0,T][0,T^{*}] is given by

    A(τ)=Γ12(V(τ)U(τ)1E)Γ12\displaystyle A(\tau)=\Gamma^{\frac{1}{2}}\Bigl{(}V(\tau)U(\tau)^{-1}-E\Bigr{)}\Gamma^{\frac{1}{2}} (19)
    B(τ)=μ¯(EV(τ))U1(τ)Γ12\displaystyle B(\tau)=\overline{\mu}^{\prime}\bigl{(}E-V(\tau)\bigr{)}U^{-1}(\tau)\Gamma^{\frac{1}{2}} (20)
    C(τ)=14μ¯(0τ(EV(s))(U(s)U(s))1(EV(s))𝑑s)μ¯\displaystyle C(\tau)=\frac{1}{4}\overline{\mu}^{\prime}\left(\int_{0}^{\tau}\bigl{(}E-V(s)\bigr{)}\bigl{(}U^{\prime}(s)U(s)\bigr{)}^{-1}(E-V^{\prime}(s))ds\right)\overline{\mu} (21)

    where μ¯:=D2Γ1/2μ\overline{\mu}:=D^{-2}\Gamma^{-1/2}\mu and the matrix valued functions U,VU,V are given by

    U(τ)\displaystyle U(\tau) =cosh(Dτ)D1sinh(Dτ)E\displaystyle=\ {\cosh}(D\tau)-D^{-1}\ {\sinh}(D\tau)E (22)
    V(τ)\displaystyle V(\tau) =sinh(Dτ)D+cosh(Dτ)E.\displaystyle=-\ {\sinh}(D\tau)D+\ {\cosh}(D\tau)E. (23)
  2. 2.

    The maximal time horizon TT^{*} is

    T=inf{τ>0:U(τ) is not invertible}.T^{*}=\ {\inf}\{\tau>0:U(\tau)\mbox{ is not invertible}\}\ .

    TT^{*} is finite if D<ED<E and \infty if D>ED>E.

  3. 3.

    For any (t,T,q,x),(t,T,q,x), the optimal trading curve q(u)q^{*}(u) over the period [t,T][t,T] is

    q(u)\displaystyle q^{*}(u) =\displaystyle= Γ1/2U(Tu)U1(Tt)Γ1/2q\displaystyle\Gamma^{-1/2}U(T-u)U^{-1}(T-t)\Gamma^{1/2}q (25)
    +12Γ1/2U(Tu)tuU1(Tr)Γ1/2B(Tr)𝑑r\displaystyle+\frac{1}{2}\Gamma^{-1/2}U(T-u)\int^{u}_{t}U^{-1}(T-r)\Gamma^{-1/2}B^{\prime}(T-r)dr
  4. 4.

    For any (t,T,q,x),(t,T,q,x), the expected value and variance of the optimal terminal equity are:

    𝔼[XT(λ)|t]\displaystyle\mathbb{E}[X_{T}^{*}(\lambda)|{\cal F}_{t}] =\displaystyle= x+q(A(Tt)λ2L(Tt))q\displaystyle x+q^{\prime}\Bigl{(}A(T-t)-\frac{\lambda}{2}L(T-t)\Bigr{)}q (26)
    +(B(Tt)λ2M(Tt))q+C(Tt)λ2N(Tt),\displaystyle\hskip-50.58878pt+\ \Bigl{(}B(T-t)-\frac{\lambda}{2}M(T-t)\Bigr{)}q+C(T-t)-\frac{\lambda}{2}N(T-t)\ ,
    Var[XT(λ)|t]\displaystyle{\mbox{Var}}[X_{T}^{*}(\lambda)|{\cal F}_{t}] =\displaystyle= qL(Tt)q+M(Tt)q+N(Tt),\displaystyle q^{\prime}L(T-t)q+M(T-t)q+N(T-t)\ , (27)

where formulas for L,M,NL,M,N are given in Appendix A.

In the special case when DD and EE are commuting matrices, these formulas decouple into dd one-dimensional problems, each of which is similar to the single risky asset case we next discuss.

3.1 The Case of a Single Risky Asset

In the single risky asset case, one can verify that the scalar functions A,B,CA,B,C and the optimal trading strategy q(u)q^{*}(u) have comparatively simple formulas obtained by reducing those given in Theorem 3.2. Notice that several distinct possibilities are determined by the relation between D=Σλ2ΓD=\Sigma\sqrt{\frac{\lambda}{2\Gamma}} and E=Λ2Γ.E=\frac{\Lambda}{2\Gamma}.

Proposition 3.3.

In the single asset case,

  1. 1.

    When D>ED>E or 2λΓΣ2>Λ22\lambda\Gamma\Sigma^{2}>\Lambda^{2}, Denote K=tanh1(E/D)K=\tanh^{-1}(E/D) we have U(τ)=cosh(DτK)coshKU(\tau)=\frac{\cosh(D\tau-K)}{\cosh K} The formulas can be rewritten in terms of hyperbolic functions as follows

    A(τ)\displaystyle A(\tau) =\displaystyle= ΓD[tanh(DτK)+tanhK]\displaystyle-\Gamma D\left[\tanh(D\tau-K)+\tanh K\right] (28)
    B(τ)\displaystyle B(\tau) =\displaystyle= μD(sinhKcosh(DτK)+tanh(DτK))\displaystyle\frac{\mu}{D}\left(\frac{\sinh K}{\cosh(D\tau-K)}+\tanh(D\tau-K)\right) (29)
    C(τ)\displaystyle C(\tau) =\displaystyle= μ24ΓD3[(sinh2(DτK)1)(tanh(DτK)+tanhK)Dτ]\displaystyle\frac{\mu^{2}}{4\Gamma D^{3}}\left[(\sinh^{2}(D\tau-K)-1)(\tanh(D\tau-K)+\tanh K)-D\tau\right] (30)
    +\displaystyle+ μ22ΓD2(tanhKsinhKcosh(DτK)).\displaystyle\frac{\mu^{2}}{2\Gamma D^{2}}(\tanh K-\frac{\sinh K}{\cosh(D\tau-K)}). (31)

    The optimal trading strategy is given by

    q(u)\displaystyle q(u)^{*} =cosh(DτuK)cosh(DτtK)q+μ2ΓD2(1cosh(DτuK)cosh(DτtK))\displaystyle=\frac{\cosh(D\tau_{u}-K)}{\cosh(D\tau_{t}-K)}q+\frac{\mu}{2\Gamma D^{2}}\left(1-\frac{\cosh(D\tau_{u}-K)}{\cosh(D\tau_{t}-K)}\right) (33)
    +μsinhKcosh(DτuK)2ΓD2(tanh(DτuK)tanh(DτtK)),\displaystyle+\frac{\mu\sinh K\cosh(D\tau_{u}-K)}{2\Gamma D^{2}}\left(\tanh(D\tau_{u}-K)-\tanh(D\tau_{t}-K)\right),

    where τs=Ts.\tau_{s}=T-s.

  2. 2.

    When D=ED=E or 2λΓΣ2=Λ22\lambda\Gamma\Sigma^{2}=\Lambda^{2}

    A(τ)\displaystyle A(\tau) =\displaystyle= 0\displaystyle 0 (34)
    B(τ)\displaystyle B(\tau) =\displaystyle= μD(eDτ1)\displaystyle\frac{\mu}{D}\left(e^{D\tau}-1\right) (35)
    C(τ)\displaystyle C(\tau) =\displaystyle= μ22DλΣ2[12e2Dτ2eDτ+Dτ+32].\displaystyle\frac{\mu^{2}}{2D\lambda\Sigma^{2}}\left[\frac{1}{2}e^{2D\tau}-2e^{D\tau}+D\tau+\frac{3}{2}\right]. (36)
  3. 3.

    When D<ED<E or 0<2λΓΣ2<Λ20<2\lambda\Gamma\Sigma^{2}<\Lambda^{2}, Denote K=coth1(E/D)K=\coth^{-1}(E/D) we have U(τ)=sinh(DτK)sinhK.U(\tau)=-\frac{\sinh(D\tau-K)}{\sinh K}. The formulas can be rewritten in terms of hyperbolic functions as follows

    A(τ)\displaystyle A(\tau) =\displaystyle= ΓD[coth(DτK)+cothK]\displaystyle-\Gamma D\left[\coth(D\tau-K)+\coth K\right] (37)
    B(τ)\displaystyle B(\tau) =\displaystyle= μD(coshKsinh(DτK)+coth(DτK))\displaystyle\frac{\mu}{D}\left(\frac{-\cosh K}{\sinh(D\tau-K)}+\coth(D\tau-K)\right) (38)
    C(τ)\displaystyle C(\tau) =\displaystyle= μ24ΓD3[(cosh2(DτK)+1)(coth(DτK)+cothK)+Dτ]\displaystyle-\frac{\mu^{2}}{4\Gamma D^{3}}\left[(\cosh^{2}(D\tau-K)+1)(\coth(D\tau-K)+\coth K)+D\tau\right] (39)
    +\displaystyle+ μ22ΓD2(cothK+coshKsinh(DτK)).\displaystyle\frac{\mu^{2}}{2\Gamma D^{2}}(\coth K+\frac{\cosh K}{\sinh(D\tau-K)}). (40)

    The optimal trading strategy is given by

    q(u)\displaystyle q(u)^{*} =sinh(DτuK)sinh(DτtK)q+μ2ΓD2(1sinh(DτuK)sinh(DτtK))\displaystyle=\frac{\sinh(D\tau_{u}-K)}{\sinh(D\tau_{t}-K)}q+\frac{\mu}{2\Gamma D^{2}}\left(1-\frac{\sinh(D\tau_{u}-K)}{\sinh(D\tau_{t}-K)}\right) (42)
    μcoshKsinh(DτuK)2ΓD2(coth(DτuK)coth(DτtK)),\displaystyle-\frac{\mu\cosh K\sinh(D\tau_{u}-K)}{2\Gamma D^{2}}\left(\coth(D\tau_{u}-K)-\coth(D\tau_{t}-K)\right),

    where τs=Ts.\tau_{s}=T-s.

  4. 4.

    When λ=0\lambda=0, we have U(τ)=1EτU(\tau)=1-E\tau and V(τ)=E.V(\tau)=E. Moreover

    A(τ)\displaystyle A(\tau) =Λ2(1U(τ)U(τ))\displaystyle=\frac{\Lambda}{2}(\frac{1-U(\tau)}{U(\tau)}) (43)
    B(τ)\displaystyle B(\tau) =μΓΛ(1U(τ)2U(τ))\displaystyle=\frac{\mu\Gamma}{\Lambda}(\frac{1-U(\tau)^{2}}{U(\tau)}) (44)
    C(τ)\displaystyle C(\tau) =μ2Γ26Λ3(U(τ)4+6U(τ)28U(τ)+3U(τ)).\displaystyle=\frac{\mu^{2}\Gamma^{2}}{6\Lambda^{3}}\left(\frac{-U(\tau)^{4}+6U(\tau)^{2}-8U(\tau)+3}{U(\tau)}\right). (45)

    The optimal trading strategy is given by

    q(u)=U(τu)(qU(τt)+μ4ΓE2(U(τt)+1U(τt)U(τu)1U(τu))).\displaystyle q(u)^{*}=U(\tau_{u})\left(\frac{q}{U(\tau_{t})}+\frac{\mu}{4\Gamma E^{2}}\left(U(\tau_{t})+\frac{1}{U(\tau_{t})}-U(\tau_{u})-\frac{1}{U(\tau_{u})}\right)\right). (46)

The third and fourth cases are the cases where T<T^{*}<\infty, and one finds the solutions become unbounded: limτTA(τ)=\lim_{\tau\rightarrow T^{*}}A(\tau)=\infty. In cases 1 and 2, the solutions are bounded for all τ\tau, and T=T^{*}=\infty.

3.2 Small Perturbations from Merton’s Solution

In his original paper Merton [12], Robert Merton presented the exact solution to the problem of optimal investment in a frictionless market for an asset price that follows a geometric Brownian motion. His solution technique also leads to an exact solution of our present model in the limit of zero market impact, Λ=0,Γ=0\Lambda=0,\Gamma=0, which we will call the “Merton solution”. It is of some interest to consider the explicit general solution from the previous section as a perturbation of the Merton solution, and to investigate the nature of its convergence as market impact goes to zero. We suppose that Λ=ϵΛ1,Γ=ϵΓ1\Lambda=\epsilon\Lambda_{1},\Gamma=\epsilon\Gamma_{1} with small ϵ\epsilon and denote by W(t,T,x,q,ϵ)W(t,T,x,q,\epsilon) the certainty equivalent value function with its dependence on ϵ\epsilon. For simplicity, we confine our attention to the single asset case of the previous section.

The Merton solution over the period [t,T][t,T] with initial conditions Xt=x,qt=qX_{t}=x,q_{t}=q involves an instantaneous trade that incurs no trading cost, to the optimal value qM:=μλΣ2q^{M}:=\frac{\mu}{\lambda\Sigma^{2}}. This portfolio is then held constant. One can show that this strategy achieves the certainty equivalent value function W(t,T,x,q,0)=x+μ2(Tt)2λΣ2W(t,T,x,q,0)=x+\frac{\mu^{2}(T-t)}{2\lambda\Sigma^{2}} which we note is independent of qq.

Now, for small ϵ\epsilon, the general solution of our model is given by Case 1 of Proposition 3.3, which leads to the following perturbative expansion

W(t,T,x,q,ϵ)=W(t,T,x,q,0)+ϵ1/2Q(q)+o(ϵ3/2).\displaystyle W(t,T,x,q,\epsilon)=W(t,T,x,q,0)+\epsilon^{1/2}Q(q)+o(\epsilon^{3/2}). (47)

where

Q(q)\displaystyle Q(q) :=\displaystyle:= Γ1D1q2+μD1q+(2E1)D1.\displaystyle\Gamma_{1}D_{1}q^{2}+\frac{\mu}{D_{1}}q+(2E-1)D_{1}. (48)

Here we define D1=Σλ2Γ1D_{1}=\Sigma\sqrt{\frac{\lambda}{2\Gamma_{1}}} which does not depend on ϵ\epsilon and we have D=ϵ1/2D1D=\epsilon^{-1/2}D_{1}\to\infty as ϵ0.\epsilon\to 0. It is obvious that EE does not depend on ϵ\epsilon either. Thus the value function of our problem converges to the value function of the Merton solution with rate of convergence ϵ1/2.\epsilon^{1/2}.

The optimal holding at the terminal time is given by

qT=qM+(qtqM)coshKcosh(DτK)+EqM1tanh(DτK)D(1tanh2K).\displaystyle q_{T}^{*}=q^{M}+(q_{t}-q^{M})\frac{\cosh K}{\cosh(D\tau-K)}+Eq^{M}\frac{1-\tanh(D\tau-K)}{D(1-\tanh^{2}K)}. (49)

Here τ:=Tt.\tau:=T-t. It is straightforward that limϵ0K=0,\lim_{\epsilon\to 0}K=0, hence limϵ0qT=qM.\lim_{\epsilon\to 0}q_{T}^{*}=q^{M}.

Let A~(τ):=U(τ)V(τ)=Dtanh(DτK),\tilde{A}(\tau):=\frac{U(\tau)}{V(\tau)}=D\ {\tanh}(D\tau-K), the trading rate at the initial time tt is given by

vt\displaystyle v_{t} =limstq˙s\displaystyle=\lim_{s\to t}\dot{q}_{s} (52)
=qtA~(τ)+U(τ)μ2ΓD2[E(D2(A~(τ))2)D2E2A~(τ)U(τ)]\displaystyle=q_{t}\tilde{A}(\tau)+U(\tau)\frac{\mu}{2\Gamma D^{2}}[\frac{E(D^{2}-(\tilde{A}(\tau))^{2})}{D^{2}-E^{2}}-\frac{\tilde{A}(\tau)}{U(\tau)}]
=(qtqM)Dtanh(DτK)+μEcoshK2Γ1D12cosh(DτK).\displaystyle=(q_{t}-q^{M})D\tanh(D\tau-K)+\frac{\mu E\cosh K}{2\Gamma_{1}D_{1}^{2}\cosh(D\tau-K)}.

Note that limϵ0tanh(DτK)=1\lim_{\epsilon\to 0}{\tanh}(D\tau-K)=1 and limϵ0cosh(DτK)=\lim_{\epsilon\to 0}{\cosh}(D\tau-K)=\infty, we have limϵ0vt=±\lim_{\epsilon\to 0}v_{t}=\pm\infty depending on if qt>qMq_{t}>q^{M} or qt<qM,q_{t}<q^{M}, i.e. the optimal strategy is to trade rapidly in the beginning. We then conclude that the optimal trajectory q(u,ϵ)q^{*}(u,\epsilon) converges to an LL–shaped or Γ\Gamma–shaped curve when the market impact tends to zero.

This result implies that when market impact is low, the firm will follow an optimal trading strategy very close to the constant holding strategy of the Merton problem. A more surprising fact is the portfolio which starts at the Merton portfolio will remain constant if permanent impact has Λ1=0\Lambda_{1}=0, and all strategies regardless of initial portfolios will move towards the Merton portfolio for sometime initially. In the following subsection, we will show similar results for the Multi-Asset case.

4 Numerical Investigations

We now consider the investment behaviour of a hypothetical unregulated financial institution, such as a hedge fund or mutual fund. The firm trades a single risky asset, with initial price S0=$100S_{0}=\$100, in a market with a 0%0\% risk free rate of return. They use our CARA optimal investment model to trade over non-overlapping half-year trading periods: we focus here on the period [0,T],T=1/2[0,T],\ T=1/2. The CARA risk aversion parameter λ\lambda is chosen to be consistent with a target default probability of 1%1\% for each period. Thus the firm will trade aggressively to maximize their expected return with a quite high tolerance to the potential of default.

The calibrated parameters of the model given in Table 1 are taken to be fixed at the beginning of the period t=0t=0. Note that the firm uses the Bachelier model only for a short period, and expects to recalibrate at the beginning of the each successive period. Since the risky asset is illiquid, there is market impact related to the velocity of trading and the total amount traded: these are assumed to give the temporary and permanent market impact parameter estimates Γ=$107years/ (units traded)2\Gamma=\$10^{-7}\text{years/ (units traded)}^{2} and Λ=$4108/unit\Lambda=\$4*10^{-8}/\text{unit}.

Balance sheets for a small, medium and large firm will be considered, all with a risky asset-to-equity ratio of 4:14:1. The initial stock holdings are q0=[50000,200000,800000]q_{0}=[50000,200000,800000] from which the initial firm equity and cash net of debt are then determined to be X0=0.25q0S0,C0=0.75q0S0X_{0}=0.25*q_{0}*S_{0},C_{0}=-0.75*q_{0}*S_{0} . In all three cases, as indicated above, the financial institution targets a fixed default probability under the optimal strategy. This is implemented by choosing the internal risk aversion parameter λ\lambda so that the WDP(0,T,q0,X0,E(λ))=0.01W_{DP}(0,T,q_{0},X_{0},E(\lambda))=0.01. Note that even though the optimal strategy does not depend on X0X_{0} for fixed λ\lambda, this specification of λ\lambda depends on X0X_{0}. Thus firms that differ only in X0X_{0} do adopt different investment strategies.

Table 1: Benchmark Parameters
Calibrated Parameter Model Parameter Value
Initial Stock Price S0=$100S_{0}=\$100
Trading Period [0,T],T=0.5[0,T],\ T=0.5 year
20%20\% Annualized Volatility Σ=$20/unit/year\Sigma=\$20/\text{unit}/\sqrt{\text{year}}
4%4\% Annual Growth μ=\mu=4/unit/year
Temporary Market Impact Γ=$107year/ (unit)2\Gamma=\$10^{-7}\text{year/ (unit)}^{2}
Permanent Market Impact Λ=$4108/unit\Lambda=\$4*10^{-8}/\text{unit}
λ\lambda such that Probability of Default =0.0005=0.0005 λ\lambda varies
Initial Holdings q0=[50000,200000,800000]q_{0}=[50000,200000,800000]
Initial Cash net Debt owing C0=0.75q0S0C_{0}=-0.75*q_{0}*S_{0}
Initial Equity X0=0.25q0S0X_{0}=0.25*q_{0}*S_{0}

4.1 The Efficient Frontier

Figure 1 shows for each of the three firms how the expected rate of return on equity (ERR) and default probability (DP) for their CARA/MV/DP optimal strategies depend as λ\lambda varies over the set of feasible values [0,)[0,\infty). These quantities are computed by the formulas

ERR(λ)=1T(E(λ)X01),DP(λ)=N(E(λ)V(λ)){\rm ERR}(\lambda)=\frac{1}{T}(\frac{E(\lambda)}{X_{0}}-1)\ ,\quad{\rm DP}(\lambda)=N\left(-\frac{E(\lambda)}{\sqrt{V(\lambda)}}\right) (53)

where N()N(\cdot) denotes the CDF of the standard normal and E(λ),V(λ)E(\lambda),V(\lambda) are given by (26) and (27) . Such a graph is called an efficient frontier, and it summarizes the results a firm may achieve by adopting different possible risk aversion parameters.

As explained earlier, the three firms each select the optimal investment strategy given by the value λ\lambda that leads to DP(λ)=0.01{\rm DP}(\lambda)=0.01: with the benchmark parameters given in Table 1, the three values they compute are λ=[2.56×107,6.7×108,1.83×108]\lambda=[2.56\times 10^{-7},6.7\times 10^{-8},1.83\times 10^{-8}]. While Figure 1 suggests that, ceteris paribus, larger firms have a lower efficient frontier, this ordering can be made to reverse by increasing the permanent impact parameter.

Refer to caption
Figure 1: The efficient frontier for three firms with parameters given in Table 1, showing their default probability and expected rate of return on equity, when adopting their optimal portfolio with risk aversion parameters λ\lambda varying over [0,)[0,\infty).

4.2 Properties of the Optimal Trading Curve

To better understand the properties of the optimal investment strategies that result from our method, we now investigate how the three hypothetical firms’ optimal trading in the single asset case compare as important model parameters are varied away from the benchmark parameters of Table 1. Figures 2 and 3 summarize the results of four experiments, and show how the firms’ optimal trading strategies over the time period [0,1/2][0,1/2] years change as the asset rate of return, asset volatility, temporary market impact and permanent market impact are made to vary one at a time. In each figure, the red curve denotes the benchmark parametrization, while the other two curves show the result as one specific parameter is varied upwards (blue curve) and downwards (green curves).

One point needs to be reiterated: for each choice of a set of parameters excluding the risk aversion parameter λ\lambda, λ\lambda is computed to ensure that the firm’s default probability (DP) is exactly 1%. Thus each curve in these figures corresponds to a different value of λ\lambda.

Refer to caption
(a)
Refer to caption
(b)
Figure 2: Effects on the benchmark optimal trading curve (red curve) for three firms as one parameter changes upwards (blue curve) and downwards (green curves). (a) shows the effect of changing μ\mu, the mean rate of return of the risky asset; (b) shows the effect of changing Σ\Sigma, the volatility of the risky asset. The vertical axis shows qq, the amount of the risky asset being held at any time during the trading period. Both figures were computed using a trading period of half a year, while maintaining a probability of default of 1%1\% for all trading curves.

The effect on the optimal strategy of varying the asset rate of return μ\mu and volatility Σ\Sigma is shown in Figure 2. It is not a surprise to observe that the optimal strategy will include more of the risky asset as the rate of return is raised, or as the volatility is lowered. There is a threshold value of Σ\Sigma below which the firm switches from sell strategies to buy strategies. Although not shown in the graph, one finds the reverse is the case for μ\mu. Finally, the velocity of selling strategies seems to retain a similar shape over time under these variations. Each of these observations are borne out by more extensive investigations of the dependence on these parameters.

Refer to caption
(a)
Refer to caption
(b)
Figure 3: Effects on the benchmark optimal trading curve (red curve) for three firms as one parameter changes upwards (blue curve) and downwards (green curves). (a) shows the effect of changing the temporary impact parameter Γ\Gamma; (b) shows the effect of changing the permanent impact parameter Λ\Lambda. The vertical axis shows qq, the amount of the risky asset being held at any time during the trading period. Both figures were computed using a trading period of half a year, while maintaining a probability of default of 1%1\% for all trading curves.

In Figure 3a, the main effect of decreasing temporary impact Γ\Gamma is seen to be to move more quickly to the final holding level early in the period. This can be understood as a change in the optimal balance between reducing temporary impact costs and price uncertainty due to the asset volatility. To a lesser extent, one also sees in these examples that the level of the final holdings decreases slightly as Γ\Gamma increases.

The effect of permanent impact Λ\Lambda on the strategy is more subtle. From Figure 3b, a higher permanent impact parameter Λ\Lambda leads to an optimal strategy ending with a higher holding level. It also causes more curvature for the trading strategies, especially towards the closing time where all trading curves seem to have positive slope. Indeed, directly from (8), the general formula for the trading velocity, one verifies that at the close of the period dqduu=T=12Γ1Λ\frac{dq}{du}\mid_{u=T}=\frac{1}{2}{\Gamma^{-1}\Lambda}. This means that as long as Λ\Lambda is positive, every trader holding long positions, whether leveraging up or down, will always end the period by buying more shares. The reason is because permanent impact gives any trader a small opportunity to push the asset prices in a favourable direction at the last moment. We call this the Ponzi property of our market impact model: the gains it implies cannot be converted to cash without bursting the small price bubble the trader has created.

4.3 Small Market Impact

The perturbative analysis of Section 3.2 provides an alternative framework for understanding the effect of permanent and temporary market impact. We investigate the middle-sized firm with q0=2×105q_{0}=2\times 10^{5} and market impact parameters Γ(ϵ)=ϵ×107,Λ(ϵ)=ϵ×4×108\Gamma(\epsilon)=\epsilon\times 10^{-7},\ \Lambda(\epsilon)=\epsilon\times 4\times 10^{-8} for a sequence of values ϵn=10n,n=0,1,\epsilon_{n}=10^{-n},n=0,1,\dots approaching zero. Figure 4(a) shows how the optimal strategies converge for ϵ0\epsilon\to 0 to the constant Merton solution for u(t,T)u\in(t,T), but show rapid transient effects for uu near both endpoints. The small Ponzi effect near u=Tu=T can be turned off by taking Λ(ϵ)=0\Lambda(\epsilon)=0, as shown in Figure 4(b).

These figures suggest that for reasonable parameter values and small market impact, our model will deliver strategies that are effectively similar to the Merton solution. The observed relationship between the optimal strategies and the Merton solution, valid for small market impact, actually remains true for intermediate levels of market impact such as our benchmark parametrizations. One observes in Figures 2 and 3 that all strategies tend to flatten as uu approaches TT, albeit with a small Ponzi effect at the end of the period. It will be well worth studying the extent that the value of the holdings at which the strategy flattens is well approximated by the Merton solution. As the market impact parameters decrease, the flat portion of the curve becomes wider, and closer to the Merton solution.

Refer to caption
(a) Λ=ϵΛ1,Γ=ϵΓ1\Lambda=\epsilon\Lambda_{1},\Gamma=\epsilon\Gamma_{1}
Refer to caption
(b) Λ=0,Γ=ϵΓ1\Lambda=0,\Gamma=\epsilon\Gamma_{1}
Figure 4: The behaviour of the optimal trading strategy for a decreasing sequence of market impact parameters as described in Section 4.3. They show convergence to the constant Merton solution.
Refer to caption
(a) 3 Uncorrelated Assets
Refer to caption
(b) 3 Positively Correlated Assets
Figure 5: The behaviour of the optimal three asset trading strategy in the uncorrelated and correlated cases, when compared to the Merton solution.

An analysis similar to that of Section 3.2 allows us to understand the multi–asset investment problem in the small market impact regime. Figure 5, we used the standard asset parameters as Asset 2, with Asset 1 being the asset with the lower perturbed parameters from the previous cases, and Asset 3 having the higher perturbed parameters from the previous cases. Figure 5(a), compares the uncorrelated case to the Merton solution. Figure 5(b), compares the case of constant pairwise correlation ρ=0.5\rho=0.5 to the Merton solution. In both cases we can see the behaviour similar to the one asset case above. It should be noted that unlike the single asset case, a hedging strategy can be utilized for when multiple assets are available, hence short selling of a illiquid asset class can be optimal.

We observe again in the multi-asset problem that when the market impact is small, the general optimal strategy is close to the Merton solution.

4.4 Bounded Optimal Trading Strategies

We have seen in Section 4.2 that in the single asset case, positive Λ\Lambda creates the Ponzi property that gives any trader an opportunity to push the price in their favour near the end of the period. Case 1 of Proposition 3.3 shows that as long as Λ<2λΓΣ\Lambda<\sqrt{2\lambda\Gamma}\Sigma, the optimal strategies computed over any finite period [0,T][0,T] remain bounded. However, when Λ>2λΓΣ\Lambda>\sqrt{2\lambda\Gamma}\Sigma, Case 3 of Proposition 3.3 implies that for the period [0,T][0,T^{*}] with T=K~/DT^{*}=\tilde{K}/D, the optimal strategy q(u)q^{*}(u) and the value function WW both blow up at u=0u=0.

Similar possibilities arise in the multi-asset investment problem. As Λ\Lambda increases, eventually the matrix function U(t)U(t) becomes singular for some finite t=Tt=T^{*}. Again, one then finds that for the period [0,T][0,T^{*}], the optimal strategy q(u)q^{*}(u) and the value function WW both blow up at u=0u=0.

5 Remarks and Conclusions

The three hypothetical financial institutions studied in Section 4 face a typical investment problem, namely to maximize their return on equity subject to an upper bound on the downside risk, which is defined here as the probability of default. We have presented an analytically tractable version of the optimal portfolio problem that can be justified three different ways: as utility optimization, as mean-variance optimization and as mean-default probability optimization. Numerical evidence shows that the solutions generated by the method have desirable and interesting features. Perhaps most importantly, we have learned that these strategies closely track the classic Merton solution arising in the zero market impact model.

The three benchmark firms have efficient frontiers shown in Figure 1 that quantify by how much their rate of return will increase if they raise their tolerance to default. We have observed that optimal trading strategies that account for market impact tend to move over the trading period toward the Merton solution. If they are initially close to the Merton solution, they will tend to remain close, which means the Merton solution is robust to perturbations. The speed of approach increases as the temporary impact parameter Γ\Gamma decreases. In addition, the main effect of the permanent impact Λ\Lambda is the Ponzi property that is manifested by some amount of buying near the end of the period. This Ponzi effect is typically small, but as Proposition 3.3 shows, it will dominate the character of the solution when Λ\Lambda becomes large enough to cause an asset price bubble.

Left to themselves, there is little incentive for such FIs to limit risk seeking. By choosing a low value of λ\lambda, or equivalently, accepting a high leverage ratio, they can achieve a high rate of return on capital. Since lower temporary impact and higher permanent impact are both relatively more advantageous to larger firms, one has situations where large firms implement aggressive Ponzi style strategies. In scenarios where the assets perform badly, there is a likelihood of serious asset price feedback that may adversely affect other financial institutions holding common assets. Such asset price feedback, both bubbles and bursts, has been identified in the literature, notably Cifuentes et al. [7], as a critical channel of systemic risk, popularly known as the asset fire sale channel. One application of our model, yet to be explored in detail, will be its use to specify the natural behaviour of the banks and financial institutions in a large financial system, and then to see how systemic risk measures are affected by asset fire sales due to market impact. In this systemic risk context, it will also be important to introduce the effects of funding illiquidity by modelling the stochastic nature of deposits.

If large banks were permitted to act in their own self interest without regard to their systemic effects, they would pose an unacceptable threat to financial stability. For that reason, all banks are subjected to a regime of strict financial regulation, of which the most important are limits to their capital asset ratio and liquidity coverage ratio. Under such regulatory constraints, FIs’ investment strategies will differ dramatically from the optimal strategies produced in the present paper. The optimal behaviour of such regulated financial institutions will be the target of future modelling studies.

References

  • Almgren and Chriss [2001] R. Almgren and N. Chriss. Optimal execution of portfolio transactions. Journal of Risk, 3:5–40, 2001.
  • Almgren and Lorenz [2007] R. Almgren and J. Lorenz. Adaptive arrival price. Trading, 2007(1):59–66, 2007.
  • Almgren [2003] R. F. Almgren. Optimal execution with nonlinear impact functions and trading-enhanced risk. Applied Mathematical Finance, 10(1):1–18, 2003.
  • Brown et al. [2010] D. B. Brown, B. I. Carlin, and M. S. Lobo. Optimal portfolio liquidation with distress risk. Management Science, 56(11):1997–2014, 2010.
  • Brunnermeier and Pedersen [2009] M. K. Brunnermeier and L. H. Pedersen. Market liquidity and funding liquidity. Review of Financial Studies, 22(6):2201–2238, 2009.
  • Cartea and Jaimungal [2015] Á. Cartea and S. Jaimungal. Optimal execution with limit and market orders. Quantitative Finance, 15(8):1279–1291, 2015.
  • Cifuentes et al. [2005] R. Cifuentes, G. Ferrucci, and H. S. Shin. Liquidity risk and contagion. Journal of the European Economic Association, 3(2-3):556–566, 2005.
  • Davis and Norman [1990] M. H. Davis and A. R. Norman. Portfolio selection with transaction costs. Mathematics of Operations Research, 15(4):676–713, 1990.
  • Gatheral and Schied [2011] J. Gatheral and A. Schied. Optimal trade execution under geometric brownian motion in the almgren and chriss framework. International Journal of Theoretical and Applied Finance, 14(03):353–368, 2011.
  • Hurd et al. [2016] T. R. Hurd, Q. H. Shao, and T. Q. Tran. Review of portfolio strategies in illiquid markets. available at, June 2016.
  • Markowitz [1952] H. Markowitz. Portfolio selection. The Journal of Finance, 7(1):77–91, 1952.
  • Merton [1969] R. C. Merton. Lifetime portfolio selection under uncertainty: the continuous–time model. Rev. Econom. Statist., 51:247–257, 1969.
  • Moazeni et al. [2013] S. Moazeni, T. F. Coleman, and Y. Li. Optimal execution under jump models for uncertain price impact. Journal of Computational Finance, 16(4):1–44, 2013.
  • Perold [1988] A. F. Perold. The implementation shortfall: Paper versus reality. The Journal of Portfolio Management, 14(3):4–9, 1988.
  • Pham and Tankov [2008] H. Pham and P. Tankov. A model of optimal consumption under liquidity risk with random trading times. Mathematical Finance, 18(4):613–627, 2008.
  • Schied and Schöneborn [2009] A. Schied and T. Schöneborn. Risk aversion and the dynamics of optimal liquidation strategies in illiquid markets. Finance and Stochastics, 13(2):181–204, 2009.
  • Schied et al. [2010] A. Schied, T. Schöneborn, and M. Tehranchi. Optimal basket liquidation for cara investors is deterministic. Applied Mathematical Finance, 17(6):471–489, 2010.
  • Schöneborn [2008] T. Schöneborn. Trade execution in illiquid markets: Optimal stochastic control and multi-agent equilibria. phd thesis. 2008.
  • Zhang [2014] T. Zhang. Nash Equilibria in Market Impact Models: Differential Game, Transient Price Impact and Transaction Costs, PhD Thesis. PhD thesis, Mannheim, Universität Mannheim, Diss., 2014, 2014.

Appendix A Appendix: Proofs of Main Results

Proof of Theorem 2.1 In this proof we fix TT to be finite. The existence of a maximal TT^{*} is a consequence of solving (7), which is analyzed in the proof of Proposition 3.1. The Hamilton-Jacobi-Bellman (HJB) equation associated to (5) arises from the DPP by assuming Markov controls vt=v(t,T,Xt,qt)v_{t}=v(t,T,X_{t},q_{t}) and value function Wt:=W(t,T,Xtv,qtv)W_{t}:=W(t,T,X^{v}_{t},q^{v}_{t}) for deterministic functions v,Wv,W. For simplicity of exposition, we have omitted the potential for dependence on the stock price StS_{t}: the standard verification result used at the end of this argument shows this is consistent.

Under these assumptions, the DPP implies that eλW(t,T,Xtv,qtv)-e^{-\lambda W(t,T,X^{v}_{t},q^{v}_{t})} is a supermartingale for all vv and a martingale for the optimal vv^{*}, which leads to the HJB equation for WW

tW+XWqμ+12qΣΣq[XX2Wλ(XW)2]\displaystyle\partial_{t}W+\partial_{X}Wq^{\prime}\mu+\frac{1}{2}q^{\prime}\Sigma\Sigma^{\prime}q[\partial_{XX}^{2}W-\lambda(\partial_{X}W)^{2}]
+supv[(qW+XWqΛ)vvΓvXW]=0.\displaystyle\hskip 108.405pt+\ {\sup}_{v}[(\partial_{q}W^{\prime}+\partial_{X}Wq^{\prime}\Lambda)v-v^{\prime}\Gamma v\partial_{X}W]=0.
W(T,T,X,q)=X.\displaystyle W(T,T,X,q)=X.

The ansatz W(t,T,X,q)=X+V(t,T,q)W(t,T,X,q)=X+V(t,T,q) leads to the equation for VV

tV+qμλ2qΣΣq+supv[(qV+qΛ)vvΓv]=0.\displaystyle\partial_{t}V+q^{\prime}\mu-\frac{\lambda}{2}q^{\prime}\Sigma\Sigma^{\prime}q+\ {\sup}_{v}[(\partial_{q}V^{\prime}+q^{\prime}\Lambda)v-v^{\prime}\Gamma v]=0.
V(T,T,q)=0.\displaystyle V(T,T,q)=0.

The optimal feedback control is thus v=Γ12(qV+Λq),v^{*}=\frac{\Gamma^{-1}}{2}(\partial_{q}V^{\prime}+\Lambda q), which is independent of XX and the price process, and hence deterministic. Using this control leads to

tV+qμλ2qΣΣq+14(qV+Λq)Γ1(qV+Λq)=0.\partial_{t}V+q^{\prime}\mu-\frac{\lambda}{2}q^{\prime}\Sigma\Sigma^{\prime}q+\frac{1}{4}(\partial_{q}V+\Lambda q)^{\prime}\Gamma^{-1}(\partial_{q}V+\Lambda q)=0\ . (54)

As we will shortly see in the proof of Proposition 3.1, this ODE has a unique smooth solution which is deterministic, over any finite time interval [t,T][t,T] for TT less than a possibly infinite maximal TT^{*}. Therefore, by the classical verification theorem, we have W=W~W=\tilde{W} and the other statements of the theorem follow.


Proof of Proposition 3.1 By Theorem 2.1 , the value function for Merton’s problem over [t,T][t,T] has the form W(t,T,X,q)=X+V(t,T,q),W(t,T,X,q)=X+V(t,T,q), where VV satisfies the ODE (54). This ODE and the form (15) leads to Riccati equations with initial conditions for A,B,CA,B,C

τA\displaystyle\partial_{\tau}A 14(A+A+Λ)Γ1(A+A+Λ)+λ2ΣΣ=0,A(0)=0\displaystyle-\frac{1}{4}(A+A^{\prime}+\Lambda)\Gamma^{-1}(A+A^{\prime}+\Lambda)+\frac{\lambda}{2}\Sigma\Sigma^{\prime}=0,\quad A(0)=0 (55)
τB\displaystyle\partial_{\tau}B 12BΓ1(A+A+Λ)μ=0,B(0)=0\displaystyle-\frac{1}{2}B\Gamma^{-1}(A+A^{\prime}+\Lambda)-\mu^{\prime}=0,\quad B(0)=0 (56)
τC\displaystyle\partial_{\tau}C 14BΓ1B=0,C(0)=0.\displaystyle-\frac{1}{4}B\Gamma^{-1}B^{\prime}=0,\quad C(0)=0. (57)

Notice that if AA is a solution of (16), then so is AA^{\prime}: By the uniqueness theorem for solutions of ODEs, A=AA=A^{\prime} and therefore AA is symmetric.


Proof of Theorem 3.2 Part 1: Note that λ2Γ1/2ΣΣΓ1/2\frac{\lambda}{2}\Gamma^{-1/2}\Sigma\Sigma^{\prime}\Gamma^{-1/2} is positive definite and define DD to be its symmetric square root. If

A~(τ):=Γ1/2(A(τ)+Λ/2)Γ1/2\tilde{A}(\tau):=\Gamma^{-1/2}(A(\tau)+\Lambda/2)\Gamma^{-1/2}

then (16) becomes

τA~A~2+D2=0,A~(0)=E:=Γ1/2(Λ/2)Γ1/2.\partial_{\tau}\tilde{A}-\tilde{A}^{2}+D^{2}=0,\quad\tilde{A}(0)=E:=\Gamma^{-1/2}(\Lambda/2)\Gamma^{-1/2}\ . (58)

One can now check that the solution to (58) has the form A~=VU1,\tilde{A}=VU^{-1}, where U,VU,V satisfy the following linear ODE with terminal condition

[τUτV]=[0𝟙D20]×[UV],[U(0)V(0)]=[𝟙E].\left[\begin{array}[]{c}\partial_{\tau}U\\ \partial_{\tau}V\end{array}\right]=\left[\begin{array}[]{cc}0&-\mathbbm{1}\\ -D^{2}&0\end{array}\right]\times\left[\begin{array}[]{c}U\\ V\end{array}\right],\quad\left[\begin{array}[]{c}U(0)\\ V(0)\end{array}\right]=\left[\begin{array}[]{c}\mathbbm{1}\\ E\end{array}\right]\ .

By block-diagonalization using

Q=[𝟙𝟙DD],Q1=12[𝟙D1𝟙D1]Q=\left[\begin{array}[]{cc}\mathbbm{1}&\mathbbm{1}\\ D&-D\end{array}\right],\quad Q^{-1}=\frac{1}{2}\left[\begin{array}[]{cc}\mathbbm{1}&D^{-1}\\ \mathbbm{1}&-D^{-1}\end{array}\right]

one finds

[0𝟙D20]=Q[D00D]Q1\left[\begin{array}[]{cc}0&-\mathbbm{1}\\ -D^{2}&0\end{array}\right]=Q\left[\begin{array}[]{cc}-D&0\\ 0&D\end{array}\right]Q^{-1}

and therefore, the solution of the matrix ODE is

[U(τ)V(τ)]=Q[eDτ00eDτ]Q1×[𝟙E].\left[\begin{array}[]{c}U(\tau)\\ V(\tau)\end{array}\right]=Q\left[\begin{array}[]{cc}e^{-D\tau}&0\\ 0&e^{D\tau}\end{array}\right]Q^{-1}\times\left[\begin{array}[]{c}\mathbbm{1}\\ E\end{array}\right]\ .

From the explicit forms

U(τ)\displaystyle U(\tau) =cosh(Dτ)sinh(Dτ)D1E\displaystyle=\ {\cosh}(D\tau)-\ {\sinh}(D\tau)D^{-1}E (59)
V(τ)\displaystyle V(\tau) =sinh(Dτ)D+cosh(Dτ)E\displaystyle=-\ {\sinh}(D\tau)D+\ {\cosh}(D\tau)E (60)

one finds A(τ)=Γ1/2(A~(τ)E)Γ1/2A(\tau)=\Gamma^{1/2}(\tilde{A}(\tau)-E)\Gamma^{1/2} where

A~(τ)=[sinh(Dτ)D+cosh(Dτ)E][cosh(Dτ)sinh(Dτ)D1E]1.\tilde{A}(\tau)=[-\ {\sinh}(D\tau)D+\ {\cosh}(D\tau)E][\ {\cosh}(D\tau)-\ {\sinh}(D\tau)D^{-1}E]^{-1}. (61)

The Riccati equation (17) for B can be solved by noting that B~=BΓ1/2\tilde{B}=B\Gamma^{-1/2} solves the ODE

τB~B~A~μΓ1/2=0.\partial_{\tau}\tilde{B}-\tilde{B}\tilde{A}-\mu^{\prime}\Gamma^{-1/2}=0.

Since τU=A~U\partial_{\tau}U=-\tilde{A}U, we find τ(B~U)=(τB~B~A~)U=μΓ1/2U\partial_{\tau}(\tilde{B}U)=(\partial_{\tau}\tilde{B}-\tilde{B}\tilde{A})U=\mu^{\prime}\Gamma^{-1/2}U which can be integrated to give B~(τ)U(τ)=μΓ1/2(0τU(s)𝑑s)\tilde{B}(\tau)U(\tau)=\mu^{\prime}\Gamma^{-1/2}(\int^{\tau}_{0}U(s)ds) and thus

B(τ)=μΓ1/2(0τU(s)𝑑s)U1(τ)Γ1/2.B(\tau)=\mu^{\prime}\Gamma^{-1/2}\left(\int_{0}^{\tau}U(s)ds\right)U^{-1}(\tau)\Gamma^{1/2}\ .

It is straightforward that 0τU(s)𝑑s=D2[EV(τ)]\int^{\tau}_{0}U(s)ds=D^{-2}[E-V(\tau)] which gives the desired formula

B(τ)=μΓ1/2D2[EV(τ)]U1(τ)Γ1/2.B(\tau)=\mu^{\prime}\Gamma^{-1/2}D^{-2}[E-V(\tau)]U^{-1}(\tau)\Gamma^{1/2}\ .

In a similar fashion, one finds

C(τ)\displaystyle C(\tau) =140τB(s)Γ1B(s)𝑑s\displaystyle=\frac{1}{4}\int_{0}^{\tau}B(s)\Gamma^{-1}B^{\prime}(s)ds (63)
=14μ¯(0τ(EV(s))(U(s)U(s))1(EV(s))𝑑s)μ¯,\displaystyle=\frac{1}{4}\overline{\mu}^{\prime}\left(\int_{0}^{\tau}(E-V(s))(U^{\prime}(s)U(s))^{-1}(E-V^{\prime}(s))ds\right)\overline{\mu},

where μ¯:=D2Γ1/2μ.\overline{\mu}:=D^{-2}\Gamma^{-1/2}\mu.

Part 2: This part is straightforward.

Part 3: From part 4 of Theorem 2.1, the optimal control q(u)q^{*}(u) over the period [t,T][t,T] solves

uqΓ1(A(Tu)+Λ/2)q=12Γ1B(Tu)\partial_{u}q-\Gamma^{-1}(A(T-u)+\Lambda/2)q=\frac{1}{2}\Gamma^{-1}B^{\prime}(T-u)

When this linear ODE is multiplied on the left by the integrating factor U1(Tu)Γ1/2U^{-1}(T-u)\Gamma^{1/2}, the left-hand side becomes an exact derivative:

u[U1(Tu)Γ1/2q]=U1(Tu)Γ1/2×12Γ1B(Tu).\partial_{u}\left[U^{-1}(T-u)\Gamma^{1/2}q\right]=U^{-1}(T-u)\Gamma^{1/2}\times\frac{1}{2}\Gamma^{-1}B^{\prime}(T-u)\ .

Integration of this equation over [t,u][t,u] gives

U1(Tu)Γ1/2q(u)U1(Tt)Γ1/2q=12tuU1(Tr)Γ1/2B(Tr)𝑑rU^{-1}(T-u)\Gamma^{1/2}q(u)-U^{-1}(T-t)\Gamma^{1/2}q=\frac{1}{2}\int^{u}_{t}U^{-1}(T-r)\Gamma^{-1/2}B^{\prime}(T-r)dr

which leads to the desired formula.

Part 4: The Variance is calculated directly as follows

Vart(XT)=tTq(s)ΣΣq(s)𝑑s=qL(Tt)q+M(Tt)q+N(Tt).\text{Var}_{t}(X_{T}^{*})=\int_{t}^{T}q^{*}(s)^{\prime}\Sigma\Sigma^{\prime}q(s)^{*}ds=q^{\prime}L(T-t)q+M(T-t)q+N(T-t)\ .

Rewrite q(u)=U~(Tu)U~1(Tt)q+12U~(Tu)I(u)q^{*}(u)={\tilde{U}}{(T-u)}{\tilde{U}}^{-1}{(T-t)}q+\frac{1}{2}{\tilde{U}}{(T-u)}I(u), where U~(Tu):=Γ1/2U(Tu)\tilde{U}(T-u):=\Gamma^{-1/2}U(T-u) and I(u):=tuU~1(Tr)Γ1B(Tr)𝑑rI(u):=\int_{t}^{u}{\tilde{U}}^{-1}(T-r)\Gamma^{-1}B(T-r)dr. Explicit forms for L,M,NL,M,N are calculated as follows.

L(Tt)=(U~1(Tt))(tTU~(Tr)ΣΣU~(Tr)𝑑r)U~1(Tu).\displaystyle L(T-t)=(\tilde{U}^{-1}(T-t)^{\prime})\Bigl{(}\int_{t}^{T}\tilde{U}(T-r)^{\prime}\Sigma\Sigma^{\prime}\tilde{U}(T-r)dr\Bigr{)}\tilde{U}^{-1}(T-u)\ .

By using Fubini’s formula, we have

M(Tt)\displaystyle M^{\prime}(T-t) =tTU~1(Tt)U~(Tr)ΣΣU~(Tr)I(r)𝑑r\displaystyle=\int_{t}^{T}{\tilde{U}^{-1}}{(T-t)^{\prime}}{\tilde{U}}{(T-r)^{\prime}}\Sigma\Sigma^{\prime}{\tilde{U}}{(T-r)}I(r)dr
=tT(sTU~1(Tt)U~(Tr)ΣΣU~(Tr)𝑑r)U~(Ts)1Γ1B(Ts)𝑑s\displaystyle=\int_{t}^{T}\Bigl{(}\int_{s}^{T}{\tilde{U}^{-1}}{(T-t)^{\prime}}{\tilde{U}}{(T-r)^{\prime}}\Sigma\Sigma^{\prime}{\tilde{U}}{(T-r)}dr\Bigr{)}{\tilde{U}}{(T-s)}^{-1}\Gamma^{-1}B(T-s)ds
=U~(Tt)1tTU~(Ts)L(Ts)Γ1B(Ts)𝑑s.\displaystyle={\tilde{U}}{(T-t)^{\prime}}^{-1}\int_{t}^{T}{\tilde{U}}{(T-s)}L(T-s)\Gamma^{-1}B(T-s)ds\ .

Similarly

N(Tt)\displaystyle N(T-t) =14tTI(r)U~(Tr)ΣΣU~(Tr)I(r)𝑑r\displaystyle=\frac{1}{4}\int_{t}^{T}I(r)^{\prime}{\tilde{U}}{(T-r)}^{\prime}\Sigma\Sigma^{\prime}{\tilde{U}}{(T-r)}I(r)dr
=14tT(sTI(r)U~(Tr)ΣΣU~(Tr)𝑑r)U~1(Ts)Γ1B(Ts)𝑑s\displaystyle=\frac{1}{4}\int_{t}^{T}\Bigl{(}\int_{s}^{T}I(r)^{\prime}{\tilde{U}}{(T-r)}^{\prime}\Sigma\Sigma^{\prime}{\tilde{U}}{(T-r)}dr\Bigr{)}{\tilde{U}}^{-1}(T-s)\Gamma^{-1}B(T-s)ds
=14tTM(Ts)Γ1B(Ts)𝑑s.\displaystyle=\frac{1}{4}\int_{t}^{T}M(T-s)\Gamma^{-1}B(T-s)ds\ .