Optimal Portfolios of Illiquid Assets

T. R. Hurd¹, Quentin H. Shao¹, Tuan Tran¹
¹Department of Mathematics & Statistics,
McMaster University, Canada

Abstract

This paper investigates the investment behaviour of a large unregulated financial institution (FI) with CARA risk preferences. It shows how the FI optimizes its trading to account for market illiquidity using an extension of the Almgren-Chriss market impact model of multiple risky assets. This expected utility optimization problem over the set of adapted strategies turns out to have the same solutions as a mean-variance optimization over deterministic trading strategies. That means the optimal adapted trading strategy is both deterministic and time-consistent. It is also found to have an explicit closed form that clearly displays interesting properties. For example, the classic constant Merton portfolio strategy, a particular solution of the frictionless limit of the problem, behaves like an attractor in the space of more general solutions. The main effect of temporary market impact is to slow down the speed of convergence to this constant Merton portfolio. The effect of permanent market impact is to incentivize the FI to buy additional risky assets near the end of the period. This property, that we name the Ponzi property, is related to the creation and bursting of bubbles in the market. The proposed model can be used as a stylized dynamic model of a typical FI in the study of the asset fire sale channel relevant to understanding systemic risk and financial stability.

1 Introduction

Long after such landmark contributions as the Markowitz mean-variance strategy (Markowitz [11]) and the Merton portfolio model introduced in Merton [12], our understanding of optimal portfolio selection has continued to develop. We now have learned how to analyze investment in imperfect markets that have frictions such as transaction costs (Davis and Norman [8], Perold [14]) and price impact (Almgren and Chriss [1], Almgren [3], Schöneborn [18]), and have complex dynamics such as jumps (Cartea and Jaimungal [6], Moazeni et al. [13] and Pham and Tankov [15]). Indeed, this problem has generated hundreds of research papers. Our goal now is to present a solvable model of optimal investment for a large financial institution (FI) in a many-asset setting. It is based on the expected utility maximization criterion, and it accounts for market illiquidity, which means the transaction costs to pay and the fact that trades have a permanent price impact. The underlying investment assets, which may be very illiquid, are assumed to follow Bachelier dynamics, meaning they are modelled by correlated arithmetic Brownian motions. For these assumptions to make financial sense, the optimal strategy should be implemented only over a time horizon $[0,T]$ short enough that the Bachelier dynamics remains a reasonable approximation (we take as a benchmark $T=1/2$ years in our examples).

The class of optimal strategies we obtain has several remarkable properties. First, the general multidimensional problem has a closed-form solution expressible in terms of a matrix-valued equation that can be efficiently computed with a controllable error. Second, the solution depends on the full range of important parameters: temporary price impact, permanent impact, risk aversion, the initial portfolio weights, the risk free interest rate, and the parameters underlying the Bachelier dynamics. Thirdly, the optimal strategies, which are a priori adapted processes that solve a version of Merton’s problem, turn out to be deterministic over a finite time horizon and to solve a version of the Markowitz mean-variance optimization. This property implies that our investment strategies are fully consistent with dynamic programming, despite being deterministic solutions of a time-inconsistent mean-variance optimization problem.

The aim of this paper is to study the effect of market illiquidity on the behaviour of an FI. Funding illiquidity (see for example Brunnermeier and Pedersen [5]) is the distinct effect that the balance sheet of an FI may experience funding shocks caused by unanticipated withdrawals by depositors. To keep the focus of the paper squarely on market liquidity, funding illiquidity is ruled out by the assumption that deposits are constant and sufficient to support all asset purchases of the FI.

The proposed model and its solution is closely related to some important contributions to the existing literature. Our solutions reduce to the Markowitz optimal portfolios, or equivalently to Merton’s optimal solutions, when permanent and temporary impact are both assumed to be zero. The posed finance problem is inspired by the mean-variance optimal liquidation problem studied by Almgren and Chriss [1], but differs in that there is no constraint placed on the portfolio holdings at the terminal time $T$ . Finally, under certain initial conditions the FI will seek to liquidate a large position, creating what has been called an asset fire sale. Our strategies extend to this setting and give natural criteria similar to those discussed by Brown et al. [4] that solve the problem of the order in which different assets are liquidated.

2 Optimal Portfolio Strategies

This paper will investigate the investment strategies of a large financial institution (FI) with CARA risk preferences (CARA is short for constant absolute risk aversion that trades continuously over a finite time horizon $[0,T]$ in a market with imperfect liquidity. This is similar to a problem studied in Zhang [19]. The changes caused by rebalancing a portfolio of a large FI may amount to a large fraction of the total daily volume traded of these assets and significantly impact these assets’ prices. It is well understood that this effect will lead the FI to break large orders into small portions spread over time to reduce market liquidity costs, while still aiming to rebalance its portfolio. By taking additional time to reduce liquidity costs, the FI now faces additional uncertainty in the price of the assets. To handle this delicate balance between liquidity costs and price uncertainty, the FI will be inclined to consider utility optimization.

There are sound economic reasons to optimize using an exponential (CARA) utility function: It leads to a tractable time-consistent strategy where additional information does not provide additional utility, and is similar to the original Mean-Variance optimization of Almgren and Chriss [1]. Since the strategy is only implemented over $[0,T]$ , at time $T$ the FI will update its information and continue in a similar way to rebalance over the subsequent period. This rebalancing is necessary to account for shortcomings of the model, changes in the balance sheet, and unanticipated events that cause fundamental changes to the parameters of the price dynamics.

A number of simplifications will be assumed about this problem. The total information available to the FI up to any given instant of time $t$ is modelled by a filtration $\{{\cal F}_{t}\}_{t\geq 0}$ on a given probability space $(\Omega,{\cal F},\mathbb{P})$ . The market consists of one risk-free asset with zero interest rate, and $d$ risky assets whose true price process is $S_{t}=(S^{(1)}_{t},...,S^{(d)}_{t})^{\prime}$ and whose transaction price process is $\tilde{S}_{t}=(\tilde{S}^{(1)}_{t},...,\tilde{S}^{(d)}_{t})^{\prime}$ . Here and in the following, we adopt matrix notation where $M^{\prime}$ denotes the matrix transpose of $M$ . Let us denote the vector of the amounts held in risky assets by $(q_{u})_{u\in[t,T]}$ and the vector of trading rates of the large trader by $v_{u}:=\dot{q}_{u}:=dq_{u}/du,u\in[t,T]$ .

Like Almgren and Chriss [1] and others, we suppose that the price of risky assets follows a $d-$ dimensional Bachelier model with both linear permanent and linear temporary market impact (parametrized by $\Lambda$ and $\Gamma$ respectively):

	$\displaystyle dS_{t}$	$\displaystyle=$	$\displaystyle(\Lambda v_{t}+\mu)\ dt+\Sigma\ dB_{t}\ ,$
	$\displaystyle\tilde{S}_{t}$	$\displaystyle=$	$\displaystyle S_{t}+\Gamma v_{t}\ .$		(1)

Here, $B_{t}$ is a $d-$ dimensional Brownian motion and $\Sigma\in\mathbb{R}^{d\times d}$ is the volatility matrix. The drift term $\mu=b-d+\Lambda Q\in\mathbb{R}^{d}$ is assumed to be constant. It takes into account the trending rate $b$ , dividend rate $d$ and aggregated permanent market price impact due to external traders $Q$ .

A more general formulation of the model that does not require linear market impact is certainly possible, and will not change many of the same basic properties. However, the assumption of linear impact leads to significantly more tractable optimal strategies. Moreover, as shown by Gatheral and Schied [9], so-called dynamic arbitrage is ruled out by choosing the permanent impact to be linear. It is further assumed that the permanent and temporary impact matrices $\Lambda$ and $\Gamma$ are symmetric and non-negative definite. The assumption that $\Gamma$ is symmetric is without loss of generality. On the other hand, $\Lambda$ is assumed to be symmetric not for economic reasons but for convenience: when it has an anti-symmetric part, a somewhat more complicated explicit solution is obtainable. Models similar to ours have been studied by Almgren and Lorenz [2], Gatheral and Schied [9] and Schied and Schöneborn [16]. We refer to the review by Hurd et al. [10] for further background and justification of these and other similar models.

2.1 The Merton Problem

Merton’s problem, introduced in Merton [12], aims to determine the strategies followed by utility optimizing investors in continuous time market models. To this end, we now consider the most general portfolio strategy, or control process, that trades within the market impact model (1) over some time interval $[s,t]\subset\mathbb{R}_{+}$ . In our setting, each possible strategy will be simply a $d-$ dimensional trading rate process $v=(v_{u})_{u\in[s,t]}$ that is adapted to the information filtration $\{{\cal F}_{t}\}$ : We denote the set of such admissible strategies by $\Pi^{ad}[s,t]$ . The subclass of deterministic strategies where each value $v_{u},u\in[s,t]$ is $\mathcal{F}_{s}$ measurable is denoted by $\Pi^{det}[s,t]$ .

Given any control process $v\in\Pi^{ad}[0,s]$ for $0<t<s$ , the cash net of debt owed $C_{t}:=C^{v}_{t}$ and marked-to-market equity, or assets net of debt owed, $X_{t}:=X^{v}_{t}:=C_{t}+q_{t}^{\prime}S_{t}$ are given by:

	$\displaystyle C_{t}$	$\displaystyle=$	$\displaystyle C_{0}-\int_{0}^{t}v_{u}^{\prime}\tilde{S}_{u}\ du=C_{0}-\int_{0}^{t}v_{u}^{\prime}S_{u}\ du-\int_{0}^{t}v_{u}^{\prime}\Gamma v_{u}\ du\ ,$		(2)
	$\displaystyle X_{t}$	$\displaystyle=$	$\displaystyle X_{0}+\int_{0}^{t}q_{u}\ dS_{u}-\int_{0}^{t}v_{u}^{\prime}\Gamma v_{u}\ du\ .$		(3)

where the second equation is obtained by integration by parts. Note that here and henceforth, the superscript $v$ that labels processes controlled by $v$ will be omitted.

The interpretation of (2) and (3) in terms of the firm’s balance sheet is that assets are stochastic due to fluctuations of $S$ , while the debt, thought of as deposits, is assumed to be constant and sufficient to fund all trades. In other words, we focus on market illiquidity without funding illiquidity. It is consistent with the Principle of Limited Liability that a firm becomes insolvent when its equity $X_{t}$ becomes negative. In the following, an insolvent firm with negative equity $X_{T}<0$ at a time $T$ , will be declared to be in default, implying that the laws of bankruptcy will be applied to the firm.

The FI can now try to solve Merton’s optimal problem of a CARA investor with constant absolute risk aversion parameter $\lambda>0$ over any period $[t,T]$ . For each $t$ , they may express the value function $J_{t}$ achieved in terms of a certainty equivalent value $W_{t}$ ,

J_{t}:=-e^{-\lambda W_{t}}:=\text{sup}_{v\in\Pi^{ad}[t,T]}\ \left(-\mathbb{E}[-e^{-\lambda X_{T}}|{\cal F}_{t}]\right)\ .

(4)

If the supremum exists, it is achieved by adopting an optimal control denoted by $v^{*}(t)=(v_{u}^{*}(t))_{u\in[t,T]}$ , which will be an adapted process over $[t,T]$ . The CARA investment problem in general always satisfies the dynamic programming principle (see Schied et al. [17]), which means that for any $s\leq t\leq T$ , $v_{u}^{*}(t)=v_{u}^{*}(s)$ for all $u\geq t$ and

-e^{-\lambda W_{s}}=\text{sup}_{v\in\Pi^{ad}[s,t]}\ \left(-\mathbb{E}[e^{-\lambda W_{t}}|{\cal F}_{s}]\right)\ .

(5)

An investor restricted to deterministic strategies over $[t,T]$ cannot achieve a higher certainty equivalent value than equation (4). Therefore, if $\widetilde{W}_{t}$ is defined by

-e^{-\lambda\widetilde{W}_{t}}:=\text{sup}_{v\in\Pi^{det}[t,T]}\ \left(-\mathbb{E}[-e^{-\lambda X_{T}}|{\cal F}_{t}]\right)

(6)

then $\widetilde{W}_{t}\leq W_{t}$ . The first result of this paper, stated next, is that (4) is always optimized by deterministic strategies and therefore $W_{t}=\widetilde{W}_{t}$ for $t\geq 0$ . Moreover, it will be found in subsequent sections that the optimal control and value functions can be expressed in closed forms involving one-dimensional integrals that solve a system of ordinary differential equations of Riccati type. First, however, a note about notation: Because $v^{*}$ and $q^{*}$ turn out to be deterministic, we henceforth replace the stochastic process notation $v_{u}$ by function notation $v(u)$ and moreover suppress the dependence on the investment period $[t,T]$ .

Theorem 2.1.

Under the above modelling assumptions, there is a (possibly infinite) maximal time $T^{*}\in\mathbb{R}_{+}\cup\{\infty\}$ ) such that for any finite time horizon $[t,T]$ with $0\leq t\leq T\leq T^{*}$ :

1.

The optimal strategy $v^{*}(u),u\in[t,T]$ exists, is unique and ${\cal F}_{t}$ measurable, hence deterministic.
2.

The value function $\widetilde{W}_{t}$ achieved over $[t,T]$ , when restricted to deterministic strategies, equals $W_{t}$ .

The value function has the form $W_{t}=X_{t}+V(T-t,q)$ where $V(\tau,q),\tau=T-t$ solves the non-linear partial differential equation

\hskip-54.2025pt-\partial_{\tau}V+q^{\prime}\mu-\frac{\lambda}{2}q^{\prime}\Sigma\Sigma^{\prime}q+\frac{1}{4}(\Lambda q+\partial_{q}V)^{\prime}\Gamma^{-1}(\Lambda q+\partial_{q}V)=0,\quad V(0,q)=0\

(7)

on the domain $[0,T]\times\mathbb{R}^{d}$ .

4.

Given initial holdings $q$ at time $t$ , the optimal portfolio holdings $q^{*}(u)$ for $u\in[t,T]$ solves the system of ODEs:

$\displaystyle\frac{dq}{du}=\frac{\Gamma^{-1}}{2}\bigl{(}\partial_{q}V(T-u,q)^{\prime}+\Lambda q\bigr{)}\ ,\quad q(t)=q\ .$ (8)

The proof of this theorem is found in the Appendix. As we shall see in Section 3, $V(\tau,q)$ is a quadratic form in $q$ with time-dependent coefficients and thus the ODE (8) for $q^{*}$ is linear and can be solved explicitly.

2.2 Mean, Variance, Probability of Default and Time Consistency

From equations (2) and (3) we can deduce that, if $v$ is deterministic, then for any $0\leq s\leq t\leq T$ , the equity $X_{t}$ conditioned on ${\cal F}_{s}$ is normally distributed with mean and variance given by

	$\displaystyle\mathbb{E}[X_{t}\|{\cal F}_{s}]$	$\displaystyle=$	$\displaystyle X_{s}+\int_{s}^{t}\Bigl{(}q^{\prime}(u)(\Lambda v(u)+\mu)-v^{\prime}(u)\Gamma v(u)\Bigr{)}\ du,$		(9)
	$\displaystyle{\mbox{Var}}[X_{t}\|{\cal F}_{s}]$	$\displaystyle=$	$\displaystyle\int_{s}^{t}\ q^{\prime}(u)\Sigma\Sigma^{\prime}q(u)\ du\ .$		(10)

In particular, the fact that $X_{T}|{\cal F}_{t}$ is always normal implies that

\mathbb{E}[e^{-\lambda X_{T}}|{\cal F}_{t}]=e^{-\lambda\left(\mathbb{E}[X_{T}|{\cal F}_{t}]-\frac{\lambda}{2}{\mbox{Var}}[X_{T}|{\cal F}_{t}]\right)}

(11)

and hence from (6) and Theorem 2.1 one deduces that

W_{t}=\widetilde{W}_{t}=\text{sup}_{v\in\Pi^{det}[t,T]}\left(\mathbb{E}[X_{T}|{\cal F}_{t}]-\frac{\lambda}{2}{\mbox{Var}}[X_{T}|{\cal F}_{t}]\right)\ .

(12)

This demonstrates the well-known equality of the certainty equivalent value for CARA optimization with the value function for Markowitz’ mean-variance (M-V) optimization, as well as the coincidence of their optimal strategies, when the optimal equity processes under consideration are all normally distributed.

In practice, the firm’s default probability (DP), meaning the probability that $X_{T}<0$ , may be preferable to variance as a risk measure for institutional investors, as it gives more information about bad scenarios that need to be controlled. In the Bachelier model, the normality that follows for deterministic strategies implies that over any time horizon $[t,T]$ , the Mean-Variance (M-V) criterion

Problem M-V

	$\displaystyle W_{V}(t,T,q,x,E)$	$\displaystyle:=$	$\displaystyle\text{min}_{v\in\Pi^{det}[t,T]}\ \text{Var}[X_{T}\|{\cal F}_{t}]$
			$\displaystyle\text{subject to }\mathbb{E}[X_{T}\|{\cal F}_{t}]=E\ ,$

and the Mean-Default Probability (M-DP) criterion

Problem M-DP

	$\displaystyle W_{DP}(t,T,q,x,E)$	$\displaystyle:=$	$\displaystyle\text{min}_{v\in\Pi^{det}[t,T]}\ \mathbb{P}[X_{T}<0\|{\cal F}_{t}]$
			$\displaystyle\text{subject to }\mathbb{E}[X_{T}\|{\cal F}_{t}]=E\ ,$

are both solved by the same optimal trading strategy when $\mathbb{E}[X_{T}|{\cal F}_{t}]=E>0$ . This is because $\mathbb{P}[X_{T}<0|{\cal F}_{t}]$ is strictly increasing in $\text{Var}[X_{T}|{\cal F}_{t}]$ as long as $\mathbb{E}[X_{T}|{\cal F}_{t}]>0$ is fixed. Moreover, if $v^{*}(\lambda)$ denotes the optimizer of (12), and $X^{*}_{T}(\lambda)$ is the optimal equity it achieves, then $v^{*}(\lambda)$ also optimizes problems (2.2) provided $E=E(\lambda):=\mathbb{E}[X^{*}_{T}(\lambda)|{\cal F}_{t}]$ , and also (2.2) if in addition $E(\lambda)>0$ .

Since Merton’s optimal problem (4) satisfies Bellman’s Dynamic Programming Principle at all times, its optimal strategies are “time-consistent”, which means that the optimal strategies computed for any two periods $[t,T]$ and $[s,T]$ always coincide on the intersection $[s\vee t,T]$ . On the other hand, it is known () that mean-variance optimization is generally time-inconsistent and optimal adapted strategies starting at one time do not usually appear optimal at a later time. Surprisingly, Theorem 2.1 combined with equation 12 implies that in the present context, both the mean-variance and mean-default probability problems (2.2) and (2.2) are in fact time consistent, provided the optimization is restricted to deterministic strategies. The following result summarizes these relationships.

Corollary 2.1.1.

For any fixed time horizon $[t,T]$ , let $E(\lambda)=\mathbb{E}[X^{*}_{T}(\lambda)|{\cal F}_{t}]$ be the expected value of equity computed for the optimal strategy $v^{*}(\lambda)$ of the CARA investment problem (12) with risk aversion parameter $\lambda$ . Let $\underline{E}$ and $\overline{E}$ be the infimum and supremum of $E(\lambda)$ when $\lambda$ varies over $[0,\infty)$ . Then:

1.

For any $E\in(\underline{E},\overline{E}),$ there exists a unique $\lambda=\lambda(E)$ such that $E(\lambda)=E$ .
2.

For all possible values of $E(\lambda)$ , the deterministic optimal strategies $v^{*}$ of Problem M-V coincide with the unique adapted optimal strategy $v^{*}(\lambda)$ of (12).
3.

The optimal strategies computed for any two periods $[t,T]$ and $[s,T]$ are time-consistent, meaning they coincide on the intersection $[s\vee t,T]$ .
4.

If $E(\lambda)>0$ , the deterministic optimal strategies $v^{*}$ of Problem M-V and Problem M-DP also coincide with each other.

3 Explicit Optimal Strategies

We now exploit the tractability of Merton’s problem in the market impact setting to obtain closed formulas (involving matrix algebra and one dimensional integration) for the optimal trading curve of the financial institution (FI). The techniques invoked in this section are closely related to the methods developed for the optimal liquidation problem in .

Proposition 3.1.

Under the modelling assumptions of Theorem 2.1, for any finite $T$ with $T\leq T^{*}$ , the value function $W_{t}:=W(t,T,q,x)=x+V(\tau,q),\tau=T-t$ for any $t\in[0,T]$ has the form

V(\tau,q)=q^{\prime}A(\tau)q+B(\tau)q+C(\tau)

(15)

where $A,B,C$ are matrix valued functions of dimension $[d,d]$ , $[1,d]$ and $[1,1]$ respectively with $A$ symmetric. These functions satisfy Riccati-type ODEs for $\tau>0$ :

$\displaystyle\frac{\partial A}{\partial\tau}$	$\displaystyle-$	$\displaystyle(A+\Lambda/2)^{\prime}\Gamma^{-1}(A+\Lambda/2)+\frac{\lambda}{2}\Sigma\Sigma^{\prime}=0,\quad A(0)=0,$	(16)
$\displaystyle\frac{\partial B}{\partial\tau}$	$\displaystyle-$	$\displaystyle B\Gamma^{-1}(A+\Lambda/2)-\mu^{\prime}=0,\quad B(0)=0,$	(17)
$\displaystyle\frac{\partial C}{\partial\tau}$	$\displaystyle-$	$\displaystyle\frac{1}{4}B\Gamma^{-1}B^{\prime}=0,\quad C(0)=0.$	(18)

The next theorem, whose proof is given in Appendix A, solves this system of Riccati equations in closed form in terms of $E:=\Gamma^{-1/2}(\Lambda/2)\Gamma^{-1/2}$ and the symmetric square root $D$ of

D^{2}:=\frac{\lambda}{2}\Gamma^{-1/2}\Sigma\Sigma^{\prime}\Gamma^{-1/2}\ .

It also provides a closed form for the optimal strategy $q^{*}$ .

Theorem 3.2.

The solution of the system of Riccati equations (16)–(18) over the maximal interval $[0,T^{*}]$ is given by

	$\displaystyle A(\tau)=\Gamma^{\frac{1}{2}}\Bigl{(}V(\tau)U(\tau)^{-1}-E\Bigr{)}\Gamma^{\frac{1}{2}}$		(19)
	$\displaystyle B(\tau)=\overline{\mu}^{\prime}\bigl{(}E-V(\tau)\bigr{)}U^{-1}(\tau)\Gamma^{\frac{1}{2}}$		(20)
	$\displaystyle C(\tau)=\frac{1}{4}\overline{\mu}^{\prime}\left(\int_{0}^{\tau}\bigl{(}E-V(s)\bigr{)}\bigl{(}U^{\prime}(s)U(s)\bigr{)}^{-1}(E-V^{\prime}(s))ds\right)\overline{\mu}$		(21)

where $\overline{\mu}:=D^{-2}\Gamma^{-1/2}\mu$ and the matrix valued functions $U,V$ are given by

	$\displaystyle U(\tau)$		$\displaystyle=\ {\cosh}(D\tau)-D^{-1}\ {\sinh}(D\tau)E$		(22)
	$\displaystyle V(\tau)$		$\displaystyle=-\ {\sinh}(D\tau)D+\ {\cosh}(D\tau)E.$		(23)

2.

The maximal time horizon $T^{*}$ is

$T^{*}=\ {\inf}\{\tau>0:U(\tau)\mbox{ is not invertible}\}\ .$

$T^{*}$ is finite if $D<E$ and $\infty$ if $D>E$ .

For any $(t,T,q,x),$ the optimal trading curve $q^{*}(u)$ over the period $[t,T]$ is

	$\displaystyle q^{*}(u)$	$\displaystyle=$	$\displaystyle\Gamma^{-1/2}U(T-u)U^{-1}(T-t)\Gamma^{1/2}q$		(25)
			$\displaystyle+\frac{1}{2}\Gamma^{-1/2}U(T-u)\int^{u}_{t}U^{-1}(T-r)\Gamma^{-1/2}B^{\prime}(T-r)dr$		(25)

For any $(t,T,q,x),$ the expected value and variance of the optimal terminal equity are:

$\displaystyle\mathbb{E}[X_{T}^{*}(\lambda)\|{\cal F}_{t}]$	$\displaystyle=$	$\displaystyle x+q^{\prime}\Bigl{(}A(T-t)-\frac{\lambda}{2}L(T-t)\Bigr{)}q$	(26)
		$\displaystyle\hskip-50.58878pt+\ \Bigl{(}B(T-t)-\frac{\lambda}{2}M(T-t)\Bigr{)}q+C(T-t)-\frac{\lambda}{2}N(T-t)\ ,$
$\displaystyle{\mbox{Var}}[X_{T}^{*}(\lambda)\|{\cal F}_{t}]$	$\displaystyle=$	$\displaystyle q^{\prime}L(T-t)q+M(T-t)q+N(T-t)\ ,$	(27)

where formulas for $L,M,N$ are given in Appendix A.

In the special case when $D$ and $E$ are commuting matrices, these formulas decouple into $d$ one-dimensional problems, each of which is similar to the single risky asset case we next discuss.

3.1 The Case of a Single Risky Asset

In the single risky asset case, one can verify that the scalar functions $A,B,C$ and the optimal trading strategy $q^{*}(u)$ have comparatively simple formulas obtained by reducing those given in Theorem 3.2. Notice that several distinct possibilities are determined by the relation between $D=\Sigma\sqrt{\frac{\lambda}{2\Gamma}}$ and $E=\frac{\Lambda}{2\Gamma}.$

Proposition 3.3.

In the single asset case,

When $D>E$ or $2\lambda\Gamma\Sigma^{2}>\Lambda^{2}$ , Denote $K=\tanh^{-1}(E/D)$ we have $U(\tau)=\frac{\cosh(D\tau-K)}{\cosh K}$ The formulas can be rewritten in terms of hyperbolic functions as follows

$\displaystyle A(\tau)$	$\displaystyle=$	$\displaystyle-\Gamma D\left[\tanh(D\tau-K)+\tanh K\right]$	(28)
$\displaystyle B(\tau)$	$\displaystyle=$	$\displaystyle\frac{\mu}{D}\left(\frac{\sinh K}{\cosh(D\tau-K)}+\tanh(D\tau-K)\right)$	(29)
$\displaystyle C(\tau)$	$\displaystyle=$	$\displaystyle\frac{\mu^{2}}{4\Gamma D^{3}}\left[(\sinh^{2}(D\tau-K)-1)(\tanh(D\tau-K)+\tanh K)-D\tau\right]$	(30)
	$\displaystyle+$	$\displaystyle\frac{\mu^{2}}{2\Gamma D^{2}}(\tanh K-\frac{\sinh K}{\cosh(D\tau-K)}).$	(31)

The optimal trading strategy is given by

	$\displaystyle q(u)^{*}$		$\displaystyle=\frac{\cosh(D\tau_{u}-K)}{\cosh(D\tau_{t}-K)}q+\frac{\mu}{2\Gamma D^{2}}\left(1-\frac{\cosh(D\tau_{u}-K)}{\cosh(D\tau_{t}-K)}\right)$		(33)
			$\displaystyle+\frac{\mu\sinh K\cosh(D\tau_{u}-K)}{2\Gamma D^{2}}\left(\tanh(D\tau_{u}-K)-\tanh(D\tau_{t}-K)\right),$		(33)

where $\tau_{s}=T-s.$

When $D=E$ or $2\lambda\Gamma\Sigma^{2}=\Lambda^{2}$

$\displaystyle A(\tau)$	$\displaystyle=$	$\displaystyle 0$	(34)
$\displaystyle B(\tau)$	$\displaystyle=$	$\displaystyle\frac{\mu}{D}\left(e^{D\tau}-1\right)$	(35)
$\displaystyle C(\tau)$	$\displaystyle=$	$\displaystyle\frac{\mu^{2}}{2D\lambda\Sigma^{2}}\left[\frac{1}{2}e^{2D\tau}-2e^{D\tau}+D\tau+\frac{3}{2}\right].$	(36)

When $D<E$ or $0<2\lambda\Gamma\Sigma^{2}<\Lambda^{2}$ , Denote $K=\coth^{-1}(E/D)$ we have $U(\tau)=-\frac{\sinh(D\tau-K)}{\sinh K}.$ The formulas can be rewritten in terms of hyperbolic functions as follows

$\displaystyle A(\tau)$	$\displaystyle=$	$\displaystyle-\Gamma D\left[\coth(D\tau-K)+\coth K\right]$	(37)
$\displaystyle B(\tau)$	$\displaystyle=$	$\displaystyle\frac{\mu}{D}\left(\frac{-\cosh K}{\sinh(D\tau-K)}+\coth(D\tau-K)\right)$	(38)
$\displaystyle C(\tau)$	$\displaystyle=$	$\displaystyle-\frac{\mu^{2}}{4\Gamma D^{3}}\left[(\cosh^{2}(D\tau-K)+1)(\coth(D\tau-K)+\coth K)+D\tau\right]$	(39)
	$\displaystyle+$	$\displaystyle\frac{\mu^{2}}{2\Gamma D^{2}}(\coth K+\frac{\cosh K}{\sinh(D\tau-K)}).$	(40)

The optimal trading strategy is given by

	$\displaystyle q(u)^{*}$		$\displaystyle=\frac{\sinh(D\tau_{u}-K)}{\sinh(D\tau_{t}-K)}q+\frac{\mu}{2\Gamma D^{2}}\left(1-\frac{\sinh(D\tau_{u}-K)}{\sinh(D\tau_{t}-K)}\right)$		(42)
			$\displaystyle-\frac{\mu\cosh K\sinh(D\tau_{u}-K)}{2\Gamma D^{2}}\left(\coth(D\tau_{u}-K)-\coth(D\tau_{t}-K)\right),$		(42)

where $\tau_{s}=T-s.$

When $\lambda=0$ , we have $U(\tau)=1-E\tau$ and $V(\tau)=E.$ Moreover

$\displaystyle A(\tau)$	$\displaystyle=\frac{\Lambda}{2}(\frac{1-U(\tau)}{U(\tau)})$	(43)
$\displaystyle B(\tau)$	$\displaystyle=\frac{\mu\Gamma}{\Lambda}(\frac{1-U(\tau)^{2}}{U(\tau)})$	(44)
$\displaystyle C(\tau)$	$\displaystyle=\frac{\mu^{2}\Gamma^{2}}{6\Lambda^{3}}\left(\frac{-U(\tau)^{4}+6U(\tau)^{2}-8U(\tau)+3}{U(\tau)}\right).$	(45)

The optimal trading strategy is given by

\displaystyle q(u)^{*}=U(\tau_{u})\left(\frac{q}{U(\tau_{t})}+\frac{\mu}{4\Gamma E^{2}}\left(U(\tau_{t})+\frac{1}{U(\tau_{t})}-U(\tau_{u})-\frac{1}{U(\tau_{u})}\right)\right).

(46)

The third and fourth cases are the cases where $T^{*}<\infty$ , and one finds the solutions become unbounded: $\lim_{\tau\rightarrow T^{*}}A(\tau)=\infty$ . In cases 1 and 2, the solutions are bounded for all $\tau$ , and $T^{*}=\infty$ .

3.2 Small Perturbations from Merton’s Solution

In his original paper Merton [12], Robert Merton presented the exact solution to the problem of optimal investment in a frictionless market for an asset price that follows a geometric Brownian motion. His solution technique also leads to an exact solution of our present model in the limit of zero market impact, $\Lambda=0,\Gamma=0$ , which we will call the “Merton solution”. It is of some interest to consider the explicit general solution from the previous section as a perturbation of the Merton solution, and to investigate the nature of its convergence as market impact goes to zero. We suppose that $\Lambda=\epsilon\Lambda_{1},\Gamma=\epsilon\Gamma_{1}$ with small $\epsilon$ and denote by $W(t,T,x,q,\epsilon)$ the certainty equivalent value function with its dependence on $\epsilon$ . For simplicity, we confine our attention to the single asset case of the previous section.

The Merton solution over the period $[t,T]$ with initial conditions $X_{t}=x,q_{t}=q$ involves an instantaneous trade that incurs no trading cost, to the optimal value $q^{M}:=\frac{\mu}{\lambda\Sigma^{2}}$ . This portfolio is then held constant. One can show that this strategy achieves the certainty equivalent value function $W(t,T,x,q,0)=x+\frac{\mu^{2}(T-t)}{2\lambda\Sigma^{2}}$ which we note is independent of $q$ .

Now, for small $\epsilon$ , the general solution of our model is given by Case 1 of Proposition 3.3, which leads to the following perturbative expansion

\displaystyle W(t,T,x,q,\epsilon)=W(t,T,x,q,0)+\epsilon^{1/2}Q(q)+o(\epsilon^{3/2}).

(47)

where

\displaystyle Q(q)

\displaystyle:=

\displaystyle\Gamma_{1}D_{1}q^{2}+\frac{\mu}{D_{1}}q+(2E-1)D_{1}.

(48)

Here we define $D_{1}=\Sigma\sqrt{\frac{\lambda}{2\Gamma_{1}}}$ which does not depend on $\epsilon$ and we have $D=\epsilon^{-1/2}D_{1}\to\infty$ as $\epsilon\to 0.$ It is obvious that $E$ does not depend on $\epsilon$ either. Thus the value function of our problem converges to the value function of the Merton solution with rate of convergence $\epsilon^{1/2}.$

The optimal holding at the terminal time is given by

\displaystyle q_{T}^{*}=q^{M}+(q_{t}-q^{M})\frac{\cosh K}{\cosh(D\tau-K)}+Eq^{M}\frac{1-\tanh(D\tau-K)}{D(1-\tanh^{2}K)}.

(49)

Here $\tau:=T-t.$ It is straightforward that $\lim_{\epsilon\to 0}K=0,$ hence $\lim_{\epsilon\to 0}q_{T}^{*}=q^{M}.$

Let $\tilde{A}(\tau):=\frac{U(\tau)}{V(\tau)}=D\ {\tanh}(D\tau-K),$ the trading rate at the initial time $t$ is given by

$\displaystyle v_{t}$	$\displaystyle=\lim_{s\to t}\dot{q}_{s}$	(52)
	$\displaystyle=q_{t}\tilde{A}(\tau)+U(\tau)\frac{\mu}{2\Gamma D^{2}}[\frac{E(D^{2}-(\tilde{A}(\tau))^{2})}{D^{2}-E^{2}}-\frac{\tilde{A}(\tau)}{U(\tau)}]$
	$\displaystyle=(q_{t}-q^{M})D\tanh(D\tau-K)+\frac{\mu E\cosh K}{2\Gamma_{1}D_{1}^{2}\cosh(D\tau-K)}.$

Note that $\lim_{\epsilon\to 0}{\tanh}(D\tau-K)=1$ and $\lim_{\epsilon\to 0}{\cosh}(D\tau-K)=\infty$ , we have $\lim_{\epsilon\to 0}v_{t}=\pm\infty$ depending on if $q_{t}>q^{M}$ or $q_{t}<q^{M},$ i.e. the optimal strategy is to trade rapidly in the beginning. We then conclude that the optimal trajectory $q^{*}(u,\epsilon)$ converges to an $L$ –shaped or $\Gamma$ –shaped curve when the market impact tends to zero.

This result implies that when market impact is low, the firm will follow an optimal trading strategy very close to the constant holding strategy of the Merton problem. A more surprising fact is the portfolio which starts at the Merton portfolio will remain constant if permanent impact has $\Lambda_{1}=0$ , and all strategies regardless of initial portfolios will move towards the Merton portfolio for sometime initially. In the following subsection, we will show similar results for the Multi-Asset case.

4 Numerical Investigations

We now consider the investment behaviour of a hypothetical unregulated financial institution, such as a hedge fund or mutual fund. The firm trades a single risky asset, with initial price $S_{0}=\$100$ , in a market with a $0\%$ risk free rate of return. They use our CARA optimal investment model to trade over non-overlapping half-year trading periods: we focus here on the period $[0,T],\ T=1/2$ . The CARA risk aversion parameter $\lambda$ is chosen to be consistent with a target default probability of $1\%$ for each period. Thus the firm will trade aggressively to maximize their expected return with a quite high tolerance to the potential of default.

The calibrated parameters of the model given in Table 1 are taken to be fixed at the beginning of the period $t=0$ . Note that the firm uses the Bachelier model only for a short period, and expects to recalibrate at the beginning of the each successive period. Since the risky asset is illiquid, there is market impact related to the velocity of trading and the total amount traded: these are assumed to give the temporary and permanent market impact parameter estimates $\Gamma=\$10^{-7}\text{years/ (units traded)}^{2}$ and $\Lambda=\$4*10^{-8}/\text{unit}$ .

Balance sheets for a small, medium and large firm will be considered, all with a risky asset-to-equity ratio of $4:1$ . The initial stock holdings are $q_{0}=[50000,200000,800000]$ from which the initial firm equity and cash net of debt are then determined to be $X_{0}=0.25*q_{0}*S_{0},C_{0}=-0.75*q_{0}*S_{0}$ . In all three cases, as indicated above, the financial institution targets a fixed default probability under the optimal strategy. This is implemented by choosing the internal risk aversion parameter $\lambda$ so that the $W_{DP}(0,T,q_{0},X_{0},E(\lambda))=0.01$ . Note that even though the optimal strategy does not depend on $X_{0}$ for fixed $\lambda$ , this specification of $\lambda$ depends on $X_{0}$ . Thus firms that differ only in $X_{0}$ do adopt different investment strategies.

Table 1: Benchmark Parameters

Calibrated Parameter	Model Parameter Value
Initial Stock Price	$S_{0}=\$100$
Trading Period	$[0,T],\ T=0.5$ year
$20\%$ Annualized Volatility	$\Sigma=\$20/\text{unit}/\sqrt{\text{year}}$
$4\%$ Annual Growth	$\mu=$ 4/unit/year
Temporary Market Impact	$\Gamma=\$10^{-7}\text{year/ (unit)}^{2}$
Permanent Market Impact	$\Lambda=\$4*10^{-8}/\text{unit}$
$\lambda$ such that Probability of Default $=0.0005$	$\lambda$ varies
Initial Holdings	$q_{0}=[50000,200000,800000]$
Initial Cash net Debt owing	$C_{0}=-0.75q_{0}S_{0}$
Initial Equity	$X_{0}=0.25q_{0}S_{0}$

4.1 The Efficient Frontier

Figure 1 shows for each of the three firms how the expected rate of return on equity (ERR) and default probability (DP) for their CARA/MV/DP optimal strategies depend as $\lambda$ varies over the set of feasible values $[0,\infty)$ . These quantities are computed by the formulas

{\rm ERR}(\lambda)=\frac{1}{T}(\frac{E(\lambda)}{X_{0}}-1)\ ,\quad{\rm DP}(\lambda)=N\left(-\frac{E(\lambda)}{\sqrt{V(\lambda)}}\right)

(53)

where $N(\cdot)$ denotes the CDF of the standard normal and $E(\lambda),V(\lambda)$ are given by (26) and (27) . Such a graph is called an efficient frontier, and it summarizes the results a firm may achieve by adopting different possible risk aversion parameters.

As explained earlier, the three firms each select the optimal investment strategy given by the value $\lambda$ that leads to ${\rm DP}(\lambda)=0.01$ : with the benchmark parameters given in Table 1, the three values they compute are $\lambda=[2.56\times 10^{-7},6.7\times 10^{-8},1.83\times 10^{-8}]$ . While Figure 1 suggests that, ceteris paribus, larger firms have a lower efficient frontier, this ordering can be made to reverse by increasing the permanent impact parameter.

Refer to caption — Figure 1: The efficient frontier for three firms with parameters given in Table 1, showing their default probability and expected rate of return on equity, when adopting their optimal portfolio with risk aversion parameters $\lambda$ varying over $[0,\infty)$ .

4.2 Properties of the Optimal Trading Curve

To better understand the properties of the optimal investment strategies that result from our method, we now investigate how the three hypothetical firms’ optimal trading in the single asset case compare as important model parameters are varied away from the benchmark parameters of Table 1. Figures 2 and 3 summarize the results of four experiments, and show how the firms’ optimal trading strategies over the time period $[0,1/2]$ years change as the asset rate of return, asset volatility, temporary market impact and permanent market impact are made to vary one at a time. In each figure, the red curve denotes the benchmark parametrization, while the other two curves show the result as one specific parameter is varied upwards (blue curve) and downwards (green curves).

One point needs to be reiterated: for each choice of a set of parameters excluding the risk aversion parameter $\lambda$ , $\lambda$ is computed to ensure that the firm’s default probability (DP) is exactly 1%. Thus each curve in these figures corresponds to a different value of $\lambda$ .

The effect on the optimal strategy of varying the asset rate of return $\mu$ and volatility $\Sigma$ is shown in Figure 2. It is not a surprise to observe that the optimal strategy will include more of the risky asset as the rate of return is raised, or as the volatility is lowered. There is a threshold value of $\Sigma$ below which the firm switches from sell strategies to buy strategies. Although not shown in the graph, one finds the reverse is the case for $\mu$ . Finally, the velocity of selling strategies seems to retain a similar shape over time under these variations. Each of these observations are borne out by more extensive investigations of the dependence on these parameters.

In Figure 3a, the main effect of decreasing temporary impact $\Gamma$ is seen to be to move more quickly to the final holding level early in the period. This can be understood as a change in the optimal balance between reducing temporary impact costs and price uncertainty due to the asset volatility. To a lesser extent, one also sees in these examples that the level of the final holdings decreases slightly as $\Gamma$ increases.

The effect of permanent impact $\Lambda$ on the strategy is more subtle. From Figure 3b, a higher permanent impact parameter $\Lambda$ leads to an optimal strategy ending with a higher holding level. It also causes more curvature for the trading strategies, especially towards the closing time where all trading curves seem to have positive slope. Indeed, directly from (8), the general formula for the trading velocity, one verifies that at the close of the period $\frac{dq}{du}\mid_{u=T}=\frac{1}{2}{\Gamma^{-1}\Lambda}$ . This means that as long as $\Lambda$ is positive, every trader holding long positions, whether leveraging up or down, will always end the period by buying more shares. The reason is because permanent impact gives any trader a small opportunity to push the asset prices in a favourable direction at the last moment. We call this the Ponzi property of our market impact model: the gains it implies cannot be converted to cash without bursting the small price bubble the trader has created.

4.3 Small Market Impact

The perturbative analysis of Section 3.2 provides an alternative framework for understanding the effect of permanent and temporary market impact. We investigate the middle-sized firm with $q_{0}=2\times 10^{5}$ and market impact parameters $\Gamma(\epsilon)=\epsilon\times 10^{-7},\ \Lambda(\epsilon)=\epsilon\times 4\times 10^{-8}$ for a sequence of values $\epsilon_{n}=10^{-n},n=0,1,\dots$ approaching zero. Figure 4(a) shows how the optimal strategies converge for $\epsilon\to 0$ to the constant Merton solution for $u\in(t,T)$ , but show rapid transient effects for $u$ near both endpoints. The small Ponzi effect near $u=T$ can be turned off by taking $\Lambda(\epsilon)=0$ , as shown in Figure 4(b).

These figures suggest that for reasonable parameter values and small market impact, our model will deliver strategies that are effectively similar to the Merton solution. The observed relationship between the optimal strategies and the Merton solution, valid for small market impact, actually remains true for intermediate levels of market impact such as our benchmark parametrizations. One observes in Figures 2 and 3 that all strategies tend to flatten as $u$ approaches $T$ , albeit with a small Ponzi effect at the end of the period. It will be well worth studying the extent that the value of the holdings at which the strategy flattens is well approximated by the Merton solution. As the market impact parameters decrease, the flat portion of the curve becomes wider, and closer to the Merton solution.

An analysis similar to that of Section 3.2 allows us to understand the multi–asset investment problem in the small market impact regime. Figure 5, we used the standard asset parameters as Asset 2, with Asset 1 being the asset with the lower perturbed parameters from the previous cases, and Asset 3 having the higher perturbed parameters from the previous cases. Figure 5(a), compares the uncorrelated case to the Merton solution. Figure 5(b), compares the case of constant pairwise correlation $\rho=0.5$ to the Merton solution. In both cases we can see the behaviour similar to the one asset case above. It should be noted that unlike the single asset case, a hedging strategy can be utilized for when multiple assets are available, hence short selling of a illiquid asset class can be optimal.

We observe again in the multi-asset problem that when the market impact is small, the general optimal strategy is close to the Merton solution.

4.4 Bounded Optimal Trading Strategies

We have seen in Section 4.2 that in the single asset case, positive $\Lambda$ creates the Ponzi property that gives any trader an opportunity to push the price in their favour near the end of the period. Case 1 of Proposition 3.3 shows that as long as $\Lambda<\sqrt{2\lambda\Gamma}\Sigma$ , the optimal strategies computed over any finite period $[0,T]$ remain bounded. However, when $\Lambda>\sqrt{2\lambda\Gamma}\Sigma$ , Case 3 of Proposition 3.3 implies that for the period $[0,T^{*}]$ with $T^{*}=\tilde{K}/D$ , the optimal strategy $q^{*}(u)$ and the value function $W$ both blow up at $u=0$ .

Similar possibilities arise in the multi-asset investment problem. As $\Lambda$ increases, eventually the matrix function $U(t)$ becomes singular for some finite $t=T^{*}$ . Again, one then finds that for the period $[0,T^{*}]$ , the optimal strategy $q^{*}(u)$ and the value function $W$ both blow up at $u=0$ .

5 Remarks and Conclusions

The three hypothetical financial institutions studied in Section 4 face a typical investment problem, namely to maximize their return on equity subject to an upper bound on the downside risk, which is defined here as the probability of default. We have presented an analytically tractable version of the optimal portfolio problem that can be justified three different ways: as utility optimization, as mean-variance optimization and as mean-default probability optimization. Numerical evidence shows that the solutions generated by the method have desirable and interesting features. Perhaps most importantly, we have learned that these strategies closely track the classic Merton solution arising in the zero market impact model.

The three benchmark firms have efficient frontiers shown in Figure 1 that quantify by how much their rate of return will increase if they raise their tolerance to default. We have observed that optimal trading strategies that account for market impact tend to move over the trading period toward the Merton solution. If they are initially close to the Merton solution, they will tend to remain close, which means the Merton solution is robust to perturbations. The speed of approach increases as the temporary impact parameter $\Gamma$ decreases. In addition, the main effect of the permanent impact $\Lambda$ is the Ponzi property that is manifested by some amount of buying near the end of the period. This Ponzi effect is typically small, but as Proposition 3.3 shows, it will dominate the character of the solution when $\Lambda$ becomes large enough to cause an asset price bubble.

Left to themselves, there is little incentive for such FIs to limit risk seeking. By choosing a low value of $\lambda$ , or equivalently, accepting a high leverage ratio, they can achieve a high rate of return on capital. Since lower temporary impact and higher permanent impact are both relatively more advantageous to larger firms, one has situations where large firms implement aggressive Ponzi style strategies. In scenarios where the assets perform badly, there is a likelihood of serious asset price feedback that may adversely affect other financial institutions holding common assets. Such asset price feedback, both bubbles and bursts, has been identified in the literature, notably Cifuentes et al. [7], as a critical channel of systemic risk, popularly known as the asset fire sale channel. One application of our model, yet to be explored in detail, will be its use to specify the natural behaviour of the banks and financial institutions in a large financial system, and then to see how systemic risk measures are affected by asset fire sales due to market impact. In this systemic risk context, it will also be important to introduce the effects of funding illiquidity by modelling the stochastic nature of deposits.

If large banks were permitted to act in their own self interest without regard to their systemic effects, they would pose an unacceptable threat to financial stability. For that reason, all banks are subjected to a regime of strict financial regulation, of which the most important are limits to their capital asset ratio and liquidity coverage ratio. Under such regulatory constraints, FIs’ investment strategies will differ dramatically from the optimal strategies produced in the present paper. The optimal behaviour of such regulated financial institutions will be the target of future modelling studies.

References

Almgren and Chriss [2001] R. Almgren and N. Chriss. Optimal execution of portfolio transactions. Journal of Risk, 3:5–40, 2001.
Almgren and Lorenz [2007] R. Almgren and J. Lorenz. Adaptive arrival price. Trading, 2007(1):59–66, 2007.
Almgren [2003] R. F. Almgren. Optimal execution with nonlinear impact functions and trading-enhanced risk. Applied Mathematical Finance, 10(1):1–18, 2003.
Brown et al. [2010] D. B. Brown, B. I. Carlin, and M. S. Lobo. Optimal portfolio liquidation with distress risk. Management Science, 56(11):1997–2014, 2010.
Brunnermeier and Pedersen [2009] M. K. Brunnermeier and L. H. Pedersen. Market liquidity and funding liquidity. Review of Financial Studies, 22(6):2201–2238, 2009.
Cartea and Jaimungal [2015] Á. Cartea and S. Jaimungal. Optimal execution with limit and market orders. Quantitative Finance, 15(8):1279–1291, 2015.
Cifuentes et al. [2005] R. Cifuentes, G. Ferrucci, and H. S. Shin. Liquidity risk and contagion. Journal of the European Economic Association, 3(2-3):556–566, 2005.
Davis and Norman [1990] M. H. Davis and A. R. Norman. Portfolio selection with transaction costs. Mathematics of Operations Research, 15(4):676–713, 1990.
Gatheral and Schied [2011] J. Gatheral and A. Schied. Optimal trade execution under geometric brownian motion in the almgren and chriss framework. International Journal of Theoretical and Applied Finance, 14(03):353–368, 2011.
Hurd et al. [2016] T. R. Hurd, Q. H. Shao, and T. Q. Tran. Review of portfolio strategies in illiquid markets. available at, June 2016.
Markowitz [1952] H. Markowitz. Portfolio selection. The Journal of Finance, 7(1):77–91, 1952.
Merton [1969] R. C. Merton. Lifetime portfolio selection under uncertainty: the continuous–time model. Rev. Econom. Statist., 51:247–257, 1969.
Moazeni et al. [2013] S. Moazeni, T. F. Coleman, and Y. Li. Optimal execution under jump models for uncertain price impact. Journal of Computational Finance, 16(4):1–44, 2013.
Perold [1988] A. F. Perold. The implementation shortfall: Paper versus reality. The Journal of Portfolio Management, 14(3):4–9, 1988.
Pham and Tankov [2008] H. Pham and P. Tankov. A model of optimal consumption under liquidity risk with random trading times. Mathematical Finance, 18(4):613–627, 2008.
Schied and Schöneborn [2009] A. Schied and T. Schöneborn. Risk aversion and the dynamics of optimal liquidation strategies in illiquid markets. Finance and Stochastics, 13(2):181–204, 2009.
Schied et al. [2010] A. Schied, T. Schöneborn, and M. Tehranchi. Optimal basket liquidation for cara investors is deterministic. Applied Mathematical Finance, 17(6):471–489, 2010.
Schöneborn [2008] T. Schöneborn. Trade execution in illiquid markets: Optimal stochastic control and multi-agent equilibria. phd thesis. 2008.
Zhang [2014] T. Zhang. Nash Equilibria in Market Impact Models: Differential Game, Transient Price Impact and Transaction Costs, PhD Thesis. PhD thesis, Mannheim, Universität Mannheim, Diss., 2014, 2014.

Appendix A Appendix: Proofs of Main Results

Proof of Theorem 2.1: In this proof we fix $T$ to be finite. The existence of a maximal $T^{*}$ is a consequence of solving (7), which is analyzed in the proof of Proposition 3.1. The Hamilton-Jacobi-Bellman (HJB) equation associated to (5) arises from the DPP by assuming Markov controls $v_{t}=v(t,T,X_{t},q_{t})$ and value function $W_{t}:=W(t,T,X^{v}_{t},q^{v}_{t})$ for deterministic functions $v,W$ . For simplicity of exposition, we have omitted the potential for dependence on the stock price $S_{t}$ : the standard verification result used at the end of this argument shows this is consistent.

Under these assumptions, the DPP implies that $-e^{-\lambda W(t,T,X^{v}_{t},q^{v}_{t})}$ is a supermartingale for all $v$ and a martingale for the optimal $v^{*}$ , which leads to the HJB equation for $W$

	$\displaystyle\partial_{t}W+\partial_{X}Wq^{\prime}\mu+\frac{1}{2}q^{\prime}\Sigma\Sigma^{\prime}q[\partial_{XX}^{2}W-\lambda(\partial_{X}W)^{2}]$
	$\displaystyle\hskip 108.405pt+\ {\sup}_{v}[(\partial_{q}W^{\prime}+\partial_{X}Wq^{\prime}\Lambda)v-v^{\prime}\Gamma v\partial_{X}W]=0.$
	$\displaystyle W(T,T,X,q)=X.$

The ansatz $W(t,T,X,q)=X+V(t,T,q)$ leads to the equation for $V$

	$\displaystyle\partial_{t}V+q^{\prime}\mu-\frac{\lambda}{2}q^{\prime}\Sigma\Sigma^{\prime}q+\ {\sup}_{v}[(\partial_{q}V^{\prime}+q^{\prime}\Lambda)v-v^{\prime}\Gamma v]=0.$
	$\displaystyle V(T,T,q)=0.$

The optimal feedback control is thus $v^{*}=\frac{\Gamma^{-1}}{2}(\partial_{q}V^{\prime}+\Lambda q),$ which is independent of $X$ and the price process, and hence deterministic. Using this control leads to

\partial_{t}V+q^{\prime}\mu-\frac{\lambda}{2}q^{\prime}\Sigma\Sigma^{\prime}q+\frac{1}{4}(\partial_{q}V+\Lambda q)^{\prime}\Gamma^{-1}(\partial_{q}V+\Lambda q)=0\ .

(54)

As we will shortly see in the proof of Proposition 3.1, this ODE has a unique smooth solution which is deterministic, over any finite time interval $[t,T]$ for $T$ less than a possibly infinite maximal $T^{*}$ . Therefore, by the classical verification theorem, we have $W=\tilde{W}$ and the other statements of the theorem follow.
∎

Proof of Proposition 3.1: By Theorem 2.1 , the value function for Merton’s problem over $[t,T]$ has the form $W(t,T,X,q)=X+V(t,T,q),$ where $V$ satisfies the ODE (54). This ODE and the form (15) leads to Riccati equations with initial conditions for $A,B,C$

$\displaystyle\partial_{\tau}A$	$\displaystyle-\frac{1}{4}(A+A^{\prime}+\Lambda)\Gamma^{-1}(A+A^{\prime}+\Lambda)+\frac{\lambda}{2}\Sigma\Sigma^{\prime}=0,\quad A(0)=0$	(55)
$\displaystyle\partial_{\tau}B$	$\displaystyle-\frac{1}{2}B\Gamma^{-1}(A+A^{\prime}+\Lambda)-\mu^{\prime}=0,\quad B(0)=0$	(56)
$\displaystyle\partial_{\tau}C$	$\displaystyle-\frac{1}{4}B\Gamma^{-1}B^{\prime}=0,\quad C(0)=0.$	(57)

Notice that if $A$ is a solution of (16), then so is $A^{\prime}$ : By the uniqueness theorem for solutions of ODEs, $A=A^{\prime}$ and therefore $A$ is symmetric.

∎

Proof of Theorem 3.2: Part 1: Note that $\frac{\lambda}{2}\Gamma^{-1/2}\Sigma\Sigma^{\prime}\Gamma^{-1/2}$ is positive definite and define $D$ to be its symmetric square root. If

\tilde{A}(\tau):=\Gamma^{-1/2}(A(\tau)+\Lambda/2)\Gamma^{-1/2}

then (16) becomes

\partial_{\tau}\tilde{A}-\tilde{A}^{2}+D^{2}=0,\quad\tilde{A}(0)=E:=\Gamma^{-1/2}(\Lambda/2)\Gamma^{-1/2}\ .

(58)

One can now check that the solution to (58) has the form $\tilde{A}=VU^{-1},$ where $U,V$ satisfy the following linear ODE with terminal condition

\left[\begin{array}[]{c}\partial_{\tau}U\\ \partial_{\tau}V\end{array}\right]=\left[\begin{array}[]{cc}0&-\mathbbm{1}\\ -D^{2}&0\end{array}\right]\times\left[\begin{array}[]{c}U\\ V\end{array}\right],\quad\left[\begin{array}[]{c}U(0)\\ V(0)\end{array}\right]=\left[\begin{array}[]{c}\mathbbm{1}\\ E\end{array}\right]\ .

By block-diagonalization using

Q=\left[\begin{array}[]{cc}\mathbbm{1}&\mathbbm{1}\\ D&-D\end{array}\right],\quad Q^{-1}=\frac{1}{2}\left[\begin{array}[]{cc}\mathbbm{1}&D^{-1}\\ \mathbbm{1}&-D^{-1}\end{array}\right]

one finds

\left[\begin{array}[]{cc}0&-\mathbbm{1}\\ -D^{2}&0\end{array}\right]=Q\left[\begin{array}[]{cc}-D&0\\ 0&D\end{array}\right]Q^{-1}

and therefore, the solution of the matrix ODE is

\left[\begin{array}[]{c}U(\tau)\\ V(\tau)\end{array}\right]=Q\left[\begin{array}[]{cc}e^{-D\tau}&0\\ 0&e^{D\tau}\end{array}\right]Q^{-1}\times\left[\begin{array}[]{c}\mathbbm{1}\\ E\end{array}\right]\ .

From the explicit forms

	$\displaystyle U(\tau)$		$\displaystyle=\ {\cosh}(D\tau)-\ {\sinh}(D\tau)D^{-1}E$		(59)
	$\displaystyle V(\tau)$		$\displaystyle=-\ {\sinh}(D\tau)D+\ {\cosh}(D\tau)E$		(60)

one finds $A(\tau)=\Gamma^{1/2}(\tilde{A}(\tau)-E)\Gamma^{1/2}$ where

\tilde{A}(\tau)=[-\ {\sinh}(D\tau)D+\ {\cosh}(D\tau)E][\ {\cosh}(D\tau)-\ {\sinh}(D\tau)D^{-1}E]^{-1}.

(61)

The Riccati equation (17) for B can be solved by noting that $\tilde{B}=B\Gamma^{-1/2}$ solves the ODE

\partial_{\tau}\tilde{B}-\tilde{B}\tilde{A}-\mu^{\prime}\Gamma^{-1/2}=0.

Since $\partial_{\tau}U=-\tilde{A}U$ , we find $\partial_{\tau}(\tilde{B}U)=(\partial_{\tau}\tilde{B}-\tilde{B}\tilde{A})U=\mu^{\prime}\Gamma^{-1/2}U$ which can be integrated to give $\tilde{B}(\tau)U(\tau)=\mu^{\prime}\Gamma^{-1/2}(\int^{\tau}_{0}U(s)ds)$ and thus

B(\tau)=\mu^{\prime}\Gamma^{-1/2}\left(\int_{0}^{\tau}U(s)ds\right)U^{-1}(\tau)\Gamma^{1/2}\ .

It is straightforward that $\int^{\tau}_{0}U(s)ds=D^{-2}[E-V(\tau)]$ which gives the desired formula

B(\tau)=\mu^{\prime}\Gamma^{-1/2}D^{-2}[E-V(\tau)]U^{-1}(\tau)\Gamma^{1/2}\ .

In a similar fashion, one finds

	$\displaystyle C(\tau)$		$\displaystyle=\frac{1}{4}\int_{0}^{\tau}B(s)\Gamma^{-1}B^{\prime}(s)ds$		(63)
			$\displaystyle=\frac{1}{4}\overline{\mu}^{\prime}\left(\int_{0}^{\tau}(E-V(s))(U^{\prime}(s)U(s))^{-1}(E-V^{\prime}(s))ds\right)\overline{\mu},$		(63)

where $\overline{\mu}:=D^{-2}\Gamma^{-1/2}\mu.$

Part 2: This part is straightforward.

Part 3: From part 4 of Theorem 2.1, the optimal control $q^{*}(u)$ over the period $[t,T]$ solves

\partial_{u}q-\Gamma^{-1}(A(T-u)+\Lambda/2)q=\frac{1}{2}\Gamma^{-1}B^{\prime}(T-u)

When this linear ODE is multiplied on the left by the integrating factor $U^{-1}(T-u)\Gamma^{1/2}$ , the left-hand side becomes an exact derivative:

\partial_{u}\left[U^{-1}(T-u)\Gamma^{1/2}q\right]=U^{-1}(T-u)\Gamma^{1/2}\times\frac{1}{2}\Gamma^{-1}B^{\prime}(T-u)\ .

Integration of this equation over $[t,u]$ gives

U^{-1}(T-u)\Gamma^{1/2}q(u)-U^{-1}(T-t)\Gamma^{1/2}q=\frac{1}{2}\int^{u}_{t}U^{-1}(T-r)\Gamma^{-1/2}B^{\prime}(T-r)dr

which leads to the desired formula.

Part 4: The Variance is calculated directly as follows

\text{Var}_{t}(X_{T}^{*})=\int_{t}^{T}q^{*}(s)^{\prime}\Sigma\Sigma^{\prime}q(s)^{*}ds=q^{\prime}L(T-t)q+M(T-t)q+N(T-t)\ .

Rewrite $q^{*}(u)={\tilde{U}}{(T-u)}{\tilde{U}}^{-1}{(T-t)}q+\frac{1}{2}{\tilde{U}}{(T-u)}I(u)$ , where $\tilde{U}(T-u):=\Gamma^{-1/2}U(T-u)$ and $I(u):=\int_{t}^{u}{\tilde{U}}^{-1}(T-r)\Gamma^{-1}B(T-r)dr$ . Explicit forms for $L,M,N$ are calculated as follows.

\displaystyle L(T-t)=(\tilde{U}^{-1}(T-t)^{\prime})\Bigl{(}\int_{t}^{T}\tilde{U}(T-r)^{\prime}\Sigma\Sigma^{\prime}\tilde{U}(T-r)dr\Bigr{)}\tilde{U}^{-1}(T-u)\ .

By using Fubini’s formula, we have

	$\displaystyle M^{\prime}(T-t)$		$\displaystyle=\int_{t}^{T}{\tilde{U}^{-1}}{(T-t)^{\prime}}{\tilde{U}}{(T-r)^{\prime}}\Sigma\Sigma^{\prime}{\tilde{U}}{(T-r)}I(r)dr$
			$\displaystyle=\int_{t}^{T}\Bigl{(}\int_{s}^{T}{\tilde{U}^{-1}}{(T-t)^{\prime}}{\tilde{U}}{(T-r)^{\prime}}\Sigma\Sigma^{\prime}{\tilde{U}}{(T-r)}dr\Bigr{)}{\tilde{U}}{(T-s)}^{-1}\Gamma^{-1}B(T-s)ds$
			$\displaystyle={\tilde{U}}{(T-t)^{\prime}}^{-1}\int_{t}^{T}{\tilde{U}}{(T-s)}L(T-s)\Gamma^{-1}B(T-s)ds\ .$

Similarly

	$\displaystyle N(T-t)$		$\displaystyle=\frac{1}{4}\int_{t}^{T}I(r)^{\prime}{\tilde{U}}{(T-r)}^{\prime}\Sigma\Sigma^{\prime}{\tilde{U}}{(T-r)}I(r)dr$
			$\displaystyle=\frac{1}{4}\int_{t}^{T}\Bigl{(}\int_{s}^{T}I(r)^{\prime}{\tilde{U}}{(T-r)}^{\prime}\Sigma\Sigma^{\prime}{\tilde{U}}{(T-r)}dr\Bigr{)}{\tilde{U}}^{-1}(T-s)\Gamma^{-1}B(T-s)ds$
			$\displaystyle=\frac{1}{4}\int_{t}^{T}M(T-s)\Gamma^{-1}B(T-s)ds\ .$

∎