Reducing Obizhaeva-Wang type trade execution problems to LQ stochastic control problems

Julia Ackermann Department of Mathematics & Informatics, University of Wuppertal, Gaußstr. 20, 42119 Wuppertal, Germany. Email: jackermann@uni-wuppertal.de, Phone: +49 (0)202 4395238. Thomas Kruse Department of Mathematics & Informatics, University of Wuppertal, Gaußstr. 20, 42119 Wuppertal, Germany. Email: tkruse@uni-wuppertal.de, Phone: +49 (0)202 4395239. Mikhail Urusov Faculty of Mathematics, University of Duisburg-Essen, Thea-Leymann-Str. 9, 45127 Essen, Germany. Email: mikhail.urusov@uni-due.de, Phone: +49 (0)201 1837428.

Abstract

We start with a stochastic control problem where the control process is of finite variation (possibly with jumps) and acts as integrator both in the state dynamics and in the target functional. Problems of such type arise in the stream of literature on optimal trade execution pioneered by Obizhaeva and Wang (models with finite resilience). We consider a general framework where the price impact and the resilience are stochastic processes. Both are allowed to have diffusive components. First we continuously extend the problem from processes of finite variation to progressively measurable processes. Then we reduce the extended problem to a linear quadratic (LQ) stochastic control problem. Using the well developed theory on LQ problems we describe the solution to the obtained LQ one and trace it back up to the solution to the (extended) initial trade execution problem. Finally, we illustrate our results by several examples. Among other things the examples show the Obizhaeva-Wang model with random (terminal and moving) targets, the necessity to extend the initial trade execution problem to a reasonably large class of progressively measurable processes (even going beyond semimartingales) and the effects of diffusive components in the price impact process and/or in the resilience process.

Keywords: optimal trade execution; stochastic price impact; stochastic resilience; finite variation stochastic control; continuous extension of cost functional; progressively measurable execution strategy; linear quadratic stochastic control; backward stochastic differential equation.

2020 MSC: Primary: 91G10; 93E20; 60H10. Secondary: 60G99.

Introduction

In the literature on optimal trade execution in illiquid financial markets there arise stochastic control problems where the control is a process of finite variation (possibly with jumps) that acts as integrator both in the state dynamics and in the target functional. For brevity, we use the term finite variation stochastic control for such problems.¹¹1Notice that the class of finite variation stochastic control problems contains the class of singular stochastic control problems. In contrast, for control problems where the state is driven by a controlled stochastic differential equation (SDE) and the control acts as one of the arguments in that SDE and as one of the arguments in the integrand of the target functional, we use the term standard stochastic control problems.

In this article we present a general solution approach to finite variation stochastic control problems that arise in the literature on optimal trade execution. We set up a finite variation stochastic control problem of the type of the one in Obizhaeva and Wang [37] and its extensions like, e.g., Alfonsi and Acevedo [4], Bank and Fruth [13], Fruth et al. [24] and [25]. We then show how it can be transformed into a standard linear quadratic (LQ) stochastic control problem which can be solved with the help of state-of-the-art techniques from stochastic optimal control theory. In the introduction we first describe the finite variation stochastic control problem and showcase its usage in finance, before presenting our solution approach, summarizing our main contributions and embedding our paper into the literature.

Finite variation stochastic control problem: As a starting point we consider in this paper the following stochastic control problem. Let $T>0$ and let $(\Omega,\mathcal{F}_{T},(\mathcal{F}_{t})_{t\in[0,T]},P)$ be a filtered probability space satisfying the usual conditions. Let $\xi$ be an $\mathcal{F}_{T}$ -measurable random variable and let $\zeta=(\zeta_{s})_{s\in[0,T]}$ be a progressively measurable process both satisfying suitable integrability assumptions (see (5) below). Further, let $\lambda=(\lambda_{s})_{s\in[0,T]}$ be a bounded progressively measurable process. Let $\gamma=(\gamma_{s})_{s\in[0,T]}$ be a positive Itô process driven by some Brownian motion and $R=(R_{s})_{s\in[0,T]}$ an Itô process driven by a (stochastically) correlated Brownian motion (see (3) and (4) below). Throughout the introduction we fix $t\in[0,T]$ , $x,d\in\mathbb{R}$ and denote by $\mathcal{A}^{fv}_{t}(x,d)$ the set of all adapted, càdlàg, finite variation processes $X=(X_{s})_{s\in[t-,T]}$ satisfying $X_{t-}=x$ , $X_{T}=\xi$ , and appropriate integrability assumptions (see (A1)–(A3) below). To each $X\in\mathcal{A}^{fv}_{t}(x,d)$ we associate a process $D^{X}=(D^{X}_{s})_{s\in[t-,T]}$ satisfying

dD^{X}_{s}=-D^{X}_{s}dR_{s}+\gamma_{s}dX_{s},\quad s\in[t,T],\quad D^{X}_{t-}=d.

(1)

We consider the finite variation stochastic control problem of minimizing the cost functional

J^{fv}_{t}(x,d,X)=E_{t}\left[\int_{[t,T]}\left(D^{X}_{s-}+\frac{1}{2}\Delta X_{s}\gamma_{s}\right)dX_{s}+\int_{t}^{T}\lambda_{s}\gamma_{s}\left(X_{s}-\zeta_{s}\right)^{2}ds\right]

(2)

over $X\in\mathcal{A}^{fv}_{t}(x,d)$ , where $E_{t}[\cdot]$ is a shorthand notation for $E[\cdot|\mathcal{F}_{t}]$ .

Financial interpretation: Stochastic control problems with cost functional of the form (2) or a special case thereof play a central role in the scientific literature on optimal trade execution problems (see the literature discussion below). Consider an institutional investor who holds immediately prior to time $t\in[0,T]$ a position $x\in\mathbb{R}$ ( $x>0$ meaning a long position of $x$ shares of a stock and $x<0$ a short position of $-x$ shares) of a certain financial asset. The investor trades the asset during the period $[t,T]$ in such a way that at each time $s\in[t-,T]$ the position is given by the value $X_{s}$ of the adapted, càdlàg, finite variation process $X=(X_{s})_{s\in[t-,T]}$ (satisfying $X_{t-}=x$ ). More precisely, $X_{s-}$ represents the position immediately prior to the trade at time $s$ , while $X_{s}$ is the position immediately after that trade. The investor’s goal is to reach the target position

X_{T}=\xi

during the course of the trading period $[t,T]$ . Note that we allow $\xi$ to be random to incorporate the possibility that the target position is not known at the beginning of trading but only revealed at terminal time $T$ . Such situations may for example be faced by airline companies buying on forward markets the kerosene they need in $T$ months. Their precise demand for kerosene at that future time depends on several factors, such as ticket sales and flight schedules, that are not known today but only gradually learned.

We assume that the market the investor trades in is illiquid, implying that the investor’s trades impact the asset price. To model this effect, we assume (as is typically done in the literature on optimal trade execution) an additive impact on the price. This means that the realized price at which the investor trades at time $r\in[t,T]$ consists of an unaffected price $S^{0}_{r}$ plus a deviation $D^{X}_{r}$ that is caused by the investor’s trades during $[t,r]$ . We assume that the unaffected price process $S^{0}=(S^{0}_{r})_{r\in[0,T]}$ is a càdlàg martingale satisfying appropriate integrability conditions. Then integration by parts and the martingale property of $S^{0}$ ensure that expected trading costs due to $S^{0}$ are given by

E_{t}\left[\int_{[t,T]}S^{0}_{r}dX_{r}\right]=E_{t}\left[\xi S_{T}^{0}\right]-xS_{t}^{0}.

Thus, these costs do not depend on the investor’s trading strategy $X$ and are therefore neglected in the sequel (we refer to Remark 2.2 in [1] for a more detailed discussion in the case $\xi=0$ ). The deviation process $D^{X}$ associated to $X$ is given by (1). Informally speaking, we see from (1) that a trade of size $dX_{s}$ at time $s\in[t,T]$ impacts $D^{X}$ by $\gamma_{s}dX_{s}$ . So, the factor $\gamma_{s}$ determines how strongly the price reacts to trades, and the process $\gamma$ is therefore called the price impact process. In particular, the fact that $\gamma$ is nonnegative entails that a buy trade $dX_{s}>0$ leads to higher prices whereas a sell trade $dX_{s}<0$ leads to smaller prices. The second component $-D^{X}_{s}dR_{s}$ in the dynamics (1) describes the behavior of $D^{X}$ when the investor is not trading. Typically, it is assumed that $R$ is an increasing process such that in the absence of trades $D^{X}$ is reverting to $0$ with relative rate $dR_{s}$ . Therefore, $R$ is called the resilience process. We refer to [3] for a discussion of the effects of “negative” resilience, where $R$ might also be decreasing. We highlight that in the present paper we allow $R$ to have a diffusive part. In summary, we note that the deviation prior to a trade of the investor at time $s\in[t,T]$ is given by $D^{X}_{s-}$ whereas it is equal to $D^{X}_{s}=D^{X}_{s-}+\gamma_{s}\Delta X_{s}$ afterwards. We take the mean $D^{X}_{s-}+\frac{1}{2}\gamma_{s}\Delta X_{s}$ of these two values as the realized price per unit so that the investor’s overall trading costs due to $D^{X}$ amount to $\int_{[t,T]}\left(D^{X}_{s-}+\frac{1}{2}\gamma_{s}\Delta X_{s}\right)dX_{s}$ . This describes the first integral on the right-hand side of (2). Under the assumption that $\lambda$ is nonnegative, the second integral $\int_{t}^{T}\lambda_{s}\gamma_{s}\left(X_{s}-\zeta_{s}\right)^{2}ds$ can be understood as a risk term that penalizes any deviation of the position $X$ from the moving target $\zeta$ in a quadratic way²²2The parametrization $\lambda_{s}\gamma_{s}$ , $s\in[0,T]$ , for the weight is chosen out of mathematical convenience since it makes some of the following assumptions and results shorter to state. Likewise, one can use $\tilde{\lambda}_{s}$ , $s\in[0,T]$ , as a weight and replace $\lambda$ by $\tilde{\lambda}/\gamma$ in the subsequent assumptions and results.. A possible and natural choice would be $\zeta_{s}=E_{s}[\xi]$ , $s\in[0,T]$ , so that the risk term ensures that any optimal strategy $X$ does not deviate too much from the (expected) target position $\xi$ in the course of the trading period.

Solution approach: The overarching goal of this paper is to show that the finite variation stochastic control problem (2) is equivalent to a standard LQ stochastic control problem (see Corollary˜2.3 and Corollary˜2.4 below). The derivation of this result is based on the following insights. The first observation is that, in general, the functional (2) does not admit a minimizer in $\mathcal{A}^{fv}_{t}(x,d)$ (see Section˜4.3 below for a specific example). In [1] the functional (2) was extended to a set of càdlàg semimartingales $X$ and it was shown that its minimum is attained in this set of semimartingales if and only if a certain process that is derived from the solution of an associated backward stochastic differential equation (BSDE) can be represented by a càdlàg semimartingale (see Theorem 2.4 in [1]). In this work we go even a step further and extend the functional (2) to the set $\mathcal{A}^{pm}_{t}(x,d)$ of progressively measurable processes $X=(X_{s})_{s\in[t-,T]}$ satisfying appropriate integrability conditions (see (A1) below) and the boundary conditions $X_{t-}=x$ and $X_{T}=\xi$ . To do so, we first derive alternative representations of the first integral inside the expectation in (2) and the deviation in (1) that do not involve $X\in\mathcal{A}^{fv}_{t}(x,d)$ as an integrator (see ˜1.3). It follows that the resulting alternative representation of $J^{fv}$ (see ˜1.4) is not only well-defined on $\mathcal{A}^{fv}_{t}(x,d)$ but even on $\mathcal{A}^{pm}_{t}(x,d)$ , and we denote this extended functional by $J^{pm}$ (see Section˜1.3). We next introduce a metric on $\mathcal{A}^{pm}_{t}(x,d)$ and prove that $J^{pm}$ is the unique continuous extension of $J^{fv}$ from $\mathcal{A}^{fv}_{t}(x,d)$ to $\mathcal{A}^{pm}_{t}(x,d)$ (see ˜1.7). In particular, it follows that the infimum of $J^{fv}$ over $\mathcal{A}^{fv}_{t}(x,d)$ and the infimum of $J^{pm}$ over $\mathcal{A}^{pm}_{t}(x,d)$ coincide.

Next, for a given $X\in\mathcal{A}^{pm}_{t}(x,d)$ we identify the process $\overline{H}^{X}_{s}=\gamma_{s}^{-\frac{1}{2}}D^{X}_{s}-\gamma_{s}^{\frac{1}{2}}X_{s}$ , $s\in[t,T]$ , as a useful tool in our analysis. Despite $X$ and $D^{X}$ having discontinuous paths in general, the process $\overline{H}^{X}$ , which we call the scaled hidden deviation process, is always continuous. Moreover, we show that $\overline{H}^{X}$ can be expressed in feedback form as an Itô process with coefficients that are linear in $\gamma^{-\frac{1}{2}}D^{X}$ and $\overline{H}^{X}$ (see Lemma˜1.6). Subsequently, we reinterpret the process $\gamma^{-\frac{1}{2}}D^{X}$ as a control process $u$ and $\overline{H}^{X}$ as the associated state process. Since the cost functional $J^{pm}$ is quadratic in $\overline{H}^{X}$ and $u=\gamma^{-\frac{1}{2}}D^{X}$ , we arrive at a standard LQ stochastic control problem (see (22) and (23)) whose minimal costs coincide with the infimum of $J^{pm}$ over $\mathcal{A}^{pm}_{t}(x,d)$ (see Corollary˜2.3). Importantly, there is a one-to-one correspondence between square integrable controls $u$ for this standard problem and strategies $X\in\mathcal{A}^{pm}_{t}(x,d)$ , which allows to recover the minimizer $X^{*}\in\mathcal{A}^{pm}_{t}(x,d)$ of $J^{pm}$ from a minimizer $u^{*}$ of the standard problem and vice versa (see Corollary˜2.4).

We then solve the LQ stochastic control problem in (22) and (23) using techniques provided in the literature on stochastic optimal control theory. More precisely, we apply results from Kohlmann and Tang [34]³³3We moreover indicate in Remark 3.1 how we could alternatively use results from Sun et al. [40]. to provide conditions that guarantee that an optimal control $u^{*}$ exists (and is unique). This optimal control $u^{*}$ in the LQ problem is characterized by two BSDEs: one is a quadratic BSDE of Riccati type, the other one is linear, however, with unbounded coefficients (see ˜3.3). In Corollary˜3.4 we trace everything back and obtain a unique optimal execution strategy in the class of progressively measurable processes in a closed form (in terms of the solutions to the mentioned BSDEs).

Summary of our contributions: (a) The Obizhaeva-Wang type finite variation stochastic control problem (1)–(2) is continuously extended to the set $\mathcal{A}^{pm}_{t}(x,d)$ of appropriate progressively measurable processes $X$ .

(b) Problem (1)–(2) is rather general. In particular, it includes the following features:

•

Presence of random terminal and moving targets $\xi$ and $(\zeta_{s})$ ;
•

Price impact is a positive Itô process $(\gamma_{s})$ ;
•

Resilience⁴⁴4To expand on this point, it is worth noting that in our current parametrization, only processes $(R_{s})$ with dynamics $dR_{s}=\rho_{s}\,ds$ without a diffusive component were considered by now in the literature on optimal trade execution in Obizhaeva-Wang type models. Moreover, in most papers $\rho$ is assumed to be positive, that is, only the case of an increasing $(R_{s})$ was extensively studied previously. is an Itô process $(R_{s})$ acting as an integrator in (1).

(c) Via introducing the mentioned scaled hidden deviation process $(\overline{H}^{X}_{s})$ and reinterpreting the process $(\gamma_{s}^{-\frac{1}{2}}D^{X}_{s})$ as a control in an (a priori, different) stochastic control problem, the extended to $\mathcal{A}^{pm}_{t}(x,d)$ problem is reduced to an explicitly solvable LQ stochastic control problem. Thus, a unique optimal execution strategy in $\mathcal{A}^{pm}_{t}(x,d)$ is obtained in a closed form (in terms of solutions to two BSDEs).

Literature discussion: Finite variation stochastic control problems arise in the group of literature on optimal trade execution in limit order books with finite resilience. The pioneering work⁵⁵5Posted 2005 on SSRN. Obizhaeva and Wang [37] models the price impact via a block-shaped limit order book, where the impact decays exponentially at a constant rate. This embeds into our model via the price impact process $\gamma$ that is a positive constant and the resilience process $(R_{s})$ given by $R_{s}=\rho s$ with some positive constant $\rho>0$ . Alfonsi et al. [5] study constrained portfolio liquidation in the Obizhaeva-Wang model. Subsequent works within this group of literature either extend this framework in different directions or suggest alternative frameworks with similar features. There is a subgroup of models which include more general limit order book shapes, see Alfonsi et al. [6], Alfonsi and Schied [7], Predoiu et al. [38]. Models in another subgroup extend the exponential decay of the price impact to general decay kernels, see Alfonsi et al. [8], Gatheral et al. [27]. Models with multiplicative price impact are analyzed in Becherer et al. [17, 18]. We mention that in [18], the (multiplicative) deviation is of Ornstein-Uhlenbeck type and incorporates a diffusion term (but this is different from our diffusion term that results from a diffusive part in the resilience $R$ ). Superreplication and optimal investment in a block-shaped limit order book model with exponential resilience is discussed in Bank and Dolinsky [11, 12] and in Bank and Voß [16].

The present paper falls into the subgroup of the literature that studies time-dependent (possibly stochastic) price impact $(\gamma_{s})$ and resilience $(R_{s})$ in generalized Obizhaeva-Wang models. In this connection we mention the works Alfonsi and Acevedo [4], Bank and Fruth [13], Fruth et al. [24], where deterministically varying price impact and resilience are considered. Fruth et al. [25] allow for stochastically varying price impact (resilience is still deterministic) and study the arising optimization problem over monotone strategies. Optimal strategies in a discrete-time model with stochastically varying resilience and constant price impact are derived in Siu et al. [39]. In Ackermann et al. [1, 3, 2] both price impact and resilience are stochastic. We now describe the differences from our present paper in more detail. In [2] optimal execution is studied in discrete time via dynamic programming. In [1] the framework is the closest to the one in this paper. Essentially, our current framework is the framework from [1] extended by a risk term with some moving target $(\zeta_{s})$ , a possibly non-zero (random) terminal target $\xi$ , and a larger class of resilience processes (in [1], as in many previous papers, $(R_{s})$ is assumed to have the dynamics $dR_{s}=\rho_{s}\,ds$ , and $(\rho_{s})$ is called resilience). In [3] the framework is similar to the one in [1], while the aim is to study qualitative effects of “negative” resilience (in the sense that $\rho_{s}\leq 0$ with $(\rho_{s})$ as in the previous sentence). Now, to compare the approach in the present paper with the one in [1], we first recall that in [1] the finite variation stochastic control problem of the type (1)–(2) is extended to allow for càdlàg semimartingale trading strategies $X$ and the resulting optimal execution problem over semimartingales is studied. The approach in [1] is based on (1)–(2) (extended with some additional terms), but this does not work beyond semimartingales, as $X$ acts as integrator there. In contrast, our continuous extension needs to employ essentially different ideas since we want to consider the set $\mathcal{A}^{pm}_{t}(x,d)$ of progressively measurable strategies (in particular, beyond semimartingales). This extension is indeed necessary to get an optimizer (see the discussion in the end of Section˜4.3).

Especially with regard to our extension result we now mention several papers where, in different models with finite resilience, trading strategies are not restricted to be of finite variation. The first instance known to us is Lorenz and Schied [35], who discuss dependence of optimal trade execution strategies on a drift in the unaffected price. In order to react to non-martingale trends they allow for càdlàg semimartingale trading strategies. Gârleanu and Pedersen [26, Section 1.3] allow for strategies of infinite variation in an infinite horizon portfolio optimization problem under market frictions. Becherer et al. [19] prove a continuous extension result for gains of a large investor in the Skorokhod $J_{1}$ and $M_{1}$ topologies in the class of predictable strategies with càdlàg paths. As discussed in the previous paragraph in more detail, in [1] the strategies are càdlàg semimartingales. In Horst and Kivman [29] càdlàg semimartingale strategies emerge in the limiting case of vanishing instantaneous impact parameter, where the initial modeling framework is inspired by Graewe and Horst [28] and Horst and Xia [31].

To complement the preceding discussion from another perspective, we mention Carmona and Webster [22], who examine high-frequency trading in limit order books in general (not necessarily related with optimal trade execution). It is very interesting that one of their conclusions is a strong empirical evidence for the infinite variation nature of trading strategies of high-frequency traders.

Finally, let us mention that, in the context of trade execution problems, risk terms with zero moving target have been included, e.g., in Ankirchner et al. [9], Ankirchner and Kruse [10], Graewe and Horst [28]. Inequality terminal constraints have been considered in Dolinsky et al. [23], and risk terms with general terminal and moving targets appear in the models of, e.g., Bank et al. [14], Bank and Voß [15], Horst and Naujokat [30], Naujokat and Westray [36]. In particular, [10], [15], and [23] consider random terminal targets $\xi$ within trade execution models where position paths are required to be absolutely continuous functions of time. This restriction of the set of position paths entails technical difficulties that make these problems challenging to analyze. In particular, existence of admissible paths that satisfy the terminal constraint is far from obvious and can in general only be assured under further conditions on $\xi$ . Since in our model position paths are allowed to jump at terminal time we do not face these challenges in our framework.

The paper is structured as follows. Section˜1 is devoted to the continuous extension of our initial trade execution problem to the class of progressively measurable strategies. Section˜2 reduces the problem for the progressively measurable strategies to a standard LQ stochastic control problem. In Section˜3 we present the solution to the obtained LQ problem and trace it back up to the solution to the (extended to progressively measurable strategies) trade execution problem. In Section˜4 we illustrate our results with several examples. Finally, Section˜5 contains the proofs together with some auxiliary results necessary for them.

1 From finite variation to progressively measurable execution strategies

In this section we first set up the finite variation stochastic control problem (see Section˜1.1). In Section˜1.2 we then derive alternative representations of the cost functional and the deviation process which do not require the strategies to be of finite variation. We use these results in Section˜1.3 to extend the cost functional to progressively measurable strategies. In Section˜1.5 we show that this is the unique continuous extension. Section˜1.4 introduces the hidden deviation process as a key tool for the proofs of Section˜1.5. All proofs of this section are deferred to Section˜5.

1.1 The finite variation stochastic control problem

Let $T>0$ and $m\in\mathbb{N}$ , $m\geq 2$ . We fix a filtered probability space $(\Omega,\mathcal{F}_{T},(\mathcal{F}_{s})_{s\in[0,T]},P)$ satisfying the usual conditions and supporting an $m$ -dimensional Brownian motion $(W^{1},\ldots,W^{m})^{\top}$ with respect to the filtration $(\mathcal{F}_{s})$ .

We first fix some notation. For $t\in[0,T]$ conditional expectations with respect to $\mathcal{F}_{t}$ are denoted by $E_{t}[\cdot]$ . For $t\in[0,T]$ and a càdlàg process $X=(X_{s})_{s\in[t-,T]}$ a jump at time $s\in[t,T]$ is denoted by $\Delta X_{s}=X_{s}-X_{s-}$ . We follow the convention that, for $t\in[0,T]$ , $r\in[t,T]$ and a càdlàg semimartingale $L=(L_{s})_{s\in[t-,T]}$ , jumps of the càdlàg integrator $L$ at time $t$ contribute to integrals of the form $\int_{[t,r]}\ldots dL_{s}$ . In contrast, we write $\int_{(t,r]}\ldots dL_{s}$ when we do not include jumps of $L$ at time $t$ into the integral. The notation $\int_{t}^{r}\ldots dL_{s}$ is sometimes used for continuous integrators $L$ . For $n\in\mathbb{N}$ and $y\in\mathbb{R}^{n}$ let $\lVert y\rVert_{2}=(\sum_{j=1}^{n}y_{j}^{2})^{\frac{1}{2}}$ . For every $t\in[0,T]$ we mean by $L^{1}(\Omega,\mathcal{F}_{t},P)$ the space of all real-valued $\mathcal{F}_{t}$ -measurable random variables $Y$ such that $\lVert Y\rVert_{L^{1}}=E[\lvert Y\rvert]<\infty$ . For $t\in[0,T]$ , let $\mathcal{L}_{t}^{2}=\mathcal{L}^{2}(\Omega\times[t,T],\mathrm{Prog}(\Omega\times[t,T]),dP\times ds|_{[t,T]})$ denote the space of all (equivalence classes of) real-valued progressively measurable processes $u=(u_{s})_{s\in[t,T]}$ such that $\lVert u\rVert_{\mathcal{L}_{t}^{2}}=(E[\int_{t}^{T}u_{s}^{2}ds])^{\frac{1}{2}}<\infty$ .

The control problem we are about to set up requires as input the real-valued, $\mathcal{F}_{T}$ -measurable random variable $\xi$ and the real-valued, progressively measurable processes $\mu=(\mu_{s})_{s\in[0,T]}$ , $\sigma=(\sigma_{s})_{s\in[0,T]}$ , $\rho=(\rho_{s})_{s\in[0,T]}$ , $\eta=(\eta_{s})_{s\in[0,T]}$ , $\overline{r}=(\overline{r}_{s})_{s\in[0,T]}$ , $\zeta=(\zeta_{s})_{s\in[0,T]}$ and $\lambda=(\lambda_{s})_{s\in[0,T]}$ . We suppose that $\mu$ , $\sigma$ , $\rho$ , $\eta$ and $\lambda$ are $dP\times ds|_{[0,T]}$ -a.e. bounded. Moreover, we assume that $\overline{r}$ is $[-1,1]$ -valued. We define $W^{R}=(W^{R}_{s})_{s\in[0,T]}$ by $dW^{R}_{s}=\overline{r}_{s}dW^{1}_{s}+\sqrt{1-\overline{r}_{s}^{2}}dW^{2}_{s}$ , $s\in[0,T]$ , $W^{R}_{0}=0$ and refer to $\overline{r}$ as the correlation process. The processes $\rho$ and $\eta$ give rise to the continuous semimartingale $R=(R_{s})_{s\in[0,T]}$ with

dR_{s}=\rho_{s}ds+\eta_{s}dW^{R}_{s},\quad s\in[0,T],\quad R_{0}=0,

(3)

which is called the resilience process. We use the processes $\mu$ and $\sigma$ to define the positive continuous semimartingale $\gamma=(\gamma_{s})_{s\in[0,T]}$ by

d\gamma_{s}=\gamma_{s}(\mu_{s}ds+\sigma_{s}dW^{1}_{s}),\quad s\in[0,T],

(4)

with deterministic initial value $\gamma_{0}>0$ . We refer to $\gamma$ as the price impact process. Finally, we assume that $\xi$ and $\zeta$ satisfy the integrability conditions

E[\gamma_{T}\xi^{2}]<\infty\quad\text{and}\quad E\left[\int_{0}^{T}\gamma_{s}\zeta_{s}^{2}ds\right]<\infty.

(5)

Remark 1.1.

Note that the components $W^{3},\ldots,W^{m}$ of the Brownian motion are not needed in the dynamics (3) and (4). We introduce these components already here, as in Section˜3, in order to apply the results from the literature on LQ stochastic control, we restrict the present setting a little by assuming that the filtration $(\mathcal{F}_{s})_{s\in[0,T]}$ is generated by $(W^{1},\ldots,W^{m})^{\top}$ . The components $W^{3},\ldots,W^{m}$ will therefore serve as further sources of randomness, on which the model inputs may depend.

We next introduce the finite variation strategies that we consider in the sequel. Given $t\in[0,T]$ and $d\in\mathbb{R}$ we associate to an adapted, càdlàg, finite variation process $X=(X_{s})_{s\in[t-,T]}$ a process $D^{X}=(D^{X}_{s})_{s\in[t-,T]}$ defined by

dD^{X}_{s}=-D^{X}_{s}dR_{s}+\gamma_{s}dX_{s},\quad s\in[t,T],\quad D^{X}_{t-}=d.

(6)

If there is no risk of confusion we sometimes simply write $D$ instead of $D^{X}$ in the sequel. For $t\in[0,T]$ , $x,d\in\mathbb{R}$ we denote by $\mathcal{A}^{fv}_{t}(x,d)$ the set of all adapted, càdlàg, finite variation processes $X=(X_{s})_{s\in[t-,T]}$ satisfying $X_{t-}=x$ , $X_{T}=\xi$ , and

(A1)

$E\left[\int_{t}^{T}\gamma_{s}^{-1}(D_{s}^{X})^{2}ds\right]<\infty$ ,
(A2)

$E\left[\left(\int_{t}^{T}(D_{s}^{X})^{4}\gamma_{s}^{-2}\eta_{s}^{2}ds\right)^{\frac{1}{2}}\right]<\infty$ ,
(A3)

$E\left[\left(\int_{t}^{T}(D_{s}^{X})^{4}\gamma_{s}^{-2}\sigma_{s}^{2}ds\right)^{\frac{1}{2}}\right]<\infty$ .

Any element $X\in\mathcal{A}^{fv}_{t}(x,d)$ is called a finite variation execution strategy. The process $D=D^{X}$ defined via (6) is called the associated deviation process.

For $t\in[0,T]$ , $x,d\in\mathbb{R}$ , $X\in\mathcal{A}^{fv}_{t}(x,d)$ and associated $D^{X}$ , the cost functional $J^{fv}$ is given by

J^{fv}_{t}(x,d,X)=E_{t}\left[\int_{[t,T]}D^{X}_{s-}dX_{s}+\frac{1}{2}\int_{[t,T]}\Delta X_{s}\gamma_{s}dX_{s}+\int_{t}^{T}\lambda_{s}\gamma_{s}\left(X_{s}-\zeta_{s}\right)^{2}ds\right].

(7)

(see the proofs of ˜1.4 and Lemma˜1.6 for well-definedness). The finite variation stochastic control problem consists of minimizing the cost functional $J^{fv}$ over $X\in\mathcal{A}^{fv}_{t}(x,d)$ .

1.2 Alternative representations for the cost functional and the deviation process

For $t\in[0,T]$ we introduce an auxiliary process $\nu=(\nu_{s})_{s\in[t,T]}$ . It is defined to be the solution of

d\nu_{s}=\nu_{s}d\left(R_{s}+[R]_{s}\right),\quad s\in[t,T],\quad\nu_{t}=1.

(8)

Observe that the inverse is given by

d\nu_{s}^{-1}=-\nu_{s}^{-1}dR_{s},\quad s\in[t,T],\quad\nu_{t}^{-1}=1.

(9)

Remark 1.2.

Let $t\in[0,T]$ , $d\in\mathbb{R}$ . With the definition of $\nu$ in (8), it holds for all adapted, càdlàg, finite variation processes $X=(X_{s})_{s\in[t-,T]}$ that the solution $D^{X}=(D^{X}_{s})_{s\in[t-,T]}$ of the linear SDE (6) reads $D_{s}^{X}=\nu_{s}^{-1}(d+\int_{[t,s]}\nu_{r}\gamma_{r}dX_{r})$ , $s\in[t,T].$

Proposition 1.3.

Let $t\in[0,T]$ and $x,d\in\mathbb{R}$ . Suppose that $X=(X_{s})_{s\in[t-,T]}$ is an adapted, càdlàg, finite variation process with $X_{t-}=x$ and with associated process $D^{X}$ defined by (6). It then holds that

\begin{split}&\int_{[t,T]}D^{X}_{s-}dX_{s}+\frac{1}{2}\int_{[t,T]}\Delta X_{s}\gamma_{s}dX_{s}=\frac{1}{2}\left(\gamma_{T}^{-1}(D^{X}_{T})^{2}-\gamma_{t}^{-1}d^{2}-\int_{t}^{T}(D^{X}_{s})^{2}\nu_{s}^{2}d\left(\nu^{-2}_{s}\gamma_{s}^{-1}\right)\right)\end{split}

(10)

and

\begin{split}D^{X}_{r}&=\gamma_{r}X_{r}+\nu_{r}^{-1}\left(d-\gamma_{t}x-\int_{t}^{r}X_{s}d(\nu_{s}\gamma_{s})\right),\quad r\in[t,T].\end{split}

(11)

As a consequence of ˜1.3, and relying on (A1)–(A3), we can rewrite the cost functional $J^{fv}$ as follows.⁶⁶6Analogues of 1.4 are present in the literature in other related settings; see, e.g., Lemmas 7.4 and 8.6 in [24] and the proof of Lemma 5.3 in Appendix B of [25]. A small technical point, which might be worth noting, is that we present a somewhat different proof below. The idea in [24, 25] is to derive an analogue of (10) by applying the substitution $dX_{s}=\gamma_{s}^{-1}(dD^{X}_{s}+D^{X}_{s}dR_{s})$ and then to compute the expectation. Exactly the same idea would also work in our present setting but it would result in more sustained calculations and, moreover, the right-hand side of (10) would then look rather different (but this would be an equivalent representation, of course). The reason for this is that the process $R$ , hence $D^{X}$ , can have nonvanishing quadratic variation. Here we, essentially, express everything not through $D^{X}$ but rather through $\nu D^{X}$ , which has finite variation by Remark 1.2 (as $X$ has finite variation here). This allows to reduce calculations and provides a somewhat more compact form of (10). To shorten notation, we introduce the process $\kappa=(\kappa_{s})_{s\in[0,T]}$ defined by

\kappa_{s}=\frac{1}{2}\big{(}2\rho_{s}+\mu_{s}-\sigma_{s}^{2}-\eta_{s}^{2}-2\sigma_{s}\eta_{s}\overline{r}_{s}\big{)},\quad s\in[0,T].

(12)

Proposition 1.4.

Let $t\in[0,T]$ and $x,d\in\mathbb{R}$ . Suppose that $X\in\mathcal{A}_{t}^{fv}(x,d)$ with associated deviation process $D^{X}$ defined by (6). It then holds that $J^{fv}_{t}(x,d,X)$ in (7) admits the representation

\begin{split}J^{fv}_{t}(x,d,X)&=\frac{1}{2}E_{t}\!\left[\gamma_{T}^{-1}(D_{T}^{X})^{2}+\!\int_{t}^{T}\!(D^{X}_{s})^{2}\gamma_{s}^{-1}2\kappa_{s}ds+\!\int_{t}^{T}\!2\lambda_{s}\gamma_{s}\left(X_{s}-\zeta_{s}\right)^{2}ds\right]\!-\frac{d^{2}}{2\gamma_{t}}\text{ a.s.}\end{split}

(13)

1.3 Progressively measurable execution strategies

We point out that the right-hand side of (13) is also well-defined for progressively measurable processes $X$ satisfying an appropriate integrability condition and with associated deviation $D$ defined by (11) for which one assumes (A1). This motivates the following extension of the setting from Section˜1.1.

For $t\in[0,T]$ , $x,d\in\mathbb{R}$ and a progressively measurable process $X=(X_{s})_{s\in[t-,T]}$ such that $\int_{t}^{T}X_{s}^{2}ds<\infty$ a.s. and $X_{t-}=x$ , we define the process $D^{X}=(D^{X}_{s})_{s\in[t-,T]}$ by

D^{X}_{s}=\gamma_{s}X_{s}+\nu_{s}^{-1}\left(d-\gamma_{t}x-\int_{t}^{s}X_{r}d(\nu_{r}\gamma_{r})\right),\quad s\in[t,T],\quad D^{X}_{t-}=d

(14)

(recall $\nu$ from (8)). Notice that the condition $\int_{t}^{T}X_{s}^{2}ds<\infty$ a.s. ensures that the stochastic integral in (14) is well-defined. Again, we sometimes write $D$ instead of $D^{X}$ . Further, for $t\in[0,T]$ , $x,d\in\mathbb{R}$ , let $\mathcal{A}^{pm}_{t}(x,d)$ be the set of (equivalence classes of) progressively measurable processes $X=(X_{s})_{s\in[t-,T]}$ with $X_{t-}=x$ and $X_{T}=\xi$ that satisfy $\int_{t}^{T}X_{s}^{2}ds<\infty$ a.s. and such that condition (A1) holds true for $D^{X}$ defined by (14). To be precise, we stress that the equivalence classes for $\mathcal{A}_{t}^{pm}(x,d)$ are understood with respect to the equivalence relation

	$\displaystyle X^{(1)}\sim X^{(2)}\text{ means }$	$\displaystyle X^{(1)}_{.}=X^{(2)}_{.}\;\;dP\times ds\text{-a.e.\ on }\Omega\times[t,T],$
		$\displaystyle X^{(1)}_{t-}=X^{(2)}_{t-}\,(=x)\text{ and }X^{(1)}_{T}=X^{(2)}_{T}\,(=\xi).$		(15)

Any element $X\in\mathcal{A}^{pm}_{t}(x,d)$ is called a progressively measurable execution strategy. Again the process $D=D^{X}$ now defined via (14) is called the associated deviation process. Clearly, we have that $\mathcal{A}^{fv}_{t}(x,d)\subseteq\mathcal{A}^{pm}_{t}(x,d)$ .

Given $t\in[0,T]$ , $x,d\in\mathbb{R}$ , and $X\in\mathcal{A}^{pm}_{t}(x,d)$ with associated $D^{X}$ (see (14)), we define the cost functional $J^{pm}$ by

\begin{split}J^{pm}_{t}(x,d,X)&=\frac{1}{2}E_{t}\!\left[\gamma_{T}^{-1}(D_{T}^{X})^{2}+\!\int_{t}^{T}\!(D^{X}_{s})^{2}\gamma_{s}^{-1}2\kappa_{s}ds+\!\int_{t}^{T}\!2\lambda_{s}\gamma_{s}\left(X_{s}-\zeta_{s}\right)^{2}ds\right]\!-\frac{d^{2}}{2\gamma_{t}}.\end{split}

(16)

Observe that we have the following corollary of ˜1.3 and ˜1.4.

Corollary 1.5.

Let $t\in[0,T]$ , $x,d\in\mathbb{R}$ , and $X\in\mathcal{A}^{fv}_{t}(x,d)$ with associated deviation process $D^{X}$ given by (6). It then holds that $X\in\mathcal{A}^{pm}_{t}(x,d)$ , that $D^{X}$ satisfies (14), and that $J^{fv}_{t}(x,d,X)=J^{pm}_{t}(x,d,X)$ .

1.4 The hidden deviation process

For $t\in[0,T]$ , $x,d\in\mathbb{R}$ , and $X\in\mathcal{A}^{pm}_{t}(x,d)$ with associated deviation process $D^{X}$ , we define $H^{X}=(H^{X}_{s})_{s\in[t,T]}$ by $H^{X}_{s}=D^{X}_{s}-\gamma_{s}X_{s}$ , $s\in[t,T]$ . Observe that if the investor followed a finite variation execution strategy $X\in\mathcal{A}^{fv}_{t}(x,d)$ until time $s\in[t,T]$ and then decided to sell $X_{s}$ units of the asset ( $X_{s}<0$ means buying) at time $s$ , then by (6) the resulting deviation at time $s$ would equal $D^{X}_{s}-\gamma_{s}X_{s}$ . The value of $H^{X}_{s}$ hence represents the hypothetical deviation if the investor decides to close the position at time $s\in[t,T]$ . We therefore call $H^{X}$ the hidden deviation process. Despite $X\in\mathcal{A}^{pm}_{t}(x,d)$ and $D^{X}$ in general being discontinuous, the hidden deviation process $H^{X}$ is always continuous. This can be seen from (14) and the fact that $R$ (hence also $\nu$ ) and $\gamma$ are continuous. In the case of a finite variation execution strategy $X\in\mathcal{A}^{fv}_{t}(x,d)$ , it holds that $dH^{X}_{s}=-D_{s}dR_{s}-X_{s}d\gamma_{s}$ , $s\in[t,T]$ . In particular, the infinitesimal change of the hidden deviation is driven by the changes of the resilience process and the price impact process.

For $t\in[0,T]$ , $x,d\in\mathbb{R}$ , and $X\in\mathcal{A}^{pm}_{t}(x,d)$ , we furthermore introduce the scaled hidden deviation⁷⁷7From the mathematical viewpoint, the scaled hidden deviation plays an extremely important role in what follows. It is, therefore, instructive to see in what kind of units it is measured. The meaning of $X$ is quantity (of shares), while both $D^{X}$ and $\gamma$ are measured in $. Thus, the scaled hidden deviation $\overline{H}^{X}$ is measured in $\sqrt{\$}$ . $\overline{H}^{X}=(\overline{H}^{X}_{s})_{s\in[t,T]}$ defined by

\overline{H}_{s}^{X}=\gamma_{s}^{-\frac{1}{2}}H^{X}_{s}=\gamma_{s}^{-\frac{1}{2}}D^{X}_{s}-\gamma_{s}^{\frac{1}{2}}X_{s},\quad s\in[t,T].

(17)

Also for $H^{X}$ and $\overline{H}^{X}$ we sometimes simply write $H$ and $\overline{H}$ , respectively. Note that, due to (14), it holds that $\overline{H}^{X}_{s}=\gamma_{s}^{-\frac{1}{2}}\nu_{s}^{-1}(d-\gamma_{t}x-\int_{t}^{s}X_{r}d(\nu_{r}\gamma_{r}))$ , $s\in[t,T]$ .

We next show that the scaled hidden deviation process satisfies a linear SDE and an $L^{2}$ -bound. Moreover, we derive a representation of $J^{pm}$ in terms of the scaled hidden deviation process.

Lemma 1.6.

Let $t\in[0,T]$ , $x,d\in\mathbb{R}$ , and $X\in\mathcal{A}^{pm}_{t}(x,d)$ . Then it holds that

\begin{split}d\overline{H}^{X}_{s}&=\left(\frac{1}{2}\left(\mu_{s}-\frac{1}{4}\sigma_{s}^{2}\right)\overline{H}_{s}^{X}-\frac{1}{2}\left(2(\rho_{s}+\mu_{s})-\sigma_{s}^{2}-\sigma_{s}\eta_{s}\overline{r}_{s}\right)\gamma_{s}^{-\frac{1}{2}}D^{X}_{s}\right)ds\\ &\quad+\left(\frac{1}{2}\sigma_{s}\overline{H}^{X}_{s}-(\sigma_{s}+\eta_{s}\overline{r}_{s})\gamma_{s}^{-\frac{1}{2}}D^{X}_{s}\right)dW^{1}_{s}-\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}\gamma_{s}^{-\frac{1}{2}}D^{X}_{s}dW_{s}^{2},\quad s\in[t,T],\\ \overline{H}^{X}_{t}&=\frac{d}{\sqrt{\gamma_{t}}}-\sqrt{\gamma_{t}}x,\end{split}

(18)

that $E[\sup_{s\in[t,T]}(\overline{H}_{s}^{X})^{2}]<\infty$ , and that

\begin{split}J^{pm}_{t}(x,d,X)&=\frac{1}{2}E_{t}\bigg{[}\big{(}\overline{H}^{X}_{T}+\sqrt{\gamma_{T}}\xi\big{)}^{2}+\int_{t}^{T}2(\kappa_{s}+\lambda_{s})\gamma_{s}^{-1}(D_{s}^{X})^{2}ds\bigg{]}-\frac{d^{2}}{2\gamma_{t}}\\ &\quad+E_{t}\bigg{[}\int_{t}^{T}\left(\lambda_{s}\left(\overline{H}^{X}_{s}+\sqrt{\gamma_{s}}\zeta_{s}\right)^{2}-2\lambda_{s}\left(\overline{H}^{X}_{s}+\sqrt{\gamma_{s}}\zeta_{s}\right)\gamma_{s}^{-\frac{1}{2}}D_{s}^{X}\right)ds\bigg{]}.\end{split}

(19)

1.5 Continuous extension of the cost functional

Corollary˜1.5 states that for finite variation execution strategies, the cost functionals $J^{fv}$ and $J^{pm}$ are the same. In this subsection we show that $J^{pm}$ can be considered as an extension of $J^{fv}$ to progressively measurable strategies; i.e., we introduce a metric $\mathbf{d}$ on $\mathcal{A}_{t}^{pm}(x,d)$ and show that $J^{pm}_{t}(x,d,X)$ is continuous in the strategy $X\in\mathcal{A}_{t}^{pm}(x,d)$ (the first part of ˜1.7), that $\mathcal{A}^{fv}_{t}(x,d)$ is dense in $\mathcal{A}^{pm}_{t}(x,d)$ (the second part of ˜1.7) and that the metric space $(\mathcal{A}_{t}^{pm}(x,d),\mathbf{d})$ is complete (the third part of ˜1.7). The first and the second parts of ˜1.7 mean that, under the metric $\mathbf{d}$ , $J_{t}^{pm}(x,d,\cdot)$ is a unique continuous extension of $J_{t}^{fv}(x,d,\cdot)$ from $\mathcal{A}_{t}^{fv}(x,d)$ onto $\mathcal{A}_{t}^{pm}(x,d)$ . The third part of ˜1.7 means that, under the metric $\mathbf{d}$ , $\mathcal{A}_{t}^{pm}(x,d)$ is the largest space where such a continuous extension is uniquely determined by $J_{t}^{fv}(x,d,\cdot)$ on $\mathcal{A}_{t}^{fv}(x,d)$ . This is because the completeness of $(\mathcal{A}_{t}^{pm}(x,d),\mathbf{d})$ is equivalent to the following statement: For any metric space $(\widehat{\mathcal{A}}_{t}(x,d),\widehat{\mathbf{d}})$ containing $\mathcal{A}_{t}^{pm}(x,d)$ and such that $\widehat{\mathbf{d}}|_{\mathcal{A}_{t}^{pm}(x,d)^{2}}=\mathbf{d}$ , it holds that the set $\mathcal{A}_{t}^{pm}(x,d)$ is closed in $\widehat{\mathcal{A}}_{t}(x,d)$ .

For $t\in[0,T]$ , $x,d\in\mathbb{R}$ , and $X,Y\in\mathcal{A}_{t}^{pm}(x,d)$ with associated deviation processes $D^{X}$ , $D^{Y}$ defined by (14), we define

\mathbf{d}(X,Y)=\left(E\left[\int_{t}^{T}(D_{s}^{X}-D_{s}^{Y})^{2}\gamma_{s}^{-1}ds\right]\right)^{\frac{1}{2}}.

(20)

Identifying any processes that are equal $dP\times ds|_{[t,T]}$ -a.e., this indeed is a metric on $\mathcal{A}_{t}^{pm}(x,d)$ , see Lemma˜5.2.

Note that, for fixed $t\in[0,T]$ and $x,d\in\mathbb{R}$ , we may consider the cost functional (16) as a function $J^{pm}_{t}(x,d,\cdot)\colon(\mathcal{A}_{t}^{pm}(x,d),\mathbf{d})\to(L^{1}(\Omega,\mathcal{F}_{t},P),\lVert\cdot\rVert_{L^{1}}).$ Indeed, using (A1), Lemma˜1.6, (5), and boundedness of the input processes, we see that $J^{pm}_{t}(x,d,X)\in L^{1}(\Omega,\mathcal{F}_{t},P)$ for all $X\in\mathcal{A}_{t}^{pm}(x,d)$ .

Theorem 1.7.

Let $t\in[0,T]$ and $x,d\in\mathbb{R}$ .

(i) Suppose that $X\in\mathcal{A}^{pm}_{t}(x,d)$ . For every sequence $(X^{n})_{n\in\mathbb{N}}$ in $\mathcal{A}^{pm}_{t}(x,d)$ with $\lim_{n\to\infty}\mathbf{d}(X^{n},X)=0$ it holds that $\lim_{n\to\infty}\lVert J^{pm}_{t}(x,d,X^{n})-J^{pm}_{t}(x,d,X)\rVert_{L^{1}}=0$ .

(ii) For any $X\in\mathcal{A}^{pm}_{t}(x,d)$ there exists a sequence $(X^{n})_{n\in\mathbb{N}}$ in $\mathcal{A}^{fv}_{t}(x,d)$ such that $\lim_{n\to\infty}\mathbf{d}(X^{n},X)=0$ . In particular, it holds that

\operatorname*{ess\,inf}_{X\in\mathcal{A}^{fv}_{t}(x,d)}J^{fv}_{t}(x,d,X)=\operatorname*{ess\,inf}_{X\in\mathcal{A}^{pm}_{t}(x,d)}J^{pm}_{t}(x,d,X).

(21)

(iii) For any Cauchy sequence $(X^{n})_{n\in\mathbb{N}}$ in $(\mathcal{A}_{t}^{pm}(x,d),\mathbf{d})$ there exists some $X^{0}\in\mathcal{A}_{t}^{pm}(x,d)$ such that $\lim_{n\to\infty}\mathbf{d}(X^{n},X^{0})=0$ .

In Corollary˜3.4 below we provide sufficient conditions that ensure that the infimum on the right-hand side of (21) is indeed a minimum.

2 Reduction to a standard LQ stochastic control problem

In this section we recast the problem of minimizing $J^{pm}$ over $X\in\mathcal{A}_{t}^{pm}(x,d)$ as a standard LQ stochastic control problem. All proofs of this section are given in Section˜5.

2.1 The first reduction

Note that (19) in Lemma˜1.6 shows that for $t\in[0,T]$ , $x,d\in\mathbb{R}$ , and $X\in\mathcal{A}^{pm}_{t}(x,d)$ the costs $J^{pm}_{t}(x,d,X)$ depend in a quadratic way on $(\overline{H}^{X},\gamma^{-\frac{1}{2}}D^{X})$ . Moreover, (18) in Lemma˜1.6 ensures that the dynamics of $\overline{H}^{X}$ depend linearly on $(\overline{H}^{X},\gamma^{-\frac{1}{2}}D^{X})$ . These two observations suggest to view the minimization problem of $J^{pm}$ over $X\in\mathcal{A}^{pm}_{t}(x,d)$ as a standard LQ stochastic control problem with state process $\overline{H}^{X}$ and control $\gamma^{-\frac{1}{2}}D^{X}$ . This motivates the following definitions. For every $t\in[0,T]$ , $x,d\in\mathbb{R}$ , and $u\in\mathcal{L}_{t}^{2}$ , we consider the state process $\widetilde{H}^{u}=(\widetilde{H}^{u}_{s})_{s\in[t,T]}$ defined by

\begin{split}d\widetilde{H}^{u}_{s}&=\left(\frac{1}{2}\left(\mu_{s}-\frac{1}{4}\sigma_{s}^{2}\right)\widetilde{H}^{u}_{s}-\frac{1}{2}\left(2(\rho_{s}+\mu_{s})-\sigma_{s}^{2}-\sigma_{s}\eta_{s}\overline{r}_{s}\right)u_{s}\right)ds\\ &\quad+\left(\frac{1}{2}\sigma_{s}\widetilde{H}^{u}_{s}-(\sigma_{s}+\eta_{s}\overline{r}_{s})u_{s}\right)dW^{1}_{s}-\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}u_{s}dW_{s}^{2},\quad s\in[t,T],\\ \widetilde{H}^{u}_{t}&=\frac{d}{\sqrt{\gamma_{t}}}-\sqrt{\gamma_{t}}x,\end{split}

(22)

and the cost functional $J$ defined by

\begin{split}J_{t}\left(\frac{d}{\sqrt{\gamma_{t}}}-\sqrt{\gamma_{t}}x,u\right)&=\frac{1}{2}E_{t}\bigg{[}\big{(}\widetilde{H}^{u}_{T}+\sqrt{\gamma_{T}}\xi\big{)}^{2}+\int_{t}^{T}2(\kappa_{s}+\lambda_{s})u_{s}^{2}ds\\ &\quad\quad\quad+\int_{t}^{T}\left(2\lambda_{s}\left(\widetilde{H}^{u}_{s}+\sqrt{\gamma_{s}}\zeta_{s}\right)^{2}-4\lambda_{s}\left(\widetilde{H}^{u}_{s}+\sqrt{\gamma_{s}}\zeta_{s}\right)u_{s}\right)ds\bigg{]}.\end{split}

(23)

Once again we sometimes simply write $\widetilde{H}$ instead of $\widetilde{H}^{u}$ . The LQ stochastic control problem is to minimize (23) over the set of admissible controls $\mathcal{L}_{t}^{2}$ .

It holds that for every progressively measurable execution strategy $X\in\mathcal{A}_{t}^{pm}(x,d)$ there exists a control $u\in\mathcal{L}_{t}^{2}$ such that the cost functional $J^{pm}$ can be rewritten in terms of $J$ (and $-\frac{d^{2}}{2\gamma_{t}}$ ). In fact, this is achieved by taking $u=\gamma^{-\frac{1}{2}}D^{X}$ , as outlined in the motivation above. We state this as Lemma˜2.1.

Lemma 2.1.

Let $t\in[0,T]$ and $x,d\in\mathbb{R}$ . Suppose that $X\in\mathcal{A}^{pm}_{t}(x,d)$ with associated deviation $D^{X}$ . Define $u=(u_{s})_{s\in[t,T]}$ by $u_{s}=\gamma_{s}^{-\frac{1}{2}}D^{X}_{s}$ , $s\in[t,T]$ . It then holds that $u\in\mathcal{L}_{t}^{2}$ and that $J^{pm}_{t}(x,d,X)=J_{t}(\frac{d}{\sqrt{\gamma_{t}}}-\sqrt{\gamma_{t}}x,u)-\frac{d^{2}}{2\gamma_{t}}$ a.s.

On the other hand, we may also start with $u\in\mathcal{L}_{t}^{2}$ and derive a progressively measurable execution strategy $X\in\mathcal{A}_{t}^{pm}(x,d)$ such that the expected costs match.

Lemma 2.2.

Let $t\in[0,T]$ and $x,d\in\mathbb{R}$ . Suppose that $u=(u_{s})_{s\in[t,T]}\in\mathcal{L}_{t}^{2}$ and let $\widetilde{H}^{u}$ be the associated solution of (22). Define $X=(X_{s})_{s\in[t-,T]}$ by $X_{s}=\gamma_{s}^{-\frac{1}{2}}(u_{s}-\widetilde{H}^{u}_{s})$ , $s\in[t,T)$ , $X_{t-}=x$ , $X_{T}=\xi$ . It then holds that $X\in\mathcal{A}_{t}^{pm}(x,d)$ and that $J^{pm}_{t}(x,d,X)=J_{t}(\frac{d}{\sqrt{\gamma_{t}}}-\sqrt{\gamma_{t}}x,u)-\frac{d^{2}}{2\gamma_{t}}$ a.s.

Lemma˜2.1 and Lemma˜2.2 together with ˜1.7 establish the following equivalence of the control problems pertaining to $J^{fv}$ , $J^{pm}$ , and $J$ .

Corollary 2.3.

For $t\in[0,T]$ and $x,d\in\mathbb{R}$ it holds that

\operatorname*{ess\,inf}_{X\in\mathcal{A}^{fv}_{t}(x,d)}J^{fv}_{t}(x,d,X)=\operatorname*{ess\,inf}_{X\in\mathcal{A}^{pm}_{t}(x,d)}J^{pm}_{t}(x,d,X)=\operatorname*{ess\,inf}_{u\in\mathcal{L}_{t}^{2}}J_{t}\left(\frac{d}{\sqrt{\gamma_{t}}}-\sqrt{\gamma_{t}}x,u\right)-\frac{d^{2}}{2\gamma_{t}}\text{ a.s.}

Furthermore, Lemma˜2.1, Lemma˜2.2, and Corollary˜2.3 provide a method to obtain an optimal progressively measurable execution strategy and potentially an optimal finite variation execution strategy from the standard optimal control problem and vice versa.

Corollary 2.4.

Let $t\in[0,T]$ and $x,d\in\mathbb{R}$ .

(i) Suppose that $X^{*}=(X^{*}_{s})_{s\in[t-,T]}\in\mathcal{A}_{t}^{pm}(x,d)$ minimizes $J^{pm}$ over $\mathcal{A}_{t}^{pm}(x,d)$ and let $D^{X^{*}}$ be the associated deviation process. Then, $u^{*}=(u^{*}_{s})_{s\in[t,T]}$ defined by $u^{*}_{s}=\gamma_{s}^{-\frac{1}{2}}D^{X^{*}}_{s}$ , $s\in[t,T]$ , minimizes $J$ over $\mathcal{L}_{t}^{2}$ .

(ii) Suppose that $u^{*}=(u^{*}_{s})_{s\in[t,T]}\in\mathcal{L}_{t}^{2}$ minimizes $J$ over $\mathcal{L}_{t}^{2}$ and let $\widetilde{H}^{u^{*}}$ be the associated solution of (22) for $u^{*}$ . Then, $X^{*}=(X^{*}_{s})_{s\in[t,T]}$ defined by $X^{*}_{s}=\gamma_{s}^{-\frac{1}{2}}(u^{*}_{s}-\widetilde{H}^{u^{*}}_{s})$ , $s\in[t,T)$ , $X_{t-}^{*}=x$ , $X_{T}^{*}=\xi$ , minimizes $J^{pm}$ over $\mathcal{A}_{t}^{pm}(x,d)$ .

Moreover, if $X^{*}\in\mathcal{A}_{t}^{fv}(x,d)$ (in the sense that there is an element of $\mathcal{A}_{t}^{fv}(x,d)$ within the equivalence class of $X^{*}$ , see (15)), then $X^{*}$ minimizes $J^{fv}$ over $\mathcal{A}_{t}^{fv}(x,d)$ .

2.2 Formulation without cross-terms

Note that the last integral in the definition (23) of the cost functional $J$ involves a product between the state process $\widetilde{H}^{u}$ and the control process $u$ . A larger part of the literature on LQ optimal control considers cost functionals that do not contain such cross-terms. In particular, this applies to [34], whose results we apply in Section˜3 below. For this reason we provide in this subsection a reformulation of the control problem (22) and (23) that does not contain cross-terms. In order to carry out the transformation necessary for this, we need to impose a further condition on our model inputs. We assume that there exists a constant $C\in[0,\infty)$ such that for all $s\in[0,T]$ we have $P$ -a.s. that

\lvert\lambda_{s}\rvert\leq C\lvert\lambda_{s}+\kappa_{s}\rvert.

(24)

Note that this assumption ensures that the set $\{\lambda_{s}+\kappa_{s}=0\}$ is a subset of $\{\lambda_{s}=0\}$ (up to a $P$ -null set). For this reason we, in the sequel, use the following

Convention: Under (24) we always understand $\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}=0$ on the set $\{\lambda_{s}+\kappa_{s}=0\}$ .

Now in order to get rid of the cross-term in (23) we transform for $t\in[0,T]$ any control process $u\in\mathcal{L}_{t}^{2}$ in an affine way to $\hat{u}_{s}=u_{s}-\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}(\widetilde{H}^{u}_{s}+\sqrt{\gamma_{s}}\zeta_{s})$ , $s\in[t,T]$ . This leads to the new controlled state process $\widehat{H}^{\hat{u}}=(\widehat{H}^{\hat{u}}_{s})_{s\in[t,T]}$ which is defined for every $t\in[0,T]$ , $x,d\in\mathbb{R}$ , and $\hat{u}\in\mathcal{L}_{t}^{2}$ by

\begin{split}d\widehat{H}_{s}^{\hat{u}}&=\left(\frac{\mu_{s}}{2}-\frac{1}{8}\sigma_{s}^{2}-\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}\left(\rho_{s}+\mu_{s}-\frac{\sigma_{s}^{2}+\sigma_{s}\eta_{s}\overline{r}_{s}}{2}\right)\right)\widehat{H}_{s}^{\hat{u}}ds\\ &\quad-\left(\rho_{s}+\mu_{s}-\frac{\sigma_{s}^{2}+\sigma_{s}\eta_{s}\overline{r}_{s}}{2}\right)\hat{u}_{s}ds-\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}\left(\rho_{s}+\mu_{s}-\frac{\sigma_{s}^{2}+\sigma_{s}\eta_{s}\overline{r}_{s}}{2}\right)\sqrt{\gamma_{s}}\zeta_{s}ds\\ &\quad+\left(\frac{\sigma_{s}}{2}-\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}(\sigma_{s}+\eta_{s}\overline{r}_{s})\right)\widehat{H}_{s}^{\hat{u}}dW_{s}^{1}-(\sigma_{s}+\eta_{s}\overline{r}_{s})\hat{u}_{s}dW_{s}^{1}\\ &\quad-\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}(\sigma_{s}+\eta_{s}\overline{r}_{s})\sqrt{\gamma_{s}}\zeta_{s}dW_{s}^{1}-\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}\widehat{H}_{s}^{\hat{u}}dW_{s}^{2}\\ &\quad-\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}\hat{u}_{s}dW_{s}^{2}-\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}\sqrt{\gamma_{s}}\zeta_{s}dW_{s}^{2},\quad s\in[t,T],\\ \widehat{H}_{t}^{\hat{u}}&=\frac{d}{\sqrt{\gamma_{t}}}-\sqrt{\gamma_{t}}x.\end{split}

(25)

The meaning of (25) is that we only reparametrize the control ( $u\to\hat{u}$ ) but not the state variable ( $\widehat{H}^{\hat{u}}=\widetilde{H}^{u}$ ), see Lemma˜2.5 for the formal statement. For $t\in[0,T]$ , $x,d\in\mathbb{R}$ , $\hat{u}\in\mathcal{L}_{t}^{2}$ and associated $\widehat{H}^{\hat{u}}$ , we define the cost functional $\hat{J}$ by

\begin{split}\hat{J}_{t}\!\left(\!\frac{d}{\sqrt{\gamma_{t}}}-\!\sqrt{\gamma_{t}}x,\hat{u}\!\right)&\!=\!E_{t}\bigg{[}\frac{1}{2}\big{(}\widehat{H}^{\hat{u}}_{T}+\!\sqrt{\gamma_{T}}\xi\big{)}^{2}\!+\!\!\int_{t}^{T}\!\!\!\left(\!\frac{\lambda_{s}\kappa_{s}}{\lambda_{s}+\kappa_{s}}\left(\!\widehat{H}^{\hat{u}}_{s}+\!\sqrt{\gamma_{s}}\zeta_{s}\right)^{2}\!\!+\!(\lambda_{s}\!+\!\kappa_{s})\hat{u}_{s}^{2}\right)ds\bigg{]}.\end{split}

(26)

This cost functional does not exhibit cross-terms, but is equivalent to $J$ of (23) in the sense of the following lemma.

Lemma 2.5.

Assume that (24) holds true. Let $t\in[0,T]$ and $x,d\in\mathbb{R}$ .

(i) Suppose that $u\in\mathcal{L}_{t}^{2}$ with associated state process $\widetilde{H}^{u}$ defined by (22). Then, $\hat{u}=(\hat{u}_{s})_{s\in[t,T]}$ defined by $\hat{u}_{s}=u_{s}-\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}(\widetilde{H}^{u}_{s}+\sqrt{\gamma_{s}}\zeta_{s})$ , $s\in[t,T]$ , is in $\mathcal{L}_{t}^{2}$ , and it holds that $\widehat{H}^{\hat{u}}=\widetilde{H}^{u}$ and $J_{t}(\frac{d}{\sqrt{\gamma_{t}}}-\sqrt{\gamma_{t}}x,u)=\hat{J}_{t}(\frac{d}{\sqrt{\gamma_{t}}}-\sqrt{\gamma_{t}}x,\hat{u})$ .

(ii) Suppose that $\hat{u}\in\mathcal{L}_{t}^{2}$ with associated state process $\widehat{H}^{\hat{u}}$ defined by (25). Then, $u=(u_{s})_{s\in[t,T]}$ defined by $u_{s}=\hat{u}_{s}+\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}(\widehat{H}^{\hat{u}}_{s}+\sqrt{\gamma_{s}}\zeta_{s})$ , $s\in[t,T]$ , is in $\mathcal{L}_{t}^{2}$ , and it holds that $\widetilde{H}^{u}=\widehat{H}^{\hat{u}}$ and $J_{t}(\frac{d}{\sqrt{\gamma_{t}}}-\sqrt{\gamma_{t}}x,u)=\hat{J}_{t}(\frac{d}{\sqrt{\gamma_{t}}}-\sqrt{\gamma_{t}}x,\hat{u})$ .

As a corollary, we obtain the following link between an optimal control for $\hat{J}$ and an optimal control for $J$ .

Corollary 2.6.

Assume that (24) holds true. Let $t\in[0,T]$ and $x,d\in\mathbb{R}$ .

(i) Suppose that $u^{*}=(u^{*}_{s})_{s\in[t,T]}\in\mathcal{L}_{t}^{2}$ is an optimal control for $J$ , and let $\widetilde{H}^{u^{*}}$ be the solution of (22) for $u^{*}$ . Then, $\hat{u}^{*}=(\hat{u}^{*}_{s})_{s\in[t,T]}$ defined by $\hat{u}^{*}_{s}=u^{*}_{s}-\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}(\widetilde{H}^{u^{*}}_{s}+\sqrt{\gamma_{s}}\zeta_{s})$ , $s\in[t,T]$ , is an optimal control in $\mathcal{L}_{t}^{2}$ for $\hat{J}$ .

(ii) Suppose that $\hat{u}^{*}=(\hat{u}^{*}_{s})_{s\in[t,T]}\in\mathcal{L}_{t}^{2}$ is an optimal control for $\hat{J}$ , and let $\widehat{H}^{\hat{u}^{*}}$ be the solution of (25) for $\hat{u}^{*}$ . Then, $u^{*}=(u^{*}_{s})_{s\in[t,T]}$ defined by $u^{*}_{s}=\hat{u}^{*}_{s}+\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}(\widehat{H}^{\hat{u}^{*}}_{s}+\sqrt{\gamma_{s}}\zeta_{s})$ , $s\in[t,T]$ , is an optimal control in $\mathcal{L}_{t}^{2}$ for $J$ .

3 Solving the LQ control problem and the trade execution problem

We now solve the LQ control problem from Section˜2 and consequently obtain a solution of the trade execution problem.

Remark 3.1.

The solution approach of [34], which we are about to apply, is built on the tight connection between standard LQ stochastic control problems and Riccati-type BSDEs (BSRDEs). This connection is well known and dates back at least to Bismut (see, e.g., [20] and [21]). The central challenge in this approach is to establish the existence of a solution of the BSRDE. Kohlmann and Tang prove in [34] such results in a general framework which in particular covers our problem formulation in Section˜2.2 under appropriate assumptions.

There is a variety of further results in the literature on LQ stochastic control problems that provide existence results for BSRDEs under different sets of assumptions. A specific potential further possibility is, for example, to use the results of the recent article [40] by Sun et al. in our setting. The set-up of [40] allows for cross-terms in the cost functional and, more interestingly, the results in [40] hold under a uniform convexity assumption on the cost functional, which is a weaker requirement than the usually imposed nonnegativity and positivity assumptions on the coefficients of the cost functional. However, in general, the terminal costs and the running costs in (23) (and also in (26)) contain terms such as $(\widetilde{H}^{u}_{T}+\sqrt{\gamma_{T}}\xi)^{2}$ and $\lambda_{s}(\widetilde{H}^{u}_{s}+\sqrt{\gamma_{s}}\zeta_{s})^{2}$ , which are inhomogeneous. Therefore, the results of [40] are only directly applicable in the special case where $\xi=0$ and at least one of $\lambda$ and $\zeta$ vanishes. A possible route for future research could be to incorporate inhomogeneous control problems as presented in Section˜2 to the framework of [40].

Setting in Section˜3: In our general setting (see Section˜1.1) we additionally assume that the filtration $(\mathcal{F}_{s})_{s\in[0,T]}$ for the filtered probability space $(\Omega,\mathcal{F}_{T},(\mathcal{F}_{s})_{s\in[0,T]},P)$ is the augmented natural filtration of the Brownian motion $(W^{1},\ldots,W^{m})^{\top}$ . Furthermore, we set the initial time to $t=0$ . We also assume that $\lambda$ and $\kappa=\frac{1}{2}(2\rho+\mu-\sigma^{2}-\eta^{2}-2\sigma\eta\overline{r})$ are nonnegative $dP\times ds|_{[0,T]}$ -a.e.⁸⁸8We stress at this point that the results presented in Sections 1 and 2 are valid for more general filtrations and for processes $\lambda$ and $\kappa$ possibly taking negative values. This opens the way for applying Sections 1 and 2 in other settings in future research.

Remark 3.2.

Note that the assumption of nonnegativity of $\lambda$ and $\kappa$ is necessary to apply the results of [34]. Indeed, [34] requires that $\lambda+\kappa$ (the coefficient in front of $\hat{u}^{2}$ in (26)) and $\frac{\lambda\kappa}{\lambda+\kappa}$ (the coefficient in front of $(\widehat{H}^{\hat{u}}_{s}+\sqrt{\gamma_{s}}\zeta_{s})^{2}$ in (26)) are nonnegative and bounded, which implies that $\lambda$ and $\kappa$ have to be nonnegative.

Moreover, we note that nonnegativity of $\lambda$ and $\kappa$ ensures that (24) is satisfied. Further, we observe that the mentioned coefficients $\lambda+\kappa$ and $\frac{\lambda\kappa}{\lambda+\kappa}$ are bounded, as required. Indeed, it clearly holds $\frac{\lambda\kappa}{\lambda+\kappa}\leq\kappa$ , and it remains to recall that $\mu,\sigma,\rho,\eta$ , and $\lambda$ are bounded and $\overline{r}$ is $[-1,1]$ -valued (see Section˜1.1).

Note that the LQ control problem of Section˜2.2, which consists of minimizing $\hat{J}$ in (26) with state dynamics given by (25), is of the form considered in [34, (79)-(81)]. The solution can be described by the two BSDEs [34, (9) and (85)]. The first one, [34, (9)], is a Riccati-type BSDE, which in our setting reads

\begin{split}dK_{s}&=-\Bigg{[}\left(\mu_{s}+\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}\bigg{(}\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}(\sigma_{s}^{2}+2\sigma_{s}\eta_{s}\overline{r}_{s}+\eta_{s}^{2})-2(\rho_{s}+\mu_{s})\bigg{)}\right)K_{s}\\ &\qquad+\left(\sigma_{s}-\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}2(\sigma_{s}+\eta_{s}\overline{r}_{s})\right)L^{1}_{s}-\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}2\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}L^{2}_{s}+\frac{\lambda_{s}\kappa_{s}}{\lambda_{s}+\kappa_{s}}\\[2.84526pt] &\qquad-\frac{\left(\left(\rho_{s}+\mu_{s}-\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}(\sigma_{s}^{2}+2\sigma_{s}\eta_{s}\overline{r}_{s}+\eta_{s}^{2})\right)K_{s}+(\sigma_{s}+\eta_{s}\overline{r}_{s})L^{1}_{s}+\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}L^{2}_{s}\right)^{2}}{\lambda_{s}+\kappa_{s}+(\sigma_{s}^{2}+2\sigma_{s}\eta_{s}\overline{r}_{s}+\eta_{s}^{2})K_{s}}\Bigg{]}ds\\ &\quad+\sum_{j=1}^{m}L_{s}^{j}dW_{s}^{j},\quad s\in[0,T],\\ K_{T}&=\frac{1}{2}.\end{split}

(27)

We call a pair $(K,L)$ with $L=(L^{1},L^{2},\ldots,L^{m})^{\top}$ a solution to BSDE (27) if

(i)

$K$ is an adapted, continuous, nonnegative, and bounded process,
(ii)

$\lambda+\kappa+(\sigma^{2}+2\sigma\eta\overline{r}+\eta^{2})K>0$ $dP\times ds|_{[0,T]}$ -a.e.,
(iii)

$L^{1},\ldots,L^{m}\in\mathcal{L}_{0}^{2}$ , and
(iv)

BSDE (27) is satisfied $P$ -a.s.

A discussion of this definition is in order. The requirement of nonnegativity and boundedness of $K$ can be explained at this point by the fact that, under mild conditions, such a solution exists (see ˜3.3 below). Condition (ii) ensures that there is no problem with division in the driver of (27), where the quantity $\lambda+\kappa+(\sigma^{2}+2\sigma\eta\overline{r}+\eta^{2})K$ appears in the denominator. Moreover, it is worth noting that, for a nonnegative $K$ , in our setting we always have $\lambda+\kappa+(\sigma^{2}+2\sigma\eta\overline{r}+\eta^{2})K\geq 0$ , as $\sigma^{2}+2\sigma\eta\overline{r}+\eta^{2}=(\sigma+\eta\overline{r})^{2}+\eta^{2}(1-\overline{r}^{2})$ . From this we also see that the quantity $\lambda+\kappa+(\sigma^{2}+2\sigma\eta\overline{r}+\eta^{2})K$ can vanish only in “very degenerate” situations. The conclusion is that condition (ii) is quite natural.

To shorten notation, we introduce, for a solution $(K,L)$ of BSDE (27), the process $\theta=(\theta_{s})_{s\in[0,T]}$ by, for $s\in[0,T]$ ,

\theta_{s}=\frac{\left(\rho_{s}+\mu_{s}-\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}(\sigma_{s}^{2}+2\sigma_{s}\eta_{s}\overline{r}_{s}+\eta_{s}^{2})\right)K_{s}+(\sigma_{s}+\eta_{s}\overline{r}_{s})L^{1}_{s}+\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}L^{2}_{s}}{\lambda_{s}+\kappa_{s}+(\sigma_{s}^{2}+2\sigma_{s}\eta_{s}\overline{r}_{s}+\eta_{s}^{2})K_{s}}.

(28)

Next, we consider the second BSDE [34, (85)], which is linear and reads in our setting

\begin{split}d\psi_{s}&=-\Bigg{[}\left(\frac{\mu_{s}}{2}-\frac{\sigma_{s}^{2}}{8}-\left(\rho_{s}+\mu_{s}-\frac{\sigma_{s}^{2}+\sigma_{s}\eta_{s}\overline{r}_{s}}{2}\right)\left(\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}+\theta_{s}\right)\right)\psi_{s}\\ &\qquad+\left(\frac{\sigma_{s}}{2}-(\sigma_{s}+\eta_{s}\overline{r}_{s})\left(\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}+\theta_{s}\right)\right)\left(\phi_{s}^{1}+\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}(\sigma_{s}+\eta_{s}\overline{r}_{s})\sqrt{\gamma_{s}}\zeta_{s}K_{s}\right)\\ &\qquad-\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}\left(\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}+\theta_{s}\right)\left(\phi_{s}^{2}+\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}\sqrt{\gamma_{s}}\zeta_{s}K_{s}\right)\\ &\qquad+\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}\sqrt{\gamma_{s}}\zeta_{s}\left(\left(\rho_{s}+\mu_{s}-\frac{\sigma_{s}^{2}+\sigma_{s}\eta_{s}\overline{r}_{s}}{2}\right)K_{s}+(\sigma_{s}+\eta_{s}\overline{r}_{s})L_{s}^{1}+\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}L_{s}^{2}\right)\\ &\qquad-\frac{\lambda_{s}\kappa_{s}}{\lambda_{s}+\kappa_{s}}\sqrt{\gamma_{s}}\zeta_{s}\Bigg{]}ds+\sum_{j=1}^{m}\phi_{s}^{j}dW_{s}^{j},\quad s\in[0,T],\\ \psi_{T}&=-\frac{1}{2}\sqrt{\gamma_{T}}\xi.\end{split}

(29)

A pair $(\psi,\phi)$ with $\phi=(\phi^{1},\phi^{2},\ldots,\phi^{m})^{\top}$ is called a solution to BSDE (29) if

(i)

$\psi$ is an adapted continuous process with $E\left[\sup_{s\in[0,T]}\psi_{s}^{2}\right]<\infty$ ,
(ii)

$\phi$ is progressively measurable with $\int_{0}^{T}\lVert\phi_{s}\rVert_{2}^{2}ds<\infty$ $P$ -a.s., and
(iii)

BSDE (29) is satisfied $P$ -a.s.

For a solution $(K,L)$ of BSDE (27) and a corresponding solution $(\psi,\phi)$ of BSDE (29), we define $\theta^{0}=(\theta^{0}_{s})_{s\in[0,T]}$ by

\begin{split}\theta_{s}^{0}&=\Bigg{(}\bigg{(}\rho_{s}+\mu_{s}-\frac{\sigma_{s}^{2}+\sigma_{s}\eta_{s}\overline{r}_{s}}{2}\bigg{)}\psi_{s}+\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}\sqrt{\gamma_{s}}\zeta_{s}(\sigma_{s}^{2}+2\sigma_{s}\eta_{s}\overline{r}_{s}+\eta_{s}^{2})K_{s}\\ &\qquad+(\sigma_{s}+\eta_{s}\overline{r}_{s})\phi^{1}_{s}+\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}\phi^{2}_{s}\Bigg{)}\cdot\left(\lambda_{s}+\kappa_{s}+(\sigma_{s}^{2}+2\sigma_{s}\eta_{s}\overline{r}_{s}+\eta_{s}^{2})K_{s}\right)^{-1},\end{split}

(30)

for $s\in[0,T]$ . We further introduce for $x,d\in\mathbb{R}$ and $s\in[0,T]$ the SDE

d\widehat{H}_{s}^{*}=\widehat{H}_{s}^{*}\,d\mathcal{Y}_{s}+d\mathcal{Z}_{s},\quad\widehat{H}_{0}^{*}=\frac{d}{\sqrt{\gamma_{0}}}-\sqrt{\gamma_{0}}x,

(31)

where for $s\in[0,T]$

\begin{split}d\mathcal{Y}_{s}&=\left(\frac{\mu_{s}}{2}-\frac{\sigma_{s}^{2}}{8}-\bigg{(}\rho_{s}+\mu_{s}-\frac{\sigma_{s}^{2}+\sigma_{s}\eta_{s}\overline{r}_{s}}{2}\bigg{)}\bigg{(}\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}+\theta_{s}\bigg{)}\right)ds\\ &\quad+\left(\frac{\sigma_{s}}{2}-(\sigma_{s}+\eta_{s}\overline{r}_{s})\bigg{(}\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}+\theta_{s}\bigg{)}\right)dW_{s}^{1}-\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}\bigg{(}\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}+\theta_{s}\bigg{)}dW_{s}^{2},\end{split}

\begin{split}d\mathcal{Z}_{s}&=\bigg{(}\rho_{s}+\mu_{s}-\frac{\sigma_{s}^{2}+\sigma_{s}\eta_{s}\overline{r}_{s}}{2}\bigg{)}\bigg{(}\theta_{s}^{0}-\sqrt{\gamma_{s}}\zeta_{s}\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}\bigg{)}ds\\ &\quad+(\sigma_{s}+\eta_{s}\overline{r}_{s})\bigg{(}\theta_{s}^{0}-\sqrt{\gamma_{s}}\zeta_{s}\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}\bigg{)}dW_{s}^{1}+\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}\bigg{(}\theta_{s}^{0}-\sqrt{\gamma_{s}}\zeta_{s}\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}\bigg{)}dW_{s}^{2}.\end{split}

We will show that the solution $\widehat{H}^{*}$ of (31) is the optimal state process in the stochastic control problem to minimize $\hat{J}$ of (26). Notice that $\widehat{H}^{*}$ can be easily expressed via $\mathcal{Y}$ and $\mathcal{Z}$ in closed form.

In the next theorem, we summarize consequences from [34] in our setting to obtain a minimizer of $\hat{J}$ in (26) and a representation of the minimal costs.

Theorem 3.3.

(i) There exists a unique solution $(K,L)$ of BSDE (27). If $\sigma^{2}+2\sigma\eta\overline{r}+\eta^{2}\geq\varepsilon$ $dP\times ds|_{[0,T]}$ -a.e., there exists $c\in(0,\infty)$ such that $P(K_{s}\geq c\text{ for all }s\in[0,T])=1$ .

(ii) There exists a unique solution $(\psi,\phi)$ of BSDE (29).

(iii) Let $x,d\in\mathbb{R}$ , and let $\widehat{H}^{*}$ be the solution of SDE (31). Then, $\hat{u}^{*}=(\hat{u}^{*}_{s})_{s\in[0,T]}$ defined by

\hat{u}^{*}_{s}=\theta_{s}\widehat{H}_{s}^{*}-\theta_{s}^{0},\quad s\in[0,T],

(32)

is the unique optimal control in $\mathcal{L}_{0}^{2}$ for $\hat{J}$ , and $\widehat{H}^{*}$ is the corresponding state process (i.e., $\widehat{H}^{*}=\widehat{H}^{\hat{u}^{*}}$ ).

(iv) Let $x,d\in\mathbb{R}$ . The costs associated to the optimal control (32) are given by

\begin{split}\inf_{\hat{u}\in\mathcal{L}_{0}^{2}}\hat{J}_{0}\left(\frac{d}{\sqrt{\gamma_{0}}}-\sqrt{\gamma_{0}}x,\hat{u}\right)&=\hat{J}_{0}\left(\frac{d}{\sqrt{\gamma_{0}}}-\sqrt{\gamma_{0}}x,\hat{u}^{*}\right)\\ &=K_{0}\left(\frac{d}{\sqrt{\gamma_{0}}}-\sqrt{\gamma_{0}}x\right)^{2}-2\psi_{0}\left(\frac{d}{\sqrt{\gamma_{0}}}-\sqrt{\gamma_{0}}x\right)+C_{0},\end{split}

where

\begin{split}C_{0}&=\frac{1}{2}E_{0}\left[\gamma_{T}\xi^{2}\right]+E_{0}\left[\int_{0}^{T}K_{s}\frac{\lambda_{s}^{2}}{\left(\lambda_{s}+\kappa_{s}\right)^{2}}\gamma_{s}\zeta_{s}^{2}(\sigma_{s}^{2}+2\sigma_{s}\eta_{s}\overline{r}_{s}+\eta_{s}^{2})ds\right]\\ &\quad+E_{0}\left[\int_{0}^{T}\frac{\lambda_{s}\kappa_{s}}{\lambda_{s}+\kappa_{s}}\gamma_{s}\zeta_{s}^{2}ds\right]-E_{0}\left[\int_{0}^{T}(\theta_{s}^{0})^{2}(\lambda_{s}+\kappa_{s}+(\sigma_{s}^{2}+2\sigma_{s}\eta_{s}\overline{r}_{s}+\eta_{s}^{2})K_{s})ds\right]\\ &\quad+E_{0}\left[\int_{0}^{T}2\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}\sqrt{\gamma_{s}}\zeta_{s}\psi_{s}\bigg{(}\rho_{s}+\mu_{s}-\frac{\sigma_{s}^{2}+\sigma_{s}\eta_{s}\overline{r}_{s}}{2}\bigg{)}ds\right]\\ &\quad+E_{0}\left[\int_{0}^{T}2\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}\sqrt{\gamma_{s}}\zeta_{s}\left(\phi_{s}^{1}(\sigma_{s}+\eta_{s}\overline{r}_{s})+\phi_{s}^{2}\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}\right)ds\right].\end{split}

(33)

Proof.

Observe that the problem in Section˜2.2 fits the problem considered in [34, Section 5]. In particular, note that the coefficients in SDE (25) for $\widehat{H}^{\hat{u}}$ and in the cost functional $\hat{J}$ (see (26)) are bounded, and that the inhomogeneities are in $\mathcal{L}_{0}^{2}$ . Moreover, we have that $\frac{1}{2}$ , $\frac{\lambda\kappa}{\lambda+\kappa}$ , and $\lambda+\kappa$ are nonnegative. Furthermore, the filtration by assumption in this section is generated by the Brownian motion $(W^{1},\ldots,W^{m})^{\top}$ .

(i) If $\lambda+\kappa\geq\varepsilon$ , this is an immediate consequence of [34, Theorem 2.1]. In the case $\sigma^{2}+2\sigma\eta\overline{r}+\eta^{2}\geq\varepsilon$ , this is an application of [34, Theorem 2.2].

(ii) This is due to [34, Theorem 5.1].

(iii) The first part of [34, Theorem 5.2] yields the existence of a unique optimal control $\hat{u}^{*}$ , which is given in feedback form by the formula $\hat{u}^{*}=\theta\widehat{H}^{\hat{u}^{*}}-\theta^{0}$ . We obtain (31) by plugging this into (25).

(iv) The second part of [34, Theorem 5.2] provides us with the optimal costs. ∎

By an application of Corollary˜2.6 and Corollary˜2.4, we obtain a solution to the trade execution problem of Section˜1.

Corollary 3.4.

Assume that there exists $\varepsilon\in(0,\infty)$ such that $\lambda+\kappa\geq\varepsilon$ $dP\times ds|_{[0,T]}$ -a.e. or $\sigma^{2}+2\sigma\eta\overline{r}+\eta^{2}\geq\varepsilon$ $dP\times ds|_{[0,T]}$ -a.e. Let $(K,L)$ be the unique solution of BSDE (27), $(\psi,\phi)$ the unique solution of BSDE (29), and recall definitions (28) of $\theta$ and (30) of $\theta^{0}$ . Let $x,d\in\mathbb{R}$ . Then, $X^{*}=(X^{*}_{s})_{s\in[0-,T]}$ defined by

X^{*}_{0-}\!=x,\,\,\,X^{*}_{T}=\xi,\,\,\,\,X_{s}^{*}=\gamma_{s}^{-\frac{1}{2}}\left(\!\bigg{(}\theta_{s}+\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}-1\bigg{)}\widehat{H}_{s}^{*}+\gamma_{s}^{\frac{1}{2}}\zeta_{s}\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}\!-\theta_{s}^{0}\right)\!,\,s\in[0,T),

with $\widehat{H}^{*}$ from (31), is the unique (up to $dP\times ds|_{[0,T]}$ -null sets) optimal execution strategy in $\mathcal{A}_{0}^{pm}(x,d)$ for $J^{pm}$ . The associated costs are given by

\inf_{X\in\mathcal{A}_{0}^{pm}(x,d)}\!\!J^{pm}_{0}(x,d,X)\!=\!J^{pm}_{0}(x,d,X^{*})\!=\!K_{0}\!\left(\!\frac{d}{\sqrt{\gamma_{0}}}\!-\!\sqrt{\gamma_{0}}x\!\right)^{\!\!2}\!\!-\!2\psi_{0}\!\left(\!\frac{d}{\sqrt{\gamma_{0}}}\!-\!\sqrt{\gamma_{0}}x\!\right)\!+\!C_{0}\!-\!\frac{d^{2}}{2\gamma_{0}}

with $C_{0}$ from (33).

Remark 3.5.

(i) Note that BSDE (27) neither contains $\xi$ nor $\zeta$ . In particular, the solution component $K$ and the process $\theta$ from (28) do not depend on the choice of $\xi$ or $\zeta$ (although they depend on the choice of $\lambda$ ). In contrast, BSDE (29) involves both $\xi$ and $\zeta$ . If $\xi=0$ and at least one of $\lambda$ and $\zeta$ is equivalent to $0$ , we have that $(\psi,\phi)$ from (29), $\theta^{0}$ from (30), and $C_{0}$ from (33) vanish.

(ii) Under the assumptions of Corollary˜3.4 it holds that $K_{0}\leq\frac{1}{2}$ . This is a direct consequence of Corollary˜3.4 and (i) above. Indeed, choose $\xi=0$ and $\zeta=0$ (by (i) this choice does not affect $K$ ). Then Corollary˜3.4 and (i) show that $J^{pm}_{0}(1,0,X^{*})=K_{0}\gamma_{0}$ for the optimal strategy $X^{*}$ from Corollary˜3.4. The suboptimal finite variation execution strategy $X_{0-}=1$ , $X_{s}=0$ , $s\in[0,T]$ , in $\mathcal{A}_{0}^{fv}(1,0)$ incurs costs $J^{pm}_{0}(1,0,X)=\frac{\gamma_{0}}{2}$ and hence $K_{0}\leq\frac{1}{2}$ .

(iii) Our present setting essentially⁹⁹9The word “essentially” relates to different integrability conditions and to the fact that in [1] the formulation is for a continuous local martingale and a general filtration instead of Brownian motion with Brownian filtration. includes the one in [1], where we have $\xi=0$ , $\lambda=0$ , and $\eta=0$ (and, therefore, the processes $\zeta$ and $\overline{r}$ are not needed, cf. (3) and (7)). In this subsetting the finite variation control problem associated with (6)–(7) is extended in [1] to a problem where the control $X$ is a càdlàg semimartingale that acts as integrator in the extended¹⁰¹⁰10The word “extended” relates to the fact that (6) and (7) need to be extended with certain additional terms when allowing for general semimartingale strategies, see [1]. state dynamics of the form (6) and target functional of the form (7). In [1] the existence of an optimal semimartingale strategy as well as the form of the optimal strategy (when it exists) is characterized in terms of a certain process $\widetilde{\beta}$ , which is in turn defined via a solution $(Y,Z,M^{\perp})$ to a certain quadratic BSDE (see (3.2) in [1]). It is worth noting that, in the subsetting with $\xi=0$ , $\lambda=0$ , and $\eta=0$ , all formulas in this section greatly simplify and, in particular, BSDE (27) above is equivalent¹¹¹¹11For the sake of fair comparison, we consider the subsetting in [1] where the filtration is generated by $(W^{1},\ldots,W^{m})^{\top}$ and the continuous local martingale $M$ is $W^{1}$ . to BSDE (3.2) in [1]. The relation is $Y=K$ , $Z=L^{1}$ , $dM^{\perp}_{s}=\sum_{j=2}^{m}L^{j}_{s}\,dW^{j}_{s}$ . Further, in that subsetting, our process $\theta$ from (28) reduces to the above-mentioned process $\widetilde{\beta}$ (see (3.5) in [1]), while $(\psi,\phi)$ from (29), $\theta^{0}$ from (30), and $C_{0}$ from (33) vanish.

(iv) It is also instructive to compare Corollary˜3.4 above, where we obtain that the extended to $\mathcal{A}_{0}^{pm}(x,d)$ control problem always admits a minimizer, with Theorem 3.4 in [1], where it turns out that an optimal semimartingale strategy can fail to exist. See the discussion in the end of Section˜4.3 for a specific example.

On the continuity of optimal position paths: In the setting of [37] optimal position paths $X^{*}$ exhibit jumps (so-called block trades) at times $0$ and $T$ but are continuous on the interior $(0,T)$ (see also Section˜4.1 below). An interesting question is whether the continuity on $(0,T)$ prevails in the generalized setting considered in this paper. This is not reasonable to expect when we have the risk term with a “sufficiently irregular” process $\zeta$ . And, indeed, we see that the continuity of $X^{*}$ on $(0,T)$ can fail in ˜4.1 below (this is discussed in Remark˜4.2). More interestingly, such a continuity can already fail even without the risk term (i.e. $\lambda=0$ ) and with terminal target $\xi=0$ . Indeed, consider the setting with $\sigma=0$ , $\lambda=0$ , $\xi=0$ and non-diffusive resilience process $R$ given by $R_{s}=\rho s$ (with $\rho$ being a deterministic constant). Then it follows from [1, Example 6.2] that continuity of the price impact process $\gamma$ is not sufficient for continuity of optimal position paths $X^{*}$ on $(0,T)$ . It is shown that if the paths of $\gamma$ are absolutely continuous, then a jump of the weak derivative of $\gamma$ on $(0,T)$ already causes $X^{*}$ to jump on $(0,T)$ . Moreover, it is possible that the random terminal target position $\xi$ causes the optimal position path $X^{*}$ to jump in $(0,T)$ with all other input processes being continuous. We present an example for this phenomenon in Section˜4.2.

A way to obtain sufficient conditions for the continuity of $X^{*}$ on $(0,T)$ consists of combining Corollary˜3.4 with path regularity results for BSDEs. Indeed, if the coefficient processes $\rho,\mu,\sigma,\eta,\overline{r},\lambda,\zeta$ are continuous and if one can ensure that the solution components $L^{1},L^{2}$ and $\phi^{1},\phi^{2}$ (which correspond to the martingale representation part of the solution) of the BSDE (27) resp. (29) have continuous sample paths, then Corollary˜3.4 ensures that $X^{*}$ also has continuous sample paths on $(0,T)$ . Results that guarantee continuity of BSDE solutions in a Markovian framework, including the quadratic case, can for example be found in [32].

4 Examples

In this section we apply the results from the preceding sections in specific case studies.

4.1 The Obizhaeva-Wang model with random targets

The models developed by Obizhaeva and Wang [37] can be considered as special cases of the model set up in Section˜1. Indeed, we obtain the problem of [37, Section 6] by setting $\mu\equiv 0$ , $\sigma\equiv 0$ , $\eta\equiv 0$ , $\overline{r}\equiv 0$ , $\lambda\equiv 0$ and choosing $\rho\in(0,\infty)$ and $\xi\in\mathbb{R}$ as deterministic constants.

Example 4.1.

In this example we apply our results (in particular, Corollary˜3.4) and provide closed-form solutions (see (38) below) for optimal progressively measurable execution strategies in versions of these problems which allow for general random terminal targets $\xi$ and general running targets $\zeta$ .

To this end let $x,d\in\mathbb{R}$ . Suppose that $\mu\equiv 0$ , $\sigma\equiv 0$ , $\eta\equiv 0$ , and $\overline{r}\equiv 0$ . Furthermore, assume that $\rho\in(0,\infty)$ and $\lambda\in[0,\infty)$ are deterministic constants. We take some $\xi$ and $\zeta$ as specified in Section˜1.1 (in particular, see (5)). Note that the conditions of ˜3.3 and Corollary˜3.4 hold true, and that $\gamma_{s}=\gamma_{0}$ for all $s\in[0,T]$ . In the current setting, BSDE (27) reads

\begin{split}dK_{s}&=\bigg{(}\frac{\rho^{2}}{\rho+\lambda}K_{s}^{2}+\frac{2\lambda\rho}{\rho+\lambda}K_{s}-\frac{\lambda\rho}{\rho+\lambda}\bigg{)}ds+\sum_{j=1}^{m}L_{s}^{j}dW_{s}^{j},\quad s\in[0,T],\quad K_{T}=\frac{1}{2}.\end{split}

(34)

By ˜3.3, there exists a unique solution $(K,L)$ . Since the driver and the terminal condition in (34) are deterministic, we obtain that $L\equiv 0$ , and hence (34) is in fact a scalar Riccati ODE with constant coefficients. Such an equation can be solved explicitly, and in our situation we obtain in the case $\lambda>0$ that

K_{s}=\frac{1}{2}\frac{\lambda\tanh\left(\frac{\sqrt{\lambda}\rho(T-s)}{\sqrt{\lambda+\rho}}\right)+\sqrt{\lambda(\rho+\lambda)}}{(\frac{\rho}{2}+\lambda)\tanh\left(\frac{\sqrt{\lambda}\rho(T-s)}{\sqrt{\lambda+\rho}}\right)+\sqrt{\lambda(\rho+\lambda)}},\quad s\in[0,T],

and in the case $\lambda=0$ that

K_{s}=\frac{1}{2+(T-s)\rho},\quad s\in[0,T].

(35)

The process $\theta$ from (28) here is given by $\theta_{s}=\frac{\rho}{\lambda+\rho}K_{s}$ , $s\in[0,T]$ . BSDE (29) becomes

d\psi_{s}=\!\left(\!\frac{\rho\lambda}{\lambda+\rho}\!+\!\rho\theta_{s}\!\right)\!\psi_{s}ds+\frac{\rho\lambda}{\lambda+\rho}\sqrt{\gamma_{0}}\zeta_{s}(1-K_{s})ds+\sum_{j=1}^{m}\!\phi_{s}^{j}dW_{s}^{j},\,\,\,s\in[0,T],\,\,\,\psi_{T}=\!-\frac{1}{2}\sqrt{\gamma_{0}}\xi.

(36)

Again, by ˜3.3, there exists a unique solution $(\psi,\phi)$ . The solution component $\psi$ is given by

\begin{split}\psi_{s}&=\Gamma_{s}^{-1}\sqrt{\gamma_{0}}\left(-\frac{1}{2}\Gamma_{T}E_{s}[\xi]-\frac{\rho\lambda}{\lambda+\rho}E_{s}\left[\int_{s}^{T}\Gamma_{r}(1-K_{r})\zeta_{r}\,dr\right]\right),\quad s\in[0,T],\end{split}

where

\Gamma_{s}=\exp\left(-\rho\int_{0}^{s}\left(\frac{\lambda}{\lambda+\rho}+\theta_{r}\right)dr\right)=\exp\left(-\frac{\rho}{\lambda+\rho}\left(\lambda s+\rho\int_{0}^{s}K_{r}dr\right)\right),\quad s\in[0,T].

(37)

It holds for the process in (30) that $\theta_{s}^{0}=\frac{\rho}{\lambda+\rho}\psi_{s}$ , $s\in[0,T]$ . Further, SDE (31) reads

d\widehat{H}_{s}^{*}=\!-\rho\left(\!\frac{\lambda}{\lambda+\rho}+\theta_{s}\!\right)\widehat{H}_{s}^{*}ds+\rho\left(\!\theta_{s}^{0}-\!\sqrt{\gamma_{0}}\zeta_{s}\frac{\lambda}{\lambda+\rho}\right)\!ds,\,\,\,s\in[0,T],\,\,\,\widehat{H}_{0}^{*}=\frac{d}{\sqrt{\gamma_{0}}}-\sqrt{\gamma_{0}}x,

and has solution

\widehat{H}_{s}^{*}=\Gamma_{s}\left(\frac{d}{\sqrt{\gamma_{0}}}-\sqrt{\gamma_{0}}x+\rho\int_{0}^{s}\Gamma_{r}^{-1}\left(\theta_{r}^{0}-\sqrt{\gamma_{0}}\zeta_{r}\frac{\lambda}{\lambda+\rho}\right)dr\right),\quad s\in[0,T],

with $\Gamma$ from (37). It then follows from Corollary˜3.4 that $X^{*}=(X^{*}_{s})_{s\in[0-,T]}$ defined by $X_{0-}^{*}=x$ , $X_{T}^{*}=\xi$ , and, for $s\in[0,T)$ ,

\begin{split}&X_{s}^{*}=\gamma_{0}^{-\frac{1}{2}}\left(\left(\theta_{s}-\frac{\rho}{\lambda+\rho}\right)\widehat{H}_{s}^{*}-\theta_{s}^{0}\right)+\zeta_{s}\frac{\lambda}{\lambda+\rho}\\ &=\frac{\rho}{\lambda+\rho}(1-K_{s})\Gamma_{s}\!\left(\!x-\!\frac{d}{\gamma_{0}}\!+\!\frac{\rho}{\lambda+\rho}\int_{0}^{s}\!\Gamma_{r}^{-1}\!\left(\!\lambda\zeta_{r}\!-\!\frac{\rho}{\sqrt{\gamma_{0}}}\psi_{r}\!\right)dr\!\right)\!+\!\frac{\rho}{\lambda+\rho}\left(\frac{\lambda}{\rho}\zeta_{s}\!-\!\frac{1}{\sqrt{\gamma_{0}}}\psi_{s}\!\right)\end{split}

(38)

is the (up to $dP\!\times\!ds|_{[0,T]}$ -null sets unique) execution strategy in $\mathcal{A}_{0}^{pm}(x,d)$ that minimizes $J^{pm}$ .

Remark 4.2.

From ˜4.1 we see that discontinuities of the target process $\zeta$ can cause jumps of the optimal position path $X^{*}$ in $(0,T)$ . Indeed, as $\theta$ , $\theta^{0}$ and $\widehat{H}^{*}$ are continuous, it follows from (38) that, in the case $\lambda>0$ , paths of the optimal strategy $X^{*}$ inherit discontinuities from $\zeta$ on $(0,T)$ (in particular, $X^{*}$ jumps on $(0,T)$ whenever $\zeta$ does).

In the next example we study the case $\lambda\equiv 0$ in more detail.

Example 4.3.

In the setting of the previous example suppose that $\lambda\equiv 0$ . If the terminal target $\xi\in\mathbb{R}$ is a deterministic constant, then it follows from [37, Proposition 3] that the optimal finite variation execution strategy is given by

X_{s}^{*}=\left(x-\xi-\frac{d}{\gamma_{0}}\right)\frac{1+(T-s)\rho}{2+T\rho}+\xi,\quad s\in[0,T).

(39)

So the optimal strategy consists of potential block trades (jumps of $X^{*}$ ) at times $0$ and $T$ and a continuous linear trading program on $[0,T)$ . In the following we analyze how this structure changes as we allow for a random terminal target $\xi$ .

First recall that the solution of BSDE (34) is given in this case by (35). It follows that $\Gamma$ from (37) simplifies to $\Gamma_{s}=\frac{2+(T-s)\rho}{2+T\rho}$ , $s\in[0,T]$ . For the solution component $\psi$ of BSDE (36), we thus obtain

\psi_{s}=-\frac{\sqrt{\gamma_{0}}}{2+(T-s)\rho}E_{s}[\xi],\quad s\in[0,T].

The optimal strategy from (38) on $[0,T)$ becomes, for $s\in[0,T)$ ,

\begin{split}X_{s}^{*}&=(1-K_{s})\Gamma_{s}\left(x-\frac{d}{\gamma_{0}}-\rho\int_{0}^{s}\Gamma_{r}^{-1}\frac{1}{\sqrt{\gamma_{0}}}\psi_{r}dr\right)-\frac{1}{\sqrt{\gamma_{0}}}\psi_{s}\\ &=\left(x-\frac{d}{\gamma_{0}}\right)\frac{1+(T-s)\rho}{2+T\rho}+\rho(1+(T-s)\rho)\int_{0}^{s}\!\frac{E_{r}[\xi]}{(2+(T-r)\rho)^{2}}dr+\frac{E_{s}[\xi]}{2+(T-s)\rho}.\end{split}

(40)

Integration by parts implies that (note that $(E_{r}[\xi])_{r\in[0,T]}$ is a continuous martingale)

\begin{split}&\int_{0}^{s}\frac{E_{r}[\xi]}{(2+(T-r)\rho)^{2}}dr=\int_{0}^{s}E_{r}[\xi]d\frac{1}{\rho(2+(T-r)\rho)}\\ &=\frac{E_{s}[\xi]}{\rho(2+(T-s)\rho)}-\frac{E_{0}[\xi]}{\rho(2+T\rho)}-\int_{0}^{s}\frac{1}{\rho(2+(T-r)\rho)}dE_{r}[\xi],\quad s\in[0,T).\end{split}

Substituting this into (40) yields, for $s\in[0,T)$ ,

	$\displaystyle X_{s}^{*}$	$\displaystyle=\left(x-E_{0}[\xi]-\frac{d}{\gamma_{0}}\right)\frac{1+(T-s)\rho}{2+T\rho}+E_{s}[\xi]-\int_{0}^{s}\frac{1+(T-s)\rho}{2+(T-r)\rho}dE_{r}[\xi]$
		$\displaystyle=\left(x-E_{0}[\xi]-\frac{d}{\gamma_{0}}\right)\frac{1+(T-s)\rho}{2+T\rho}+E_{0}[\xi]+\int_{0}^{s}\left(1-\frac{1+(T-s)\rho}{2+(T-r)\rho}\right)dE_{r}[\xi].$

We, finally, obtain the alternative representation

X_{s}^{*}=\left(x-E_{0}[\xi]-\frac{d}{\gamma_{0}}\right)\frac{1+(T-s)\rho}{2+T\rho}+E_{0}[\xi]+\int_{0}^{s}\frac{1+(s-r)\rho}{2+(T-r)\rho}dE_{r}[\xi],\quad s\in[0,T),

for (40). We see that this optimal strategy $X^{*}\in\mathcal{A}_{0}^{pm}(x,d)$ consists of two additive parts: The first part exactly corresponds to the optimal deterministic strategy in (39) where the deterministic terminal target is replaced by the expected terminal target $E_{0}[\xi]$ . The second part represents fluctuations around this deterministic strategy which incorporate updates about the random terminal target $\xi$ . Note that this stochastic integral vanishes in expectation, although this is not a martingale (indeed, the time $s$ is not only the upper bound of integration but also appears in the integrand).

4.2 A discontinuous optimal position path for continuous inputs

We now show that the optimal strategy can have jumps inside $(0,T)$ even if all input processes, including $\zeta$ , are continuous. To this end, let $x,d\in\mathbb{R}$ . Take $\lambda\equiv 0$ , $\zeta\equiv 0$ , $\eta\equiv 0$ , $\overline{r}\equiv 0$ , and $\mu\equiv 0$ , and assume that $\sigma\in(0,\infty)$ and $\rho\in(\frac{\sigma^{2}}{2},\infty)$ are deterministic constants. Moreover, we will later consider an appropriate random terminal target $\xi$ , satisfying the assumptions of Section˜1.1, to produce a jump of the optimal strategy.

Note that the conditions of ˜3.3 and Corollary˜3.4 hold true. In particular, there exists a unique solution $(K,L)$ of BSDE (27), and it is given by (compare also with [1, Section 5.2]) $L\equiv 0$ and

K_{s}=\frac{\rho-\frac{\sigma^{2}}{2}}{\sigma^{2}}\mathcal{W}\left(\frac{\rho-\frac{\sigma^{2}}{2}}{\sigma^{2}}e^{c_{0}-\frac{\rho^{2}}{\sigma^{2}}s}\right)^{-1},\quad s\in[0,T],

where $\mathcal{W}$ denotes the Lambert $W$ function and $c_{0}=\ln(2)+\frac{1}{\sigma^{2}}(2\rho-\sigma^{2}+\rho^{2}T)$ . The process $\theta$ from (28) becomes

\theta_{s}=\frac{\rho K_{s}}{\rho-\frac{\sigma^{2}}{2}+\sigma^{2}K_{s}},\quad s\in[0,T],

and both $\theta$ and $K$ are deterministic, increasing, continuous, $(0,1/2]$ -valued functions.

For some $t_{0}\in(0,T)$ , let

\xi=-2\gamma_{0}^{-\frac{1}{2}}\left(\sigma\int_{t_{0}}^{T}\Gamma_{s}\theta_{s}ds+\int_{t_{0}}^{T}\Gamma_{s}dW_{s}^{1}\right)\,\exp\left(-\frac{\sigma}{2}W^{1}_{T}+\frac{3}{8}\sigma^{2}T+\left(\rho-\frac{\sigma^{2}}{2}\right)\int_{0}^{T}\theta_{s}ds\right),

where $\Gamma_{t}=\exp(-\frac{\sigma^{2}}{8}t-(\rho-\frac{\sigma^{2}}{2})\int_{0}^{t}\theta_{s}ds)$ , $t\in[0,T]$ . Note that $\xi$ is $\mathcal{F}_{T}$ -measurable and that $E[\gamma_{T}\xi^{2}]<\infty$ . The terminal target $\xi$ here is defined in such a way that the unique solution $(\psi,\phi)$ of BSDE (29) (cf. ˜3.3) is given by $\phi^{1}=1_{[t_{0},T]}$ , $\phi^{j}\equiv 0$ , $j\in\{2,\ldots,m\}$ , and

\psi_{t}=\begin{cases}0,&0\leq t<t_{0},\\ \Gamma_{t}^{-1}\left(\sigma\int_{t_{0}}^{t}\Gamma_{s}\theta_{s}ds+\int_{t_{0}}^{t}\Gamma_{s}dW_{s}^{1}\right),&t_{0}\leq t\leq T.\end{cases}

It follows for the process in (30) that

\theta_{t}^{0}=\begin{cases}0,&0\leq t<t_{0},\\ \frac{\left(\rho-\frac{\sigma^{2}}{2}\right)\psi_{t}+\sigma}{\rho-\frac{\sigma^{2}}{2}+\sigma^{2}K_{t}},&t_{0}\leq t\leq T.\end{cases}

We thus have that

\Delta\theta_{t_{0}}^{0}=\frac{\sigma}{\rho-\frac{\sigma^{2}}{2}+\sigma^{2}K_{t_{0}}}>0.

From Corollary˜3.4 we obtain existence of a unique optimal strategy $X^{*}$ and that $X_{s}^{*}=\gamma_{s}^{-\frac{1}{2}}((\theta_{s}-1)\widehat{H}_{s}^{*}-\theta_{s}^{0})$ , $s\in(0,T)$ . Since $\gamma$ , $\theta$ , and $\widehat{H}_{s}^{*}$ (see (31)) are continuous and $\Delta\theta_{t_{0}}^{0}>0$ , it holds that $\Delta X_{t_{0}}^{*}=-\gamma_{t_{0}}^{-\frac{1}{2}}\Delta\theta_{t_{0}}^{0}<0$ . Hence, the optimal strategy has a jump at $t_{0}\in(0,T)$ .

4.3 An example where $J^{fv}$ does not admit a minimizer

Let $x,d\in\mathbb{R}$ with $x\neq\frac{d}{\gamma_{0}}$ . Suppose that $\sigma\equiv 0$ , $\eta\equiv 0$ , $\lambda\equiv 0$ , $\overline{r}\equiv 0$ , $\zeta\equiv 0$ , $\xi=0$ . Choose $\mu$ to be a bounded deterministic càdlàg function such that there exists $\delta\in(0,T)$ with $\mu$ having infinite variation on $[0,T-\delta]$ , and take $\rho\in\mathbb{R}\setminus\{0\}$ such that there exists $\varepsilon>0$ with $2\rho+\mu\geq\varepsilon$ . Note that this corresponds to the setting in [1, Example 6.4]. Moreover, observe that the conditions of Corollary˜3.4 are satisfied. In the current setting, BSDE (27) becomes

\begin{split}dK_{s}&=\left(-\mu_{s}K_{s}+\frac{2(\rho+\mu_{s})^{2}K_{s}^{2}}{2\rho+\mu_{s}}\right)ds+\sum_{j=1}^{m}L_{s}^{j}dW_{s}^{j},\quad s\in[0,T],\quad K_{T}=\frac{1}{2}.\end{split}

Its solution is given by $(K,0)$ , where (see also $Y$ in [1, Section 6])

K_{s}=e^{\int_{s}^{T}\mu_{r}dr}\left(\int_{s}^{T}\frac{2(\rho+\mu_{r})^{2}}{2\rho+\mu_{r}}e^{\int_{r}^{T}\mu_{l}dl}dr+2\right)^{-1},\quad s\in[0,T],

is a deterministic continuous function of finite variation. We have that

\theta_{s}=\frac{2(\rho+\mu_{s})}{2\rho+\mu_{s}}K_{s},\quad s\in[0,T],

which is the same as $\widetilde{\beta}$ in [1, Example 6.4]. The solution of BSDE (29) is given by $(\psi,\phi)=(0,0)$ , and it holds $\theta^{0}\equiv 0$ . Furthermore, (31) reads

d\widehat{H}^{*}_{s}=\left(\frac{\mu_{s}}{2}-(\rho+\mu_{s})\theta_{s}\right)\widehat{H}_{s}^{*}ds,\quad s\in[0,T],\quad\widehat{H}_{0}^{*}=\frac{d}{\sqrt{\gamma_{0}}}-\sqrt{\gamma_{0}}x,

and is solved by the continuous deterministic finite-variation function

\widehat{H}_{s}^{*}=\left(\frac{d}{\sqrt{\gamma_{0}}}-\sqrt{\gamma_{0}}x\right)\exp\left(\int_{0}^{s}\left(\frac{\mu_{r}}{2}-(\rho+\mu_{r})\theta_{r}\right)dr\right),\quad s\in[0,T],

which is nonvanishing due to our assumption $x\neq\frac{d}{\gamma_{0}}$ .¹²¹²12At this point it is easy to explain why we exclude the case $x=\frac{d}{\gamma_{0}}$ in this example. In the case $x=\frac{d}{\gamma_{0}}$ we get that $\widehat{H}^{*}\equiv 0$ and then the optimal strategy is to close the position immediately, i.e., $X^{*}_{0-}=x$ , $X^{*}_{s}=0$ , $s\in[0,T]$ , which is always a finite-variation strategy. By Corollary˜3.4, there exists a (up to $dP\times ds|_{[0,T]}$ -null sets) unique minimizer $X^{*}=(X^{*}_{s})_{s\in[0-,T]}$ of $J^{pm}$ in $\mathcal{A}_{0}^{pm}(x,d)$ , namely

\begin{split}&X^{*}_{0-}=x,\quad X^{*}_{T}=0,\quad X_{s}^{*}=\gamma_{s}^{-\frac{1}{2}}\left(\theta_{s}-1\right)\widehat{H}_{s}^{*},\quad s\in[0,T).\end{split}

Assume by contradiction that there exists a minimizer $X^{0}=(X^{0}_{s})_{s\in[0-,T]}$ of $J^{fv}$ in $\mathcal{A}_{0}^{fv}(x,d)$ . We know from Corollary˜2.3 that $X^{0}$ is then also a minimizer of $J^{pm}$ in $\mathcal{A}_{0}^{pm}(x,d)$ . It follows that $X^{0}=X^{*}$ $dP\times ds|_{[0,T]}$ -a.e. Since $\widehat{H}^{*}$ is nowhere $0$ , we obtain that

1+\frac{\gamma^{\frac{1}{2}}X^{0}}{\widehat{H}^{*}}=\theta\quad dP\times ds|_{[0,T]}\text{-a.e.}

(41)

Observe that the left-hand side is a process of finite variation. On the other hand, our assumption on $\mu$ easily yields that $\theta$ has infinite variation. This contradiction proves that in the setting of this example, $J^{fv}$ does not admit a minimizer in $\mathcal{A}_{0}^{fv}(x,d)$ .

We can say even more: In this example there does not exist a semimartingale optimal strategy.¹³¹³13Under a semimartingale strategy we formally understand a semimartingale that is an element of $\mathcal{A}_{0}^{pm}(x,d)$ . Indeed, if we had a semimartingale $X^{0}$ as a minimizer, we would still get (41) (with a semimartingale $X^{0}$ ). The left-hand side would then be a semimartingale. On the other hand, it is shown in [1, Example 6.4] that there does not exist a semimartingale $\beta$ such that $\beta=\theta$ $dP\times ds|_{[0,T]}$ -a.e. Thus, the cost functional does not have a minimizer in the set of semimartingales, but we are now able to find a minimizer in the set of progressively measurable execution strategies.

4.4 An example with a diffusive resilience

As already mentioned in the introduction, the literature on optimal trade execution in Obizhaeva-Wang type models typically assumes that $R$ is an increasing process. In [1] and [3] $R$ is allowed to have finite variation. Now we consider an example with a truly diffusive $R$ .

Let $x,d\in\mathbb{R}$ with $x\neq\frac{d}{\gamma_{0}}$ . Let $\xi=0$ , $\lambda\equiv 0$ , $\zeta\equiv 0$ , and $\mu\equiv 0$ . Suppose that $\overline{r}\in[-1,1]$ and $\eta,\rho,\sigma\in\mathbb{R}$ are deterministic constants such that $\kappa=\frac{1}{2}(2\rho-\sigma^{2}-\eta^{2}-2\sigma\eta\overline{r})>0$ and $\sigma^{2}+\eta^{2}+2\sigma\eta\overline{r}>0$ (in particular, we thus need $\rho>0$ ). Note that the assumptions of Corollary˜3.4 are satisfied. We moreover remark that the subsetting where $\eta\equiv 0$ corresponds to the setting in [1, Section 5.2]. That means, the difference to [1, Section 5.2] is that we now consider a more general resilience. The Riccati-BSDE (27) becomes

dK_{s}=\frac{(\rho K_{s}+(\sigma+\eta\overline{r})L_{s}^{1}+\eta\sqrt{1-\overline{r}^{2}}L_{s}^{2})^{2}}{(\sigma^{2}+\eta^{2}+2\sigma\eta\overline{r})K_{s}+\kappa}ds-\sigma L_{s}^{1}ds+\sum_{j=1}^{m}\!L_{s}^{j}dW_{s}^{j},\,\,\,s\in[0,T],\,\,\,K_{T}=\frac{1}{2}.

This has solution $(K,L)=(K,0)$ with

K_{s}=\frac{\kappa}{\sigma^{2}+\eta^{2}+2\sigma\eta\overline{r}}\mathcal{W}\left(\frac{\kappa}{\sigma^{2}+\eta^{2}+2\sigma\eta\overline{r}}\exp\left(c-\frac{\rho^{2}s}{\sigma^{2}+\eta^{2}+2\sigma\eta\overline{r}}\right)\right)^{-1},\quad s\in[0,T],

and $c=\ln(2)+\frac{2\kappa+\rho^{2}T}{\sigma^{2}+\eta^{2}+2\sigma\eta\overline{r}}$ (compare also with [1, Section 5.2]). We further have that $\theta_{s}=\frac{\rho K_{s}}{(\sigma^{2}+\eta^{2}+2\sigma\eta\overline{r})K_{s}+\kappa}$ , $s\in[0,T]$ . Observe that $(\psi,\phi)=(0,0)$ is the solution of (29) in the present setting and that $\theta^{0}\equiv 0$ in (30). Moreover, we have that SDE (31) reads

d\widehat{H}_{s}^{*}=\!\left(\!-\frac{\sigma^{2}}{8}\!-\!\left(\!\rho-\frac{\sigma^{2}+\sigma\eta\overline{r}}{2}\right)\theta_{s}\!\right)\!\widehat{H}_{s}^{*}ds+\left(\frac{\sigma}{2}\!-\!(\sigma+\eta\overline{r})\theta_{s}\right)\!\widehat{H}_{s}^{*}dW_{s}^{1}-\eta\sqrt{1-\overline{r}^{2}}\theta_{s}\widehat{H}_{s}^{*}dW_{s}^{2}

for $s\in[0,T]$ , with start in $\widehat{H}_{0}^{*}=\frac{d}{\sqrt{\gamma_{0}}}-\sqrt{\gamma_{0}}x$ ; hence,

\begin{split}\widehat{H}_{s}^{*}&=\left(\frac{d}{\sqrt{\gamma_{0}}}-\sqrt{\gamma_{0}}x\right)\exp\!\left(-\frac{\sigma^{2}s}{4}-(\rho-\sigma^{2}-\sigma\eta\overline{r})\int_{0}^{s}\theta_{r}dr-\frac{\sigma^{2}+\eta^{2}+2\sigma\eta\overline{r}}{2}\int_{0}^{s}\theta_{r}^{2}dr\right)\\ &\quad\cdot\exp\left(\frac{\sigma}{2}W_{s}^{1}-(\sigma+\eta\overline{r})\int_{0}^{s}\theta_{r}dW_{r}^{1}-\eta\sqrt{1-\overline{r}^{2}}\int_{0}^{s}\theta_{r}dW_{r}^{2}\right),\quad s\in[0,T].\end{split}

It follows from Corollary˜3.4 that for $s\in[0,T)$ the optimal execution strategy is given by

\begin{split}X_{s}^{*}&=\left(x-\frac{d}{\gamma_{0}}\right)(1-\theta_{s})\exp\!\left(-(\rho-\sigma^{2}-\sigma\eta\overline{r})\int_{0}^{s}\theta_{r}dr-\frac{\sigma^{2}+\eta^{2}+2\sigma\eta\overline{r}}{2}\int_{0}^{s}\theta_{r}^{2}dr\right)\\ &\quad\cdot\exp\left(-(\sigma+\eta\overline{r})\int_{0}^{s}\theta_{r}dW_{r}^{1}-\eta\sqrt{1-\overline{r}^{2}}\int_{0}^{s}\theta_{r}dW_{r}^{2}\right).\end{split}

We can show that $K$ and $\theta$ both are continuous, deterministic, increasing, $(0,1/2]$ -valued functions of finite variation. Since $\theta<1$ , the optimal strategy on $[0,T)$ always has the same sign as $x-\frac{d}{\gamma_{0}}$ . Moreover, the optimal strategy is stochastic and has infinite variation, as in [1, Section 5.2]. In contrast to [1, Section 5.2], where the price impact always has infinite variation, we can here set $\sigma\equiv 0$ for a choice of $\eta^{2}\in(0,2\rho)$ . In this case, the price impact $\gamma\equiv\gamma_{0}$ is a deterministic constant, yet the optimal strategy has infinite variation (due to the infinite variation in the resilience $R$ ).

Observe furthermore that by making use of $\eta$ and $\overline{r}$ , we can choose the parameters in the current setting in such a way that $\kappa>0$ and $\sigma^{2}+\eta^{2}+2\sigma\eta\overline{r}>0$ are satisfied, but condition (3.1) in [1], i.e., $2\rho-\sigma^{2}>0$ , is violated.

With regard to Section˜4.3 we remark that in both sections there does not exist an optimal strategy in $\mathcal{A}_{0}^{fv}(x,d)$ , but opposed to Section˜4.3, it holds in the current section that there exists a semimartingale optimal strategy.

4.5 Cancellation of infinite variation

We now present an example where the infinite variation in the price impact process $\gamma$ is “cancelled” by the infinite variation in the resilience process $R$ and we obtain the optimal strategy $X^{*}$ of finite variation.

Let $x,d\in\mathbb{R}$ , $\xi=0$ , $\lambda\equiv 0$ , $\zeta\equiv 0$ , and $\mu\equiv 0$ . Suppose that $\overline{r}=-1$ and $\rho>0$ are deterministic constants, and that $\eta$ and $\sigma$ are progressively measurable, $dP\times ds|_{[0,T]}$ -a.e. bounded processes such that $\eta=\sigma$ $dP\times ds|_{[0,T]}$ -a.e. It then holds $dP\times ds|_{[0,T]}$ -a.e. that $\sigma^{2}+\eta^{2}+2\sigma\eta\overline{r}=0$ and $\kappa=\rho>0$ . In particular, the assumptions of Corollary˜3.4 are satisfied. The BSDE

\begin{split}dK_{s}&=\rho K_{s}^{2}ds-\sigma_{s}L_{s}^{1}ds+\sum_{j=1}^{m}L_{s}^{j}dW_{s}^{j},\quad s\in[0,T],\quad K_{T}=\frac{1}{2},\end{split}

which is BSDE (27) in the present setting, has the solution $(K,L)=(K,0)$ with $K_{s}=\frac{1}{2+(T-s)\rho}$ , $s\in[0,T]$ (cf. Section˜4.1). It holds that $\theta\equiv K$ , that $(\psi,\phi)=(0,0)$ is the solution of (29), and that $\theta^{0}\equiv 0$ . It follows that (31) has the solution

\begin{split}\widehat{H}_{s}^{*}&=\left(\frac{d}{\sqrt{\gamma_{0}}}-\sqrt{\gamma_{0}}x\right)\exp\left(-\frac{1}{4}\int_{0}^{s}\sigma_{r}^{2}dr-\rho\int_{0}^{s}K_{r}dr+\frac{1}{2}\int_{0}^{s}\sigma_{r}dW_{r}^{1}\right),\quad s\in[0,T].\end{split}

For the optimal execution strategy from Corollary˜3.4 we then compute that

X_{s}^{*}=\left(x-\frac{d}{\gamma_{0}}\right)\frac{1+(T-s)\rho}{2+T\rho},\quad s\in[0,T).

The optimal strategy in the current setting with general stochastic $\sigma=\eta$ and negative correlation $\overline{r}=-1$ is thus the same as in the Obizhaeva-Wang setting $\sigma=0=\eta$ (cf. [37, Proposition 3]; see also [1, Section 4.2]). In particular, the optimal strategy is deterministic and of finite variation, although the price impact $\gamma$ and the resilience $R$ are both stochastic and of infinite variation (at least if $\sigma=\eta$ is nonvanishing).

We finally remark that this setting does not reduce to the Obizhaeva-Wang setting $\sigma=0=\eta$ . Indeed, while the optimal strategies for $\sigma=0=\eta$ and for general stochastic $\sigma=\eta$ with correlation $\overline{r}=-1$ coincide, this is not true for the associated deviation processes. In general, it holds that

D_{s}^{X^{*}}=-\gamma_{0}\left(x-\frac{d}{\gamma_{0}}\right)\frac{1}{2+T\rho}\exp\left(\int_{0}^{s}\eta_{r}dW_{r}^{1}-\frac{1}{2}\int_{0}^{s}\eta_{r}^{2}dr\right),\quad s\in[0,T),

which for a nonvanishing $\eta$ and $x\neq\frac{d}{\gamma_{0}}$ has infinite variation, whereas in the Obizhaeva-Wang setting is constant (take $\eta=0$ ).

5 Proofs

In this section, we provide the proofs for the results presented in Section˜1 and Section˜2. We furthermore state and prove some auxiliary results that are used in the proofs of the main results.

For reference in several proofs, note that the order book height, i.e., the inverse of the price impact, has dynamics

d\gamma_{s}^{-1}=\gamma_{s}^{-1}\left(-(\mu_{s}-\sigma_{s}^{2})ds-\sigma_{s}dW^{1}_{s}\right),\quad s\in[0,T].

(42)

We moreover observe that by Itô’s lemma it holds that

\begin{split}d\gamma_{s}^{\frac{1}{2}}&=\gamma_{s}^{\frac{1}{2}}\left(\frac{1}{2}\mu_{s}-\frac{1}{8}\sigma_{s}^{2}\right)ds+\frac{1}{2}\gamma_{s}^{\frac{1}{2}}\sigma_{s}dW^{1}_{s},\quad s\in[0,T],\end{split}

(43)

\begin{split}d\gamma_{s}^{-\frac{1}{2}}&=\gamma_{s}^{-\frac{1}{2}}\left(-\frac{1}{2}\mu_{s}+\frac{3}{8}\sigma_{s}^{2}\right)ds-\frac{1}{2}\gamma_{s}^{-\frac{1}{2}}\sigma_{s}dW^{1}_{s},\quad s\in[0,T].\end{split}

(44)

Proof of ˜1.3.

Observe that integration by parts implies that for all $s\in[t,T]$

\begin{split}d(\nu_{s}D_{s})&=\nu_{s}dD_{s}+D_{s}d\nu_{s}+d[\nu,D]_{s}\\ &=-\nu_{s}D_{s}dR_{s}+\nu_{s}\gamma_{s}dX_{s}+\nu_{s}D_{s}dR_{s}+\nu_{s}D_{s}d[R]_{s}+d[\nu,D]_{s}\\ &=\nu_{s}\gamma_{s}dX_{s}+\nu_{s}D_{s}d[R]_{s}+d[\nu,D]_{s}.\end{split}

Since $d[\nu,D]_{s}=\nu_{s}d[R,D]_{s}=-\nu_{s}D_{s}d[R]_{s}$ , $s\in[t,T]$ , it follows that the process $\widetilde{D}_{s}=\nu_{s}D_{s}$ , $s\in[t,T]$ , $\widetilde{D}_{t-}=d$ , satisfies

\begin{split}d\widetilde{D}_{s}=d(\nu_{s}D_{s})&=\nu_{s}\gamma_{s}dX_{s},\quad s\in[t,T].\end{split}

(45)

In particular, $\widetilde{D}$ is of finite variation. The facts that $\Delta D_{s}=\gamma_{s}\Delta X_{s}$ , $s\in[t,T]$ , and $d\widetilde{D}_{s}=\nu_{s}\gamma_{s}dX_{s}$ , $s\in[t,T]$ , imply that

\begin{split}\int_{[t,T]}\left(2D_{s-}+\Delta X_{s}\gamma_{s}\right)dX_{s}&=\int_{[t,T]}\left(2D_{s-}+\Delta D_{s}\right)dX_{s}=\int_{[t,T]}\left(2D_{s-}+\Delta D_{s}\right)\gamma_{s}^{-1}\nu_{s}^{-1}d\widetilde{D}_{s}\\ &=\int_{[t,T]}\nu_{s}^{-2}\gamma_{s}^{-1}\left(2\widetilde{D}_{s-}+\Delta\widetilde{D}_{s}\right)d\widetilde{D}_{s}=\int_{[t,T]}\varphi_{s}\,d(\widetilde{D}^{2}_{s}),\end{split}

(46)

where we denote $\varphi_{s}=\nu_{s}^{-2}\gamma_{s}^{-1}$ , $s\in[t,T]$ , and, in the last equality, we use that $d(\widetilde{D}^{2}_{s})=(2\widetilde{D}_{s-}+\Delta\widetilde{D}_{s})\,d\widetilde{D}_{s}$ , as $\widetilde{D}$ has finite variation. Summing up, (46) yields

\begin{split}\int_{[t,T]}D_{s-}dX_{s}+\frac{1}{2}\int_{[t,T]}\Delta X_{s}\gamma_{s}dX_{s}&=\frac{1}{2}\left(\widetilde{D}_{T}^{2}\varphi_{T}-\widetilde{D}_{t-}^{2}\varphi_{t}-\int_{t}^{T}\widetilde{D}_{s}^{2}d\varphi_{s}\right)\\ &=\frac{1}{2}\left(\gamma_{T}^{-1}D_{T}^{2}-\gamma_{t}^{-1}d^{2}-\int_{t}^{T}D_{s}^{2}\nu_{s}^{2}d\left(\nu_{s}^{-2}\gamma_{s}^{-1}\right)\right).\end{split}

In order to show (11), we first obtain from (45) and integration by parts that

\begin{split}\nu_{r}D_{r}-d&=\nu_{r}\gamma_{r}X_{r}-\gamma_{t}x-\int_{[t,r]}X_{s}d(\nu_{s}\gamma_{s})-\int_{[t,r]}d[\nu\gamma,X]_{s},\quad r\in[t,T].\end{split}

This implies that $D_{r}=\gamma_{r}X_{r}+\nu_{r}^{-1}(d-\gamma_{t}x-\int_{t}^{r}X_{s}d(\nu_{s}\gamma_{s}))$ , $r\in[t,T]$ . ∎

Proof of ˜1.4.

We first consider the integrator $\nu^{-2}\gamma^{-1}$ on the right hand side of (10). It holds by integration by parts and (9) that for all $s\in[t,T]$

\begin{split}d(\nu_{s}^{-2}\gamma_{s}^{-1})&=\nu_{s}^{-1}d(\gamma_{s}^{-1}\nu_{s}^{-1})+\gamma_{s}^{-1}\nu_{s}^{-1}d\nu_{s}^{-1}+d[\nu^{-1},\gamma^{-1}\nu^{-1}]_{s}\\ &=2\nu_{s}^{-1}\gamma_{s}^{-1}d\nu_{s}^{-1}+\nu_{s}^{-2}d\gamma_{s}^{-1}+\nu_{s}^{-1}d[\gamma^{-1},\nu^{-1}]_{s}+d[\nu^{-1},\gamma^{-1}\nu^{-1}]_{s}\\ &=-2\nu_{s}^{-2}\gamma_{s}^{-1}dR_{s}+\nu_{s}^{-2}d\gamma_{s}^{-1}-\nu_{s}^{-2}d[\gamma^{-1},R]_{s}+d[\nu^{-1},\gamma^{-1}\nu^{-1}]_{s}.\end{split}

Note that for all $s\in[t,T]$ we have

\begin{split}d[\nu^{-1},\gamma^{-1}\nu^{-1}]_{s}&=-\nu_{s}^{-1}d[R,\gamma^{-1}\nu^{-1}]_{s}=-\nu_{s}^{-1}d\left[R,\int_{t}^{\cdot}\gamma^{-1}d\nu^{-1}+\int_{t}^{\cdot}\nu^{-1}d\gamma^{-1}\right]_{s}\\ &=-\nu_{s}^{-1}\gamma_{s}^{-1}d[R,\nu^{-1}]_{s}-\nu_{s}^{-2}d[R,\gamma^{-1}]_{s}=\nu_{s}^{-2}\gamma_{s}^{-1}d[R]_{s}-\nu_{s}^{-2}d[R,\gamma^{-1}]_{s}.\end{split}

It hence follows for all $s\in[t,T]$ that

\begin{split}d(\nu_{s}^{-2}\gamma_{s}^{-1})&=-2\nu_{s}^{-2}\gamma_{s}^{-1}dR_{s}+\nu_{s}^{-2}d\gamma_{s}^{-1}-2\nu_{s}^{-2}d[\gamma^{-1},R]_{s}+\nu_{s}^{-2}\gamma_{s}^{-1}d[R]_{s}.\end{split}

Plugged into (10) from ˜1.3, we obtain that

\begin{split}&\int_{[t,T]}D_{s-}dX_{s}+\frac{1}{2}\int_{[t,T]}\Delta X_{s}\gamma_{s}dX_{s}\\ &=\frac{1}{2}\left(\gamma_{T}^{-1}D_{T}^{2}-\gamma_{t}^{-1}d^{2}-\int_{t}^{T}D_{s}^{2}\left(d\gamma_{s}^{-1}+\gamma_{s}^{-1}d[R]_{s}-2\gamma_{s}^{-1}dR_{s}-2d[\gamma^{-1},R]_{s}\right)\right).\end{split}

(47)

We further have by (3) and (42) that for all $s\in[t,T]$

\begin{split}&d\gamma_{s}^{-1}+\gamma_{s}^{-1}d[R]_{s}-2\gamma_{s}^{-1}dR_{s}-2d[\gamma^{-1},R]_{s}\\ &=-\gamma_{s}^{-1}(\mu_{s}-\sigma_{s}^{2})ds-\gamma_{s}^{-1}\sigma_{s}dW^{1}_{s}+\gamma_{s}^{-1}\eta_{s}^{2}ds-2\gamma_{s}^{-1}\rho_{s}ds-2\gamma_{s}^{-1}\eta_{s}dW_{s}^{R}+2\gamma_{s}^{-1}\sigma_{s}\eta_{s}\overline{r}_{s}ds\\ &=-\gamma_{s}^{-1}\left(2\rho_{s}+\mu_{s}-\sigma_{s}^{2}-\eta_{s}^{2}-2\sigma_{s}\eta_{s}\overline{r}_{s}\right)ds-\gamma_{s}^{-1}\sigma_{s}dW^{1}_{s}-2\gamma_{s}^{-1}\eta_{s}dW_{s}^{R}.\end{split}

(48)

It follows from assumption (A1) and the boundedness of the input processes that

\begin{split}E\left[\left\lvert\int_{t}^{T}D_{s}^{2}\gamma_{s}^{-1}\left(2\rho_{s}+\mu_{s}-\sigma_{s}^{2}-\eta_{s}^{2}-2\sigma_{s}\eta_{s}\overline{r}_{s}\right)ds\right\rvert\right]<\infty.\end{split}

The Burkholder-Davis-Gundy inequality together with assumption (A3) shows that it holds for some constant $c\in(0,\infty)$ that

\begin{split}E\left[\sup_{r\in[t,T]}\left\lvert\int_{t}^{r}D_{s}^{2}\gamma_{s}^{-1}\sigma_{s}dW^{1}_{s}\right\rvert\right]&\leq cE\left[\left(\int_{t}^{T}D_{s}^{4}\gamma_{s}^{-2}\sigma_{s}^{2}ds\right)^{\frac{1}{2}}\right]<\infty.\end{split}

We therefore have that $E_{t}[\int_{t}^{T}D_{s}^{2}\gamma_{s}^{-1}\sigma_{s}dW^{1}_{s}]=0.$ Similarly, assumption (A2) implies that $E_{t}[\int_{t}^{T}2D_{s}^{2}\gamma_{s}^{-1}\eta_{s}dW_{s}^{R}]=0.$ It thus follows from (47), (48), and (12) that

E_{t}\left[\int_{[t,T]}D_{s-}dX_{s}+\frac{1}{2}\int_{[t,T]}\Delta X_{s}\gamma_{s}dX_{s}\right]=\frac{1}{2}E_{t}\left[\gamma_{T}^{-1}D_{T}^{2}+\int_{t}^{T}D_{s}^{2}\gamma_{s}^{-1}2\kappa_{s}ds\right]-\frac{d^{2}}{2\gamma_{t}}.

By definition (7) of $J^{fv}$ this proves (13). ∎

The dynamics that we compute in the following lemma are used in the proofs of Lemma˜1.6 and Lemma˜5.5.

Lemma 5.1.

Let $t\in[0,T]$ , $x,d\in\mathbb{R}$ . Assume that $X=(X_{s})_{s\in[t-,T]}$ is a progressively measurable process such that $\int_{t}^{T}X_{s}^{2}ds<\infty$ a.s. For $\alpha_{s}=\gamma_{s}^{-\frac{1}{2}}\nu_{s}^{-1}$ , $s\in[t,T]$ , and $\beta_{s}=d-\gamma_{t}x-\int_{t}^{s}X_{r}d(\nu_{r}\gamma_{r})$ , $s\in[t,T]$ , it then holds for all $s\in[t,T]$ that

\begin{split}&d(\alpha_{s}\beta_{s})\\ &=-\gamma_{s}^{\frac{1}{2}}X_{s}\Bigg{(}\big{(}\mu_{s}+\rho_{s}+\eta_{s}^{2}+\sigma_{s}\eta_{s}\overline{r}_{s}\big{)}ds+\big{(}\sigma_{s}+\eta_{s}\overline{r}_{s}\big{)}dW_{s}^{1}+\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}dW_{s}^{2}\Bigg{)}\\ &\quad+\alpha_{s}\beta_{s}\Bigg{(}\Big{(}-\rho_{s}-\frac{1}{2}\mu_{s}+\frac{3}{8}\sigma_{s}^{2}+\frac{1}{2}\sigma_{s}\eta_{s}\overline{r}_{s}\Big{)}ds+\Big{(}-\eta_{s}\overline{r}_{s}-\frac{1}{2}\sigma_{s}\Big{)}dW_{s}^{1}-\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}dW_{s}^{2}\Bigg{)}\\ &\quad+\gamma_{s}^{\frac{1}{2}}X_{s}\left(\frac{3}{2}\eta_{s}\sigma_{s}\overline{r}_{s}+\frac{1}{2}\sigma_{s}^{2}+\eta_{s}^{2}\right)ds.\end{split}

(49)

Proof.

Integration by parts implies that

d(\alpha_{s}\beta_{s})=-\alpha_{s}X_{s}d(\nu_{s}\gamma_{s})+\beta_{s}d(\gamma_{s}^{-\frac{1}{2}}\nu_{s}^{-1})-X_{s}d[\gamma^{-\frac{1}{2}}\nu^{-1},\nu\gamma]_{s},\quad s\in[t,T].

(50)

Furthermore, it holds by integration by parts, (8), (3) and (4) that for all $s\in[t,T]$

\begin{split}d(\nu_{s}\gamma_{s})&=\nu_{s}d\gamma_{s}+\gamma_{s}\nu_{s}dR_{s}+\gamma_{s}\nu_{s}d[R]_{s}+\nu_{s}d[R,\gamma]_{s}\\ &=\nu_{s}\gamma_{s}\mu_{s}ds+\nu_{s}\gamma_{s}\sigma_{s}dW^{1}_{s}+\nu_{s}\gamma_{s}\rho_{s}ds+\nu_{s}\gamma_{s}\eta_{s}\overline{r}_{s}dW_{s}^{1}+\nu_{s}\gamma_{s}\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}dW_{s}^{2}\\ &\quad+\nu_{s}\gamma_{s}\eta_{s}^{2}ds+\nu_{s}\gamma_{s}\sigma_{s}\eta_{s}\overline{r}_{s}ds\\ &=\nu_{s}\gamma_{s}\Bigg{(}\big{(}\mu_{s}+\rho_{s}+\eta_{s}^{2}+\sigma_{s}\eta_{s}\overline{r}_{s}\big{)}ds+\big{(}\sigma_{s}+\eta_{s}\overline{r}_{s}\big{)}dW_{s}^{1}+\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}dW_{s}^{2}\Bigg{)}.\end{split}

(51)

Also by integration by parts, and using (9), (3) and (44), we obtain for all $s\in[t,T]$ that

\begin{split}&d(\gamma_{s}^{-\frac{1}{2}}\nu_{s}^{-1})=-\gamma_{s}^{-\frac{1}{2}}\nu_{s}^{-1}dR_{s}+\nu_{s}^{-1}d\gamma_{s}^{-\frac{1}{2}}-\nu_{s}^{-1}d[R,\gamma^{-\frac{1}{2}}]_{s}\\ &=-\gamma_{s}^{-\frac{1}{2}}\nu_{s}^{-1}\rho_{s}ds-\gamma_{s}^{-\frac{1}{2}}\nu_{s}^{-1}\eta_{s}\overline{r}_{s}dW_{s}^{1}-\gamma_{s}^{-\frac{1}{2}}\nu_{s}^{-1}\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}dW_{s}^{2}\\ &\quad+\gamma_{s}^{-\frac{1}{2}}\nu_{s}^{-1}\left(-\frac{1}{2}\mu_{s}+\frac{3}{8}\sigma_{s}^{2}\right)ds-\frac{1}{2}\gamma_{s}^{-\frac{1}{2}}\nu_{s}^{-1}\sigma_{s}dW_{s}^{1}+\frac{1}{2}\gamma_{s}^{-\frac{1}{2}}\nu_{s}^{-1}\sigma_{s}\eta_{s}\overline{r}_{s}ds\\ &=\alpha_{s}\Bigg{(}\Big{(}-\rho_{s}-\frac{1}{2}\mu_{s}+\frac{3}{8}\sigma_{s}^{2}+\frac{1}{2}\sigma_{s}\eta_{s}\overline{r}_{s}\Big{)}ds+\Big{(}-\eta_{s}\overline{r}_{s}-\frac{1}{2}\sigma_{s}\Big{)}dW_{s}^{1}-\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}dW_{s}^{2}\Bigg{)}.\end{split}

(52)

It follows from (51) and (52) for all $s\in[t,T]$ that

\begin{split}d[\gamma^{-\frac{1}{2}}\nu^{-1},\nu\gamma]_{s}&=\gamma_{s}^{\frac{1}{2}}\Big{(}-\eta_{s}\overline{r}_{s}-\frac{1}{2}\sigma_{s}\Big{)}\big{(}\sigma_{s}+\eta_{s}\overline{r}_{s}\big{)}ds-\gamma_{s}^{\frac{1}{2}}\eta_{s}^{2}(1-\overline{r}_{s}^{2})ds\\ &=-\gamma_{s}^{\frac{1}{2}}\left(\frac{3}{2}\eta_{s}\sigma_{s}\overline{r}_{s}+\frac{1}{2}\sigma_{s}^{2}+\eta_{s}^{2}\right)ds.\end{split}

(53)

We then plug (51), (52) and (53) into (50), which yields (49). ∎

Proof of Lemma˜1.6.

We denote $\alpha_{s}=\gamma_{s}^{-\frac{1}{2}}\nu_{s}^{-1}$ , $s\in[t,T]$ , and $\beta_{s}=d-\gamma_{t}x-\int_{t}^{s}X_{r}d(\nu_{r}\gamma_{r})$ , $s\in[t,T]$ . It then holds that $\overline{H}_{s}=\alpha_{s}\beta_{s}$ , $s\in[t,T]$ . We use Lemma˜5.1 and substitute $-\gamma^{\frac{1}{2}}X=\overline{H}-\gamma^{-\frac{1}{2}}D$ in (49) to obtain for all $s\in[t,T]$ that

\begin{split}&d\overline{H}_{s}\\ &=\big{(}\overline{H}_{s}-\gamma_{s}^{-\frac{1}{2}}D_{s}\big{)}\Bigg{(}\left(\mu_{s}+\rho_{s}-\frac{1}{2}\sigma_{s}\eta_{s}\overline{r}_{s}-\frac{1}{2}\sigma_{s}^{2}\right)ds+\big{(}\sigma_{s}+\eta_{s}\overline{r}_{s}\big{)}dW_{s}^{1}+\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}dW_{s}^{2}\Bigg{)}\\ &\quad+\overline{H}_{s}\Bigg{(}\Big{(}-\rho_{s}-\frac{1}{2}\mu_{s}+\frac{3}{8}\sigma_{s}^{2}+\frac{1}{2}\sigma_{s}\eta_{s}\overline{r}_{s}\Big{)}ds+\Big{(}-\eta_{s}\overline{r}_{s}-\frac{1}{2}\sigma_{s}\Big{)}dW_{s}^{1}-\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}dW_{s}^{2}\Bigg{)}\\ &=-\gamma_{s}^{-\frac{1}{2}}D_{s}\Bigg{(}\left(\mu_{s}+\rho_{s}-\frac{1}{2}\sigma_{s}\eta_{s}\overline{r}_{s}-\frac{1}{2}\sigma_{s}^{2}\right)ds+\big{(}\sigma_{s}+\eta_{s}\overline{r}_{s}\big{)}dW_{s}^{1}+\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}dW_{s}^{2}\Bigg{)}\\ &\quad+\overline{H}_{s}\Bigg{(}\left(\frac{1}{2}\mu_{s}-\frac{1}{8}\sigma_{s}^{2}\right)ds+\frac{1}{2}\sigma_{s}dW_{s}^{1}\Bigg{)}.\end{split}

This proves the dynamics in (18).

In particular, $\overline{H}$ satisfies an SDE that is linear in $\overline{H}$ and $\gamma^{-\frac{1}{2}}D$ . Furthermore, boundedness of $\rho,\mu,\sigma,\eta,\overline{r}$ implies that the coefficients of the SDE are bounded. Since moreover $E[\int_{t}^{T}\big{(}\gamma_{s}^{-\frac{1}{2}}D_{s}\big{)}^{2}ds]<\infty$ by assumption (A1) and $\overline{H}_{t}=\gamma_{t}^{-\frac{1}{2}}d-\gamma_{t}^{\frac{1}{2}}x$ (cf. (17)) is square integrable, we have that $E[\sup_{s\in[t,T]}\overline{H}_{s}^{2}]<\infty$ (see, e.g., [41, Theorem 3.2.2 and Theorem 3.3.1]).

We next prove that cost functional (16) admits representation (19). To this end, note that by (17) it holds for all $s\in[t,T]$ that

\gamma_{s}\!\left(X_{s}\!-\zeta_{s}\right)^{2}=\left(\gamma^{-\frac{1}{2}}_{s}D_{s}-\overline{H}_{s}-\gamma_{s}^{\frac{1}{2}}\zeta_{s}\right)^{\!2}\!=\gamma_{s}^{-1}D_{s}^{2}-2\gamma_{s}^{-\frac{1}{2}}D_{s}\left(\overline{H}_{s}+\gamma_{s}^{\frac{1}{2}}\zeta_{s}\right)+\left(\overline{H}_{s}+\gamma_{s}^{\frac{1}{2}}\zeta_{s}\right)^{\!2}\!.

Due to assumption (5) on $\zeta$ and $E[\sup_{s\in[t,T]}\overline{H}_{s}^{2}]<\infty$ , we have that $E_{t}[\int_{t}^{T}(\overline{H}_{s}+\gamma_{s}^{\frac{1}{2}}\zeta_{s})^{2}ds]<\infty$ . This, assumption (A1), and the Cauchy–Schwarz inequality imply that also $E_{t}[\int_{t}^{T}\lvert\gamma_{s}^{-\frac{1}{2}}D_{s}(\overline{H}_{s}+\gamma_{s}^{\frac{1}{2}}\zeta_{s})\rvert ds]<\infty$ . Since $\lambda$ is bounded, we conclude that

\begin{split}E_{t}\left[\int_{t}^{T}\lambda_{s}\gamma_{s}\left(X_{s}-\zeta_{s}\right)^{2}ds\right]&=E_{t}\left[\int_{t}^{T}\lambda_{s}\gamma_{s}^{-1}D_{s}^{2}ds\right]+E_{t}\left[\int_{t}^{T}\lambda_{s}\left(\overline{H}_{s}+\gamma_{s}^{\frac{1}{2}}\zeta_{s}\right)^{2}ds\right]\\ &\quad-2E_{t}\left[\int_{t}^{T}\lambda_{s}\gamma_{s}^{-\frac{1}{2}}D_{s}\left(\overline{H}_{s}+\gamma_{s}^{\frac{1}{2}}\zeta_{s}\right)ds\right],\end{split}

(54)

where all conditional expectations are well-defined and finite. Moreover, (17) implies that $\gamma_{T}^{-\frac{1}{2}}D_{T}=\overline{H}_{T}+\gamma_{T}^{\frac{1}{2}}X_{T}$ , and thus $\gamma_{T}^{-1}D_{T}^{2}=(\overline{H}_{T}+\sqrt{\gamma_{T}}\xi)^{2}$ . Inserting this and (54) into (16), we obtain (19). ∎

Lemma 5.2.

Let $t\in[0,T]$ and $x,d\in\mathbb{R}$ . Then, (20) defines a metric on $\mathcal{A}_{t}^{pm}(x,d)$ (identifying any processes that are equal $dP\times ds|_{[t,T]}$ -a.e.).

Proof.

Note first that it holds for all $X,Y\in\mathcal{A}_{t}^{pm}(x,d)$ that $\mathbf{d}(X,Y)\geq 0$ , and that $\mathbf{d}(X,Y)$ is finite due to (A1). Symmetry of $\mathbf{d}$ is obvious. The triangle inequality follows from the Cauchy–Schwarz inequality.

Let $X,Y\in\mathcal{A}_{t}^{pm}(x,d)$ with associated deviation processes $D^{X},D^{Y}$ .

If $X=Y$ $dP\times ds|_{[t,T]}$ -a.e., then $\gamma^{-\frac{1}{2}}D^{X}=\gamma^{-\frac{1}{2}}D^{Y}$ $dP\times ds|_{[t,T]}$ -a.e., and thus $\mathbf{d}(X,Y)=(E[\int_{t}^{T}(\gamma_{s}^{-\frac{1}{2}}D_{s}^{X}-\gamma_{s}^{-\frac{1}{2}}D_{s}^{Y})^{2}ds])^{\frac{1}{2}}=0$ .

For the other direction, suppose that $\mathbf{d}(X,Y)=0$ . This implies that $\gamma^{-\frac{1}{2}}D^{X}-\gamma^{-\frac{1}{2}}D^{Y}=0$ $dP\times ds|_{[t,T]}$ -a.e. By definition of $D^{X}$ and $D^{Y}$ it further follows from a multiplication by $\nu\gamma^{\frac{1}{2}}$ that $\nu_{s}\gamma_{s}(X_{s}-Y_{s})=\int_{t}^{s}(X_{r}-Y_{r})d(\nu_{r}\gamma_{r})$ $\,dP\times ds|_{[t,T]}\text{-a.e.}$ Observe that $\nu\gamma>0$ and consider the stochastic integral equation

K_{s}=\int_{t}^{s}K_{r}\nu_{r}^{-1}\gamma_{r}^{-1}d(\nu_{r}\gamma_{r}),\quad s\in[t,T].

(55)

Define $L=(L_{s})_{s\in[0,T]}$ by $L_{0}=0$ ,

dL_{s}=\big{(}\mu_{s}+\rho_{s}+\eta_{s}^{2}+\sigma_{s}\eta_{s}\overline{r}_{s}\big{)}ds+\big{(}\sigma_{s}+\eta_{s}\overline{r}_{s}\big{)}dW_{s}^{1}+\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}dW_{s}^{2},\quad s\in[0,T].

It then follows from (51) that (55) can be written as $K_{s}=\int_{t}^{s}K_{r}dL_{r}$ , $s\in[t,T]$ . This has the unique solution $K=0$ . We therefore conclude that $X=Y$ $dP\times ds|_{[t,T]}$ -a.e. ∎

We now prepare the proof of ˜1.7. The next result on the scaled hidden deviation is helpful in ˜1.7 in order to show convergence of the cost functional.

Lemma 5.3.

Let $t\in[0,T]$ , $x,d\in\mathbb{R}$ , and $X\in\mathcal{A}^{pm}_{t}(x,d)$ with associated deviation $D$ and scaled hidden deviation $\overline{H}$ . Suppose in addition that $(X^{n})_{n\in\mathbb{N}}$ is a sequence in $\mathcal{A}^{pm}_{t}(x,d)$ such that $\lim_{n\to\infty}E[\int_{t}^{T}(D_{s}^{n}-D_{s})^{2}\gamma_{s}^{-1}ds]=0$ . for the associated deviation processes $D^{n}$ , $n\in\mathbb{N}$ . It then holds for the associated scaled hidden deviation processes $\overline{H}^{n}$ , $n\in\mathbb{N}$ , that $\lim_{n\to\infty}E[\sup_{s\in[t,T]}(\overline{H}_{s}^{n}-\overline{H}_{s})^{2}]=0.$

Proof.

Define $\delta\overline{H}^{n}=\overline{H}^{n}-\overline{H}$ , $n\in\mathbb{N}$ , and let for $n\in\mathbb{N}$ , $s\in[t,T]$ , $z\in\mathbb{R}$

\begin{split}b^{n}_{s}(z)&=-\frac{1}{2}\Big{(}2(\rho_{s}+\mu_{s})-\sigma_{s}^{2}-\sigma_{s}\eta_{s}\overline{r}_{s}\Big{)}\big{(}\gamma^{-\frac{1}{2}}_{s}D^{n}_{s}-\gamma^{-\frac{1}{2}}_{s}D_{s}\big{)}+\frac{1}{2}\left(\mu_{s}-\frac{1}{4}\sigma_{s}^{2}\right)z,\\ a_{s}^{n}(z)&=\bigg{(}-(\sigma_{s}+\eta_{s}\overline{r}_{s})\big{(}\gamma^{-\frac{1}{2}}_{s}D^{n}_{s}-\gamma^{-\frac{1}{2}}_{s}D_{s}\big{)}+\frac{1}{2}\sigma_{s}z,-\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}\big{(}\gamma^{-\frac{1}{2}}_{s}D^{n}_{s}-\gamma^{-\frac{1}{2}}_{s}D_{s}\big{)}\bigg{)}.\end{split}

In view of (18) it then holds for all $n\in\mathbb{N}$ that

d(\delta\overline{H}^{n}_{s})=b_{s}^{n}(\delta\overline{H}_{s}^{n})ds+a_{s}^{n}(\delta\overline{H}_{s}^{n})d\begin{pmatrix}W_{s}^{1}\\ W_{s}^{2}\end{pmatrix},\quad s\in[t,T],\quad\delta\overline{H}_{t}^{n}=0.

Linearity of $b^{n}$ , $a^{n}$ , $n\in\mathbb{N}$ , and boundedness of $\mu,\rho,\sigma,\eta,\overline{r}$ imply that there exists $c_{1}\in(0,\infty)$ such that for all $n\in\mathbb{N}$ and all $z_{1},z_{2}\in\mathbb{R}$ it holds $dP\times ds|_{[t,T]}$ -a.e. that

\lvert b^{n}(z_{1})-b^{n}(z_{2})\rvert+\lVert a^{n}(z_{1})-a^{n}(z_{2})\rVert_{2}\leq\frac{1}{2}\left\lvert\mu-\frac{1}{4}\sigma^{2}\right\rvert\lvert z_{1}-z_{2}\rvert+\frac{1}{2}\lvert\sigma\rvert\lvert z_{1}-z_{2}\rvert\leq c_{1}\lvert z_{1}-z_{2}\rvert.

By boundedness of $\mu,\rho,\sigma,\eta,\overline{r}$ and Jensen’s inequality, we have some $c_{2}\in(0,\infty)$ such that for all $n\in\mathbb{N}$ ,

E\left[\left(\int_{t}^{T}\lvert b_{s}^{n}(0)\rvert ds\right)^{2}\right]+E\left[\int_{t}^{T}\lVert a^{n}_{s}(0)\rVert_{2}^{2}ds\right]\leq c_{2}E\left[\int_{t}^{T}(D_{s}^{n}-D_{s})^{2}\gamma_{s}^{-1}ds\right].

E.g., [41, Theorem 3.2.2] (see also [41, Theorem 3.4.2]) now implies that there exists $c_{3}\in(0,\infty)$ such that for all $n\in\mathbb{N}$

\begin{split}E\left[\sup_{s\in[t,T]}\lvert\overline{H}_{s}^{n}-\overline{H}_{s}\rvert^{2}\right]&\leq c_{3}E\left[\left(\int_{t}^{T}\lvert b_{s}^{n}(0)\rvert ds\right)^{2}+\int_{t}^{T}\lVert a^{n}_{s}(0)\rVert_{2}^{2}ds\right]\\ &\leq c_{2}c_{3}E\left[\int_{t}^{T}(D_{s}^{n}-D_{s})^{2}\gamma_{s}^{-1}ds\right].\end{split}

The claim follows from the assumption that $\lim_{n\to\infty}E[\int_{t}^{T}\left(D_{s}^{n}-D_{s}\right)^{2}\gamma_{s}^{-1}ds]=0$ . ∎

In order to establish existence of an appropriate approximating sequence in ˜1.7, we rely on Lemma˜5.4 below. For its statement and the proof of the second part of ˜1.7, we introduce a process $Z=(Z_{s})_{s\in[0,T]}$ defined by

Z_{s}=\exp\left(-\int_{0}^{s}\left(\frac{1}{2}\sigma_{r}+\eta_{r}\overline{r}_{r}\right)dW^{1}_{r}-\int_{0}^{s}\eta_{r}\sqrt{1-\overline{r}_{r}^{2}}dW_{r}^{2}\right),\quad s\in[0,T].

(56)

Observe that by Itô’s lemma, $Z$ solves the SDE

\begin{split}&dZ_{s}=\frac{Z_{s}}{2}\left(\left(\frac{1}{2}\sigma_{s}+\eta_{s}\overline{r}_{s}\right)^{2}+\eta_{s}^{2}(1-\overline{r}_{s}^{2})\right)ds-Z_{s}\left(\frac{1}{2}\sigma_{s}+\eta_{s}\overline{r}_{s},\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}\right)d\begin{pmatrix}W^{1}_{s}\\ W_{s}^{2}\end{pmatrix},\\ &s\in[0,T],\quad Z_{0}=1.\end{split}

(57)

Lemma 5.4.

Let $t\in[0,T]$ and let $u=(u_{s})_{s\in[t,T]}\in\mathcal{L}_{t}^{2}$ . Then there exists a sequence of bounded càdlàg finite variation processes $(v^{n})_{n\in\mathbb{N}}$ such that

\lim_{n\to\infty}E\left[\int_{t}^{T}\left(\frac{u_{s}}{Z_{s}}-v_{s}^{n}\right)^{2}Z_{s}^{2}ds\right]=0.

In particular, for the sequence of processes $(u^{n})_{n\in\mathbb{N}}$ defined by $u^{n}=v^{n}Z$ , $n\in\mathbb{N}$ , it holds for all $n\in\mathbb{N}$ that $u^{n}$ is a càdlàg semimartingale and $E[\sup_{s\in[t,T]}\lvert u_{s}^{n}\rvert^{p}]<\infty$ for any $p\geq 2$ (in particular, $u^{n}\in\mathcal{L}_{t}^{2}$ ), and that $\lim_{n\to\infty}E[\int_{t}^{T}\left(u_{s}-u_{s}^{n}\right)^{2}ds]=0.$

Proof.

Define $A=(A_{s})_{s\in[0,T]}$ by $A_{s}=\int_{0}^{s}Z_{r}^{2}dr$ , $s\in[0,T]$ . Moreover, let $v=(v_{s})_{s\in[t,T]}$ be defined by $v_{s}=\frac{u_{s}}{Z_{s}}$ , $s\in[t,T]$ . We verify the assumptions of Lemma 2.7 in Section 3.2 of [33]. The process $A$ is continuous, adapted and nondecreasing. Note that boundedness of $\sigma$ , $\eta$ and $\overline{r}$ implies that the coefficients of (57) are bounded. It follows for any $p\geq 2$ that $E[\sup_{s\in[0,T]}\lvert Z_{s}\rvert^{p}]<\infty$ (see, e.g., [41, Theorem 3.4.3]), and hence $E[A_{T}]=E[\int_{0}^{T}Z_{r}^{2}dr]<\infty$ . Since $u\in\mathcal{L}_{t}^{2}$ , we have that $v$ is progressively measurable and satisfies $E[\int_{t}^{T}v_{s}^{2}dA_{s}]=E[\int_{t}^{T}u_{s}^{2}ds]<\infty$ . Thus, Lemma 2.7 in Section 3.2 of [33] applies and yields that there exists a sequence $(\hat{v}^{n})_{n\in\mathbb{N}}$ of (càglàd) simple processes $\hat{v}^{n}=(\hat{v}_{s}^{n})_{s\in[t,T]}$ , $n\in\mathbb{N}$ , such that $\lim_{n\to\infty}E[\int_{t}^{T}(v_{s}-\hat{v}_{s}^{n})^{2}dA_{s}]=0$ . Define $v_{s}^{n}(\omega)=\lim_{r\downarrow s}\hat{v}_{r}^{n}(\omega)$ , $s\in[t,T)$ , $\omega\in\Omega$ , $n\in\mathbb{N}$ , and $v_{T}^{n}=0$ , $n\in\mathbb{N}$ . Then, $(v^{n})_{n\in\mathbb{N}}$ is a sequence of bounded càdlàg finite variation processes such that $\lim_{n\to\infty}E[\int_{t}^{T}(v_{s}-v_{s}^{n})^{2}dA_{s}]=0$ . Note that for each $n\in\mathbb{N}$ , $u^{n}=(u_{s}^{n})_{s\in[t,T]}$ defined by $u^{n}_{s}=v^{n}_{s}Z_{s}$ , $s\in[t,T]$ , is càdlàg. Since $v^{n}$ is bounded for all $n\in\mathbb{N}$ and $E[\sup_{s\in[0,T]}\lvert Z_{s}\rvert^{p}]<\infty$ for any $p\geq 2$ , we have that $E[\sup_{s\in[t,T]}\lvert u_{s}^{n}\rvert^{p}]$ is finite for all $n\in\mathbb{N}$ and any $p\geq 2$ . It furthermore holds that $E[\int_{t}^{T}(u_{s}-u_{s}^{n})^{2}ds]=E[\int_{t}^{T}(v_{s}-v_{s}^{n})^{2}dA_{s}]\to 0$ as $n\to\infty$ . ∎

For the part in ˜1.7 on completeness of $(\mathcal{A}_{t}^{pm}(x,d),\mathbf{d})$ we show how to construct an execution strategy $X^{0}\in\mathcal{A}_{t}^{pm}(x,d)$ based on a square integrable process $u^{0}$ and a process $H^{0}$ that satisfies SDE (18) (with $u^{0}$ instead of $\gamma^{-\frac{1}{2}}D$ ). This result is also crucial for Lemma˜2.2.

Lemma 5.5.

Let $t\in[0,T]$ and $x,d\in\mathbb{R}$ . Suppose that $u^{0}=(u^{0}_{s})_{s\in[t,T]}\in\mathcal{L}_{t}^{2}$ , and let $H^{0}=(H^{0}_{s})_{s\in[t,T]}$ be given by $H^{0}_{t}=\frac{d}{\sqrt{\gamma_{t}}}-\sqrt{\gamma_{t}}x$ ,

\begin{split}dH^{0}_{s}&=\left(\frac{1}{2}\left(\mu_{s}-\frac{1}{4}\sigma_{s}^{2}\right)H^{0}_{s}-\frac{1}{2}\left(2(\rho_{s}+\mu_{s})-\sigma_{s}^{2}-\sigma_{s}\eta_{s}\overline{r}_{s}\right)u^{0}_{s}\right)ds\\ &\quad+\left(\frac{1}{2}\sigma_{s}H^{0}_{s}-(\sigma_{s}+\eta_{s}\overline{r}_{s})u^{0}_{s}\right)dW^{1}_{s}-\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}u^{0}_{s}dW_{s}^{2},\quad s\in[t,T].\end{split}

(58)

Define $X^{0}=(X^{0}_{s})_{s\in[t-,T]}$ by $X^{0}_{s}=\gamma_{s}^{-\frac{1}{2}}(u^{0}_{s}-H^{0}_{s})$ , $s\in[t,T)$ , $X^{0}_{t-}=x$ , $X^{0}_{T}=\xi$ . Then, $X^{0}\in\mathcal{A}_{t}^{pm}(x,d)$ , and for the associated deviation process $D^{0}=(D^{0}_{s})_{s\in[t-,T]}$ it holds $D^{0}=\gamma X^{0}+\gamma^{\frac{1}{2}}H^{0}$ .

Proof.

First, $X^{0}$ is progressively measurable and has initial value $X^{0}_{t-}=x$ and terminal value $X^{0}_{T}=\xi$ . Furthermore, it holds that

\begin{split}\int_{t}^{T}(X_{s}^{0})^{2}ds&\leq 2\int_{t}^{T}\gamma_{s}^{-1}(u_{s}^{0})^{2}ds+2\int_{t}^{T}\gamma_{s}^{-1}(H^{0}_{s})^{2}ds<\infty\text{ a.s.}\end{split}

since $\gamma$ and $H^{0}$ have a.s. continuous paths and $E[\int_{t}^{T}(u_{s}^{0})^{2}ds]<\infty$ . We are therefore able to define $D^{0}$ by (14). Moreover, denote $\alpha_{s}=\gamma_{s}^{-\frac{1}{2}}\nu_{s}^{-1}$ , $s\in[t,T]$ , and $\beta_{s}=d-\gamma_{t}x-\int_{t}^{s}X^{0}_{r}d(\nu_{r}\gamma_{r})$ , $s\in[t,T]$ . It follows from Lemma˜5.1 and $-\gamma_{s}^{\frac{1}{2}}X^{0}_{s}=H^{0}_{s}-u^{0}_{s}$ , $s\in[t,T)$ , that for all $s\in[t,T]$

\begin{split}&d(\alpha_{s}\beta_{s})\\ &=(H^{0}_{s}-u^{0}_{s})\Bigg{(}\Big{(}\mu_{s}+\rho_{s}-\frac{1}{2}\sigma_{s}\eta_{s}\overline{r}_{s}-\frac{1}{2}\sigma_{s}^{2}\Big{)}ds+\big{(}\sigma_{s}+\eta_{s}\overline{r}_{s}\big{)}dW_{s}^{1}+\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}dW_{s}^{2}\Bigg{)}\\ &\quad+\alpha_{s}\beta_{s}\Bigg{(}\Big{(}-\rho_{s}-\frac{1}{2}\mu_{s}+\frac{3}{8}\sigma_{s}^{2}+\frac{1}{2}\sigma_{s}\eta_{s}\overline{r}_{s}\Big{)}ds+\Big{(}-\eta_{s}\overline{r}_{s}-\frac{1}{2}\sigma_{s}\Big{)}dW_{s}^{1}-\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}dW_{s}^{2}\Bigg{)}.\end{split}

We combine this with

\begin{split}dH^{0}_{s}&=-u^{0}_{s}\Bigg{(}\Big{(}\mu_{s}+\rho_{s}-\frac{1}{2}\sigma_{s}\eta_{s}\overline{r}_{s}-\frac{1}{2}\sigma_{s}^{2}\Big{)}ds+(\sigma_{s}+\eta_{s}\overline{r}_{s})dW_{s}^{1}+\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}dW_{s}^{2}\Bigg{)}\\ &\quad+H^{0}_{s}\Bigg{(}\Big{(}\frac{1}{2}\mu_{s}-\frac{1}{8}\sigma_{s}^{2}\Big{)}ds+\frac{1}{2}\sigma_{s}dW_{s}^{1}\Bigg{)},\quad s\in[t,T],\end{split}

to obtain for all $s\in[t,T]$ that

\begin{split}d(\alpha_{s}\beta_{s}-H^{0}_{s})&=(\alpha_{s}\beta_{s}-H^{0}_{s})\Bigg{(}\Big{(}-\rho_{s}-\frac{1}{2}\mu_{s}+\frac{3}{8}\sigma_{s}^{2}+\frac{1}{2}\sigma_{s}\eta_{s}\overline{r}_{s}\Big{)}ds+\Big{(}-\eta_{s}\overline{r}_{s}-\frac{1}{2}\sigma_{s}\Big{)}dW_{s}^{1}\\ &\quad\quad\quad\qquad\qquad-\eta_{s}\sqrt{1-\overline{r}_{s}^{2}}dW_{s}^{2}\Bigg{)}.\end{split}

(59)

Note that $\alpha_{t}\beta_{t}=\gamma_{t}^{-\frac{1}{2}}d-\gamma_{t}^{\frac{1}{2}}x=H^{0}_{t}$ . We thus conclude that $0$ is the unique solution of (59), and hence $H^{0}_{s}=\gamma^{-\frac{1}{2}}_{s}\nu_{s}^{-1}(d-\gamma_{t}x-\int_{t}^{s}X^{0}_{r}d(\nu_{r}\gamma_{r}))$ , $s\in[t,T]$ . This implies that $D^{0}=\gamma X^{0}+\gamma^{\frac{1}{2}}H^{0}$ , i.e., $D^{0}_{s}=\gamma_{s}^{\frac{1}{2}}u^{0}_{s}$ , $s\in[t,T)$ , and $D^{0}_{T}=\gamma_{T}\xi+\gamma_{T}^{\frac{1}{2}}H^{0}_{T}$ . The fact that $E[\int_{t}^{T}(u_{s}^{0})^{2}ds]<\infty$ then immediately yields that (A1) holds. This proves that $X^{0}\in\mathcal{A}_{t}^{pm}(x,d)$ . ∎

We finally are able to prove ˜1.7.

Proof of ˜1.7.

(i) Denote by $D$ , $D^{n}$ , $n\in\mathbb{N}$ , the deviation processes associated to $X$ , $X^{n}$ , $n\in\mathbb{N}$ , and let $\overline{H}$ and $\overline{H}^{n}$ , $n\in\mathbb{N}$ , be the scaled hidden deviation processes. By Lemma˜1.6 it holds for all $n\in\mathbb{N}$ that

\begin{split}\left\lvert J^{pm}_{t}(x,d,X^{n})-J^{pm}_{t}(x,d,X)\right\rvert&=\Bigg{\lvert}\frac{1}{2}E_{t}\left[\int_{t}^{T}\gamma_{s}^{-1}\left((D_{s}^{n})^{2}-D_{s}^{2}\right)2(\kappa_{s}+\lambda_{s})ds\right]\\ &\quad-2E_{t}\!\left[\int_{t}^{T}\!\lambda_{s}\gamma_{s}^{-\frac{1}{2}}\!\left(D_{s}^{n}\big{(}\overline{H}_{s}^{n}+\gamma_{s}^{\frac{1}{2}}\zeta_{s}\big{)}\!-\!D_{s}\big{(}\overline{H}_{s}+\gamma_{s}^{\frac{1}{2}}\zeta_{s}\big{)}\!\right)\!ds\right]\\ &\quad+E_{t}\left[\int_{t}^{T}\lambda_{s}\left(\big{(}\overline{H}_{s}^{n}+\gamma_{s}^{\frac{1}{2}}\zeta_{s}\big{)}^{2}-\big{(}\overline{H}_{s}+\gamma_{s}^{\frac{1}{2}}\zeta_{s}\big{)}^{2}\right)ds\right]\\ &\quad+\frac{1}{2}E_{t}\left[(\overline{H}_{T}^{n}+\gamma_{T}^{\frac{1}{2}}\xi)^{2}-(\overline{H}_{T}+\gamma_{T}^{\frac{1}{2}}\xi)^{2}\right]\Bigg{\rvert}.\end{split}

Boundedness of $\lambda,\overline{r},\rho,\mu,\eta$ and $\sigma$ implies (recall also (12)) that there exists some $c\in(0,\infty)$ such that for all $n\in\mathbb{N}$ it holds that

\begin{split}&E\left[\left\lvert J^{pm}_{t}(x,d,X^{n})-J^{pm}_{t}(x,d,X)\right\rvert\right]\\ &\leq E\left[\left\lvert(\overline{H}_{T}^{n}+\gamma_{T}^{\frac{1}{2}}\xi)^{2}-(\overline{H}_{T}+\gamma_{T}^{\frac{1}{2}}\xi)^{2}\right\rvert\right]+cE\left[\int_{t}^{T}\left\lvert\gamma_{s}^{-1}\left((D_{s}^{n})^{2}-D_{s}^{2}\right)\right\rvert ds\right]\\ &\quad+cE\left[\int_{t}^{T}\left\lvert\gamma_{s}^{-\frac{1}{2}}\left(D_{s}^{n}\big{(}\overline{H}_{s}^{n}+\gamma_{s}^{\frac{1}{2}}\zeta_{s}\big{)}-D_{s}\big{(}\overline{H}_{s}+\gamma_{s}^{\frac{1}{2}}\zeta_{s}\big{)}\right)\right\rvert ds\right]\\ &\quad+cE\left[\int_{t}^{T}\left\lvert\big{(}\overline{H}_{s}^{n}+\gamma_{s}^{\frac{1}{2}}\zeta_{s}\big{)}^{2}-\big{(}\overline{H}_{s}+\gamma_{s}^{\frac{1}{2}}\zeta_{s}\big{)}^{2}\right\rvert ds\right].\end{split}

(60)

We treat the terminal costs first. It holds for all $n\in\mathbb{N}$ that

\begin{split}&E\left[\left\lvert(\overline{H}_{T}^{n}+\gamma_{T}^{\frac{1}{2}}\xi)^{2}-(\overline{H}_{T}+\gamma_{T}^{\frac{1}{2}}\xi)^{2}\right\rvert\right]=E\left[\left\lvert(\overline{H}_{T}^{n})^{2}+2\overline{H}_{T}^{n}\gamma_{T}^{\frac{1}{2}}\xi-\overline{H}_{T}^{2}-2\overline{H}_{T}\gamma_{T}^{\frac{1}{2}}\xi\right\rvert\right]\\ &\qquad\qquad\qquad\qquad\leq E\left[\left\lvert(\overline{H}_{T}^{n})^{2}-\overline{H}_{T}^{2}\right\rvert\right]+2E\left[\left\lvert(\overline{H}^{n}_{T}-\overline{H}_{T})\gamma_{T}^{\frac{1}{2}}\xi\right\rvert\right]\\ &\qquad\qquad\qquad\qquad\leq E\left[\left\lvert(\overline{H}_{T}^{n})^{2}-\overline{H}_{T}^{2}\right\rvert\right]+2\left(E\left[(\overline{H}^{n}_{T}-\overline{H}_{T})^{2}\right]\right)^{\frac{1}{2}}\left(E\left[\gamma_{T}\xi^{2}\right]\right)^{\frac{1}{2}}.\end{split}

From

\lim_{n\to\infty}E\left[\int_{t}^{T}(D_{s}^{n}-D_{s})^{2}\gamma_{s}^{-1}ds\right]=0

(61)

(cf. (20)) and Lemma˜5.3 we have that

\lim_{n\to\infty}E\left[\sup_{s\in[t,T]}\lvert\overline{H}_{s}^{n}-\overline{H}_{s}\rvert^{2}\right]=0.

(62)

Since furthermore $E[\gamma_{T}\xi^{2}]<\infty$ , we obtain that $\lim_{n\to\infty}E[\lvert(\overline{H}_{T}^{n}+\gamma_{T}^{\frac{1}{2}}\xi)^{2}-(\overline{H}_{T}+\gamma_{T}^{\frac{1}{2}}\xi)^{2}\rvert]=0.$ The second term in (60) converges to $0$ using (61). For the third term in (60) we have for all $n\in\mathbb{N}$ that

\begin{split}&E\left[\int_{t}^{T}\left\lvert\gamma_{s}^{-\frac{1}{2}}\left(D_{s}^{n}\big{(}\overline{H}_{s}^{n}+\gamma_{s}^{\frac{1}{2}}\zeta_{s}\big{)}-D_{s}\big{(}\overline{H}_{s}+\gamma_{s}^{\frac{1}{2}}\zeta_{s}\big{)}\right)\right\rvert ds\right]\\ &\leq E\left[\int_{t}^{T}\left(\big{\lvert}\overline{H}_{s}+\gamma_{s}^{\frac{1}{2}}\zeta_{s}\big{\rvert}\,\lvert D_{s}^{n}-D_{s}\rvert\gamma_{s}^{-\frac{1}{2}}+\gamma_{s}^{-\frac{1}{2}}\lvert D_{s}^{n}\rvert\,\lvert\overline{H}_{s}^{n}-\overline{H}_{s}\rvert\right)ds\right]\\ &\leq\left(E\bigg{[}\int_{t}^{T}\big{(}\overline{H}_{s}+\gamma_{s}^{\frac{1}{2}}\zeta_{s})^{2}ds\bigg{]}\right)^{\frac{1}{2}}\left(E\bigg{[}\int_{t}^{T}(D_{s}^{n}-D_{s})^{2}\gamma_{s}^{-1}ds\bigg{]}\right)^{\frac{1}{2}}\\ &\quad\,+\left(E\bigg{[}\int_{t}^{T}\gamma_{s}^{-1}(D_{s}^{n})^{2}ds\bigg{]}\right)^{\frac{1}{2}}T^{\frac{1}{2}}\left(E\bigg{[}\sup_{s\in[t,T]}\lvert\overline{H}_{s}^{n}-\overline{H}_{s}\rvert^{2}\bigg{]}\right)^{\frac{1}{2}}.\end{split}

(63)

By Lemma˜1.6 and (5) it holds that $E[\int_{t}^{T}(\overline{H}_{s}+\gamma_{s}^{\frac{1}{2}}\zeta_{s})^{2}ds]<\infty$ . Moreover, due to (61), we have that $E[\int_{t}^{T}\gamma_{s}^{-1}(D_{s}^{n})^{2}ds]$ is uniformly bounded in $n\in\mathbb{N}$ . It thus follows from (61), (62) and (63) that the third term in (60) converges to $0$ as $n\to\infty$ . The last term in (60) converges to $0$ using (5) and (62). This proves claim (i).

(ii) Suppose that $X\in\mathcal{A}^{pm}_{t}(x,d)$ . Let $u=(u_{s})_{s\in[t,T]}$ be defined by $u_{s}=\gamma_{s}^{-\frac{1}{2}}D_{s}$ , $s\in[t,T]$ , where $D$ denotes the deviation associated to $X$ . Then, $u$ is a progressively measurable process, and due to assumption (A1) it holds that $E[\int_{t}^{T}u_{s}^{2}ds]<\infty$ . By Lemma˜5.4 there exists a sequence of bounded càdlàg finite variation processes $(v^{n})_{n\in\mathbb{N}}$ such that $\lim_{n\to\infty}E[\int_{t}^{T}(\frac{u_{s}}{Z_{s}}-v_{s}^{n})^{2}Z_{s}^{2}ds]=0$ , where $Z$ is defined in (56). Set $u^{n}=v^{n}Z$ , $n\in\mathbb{N}$ . This is a sequence of càdlàg semimartingales in $\mathcal{L}_{t}^{2}$ that satisfies $\lim_{n\to\infty}\lVert u-u^{n}\rVert_{\mathcal{L}_{t}^{2}}=0$ . Moreover, it holds for all $n\in\mathbb{N}$ and any $p\geq 2$ that $E[\sup_{s\in[t,T]}\lvert u_{s}^{n}\rvert^{p}]<\infty$ . For each $u^{n}$ , $n\in\mathbb{N}$ , let $H^{n}=(H^{n}_{s})_{s\in[t,T]}$ be the solution of (58). We then define a sequence of càdlàg semimartingales $X^{n}=(X^{n}_{s})_{s\in[t-,T]}$ , $n\in\mathbb{N}$ , by $X^{n}_{s}=\gamma_{s}^{-\frac{1}{2}}(u_{s}^{n}-H_{s}^{n})$ , $s\in[t,T)$ , $X_{t-}^{n}=x$ , $X_{T}^{n}=\xi$ . By Lemma˜5.5 we have for all $n\in\mathbb{N}$ that $X^{n}\in\mathcal{A}_{t}^{pm}(x,d)$ and that $D^{n}=\gamma X^{n}+\gamma^{\frac{1}{2}}H^{n}$ for the associated deviation process $D^{n}=(D^{n}_{s})_{s\in[t-,T]}$ . It follows for all $n\in\mathbb{N}$ that $D_{s}^{n}=\gamma_{s}^{\frac{1}{2}}u_{s}^{n}$ , $s\in[t,T)$ . Therefore, it holds for all $n\in\mathbb{N}$ that

\mathbf{d}(X^{n},X)=\left(E\left[\int_{t}^{T}(D_{s}^{n}-D_{s})^{2}\gamma_{s}^{-1}ds\right]\right)^{\frac{1}{2}}=\left(E\left[\int_{t}^{T}(u_{s}^{n}-u_{s})^{2}ds\right]\right)^{\frac{1}{2}}.

Due to $\lim_{n\to\infty}\lVert u-u^{n}\rVert_{\mathcal{L}_{t}^{2}}=0$ , we thus have that $\lim_{n\to\infty}\mathbf{d}(X^{n},X)=0$ . We next show that for all $n\in\mathbb{N}$ , $X^{n}$ has finite variation. To this end, we observe that for all $n\in\mathbb{N}$ and $s\in[t,T)$ it holds by integration by parts that

dX_{s}^{n}=\gamma_{s}^{-\frac{1}{2}}d(u_{s}^{n}-H_{s}^{n})+(u_{s}^{n}-H_{s}^{n})d\gamma_{s}^{-\frac{1}{2}}+d[\gamma^{-\frac{1}{2}},u^{n}-H^{n}]_{s}.

(64)

Again by integration by parts, and using (57), we have for all $n\in\mathbb{N}$ and $s\in[t,T]$ that

\begin{split}du_{s}^{n}&=v_{s}^{n}dZ_{s}+Z_{s}dv_{s}^{n}+d[v^{n},Z]_{s}\\ &=\!\frac{1}{2}u_{s}^{n}\bigg{(}\!\Big{(}\frac{1}{2}\sigma_{s}\!+\!\eta_{s}\overline{r}_{s}\!\Big{)}^{\!2}\!\!+\!\eta_{s}^{2}(1-\overline{r}_{s}^{2})\!\bigg{)}\!ds-\!u_{s}^{n}\Big{(}\frac{1}{2}\sigma_{s}\!+\!\eta_{s}\overline{r}_{s}\!\Big{)}\!dW_{s}^{1}\!-\!u_{s}^{n}\eta_{s}\sqrt{1\!-\!\overline{r}_{s}^{2}}dW_{s}^{2}\!+\!Z_{s}dv_{s}^{n}.\end{split}

This and (58) yield for all $n\in\mathbb{N}$ and $s\in[t,T]$ that

\begin{split}\gamma_{s}^{-\frac{1}{2}}d(u_{s}^{n}-H_{s}^{n})&=\gamma_{s}^{-\frac{1}{2}}\left(\rho_{s}+\mu_{s}+\frac{1}{2}\eta_{s}^{2}-\frac{3}{8}\sigma_{s}^{2}\right)u_{s}^{n}ds-\gamma_{s}^{-\frac{1}{2}}\left(\frac{1}{2}\mu_{s}-\frac{1}{8}\sigma_{s}^{2}\right)H_{s}^{n}ds\\ &\quad+\gamma_{s}^{-\frac{1}{2}}\frac{1}{2}\sigma_{s}(u_{s}^{n}-H_{s}^{n})dW_{s}^{1}+\gamma_{s}^{-\frac{1}{2}}Z_{s}dv_{s}^{n}.\end{split}

(65)

Moreover, it follows from (44) for all $n\in\mathbb{N}$ and $s\in[t,T]$ that

\begin{split}(u_{s}^{n}-H_{s}^{n})d\gamma_{s}^{-\frac{1}{2}}&=(u_{s}^{n}-H_{s}^{n})\gamma_{s}^{-\frac{1}{2}}\left(-\frac{1}{2}\mu_{s}+\frac{3}{8}\sigma_{s}^{2}\right)ds-(u_{s}^{n}-H_{s}^{n})\gamma_{s}^{-\frac{1}{2}}\frac{1}{2}\sigma_{s}dW_{s}^{1}.\end{split}

(66)

We combine (64), (65), and (66) to obtain for all $n\in\mathbb{N}$ and $s\in(t,T)$ that

\begin{split}dX_{s}^{n}&=\gamma_{s}^{-\frac{1}{2}}u_{s}^{n}\left(\rho_{s}+\frac{1}{2}\mu_{s}+\frac{1}{2}\eta_{s}^{2}\right)ds-\gamma_{s}^{-\frac{1}{2}}H_{s}^{n}\frac{1}{4}\sigma_{s}^{2}ds+\gamma_{s}^{-\frac{1}{2}}Z_{s}dv_{s}^{n}+d[\gamma^{-\frac{1}{2}},u^{n}-H^{n}]_{s}.\end{split}

Since $v^{n}$ has finite variation for all $n\in\mathbb{N}$ , this representation shows that also $X^{n}$ has finite variation for all $n\in\mathbb{N}$ . Note that for all $n\in\mathbb{N}$ , by ˜1.3, the process (6) associated to the càdlàg finite variation process $X^{n}$ is nothing but $D^{n}$ . Since $\eta$ is bounded, there exists $c\in(0,\infty)$ such that for all $n\in\mathbb{N}$

\begin{split}E\left[\left(\int_{t}^{T}(D_{s}^{n})^{4}\gamma_{s}^{-2}\eta_{s}^{2}ds\right)^{\frac{1}{2}}\right]&=E\left[\left(\int_{t}^{T}(u_{s}^{n})^{4}\eta_{s}^{2}ds\right)^{\frac{1}{2}}\right]\leq cE[\sup_{s\in[t,T]}(u_{s}^{n})^{2}]<\infty.\end{split}

This implies (A2). Similarly, by boundedness of $\sigma$ , we obtain (A3). We thus conclude that $X^{n}\in\mathcal{A}_{t}^{fv}(x,d)$ for all $n\in\mathbb{N}$ .

(iii) Let $(X^{n})_{n\in\mathbb{N}}$ be a Cauchy sequence in $(\mathcal{A}_{t}^{pm}(x,d),\mathbf{d})$ . For $n\in\mathbb{N}$ we denote by $D^{n}$ the deviation process associated to $X^{n}$ . It then holds that $(\gamma^{-\frac{1}{2}}D^{n})_{n\in\mathbb{N}}$ is a Cauchy sequence in $(\mathcal{L}_{t}^{2},\lVert\cdot\rVert_{\mathcal{L}_{t}^{2}})$ . Since $(\mathcal{L}_{t}^{2},\lVert\cdot\rVert_{\mathcal{L}_{t}^{2}})$ is complete (see, e.g., Lemma 2.2 in Section 3.2 of [33]), there exists $u^{0}\in\mathcal{L}_{t}^{2}$ such that $\lim_{n\to\infty}\lVert\gamma^{-\frac{1}{2}}D^{n}-u^{0}\rVert_{\mathcal{L}_{t}^{2}}=0$ . Define $X^{0}=(X^{0}_{s})_{s\in[t-,T]}$ by $X_{t-}^{0}=x$ , $X_{T}^{0}=\xi$ , $X_{s}^{0}=\gamma_{s}^{-\frac{1}{2}}(u_{s}^{0}-H_{s}^{0})$ , $s\in[t,T)$ , where $H^{0}$ is given by (58). By Lemma˜5.5 it holds that $X^{0}\in\mathcal{A}_{t}^{pm}(x,d)$ . We furthermore obtain from Lemma˜5.5 that, for the associated deviation, $D^{0}=\gamma X^{0}+\gamma^{\frac{1}{2}}H^{0}$ . By definition of $X^{0}$ , this yields $\gamma_{s}^{-\frac{1}{2}}D_{s}^{0}=u_{s}^{0}$ , $s\in[t,T)$ . It follows that

\begin{split}\mathbf{d}(X^{n},X^{0})&=\left(E\left[\int_{t}^{T}(\gamma_{s}^{-\frac{1}{2}}D^{n}_{s}-\gamma_{s}^{-\frac{1}{2}}D_{s}^{0})^{2}ds\right]\right)^{\frac{1}{2}}=\lVert\gamma^{-\frac{1}{2}}D^{n}-u^{0}\rVert_{\mathcal{L}_{t}^{2}},\end{split}

and hence $\lim_{n\to\infty}\mathbf{d}(X^{n},X^{0})=0$ . ∎

Proof of Lemma˜2.1.

By definition of $u$ we have that $u$ is progressively measurable and, due to assumption (A1), satisfies $E[\int_{t}^{T}u_{s}^{2}ds]<\infty$ ; hence, $u\in\mathcal{L}_{t}^{2}$ .

Let $\overline{H}_{s}=\gamma_{s}^{-\frac{1}{2}}D_{s}-\gamma_{s}^{\frac{1}{2}}X_{s}$ , $s\in[t,T]$ , be the scaled hidden deviation (17) associated to $X$ . We can substitute $u=\gamma^{-\frac{1}{2}}D$ in the cost functional (19) and also in the dynamics (18) of $\overline{H}$ . Observe that $\overline{H}$ follows the same dynamics as the state process $\widetilde{H}$ associated to $u$ (see (22)), and that $\overline{H}_{t}=\frac{d}{\sqrt{\gamma_{t}}}-\sqrt{\gamma_{t}}x=\widetilde{H}_{t}$ . Therefore, $\overline{H}$ and $\widetilde{H}$ coincide, which completes the proof. ∎

Proof of Lemma˜2.2.

It follows from Lemma˜5.5 that $X\in\mathcal{A}_{t}^{pm}(x,d)$ . Moreover, we have from Lemma˜5.5 that the associated deviation satisfies $D=\gamma X+\gamma^{\frac{1}{2}}\widetilde{H}$ , i.e., $D_{s}=\gamma_{s}^{\frac{1}{2}}u_{s}$ , $s\in[t,T)$ , and $\widetilde{H}$ is the scaled hidden deviation of $X$ . It thus holds that $J^{pm}_{t}(x,d,X)$ is given by (19). In the definition (23) of $J$ , we may replace $u$ under the integrals with respect to the Lebesgue measure by $\gamma^{-\frac{1}{2}}D$ . This shows that $J^{pm}_{t}(x,d,X)=J_{t}(\frac{d}{\sqrt{\gamma_{t}}}-\sqrt{\gamma_{t}}x,u)-\frac{d^{2}}{2\gamma_{t}}$ . ∎

Proof of Lemma˜2.5.

(i) We have that $\hat{u}$ is progressively measurable. Furthermore, the facts that $E[\int_{t}^{T}u_{s}^{2}ds]<\infty$ , $E[\sup_{s\in[t,T]}\widetilde{H}_{s}^{2}]<\infty$ , $E[\int_{0}^{T}\gamma_{s}\zeta_{s}^{2}ds]<\infty$ , and (24) imply that $E[\int_{t}^{T}\hat{u}_{s}^{2}ds]<\infty$ . Hence, $\hat{u}\in\mathcal{L}_{t}^{2}$ . Substituting $u_{s}=\hat{u}_{s}+\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}(\widetilde{H}_{s}+\sqrt{\gamma_{s}}\zeta_{s})$ , $s\in[t,T]$ , in (22) leads to (25). For the cost functional, observe that

\begin{split}&\frac{1}{2}\Big{(}2\lambda_{s}(\widetilde{H}_{s}+\sqrt{\gamma_{s}}\zeta_{s})^{2}-4\lambda_{s}(\widetilde{H}_{s}+\sqrt{\gamma_{s}}\zeta_{s})u_{s}\Big{)}+(\kappa_{s}+\lambda_{s})u_{s}^{2}\\ &=\lambda_{s}(\widetilde{H}_{s}+\sqrt{\gamma_{s}}\zeta_{s})^{2}-(\lambda_{s}+\kappa_{s})\frac{\lambda_{s}^{2}}{\left(\lambda_{s}+\kappa_{s}\right)^{2}}(\widetilde{H}_{s}+\sqrt{\gamma_{s}}\zeta_{s})^{2}\\ &\quad+(\lambda_{s}+\kappa_{s})\left(u_{s}-\frac{\lambda_{s}}{\lambda_{s}+\kappa_{s}}(\widetilde{H}_{s}+\sqrt{\gamma_{s}}\zeta_{s})\right)^{2}\\ &=\frac{\lambda_{s}\kappa_{s}}{\lambda_{s}+\kappa_{s}}\left(\widehat{H}_{s}+\sqrt{\gamma_{s}}\zeta_{s}\right)^{2}+(\lambda_{s}+\kappa_{s})\hat{u}_{s}^{2},\quad s\in[t,T].\end{split}

(67)

(ii) Note that (25) is an SDE that is linear in $\widehat{H}$ , $\hat{u}$ , and $\sqrt{\gamma}\zeta$ . Furthermore, boundedness of $\rho,\mu,\sigma,\eta,\overline{r}$ and (24) imply that the coefficients of the SDE are bounded. Since moreover $E[\int_{t}^{T}(\hat{u}_{s})^{2}+\gamma_{s}\zeta_{s}^{2}ds]<\infty$ and $\widehat{H}_{t}$ is square integrable, we know that $E[\sup_{s\in[t,T]}\widehat{H}_{s}^{2}]<\infty$ (see, e.g., [41, Theorem 3.2.2 and Theorem 3.3.1]). We can thus argue similar to (i) that $u\in\mathcal{L}_{t}^{2}$ . A substitution of $\hat{u}$ in (25) yields (22). A reverse version of the argument in (67) proves equality of the cost functionals. ∎

Acknowledgement: We thank Dirk Becherer, Tiziano De Angelis, Miryana Grigorova, Martin Herdegen, and Yuri Kabanov for inspiring discussions. We are grateful to the associate editor and two anonymous referees for constructive comments and suggestions that helped us improve the manuscript.

References

[1] J. Ackermann, T. Kruse, and M. Urusov. Càdlàg semimartingale strategies for optimal trade execution in stochastic order book models. Finance Stoch., 25(4):757–810, 2021.
[2] J. Ackermann, T. Kruse, and M. Urusov. Optimal trade execution in an order book model with stochastic liquidity parameters. SIAM Journal on Financial Mathematics, 12(2):788–822, 2021.
[3] J. Ackermann, T. Kruse, and M. Urusov. Self-exciting price impact via negative resilience in stochastic order books. Annals of Operations Research, DOI:10.1007/s10479-022-04973-00, 2022.
[4] A. Alfonsi and J. I. Acevedo. Optimal execution and price manipulations in time-varying limit order books. Applied Mathematical Finance, 21(3):201–237, 2014.
[5] A. Alfonsi, A. Fruth, and A. Schied. Constrained portfolio liquidation in a limit order book model. Banach Center Publ, 83:9–25, 2008.
[6] A. Alfonsi, A. Fruth, and A. Schied. Optimal execution strategies in limit order books with general shape functions. Quantitative Finance, 10(2):143–157, 2010.
[7] A. Alfonsi and A. Schied. Optimal trade execution and absence of price manipulations in limit order book models. SIAM Journal on Financial Mathematics, 1(1):490–522, 2010.
[8] A. Alfonsi, A. Schied, and A. Slynko. Order book resilience, price manipulation, and the positive portfolio problem. SIAM Journal on Financial Mathematics, 3(1):511–533, 2012.
[9] S. Ankirchner, M. Jeanblanc, and T. Kruse. BSDEs with singular terminal condition and a control problem with constraints. SIAM Journal on Control and Optimization, 52(2):893–913, 2014.
[10] S. Ankirchner and T. Kruse. Optimal position targeting with stochastic linear-quadratic costs. Advances in mathematics of finance, 104:9–24, 2015.
[11] P. Bank and Y. Dolinsky. Continuous-time duality for superreplication with transient price impact. Ann. Appl. Probab., 29(6):3893–3917, 2019.
[12] P. Bank and Y. Dolinsky. Scaling limits for super-replication with transient price impact. Bernoulli, 26(3):2176–2201, 2020.
[13] P. Bank and A. Fruth. Optimal order scheduling for deterministic liquidity patterns. SIAM Journal on Financial Mathematics, 5(1):137–152, 2014.
[14] P. Bank, H. M. Soner, and M. Voß. Hedging with temporary price impact. Math. Financ. Econ., 11(2):215–239, 2017.
[15] P. Bank and M. Voß. Linear quadratic stochastic control problems with stochastic terminal constraint. SIAM Journal on Control and Optimization, 56(2):672–699, 2018.
[16] P. Bank and M. Voß. Optimal investment with transient price impact. SIAM J. Financial Math., 10(3):723–768, 2019.
[17] D. Becherer, T. Bilarev, and P. Frentrup. Optimal asset liquidation with multiplicative transient price impact. Applied Mathematics & Optimization, 78(3):643–676, 2018.
[18] D. Becherer, T. Bilarev, and P. Frentrup. Optimal liquidation under stochastic liquidity. Finance and Stochastics, 22(1):39–68, 2018.
[19] D. Becherer, T. Bilarev, and P. Frentrup. Stability for gains from large investors’ strategies in $M_{1}$ / $J_{1}$ topologies. Bernoulli, 25(2):1105–1140, 2019.
[20] J.-M. Bismut. Linear quadratic optimal stochastic control with random coefficients. SIAM Journal on Control and Optimization, 14(3):419–444, 1976.
[21] J.-M. Bismut. Contrôle des systemes linéaires quadratiques: applications de l’intégrale stochastique. In Séminaire de Probabilités XII, pages 180–264. Springer, 1978.
[22] R. Carmona and K. Webster. The self-financing equation in limit order book markets. Finance Stoch., 23(3):729–759, 2019.
[23] Y. Dolinsky, B. Gottesman, and O. Gurel-Gurevich. A note on costs minimization with stochastic target constraints. Electronic Communications in Probability, 25, 2020.
[24] A. Fruth, T. Schöneborn, and M. Urusov. Optimal trade execution and price manipulation in order books with time-varying liquidity. Math. Finance, 24(4):651–695, 2014.
[25] A. Fruth, T. Schöneborn, and M. Urusov. Optimal trade execution in order books with stochastic liquidity. Math. Finance, 29(2):507–541, 2019.
[26] N. Gârleanu and L. H. Pedersen. Dynamic portfolio choice with frictions. Journal of Economic Theory, 165:487–516, 2016.
[27] J. Gatheral, A. Schied, and A. Slynko. Transient linear price impact and Fredholm integral equations. Mathematical Finance: An International Journal of Mathematics, Statistics and Financial Economics, 22(3):445–474, 2012.
[28] P. Graewe and U. Horst. Optimal trade execution with instantaneous price impact and stochastic resilience. SIAM Journal on Control and Optimization, 55(6):3707–3725, 2017.
[29] U. Horst and E. Kivman. Optimal trade execution under small market impact and portfolio liquidation with semimartingale strategies. Preprint, arXiv:2103.05957, 2021.
[30] U. Horst and F. Naujokat. When to cross the spread? Trading in two-sided limit order books. SIAM J. Financial Math., 5(1):278–315, 2014.
[31] U. Horst and X. Xia. Multi-dimensional optimal trade execution under stochastic resilience. Finance and Stochastics, 23(4):889–923, 2019.
[32] P. Imkeller and G. Dos Reis. Path regularity and explicit convergence rate for BSDE with truncated quadratic growth. Stochastic Processes and their Applications, 120(3):348–379, 2010.
[33] I. Karatzas and S. E. Shreve. Brownian motion and stochastic calculus. Graduate texts in mathematics; 113. Springer, New York, 2nd edition, 1991.
[34] M. Kohlmann and S. Tang. Global adapted solution of one-dimensional backward stochastic Riccati equations, with application to the mean–variance hedging. Stochastic Process. Appl., 97(2):255–288, 2002.
[35] C. Lorenz and A. Schied. Drift dependence of optimal trade execution strategies under transient price impact. Finance and Stochastics, 17(4):743–770, 2013.
[36] F. Naujokat and N. Westray. Curve following in illiquid markets. Math. Financ. Econ., 4(4):299–335, 2011.
[37] A. A. Obizhaeva and J. Wang. Optimal trading strategy and supply/demand dynamics. Journal of Financial Markets, 16:1–32, 2013.
[38] S. Predoiu, G. Shaikhet, and S. Shreve. Optimal execution in a general one-sided limit-order book. SIAM Journal on Financial Mathematics, 2(1):183–212, 2011.
[39] C. C. Siu, I. Guo, S.-P. Zhu, and R. J. Elliott. Optimal execution with regime-switching market resilience. J. Econom. Dynam. Control, 101:17–40, 2019.
[40] J. Sun, J. Xiong, and J. Yong. Indefinite stochastic linear-quadratic optimal control problems with random coefficients: Closed-loop representation of open-loop optimal controls. Ann. Appl. Probab., 31(1):460 – 499, 2021.
[41] J. Zhang. Backward Stochastic Differential Equations. Probability Theory and Stochastic Modelling; 86. Springer, New York, 1st edition, 2017.

Reducing Obizhaeva-Wang type trade execution problems to LQ stochastic control problems

Abstract

Introduction

1 From finite variation to progressively measurable execution strategies

1.1 The finite variation stochastic control problem

Remark 1.1.

1.2 Alternative representations for the cost functional and the deviation process

Remark 1.2.

Proposition 1.3.

Proposition 1.4.

1.3 Progressively measurable execution strategies

Corollary 1.5.

1.4 The hidden deviation process

Lemma 1.6.

1.5 Continuous extension of the cost functional

Theorem 1.7.

2 Reduction to a standard LQ stochastic control problem

2.1 The first reduction

Lemma 2.1.

Lemma 2.2.

Corollary 2.3.

Corollary 2.4.

2.2 Formulation without cross-terms

Lemma 2.5.

Corollary 2.6.

3 Solving the LQ control problem and the trade execution problem

Remark 3.1.

Remark 3.2.

Theorem 3.3.

Proof.

Corollary 3.4.

Remark 3.5.

4 Examples

4.1 The Obizhaeva-Wang model with random targets

Example 4.1.

Remark 4.2.

Example 4.3.

4.2 A discontinuous optimal position path for continuous inputs

4.3 An example where Jf​vJ^{fv} does not admit a minimizer

4.4 An example with a diffusive resilience

4.5 Cancellation of infinite variation

5 Proofs

Proof of ˜1.3.

Proof of ˜1.4.

Lemma 5.1.

Proof.

Proof of Lemma˜1.6.

Lemma 5.2.

Proof.

Lemma 5.3.

Proof.

Lemma 5.4.

Proof.

Lemma 5.5.

Proof.

Proof of ˜1.7.

Proof of Lemma˜2.1.

Proof of Lemma˜2.2.

Proof of Lemma˜2.5.

References

4.3 An example where $J^{fv}$ does not admit a minimizer