Optimal execution of equity with geometric price process
Gerardo Hernandez-del-Valle
Statistics Department, Columbia University
1255 Amsterdam Ave. Room 1005, New York, N.Y., 10027.
gerardo@stat.columbia.edu and Carlos G. Pacheco-Gonzalez
Mathematics Department, CINVESTAV-IPN
Av. Instituto Polit cnico Nacional #2508, Col. San Pedro Zacatenco, México, D.F., C.P. 07360
cpacheco@math.cinvestav.mx
(Date: October 14, 2009)
Abstract.
In this paper we derive the Markowitz-optimal, deterministic-execution trajectory for a trader who wishes to buy or sell a large position of a share which evolves as a geometric Brownian motion in contrast to the arithmetic model which prevails in the existing literature. Our calculations include a general temporary impact, rather than a specific function. Additionally, we point out—under our setting—what are the necessary ingredients to tackle the problem with adaptive execution trajectories. We provide a couple of examples which illustrate the results. We would like to stress the fact that in this paper we use understandable user-friendly techniques.
2000 Mathematics Subject Classification:
Primary: 91B28, 49K15
The research of this author was partially supported by Algorithmic Trading Management LLC
1. Introduction
The problem of optimal execution is a very general problem in which a trader who wishes to buy or sell a large position of a given asset —for instance wheat, shares, derivatives, etc.—is confronted with the dilemma of executing
slowly or as quick as possible. In the first case he/she would be exposed to volatility, and in the second to the laws of offer and demand.
Thus the trader most hedge between the market impact (due to his trade) and the volatility (due to the market).
The key ingredients to study this optimization problem are: (1) The modeling of the asset—which is typically modeled as a geometric Brownian process. (2) The modeling of the so-called market impact which heuristically suggests the existence of an instantaneous impact—so-called temporary—and a cumulative component referred to as permanent. And finally (3) one should establish a criteria of optimality.
The main aim of this paper is to study and characterize the so-called Markowitz-optimal open-loop execution trajectory, in terms of nonlinear second order, ordinary differential equations (Theorems 3.2 and 3.6). The above is done for a trader who wishes to buy or sell a large position of shares which evolve as a geometric Brownian motion (although in the existing literature it is considered an arithmetic Brownian motion for this problem). In this paper we only deal with deterministic strategies, also called open loop controls; however, we point out—in Section 5—the key ingredients to address the problem with adaptive strategies, also termed Markovian controls (work in progress).
The main motivation of this work is on one hand economic, but on the other the effect of market impact in the valuation of contingent claims, and its connection with optimal execution of derivatives. Intuitively, for this kind of problem, one would expect to consider adaptive strategies to tackle the questions, although it seems natural to first understand the deterministic case.
An important element in our analysis, will be the use of a linear stochastic differential equation, first used—to our knowledge—by Brennan and Schwartz (1980) in their study of interest rates.
The problem of minimizing expected overall liquidity costs has been analyzed using different market models by Bertsimas and Lo (1998), Obizhaeva and Wang, and Alfonsi et al. (2007a,2007b), just to mention a few. However, these approaches miss the volatility risk associated with time delay. Instead, Almgren and Chriss (1999,2000), suggested studying and solving a mean-variance optimization for sales revenues in the class of deterministic strategies. Further, on Almgren and Lorenz (2007) allowed for intertemporal updating and proved that this can strictly improve the mean-variance performance. Nevertheless, in Schied and Schöneborn (2007), the authors study the original problem of expected utility maximization with CARA utility functions. Their main result states that for CARA investors there is surprisingly no added utility from allowing for intertemporal updating of strategies. Finally, we mention that the Hamilton-Jacobi-Bellman approach has also recently been studied in Forsyth (2009).
Our paper is organized as follows: In Section 2 we introduce our model, assumptions and auxiliary results. Namely through a couple of subsequent Propositions we characterize and compute—by use of a Brennan-Schwartz type process (3)—the moments of certain random variable that is relevant in our optimization problem. Section 3 is devoted to deriving and proving the characterization of our optimal trading strategies and the optimal value function as well. After that, in Section 4, we present a couple of examples in order to exemplify the procedure derived in Section 3. We first compare Almgren and Chriss trajectory with ours, and in Example 2 we use a temporary market impact to the power as suggested in the empirical study of Almgren et. al.’s (2005). We conclude the paper, in Section 5, by pointing out the key ingredients in the study of adaptive execution strategies.
2. Auxiliary Results
In this section, we describe the dynamics of the asset , the so-called market impact, and we introduce the Brennan-Schwartz process. This process will allow us not only to compute the moments of the optimization argument, but also to represent it in terms of an SDE. For the remainder of this section, let be a fixed and differentiable function for .
2.1. The model.
Let the price of the share of a given company evolve as a geometric process, where the random component is standard Brownian motion, i.e.
and where and represent respectively the permanent—which accumulates over time—and instantaneous temporary impact. Thus, the future effective price per share due to our trade can be modelled as
(1)
where is an estimable parameter.
Remark 2.1.
(a) Note that that process , where is as in (1), coincides precisely with the “standard” notion of both permanent and temporary impact. That is, if we model the price process as arithmetic Brownian motion:
then
Hence, the first term in the right-hand side of the equality is accumulating over time, on the other hand, the second term is not.
(b) Next, observe that the process can be thought of as a control, which in turn may be:
(1)
an admissible process which is adapted to the natural filtration of the associated process (1) is called a feedback control,
(2)
an admissible process which can be written in the form for some measurable map is called Markovian control, notice that any Markovian control is a feedback control,
(3)
a deterministic processes of the family of admissible controls are called open loop controls.
(c) In this paper we will only deal with open loop controls, yet to study feedback controls it will become quite useful to introduce the so-called Brennan-Schwartz process, introduced in the next subsection. The reason being, that to derive the Hamilton-Jacobi-Bellman equation—see Section 5—it is more convenient to have a diffusion instead of an average of a diffusion.
2.2. Averaged geometric and Brennan-Schwartz processes
In order to study our problem we will introduce the following averaged geometric Brownian process:
Thus, if represents the number of shares bought or sold at time at price , then represents the total amount spent or earned by the trader up to time .
To compute the moments of we will use the following linear non-homogeneous stochastic differential equation
(3)
which has been used for instance by Brennan and Schwartz (1980) in the modeling of interest rates, by Kawaguchi and Morimoto (2007) in environmental economics, and which may also be used to study the density of averaged geometric Brownian motion [see for instance, Linetsky (2004)].
In general, its usefulness is due to the fact that one may construct a Brennan-Schwartz process which satisfies
where stands for equality in distribution.
In this paper, it is used, together with Itô’s lemma to show that , which alternatively will allow us to compute the second moment of in terms of an iterated integral, indeed:
Proposition 2.2.
Let processes , , and be as in (1), (2), and (3), respectively, then
and
Proof.
By Itô’s lemma
Furthermore, for the second moment of it follows that
as claimed.
∎
Remark 2.3.
The previous proposition may be derived directly from the integration by parts formula. Yet, this characterization will be useful in the study, for instance, of the optimal trading schedule of derivatives or in the determination of Markovian controls.
2.3. Moments of
Now, by Proposition 2.2, it is straightforward to compute the first two moments of which will be used to solve our optimal execution problem.
In this section we derive a Markowitz-optimal open-loop trading strategy, Theorem 3.2 and 3.6, employing the auxiliary results derived in the previous section.
3.1. Execution shortfall
If the size of the trade is “relatively” small we would expect the market impact to be negligible, that is, the trader should execute immediately. Thus, it seems natural to compare the actual total gains (losses) with the impact-free quantity by introducing the so-called
execution shortfall defined as
(6)
If we use Markowitz optimization criterion, then our problem is equivalent to finding the trading trajectory which minimizes
simultaneously the expected shortfall given a fixed risk-aversion level characterized by the volatility of :
(7)
In fact, if then (7) has a unique solution, which may be represented in the following integral form:
Proposition 3.1.
Suppose that the permanent impact is linear, i.e.
for some as suggested by Almgren et. al. (2005) empirical study. Let
Suppose that . Then, the open-loop trading schedule is determined by a function which solves the following system
(10)
with and .
Proof.
If , that is, if we only wish to minimize expected execution shortfall, then:
and thus equation (10) follows from the Euler-Lagrange equation [see for instance Gelfand and Fomin (2000)].
∎
Example 3.3.
Let and both temporary and permanent impact be linear, i.e. , , hence:
Thus, if one wishes to minimize the expected execution shortfall one should execute according to:
given that , and .
Remark 3.4.
Note that as increases the client is willing to be more exposed to risk. This idea is equivalent to saying that he/she is willing to increase the speed of execution. Hence we have that for , the optimal trajectory will dominate :
In other words we may decompose . This last observation together with our constraint , , lead to the following two facts:
(11)
(12)
Remark 3.5.
The previous remark and equation (9) suggest a 2 step procedure to find the optimal trajectory . Namely, first find and given that information solve for
Theorem 3.6.
The optimal differentiable trajectory which solves (9) is given by , where is given in Theorem 3.3 and satisfies for :
where .
Proof.
The idea is to follow the derivation of the Euler-Lagrange equation [see for instance, Gelfand and Fomin (2000)], but in this case, the unknown function will play the role of the perturbation. Thus, it is essential the fact that .
Let
where
and
then
if we set
and by the integration by parts formula we have that
Let us compute
which yields
But observe that when we have
Now, given that and from the fundamental lemma of Calculus of variations we have that:
with the constraint that for and .
or
as claimed.
∎
Example 3.7.
(cont.) With linear and temporary impact as before, that is and . You may find the optimal trading trajectory for arbitrary by first determining , using Theorem 3.2, next you determine by use of Theorem 3.6. That is, letting:
Theorem 3.6 states that satisfies the following identity
where
4. Examples
4.1. Example
The first example we want to study is the case in which both the temporary and the permanent impact are linear as in Examples 3.3 and 3.7. The motivation is to compare our results with those obtained by Almgren and Chriss (2000). In this example we have set and . The solutions have been numerically calculated using Theorems 3.2 and 3.6 and then plotted in Figures 1 and 2.
It may be observed that, as one would expect, the solutions are not the same and in fact—under the present conditions—our strategy dominates Almgren and Chriss.
4.2. Example
The next example we want to study is the case in which the permanent impact is some power less than 1. Namely , and the permanent is linear. First from Theorem 3.2 we have that
and is the solution to:
and . In particular, with and as in the previous example we have plotted our result in Figure 3. Note that the sublinear impact has increased the speed of execution. Next, we computed the Markowitz-optimal open-loop trajectory, with by first computing as described in Theorem 3.6.
Remark 4.1.
The previous exercise suggests that if one chooses the temporary impact to be sub-linear, the solution—with all the other parameters fixed—will always dominate its linear counterpart. On the other hand, if the temporary impact is super-linear, the solution will be dominated by its linear counterpart.
A natural question is: What is the correct form of given our model?
5. Remarks on Markovian controls
As pointed out in the introduction, we have only dealt with differentiable deterministic controls—also known as open loop controls. Furthermore, our criteria of optimal is in the Markowitz sense. A couple of natural and reasonable question arise: how can we study feedback controls? How can we optimize with respect to general utility functions? Namely, given some utility function we want to find a trajectory such that
where is the set of admissible controls and is the execution shortfall. But is in general a very difficult creature to characterize, unless you observe that you may construct a diffusion with the following dynamics:
that is equal in distribution to , i.e.
Thus
and now you may proceed to derive the Hamilton-Jacobi-Bellman equation. It is precisely this question, which the authors are investigating presently.
References
[1] A. Alfonsi, A. Schied and A. Schulz (2007a) Optimal execution strategies in limit order books with general shape functions, Preprint, QP Lab and TU Berlin.
[2] A. Alfonsi, A. Schied and A. Schulz (2007b) Constrained portfolio liquidation in a limit order book model Preprint, QP Lab and TU Berlin.
[3] R. Almgren and N. Chriss (1999) Value under liquidation, Risk.
[4] R. Almgren and N. Chriss (2000) Optimal Execution of Portfolio Transactions, J. Risk. 3 (2).
[5] R. Almgren, C. Thum , E. Hauptmann, and H. Li (2005) Direct Estimation of Equity Market Impact
[6] D. Bertsimas and D. Lo (1998) Optimal control of execution costs, Journal of Financial Markets. 1.
[7] M.J. Brennan, and E. Schwartz (1979) A Continuous Time Approach to the Pricing of Bonds, J. of Banking and Finance. 3.
[8] P.A. Forsyth (2009) Hamilton Jacobi Bellman Approach to Optimal Trade Schedule, Preprint.
[9] I.M. Gelfand, and S.V. Fomin (2000) Calculus of Variations New York: Dover Publications.
[10] K. Kawaguchi, and H. Morimoto (2007) Long-run average welfare in a pollution accumulation model, J. of Economic Dynamics and Control, 31.
[11] V. Linetsky (2004) Spectral Expansions for Asian (Average Price) Options, Operations Research, 52, pp.856–867.
[12] A. Obizhaeva, and J. Wang (2005) Optimal Trading Strategy and Supply/Demand Dynamics. J. Financial Markets
[13] A. Schied, and T. Schöneborn (2007) Optimal portfolio liquidation for CARA investors, Preprint, QP Lab and TU Berlin
Figure 1. The graph is plotted with Mathematica. For the blue line (upper line) represents our optimal trading trajectory, the red line is Almgren and Chriss.Figure 2. The graph is plotted with Mathematica. The upper red line is with , the middle blue line is with and the lower yellow line is the case of arithmetic Brownian motion.Figure 3. The graph is plotted with Mathematica. The upper red line represents the trading trajectory with sub-linear temporary impact. The lower blue line represents the trajectory with linear temporary impact. Figure 4. The graph is plotted with Mathematica. The upper blue line represents the trading trajectory with sub-linear temporary impact and . The lower red line represents the trajectory with sub-linear temporary impact and .