This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

thu]Department of Automation, Tsinghua University, Beijing 100084, P. R. China

Optimal Investment with Herd Behaviour Using Rational Decision Decomposition

Huisheng Wang\arefthu and H. Vicky Zhao\arefthu [ whs22@mail.tsinghua.edu.cn, vzhao@tsinghua.edu.cn
Abstract

In this paper, we study the optimal investment problem considering the herd behaviour between two agents, including one leading expert and one following agent whose decisions are influenced by those of the leading expert. In the objective functional of the optimal investment problem, we introduce the average deviation term to measure the distance between the two agents’ decisions and use the variational method to find its analytical solution. To theoretically analyze the impact of the following agent’s herd behaviour on his/her decision, we decompose his/her optimal decision into a convex linear combination of the two agents’ rational decisions, which we call the rational decision decomposition. Furthermore, we define the weight function in the rational decision decomposition as the following agent’s investment opinion to measure the preference of his/her own rational decision over that of the leading expert. We use the investment opinion to quantitatively analyze the impact of the herd behaviour, the following agent’s initial wealth, the excess return, and the volatility of the risky asset on the optimal decision. We validate our analyses through numerical experiments on real stock data. This study is crucial to understanding investors’ herd behaviour in decision-making and designing effective mechanisms to guide their decisions.

keywords:
Herd behaviour, Investment opinion, Optimal investment, Rational decision decomposition, Variational method

1 Introduction

There exists a category of leading experts in the financial markets, such as financial analysts, fund managers, or other professionals, who share their decisions through channels like social media and websites [1]. As these leading experts’ decisions are often considered valuable investment guides, investors, referred to as agents below, tend to mimic their decisions [2]. Additionally, agents may have an intrinsic preference for conformity, and lean towards the decisions of the other agents to build consensus and trust [3]. This phenomenon is commonly known as herd behaviour in behavioural finance [4, 5, 6]. Prior studies have qualitatively verified the existence of herd behaviour in agents’ decisions in financial markets [7, 8, 9, 10, 11]. However, to quantitatively analyze the influence of herd behaviour on agents’ decisions, it is necessary to employ mathematical models based on the optimal investment theory.

Optimal investment theory provides a theoretical framework to study how agents dynamically adjust their investment decisions throughout the investment period to maximize the return and minimize the volatility using the stochastic optimal control theory [12, 13, 14, 15, 16, 17, 18]. The Merton problem is one of the classical optimal investment problems [19], where the agent allocates a certain amount of money to a risky asset at each moment to maximize the expected utility of the terminal wealth, i.e., the wealth at the end of the investment period. There have been many quantitative studies analyzing the impact of interactions among agents on the optimal investment. The previous studies in [20, 21, 22, 23, 24, 25, 26, 27] examined the impact of relative wealth and consumption among agents on their decisions, where the objective was a functional of each agent’s excess wealth or consumption relative to other agents during the investment period. However, to the best of our knowledge, few studies have investigated how herd behaviour affects agents’ decisions on the allocation of risky assets in the Merton problem.

In this paper, we study an optimal investment problem involving two agents, one leading expert, and one following agent whose decisions are influenced by those of the leading expert, and we quantitatively analyze the impact of herd behaviour on the following agent’s decision. The prior work in [28] used the traditional Euclidean distance to quantitatively measure the disparity between agents’ decisions. In this paper, we incorporate an exponential decay term into the traditional Euclidean distance to reflect the fact that agents often give less weight to future rewards when making decisions [29], and we call the modified Euclidean distance the average deviation. We introduce the average deviation into the objective functional of the traditional Merton problem and employ the variational method to analytically solve the stochastic optimal control problem and find the following agent’s optimal decision.

Note that with the above analytical solution of the following agent’s optimal decision, it is theoretically difficult to directly examine the influence of herd behaviour due to its complicated expression. To address this problem, we introduce a method called rational decision decomposition to transform the following agent’s optimal decision into a convex linear combination of the two agents’ rational decisions, i.e., the optimal decisions in the traditional Merton problem, where the agents make independent decisions without the influence of other agents. Furthermore, we define the weight function as the following agent’s investment opinion, which indicates the extent to which the following agent’s decision aligns with his/her own rational decision. We use the following agent’s investment opinion to quantitatively analyze the impact of the herd behaviour, the following agent’s initial wealth, the excess return, and the volatility of the risky asset on the optimal decision.

The rest of this paper is organized as follows. We introduce the average deviation to quantify the disparity between the two agents’ decisions, and present the dual-agent optimal investment problem considering herd behaviour in Section 2. We derive the analytical solution of the following agent’s optimal decision using the variational method in Section 3, and quantitatively analyze the impact of herd behaviour on the following agent’s optimal decisions using the rational decision decomposition in Section 4. In Section 5, we conduct numerical experiments on a real stock dataset to validate our analyses. Section 6 is the conclusion.

2 Problem Formulation

2.1 The Traditional Merton Problem

Following the prior work in [19], we consider the problem of an agent investing in the period 𝒯:=[0,T]\mathcal{T}:=[0,T] in a financial market consisting of a risk-free asset and a risky asset. Let (Ω,,{(t)}t𝒯,)(\Omega,\mathscr{F},\{\mathscr{F}(t)\}_{t\in\mathcal{T}},\mathbb{P}) be a complete filtered probability space on which a standard Brownian motion {W(t)}t𝒯\{W(t)\}_{t\in\mathcal{T}} is defined. The price process of the risk-free asset {B(t)}t𝒯\{B(t)\}_{t\in\mathcal{T}} and the risky asset {S(t)}t𝒯\{S(t)\}_{t\in\mathcal{T}} are given by

dB(t)\displaystyle\mathrm{d}B(t) =rB(t)dtand\displaystyle=rB(t)\mathrm{d}t\quad\mbox{and} (1)
dS(t)\displaystyle\mathrm{d}S(t) =μS(t)dt+σS(t)dW(t),\displaystyle=\mu S(t)\mathrm{d}t+\sigma S(t)\mathrm{d}W(t), (2)

respectively, where rr is the interest rate of the risk-free asset, and μ\mu and σ\sigma are the appreciation rate and the volatility of the risky asset. We define v=μr>0v=\mu-r>0 as the excess return rate of the risky asset. Let xx denote the agent’s initial wealth, and let P(t)P(t) denote the amount of the risky asset held by the agent at time tt. According to [19], the agent’s wealth process {X(t)}t𝒯\{X(t)\}_{t\in\mathcal{T}} can be expressed as

dX(t)=[rX(t)+vP(t)]dt+σP(t)dW(t),\mathrm{d}X(t)=[rX(t)+vP(t)]\mathrm{d}t+\sigma P(t)\mathrm{d}W(t), (3)

subject to X(0)=xX(0)=x. In the traditional Merton problem, the agent determines {P(t)}t𝒯\{P(t)\}_{t\in\mathcal{T}} to maximize the expected utility of the terminal wealth 𝔼ϕ(X(T))\mathbb{E}\phi(X(T)). The utility function ϕ(X(T))\phi(X(T)) satisfies the characteristics of diminishing marginal returns and concavity [30]. In this paper, we consider the Constant Absolute Risk Aversion form [30]:

ϕ(X(T))=1αeαX(T),\phi(X(T))=-\frac{1}{\alpha}\mathrm{e}^{-\alpha X(T)}, (4)

where α>0\alpha>0 is referred to as the risk aversion coefficient. As the risk aversion coefficient α\alpha increases, the utility function becomes more sensitive to changes in the terminal wealth.

In summary, the traditional Merton problem becomes

Problem 1. sup{P(t)}t𝒯𝒰𝔼ϕ(X(T))\displaystyle\sup_{\{P(t)\}_{t\in\mathcal{T}}\in\mathcal{U}}\mathbb{E}\phi(X(T))
s.t. dX(t)=[rX(t)+vP(t)]dt+σP(t)dW(t),\displaystyle\mathrm{d}X(t)=[rX(t)+vP(t)]\mathrm{d}t+\sigma P(t)\mathrm{d}W(t),
X(0)=x,\displaystyle X(0)=x,

which is a stochastic optimal control problem. In Problem 1, 𝒰\mathcal{U} is the set of admissible decisions, which is a subset of 1:={{u(t)}t𝒯|𝔼0T|u(t)|dt<+}\mathcal{L}^{1}:=\left\{\{u(t)\}_{t\in\mathcal{T}}\middle|\mathbb{E}\int_{0}^{T}|u(t)|\mathrm{d}t<+\infty\right\}.

According to [19], the agent’s optimal decision {P¯(t)}t𝒯\{\bar{P}(t)\}_{t\in\mathcal{T}} in Problem 1 is given by

P¯(t)=vασ2er(tT),t𝒯.\bar{P}(t)=\frac{v}{\alpha\sigma^{2}}\mathrm{e}^{r(t-T)},t\in\mathcal{T}. (5)

We call (5) the agent’s rational decision. According to (5), the rational amount of the risky asset is proportional to the excess return rate vv and inversely proportional to the volatility σ\sigma and the risk aversion coefficient α\alpha.

2.2 Optimal Investment with Herd Behaviour

Given the traditional Merton problem, we study the optimal investment problem with consideration of herd behaviour. The problem involves two agents, A1 and A2, where A1 is the following agent and A2 is the leading expert. We denote α1\alpha_{1} and α2\alpha_{2} as their risk aversion coefficients, respectively. We assume that A2’s decision follows the form of the rational decision {P¯2(t)}t𝒯\{\bar{P}_{2}(t)\}_{t\in\mathcal{T}} as in (5), and A1’s decision {P1(t)}t𝒯\{P_{1}(t)\}_{t\in\mathcal{T}} is unilaterally influenced by A2’s decision. Considering the herd behaviour, A1 aims to maximize his/her expected utility of the terminal wealth while minimizing the distance between his/her decision and A2’s decision. The prior work in [28] used the Euclidean distance to measure the distance between two decisions, i.e., 120T[P1(t)P¯2(t)]2dt\frac{1}{2}\int_{0}^{T}[P_{1}(t)-\bar{P}_{2}(t)]^{2}\mathrm{d}t in continuous-time format. In this paper, we introduce an exponential decay term eρr(Tt)\mathrm{e}^{\rho r(T-t)} to reflect the fact that agents often give less weight to future rewards when making decisions, and we call the modified Euclidean distance the average deviation.

Definition 1.

The average deviation between A1 and A2’s decisions {P1(t)}t𝒯\{P_{1}(t)\}_{t\in\mathcal{T}} and {P¯2(t)}t𝒯\{\bar{P}_{2}(t)\}_{t\in\mathcal{T}} is defined as:

D[P1P¯2]=120Teρr(Tt)[P1(t)P¯2(t)]2dt,D[P_{1}\|\bar{P}_{2}]=\frac{1}{2}\int_{0}^{T}\mathrm{e}^{\rho r(T-t)}[P_{1}(t)-\bar{P}_{2}(t)]^{2}\mathrm{d}t, (6)

where ρ0\rho\geqslant 0 is the decay rate.

A higher decay rate implies that deviations occurring later carry less weight. When ρ=0\rho=0, deviations for all times are equally weighted, and (6) becomes the same as the traditional Euclidean distance in [28]. Let {X1(t)}t𝒯\{X_{1}(t)\}_{t\in\mathcal{T}} denote A1’s wealth process.

With Definition 1, we define the following optimal investment problem for A1’s decision:

Problem 2. sup{P1(t)}t𝒯𝒰𝔼ϕ(X1(T))θD[P1P¯2]\displaystyle\sup_{\{P_{1}(t)\}_{t\in\mathcal{T}}\in\mathcal{U}}\mathbb{E}\phi(X_{1}(T))-\theta D[P_{1}\|\bar{P}_{2}]
s.t. dX1(t)=[rX1(t)+vP1(t)]dt+σP1(t)dW(t),\displaystyle\mathrm{d}X_{1}(t)=[rX_{1}(t)+vP_{1}(t)]\mathrm{d}t+\sigma P_{1}(t)\mathrm{d}W(t),
X1(0)=x1,\displaystyle X_{1}(0)=x_{1},

where the herd coefficient θ>0\theta>0 is to address the tradeoff between the two different objectives, i.e., maximizing the expected utility of the terminal wealth 𝔼ϕ(X1(T))\mathbb{E}\phi(X_{1}(T)) and minimizing the average deviation D[P1P¯2]D[P_{1}\|\bar{P}_{2}]. When θ=0\theta=0, A1’s optimal decision is entirely independent of A2’s decision, and Problem 2 degenerates into the traditional Merton Problem, i.e., limθ0P1(t)=P¯1(t)\lim_{\theta\to 0}P_{1}^{*}(t)=\bar{P}_{1}(t) for all t𝒯t\in\mathcal{T}. As θ\theta approaches infinity, A1’s optimal decision is equal to A2’s decision, i.e., limθ+P1(t)=P¯2(t)\lim_{\theta\to+\infty}P_{1}^{*}(t)=\bar{P}_{2}(t) for all t𝒯t\in\mathcal{T}.

3 Solution to the Optimal Investment Problem

Next, we use the variational method to solve Problem 2. For the convenience of expression, we define ϑ:=θα1σ2\vartheta:=\frac{\theta}{\alpha_{1}\sigma^{2}} and ϱ:=2ρ\varrho:=2-\rho as the modified herd coefficient and decay rate, respectively.

Theorem 1.

The optimal decision {P1(t)}t𝒯\{P_{1}^{*}(t)\}_{t\in\mathcal{T}} in Problem 2 is given by

P1(t)=ηα2σ2eϱr(Tt)+θηα1σ2eϱr(Tt)+θvα2σ2er(tT),t𝒯,P_{1}^{*}(t)=\frac{\eta\alpha_{2}\sigma^{2}\mathrm{e}^{\varrho r(T-t)}+\theta}{\eta\alpha_{1}\sigma^{2}\mathrm{e}^{\varrho r(T-t)}+\theta}\cdot\frac{v}{\alpha_{2}\sigma^{2}}\mathrm{e}^{r(t-T)},t\in\mathcal{T}, (7)

where η\eta is the integral constant calculated by Algorithm 1.

Input: Interest rate: rr; Excess return rate: vv; Volatility: σ\sigma; Initial wealth x1x_{1}; Risk aversion coefficients: α1,α2\alpha_{1},\alpha_{2}; Investment period: TT; Modified herd coefficient: ϑ\vartheta; Modified decay rate: ϱ\varrho; Tolerance: ε\varepsilon.
Output: Integral constant: η\eta.
1 η0=exp{α1x1erTv2T2σ2}\eta_{0}=\exp\left\{-\alpha_{1}x_{1}\mathrm{e}^{rT}-\frac{v^{2}T}{2\sigma^{2}}\right\}, Δη0=+\Delta\eta_{0}=+\infty, k=0k=0;
2while Δηkε\Delta\eta_{k}\geqslant\varepsilon do
3      ηk+1=η0exp{0Tϑ2v2(α1/α21)2dt2σ2[ηkeϱr(Tt)+ϑ]2}\eta_{k+1}=\eta_{0}\exp\left\{\int_{0}^{T}\frac{\vartheta^{2}v^{2}\left(\alpha_{1}/\alpha_{2}-1\right)^{2}\mathrm{d}t}{2\sigma^{2}\left[\eta_{k}\mathrm{e}^{\varrho r(T-t)}+\vartheta\right]^{2}}\right\};
4      Δηk+1=|ηk+1ηk|\Delta\eta_{k+1}=\left|\eta_{k+1}-\eta_{k}\right|;
5      kk+1k\leftarrow k+1;
6 end while
ηηk\eta\approx\eta_{k} (|ηηk|<ε\left|\eta-\eta_{k}\right|<\varepsilon).
Algorithm 1 Iterative Method of the Integral Constant η\eta.
Proof.

First, by solving (3), A1’s terminal wealth is

X1(T)=x1erT+0Ter(Tt)P1(t)[vdt+σdW(t)].X_{1}(T)=x_{1}\mathrm{e}^{rT}+\int_{0}^{T}\mathrm{e}^{r(T-t)}P_{1}(t)[v\mathrm{d}t+\sigma\mathrm{d}W(t)]. (8)

Therefore, the terminal wealth X1(T)X_{1}(T) is a normal random variable whose expectation and variance are

𝔼X1(T)\displaystyle\mathbb{E}X_{1}(T) =x1erT+v0Ter(Tt)P1(t)dtand\displaystyle=x_{1}\mathrm{e}^{rT}+v\int_{0}^{T}\mathrm{e}^{r(T-t)}P_{1}(t)\mathrm{d}t\quad\mbox{and} (9)
𝔻X1(T)\displaystyle\mathbb{D}X_{1}(T) =σ20Te2r(Tt)P12(t)dt,\displaystyle=\sigma^{2}\int_{0}^{T}\mathrm{e}^{2r(T-t)}P_{1}^{2}(t)\mathrm{d}t, (10)

respectively. Because the set of admissible decisions 𝒰\mathcal{U} is a subset of 1\mathcal{L}^{1}, we have 𝔼X1(t)<\mathbb{E}X_{1}(t)<\infty and 𝔻X1(t)<\mathbb{D}X_{1}(t)<\infty. Because ϕ(X1(T))\phi(X_{1}(T)) is an exponential function of X1(T)X_{1}(T), it is a lognormal random variable whose expectation is

𝔼ϕ(X1(T))=1α1exp{α1𝔼X1(T)+α122𝔻X1(T)}.\mathbb{E}\phi(X_{1}(T))=-\frac{1}{\alpha_{1}}\exp\left\{-\alpha_{1}\mathbb{E}X_{1}(T)+\frac{\alpha_{1}^{2}}{2}\mathbb{D}X_{1}(T)\right\}. (11)

Therefore, the objective functional of Problem 2 can be expressed as a functional of {P1(t)}t𝒯\{P_{1}(t)\}_{t\in\mathcal{T}}, i.e.,

J[P1]\displaystyle J[P_{1}] :=1α1exp{α1x1erTα1v0Ter(Tt)P1(t)dt\displaystyle:=-\frac{1}{\alpha_{1}}\exp\left\{-\alpha_{1}x_{1}\mathrm{e}^{rT}-\alpha_{1}v\int_{0}^{T}\mathrm{e}^{r(T-t)}P_{1}(t)\mathrm{d}t\right.
+α12σ220Te2r(Tt)P12(t)dt}\displaystyle+\left.\frac{\alpha_{1}^{2}\sigma^{2}}{2}\int_{0}^{T}\mathrm{e}^{2r(T-t)}P_{1}^{2}(t)\mathrm{d}t\right\}
θ20Teρr(Tt)[P1(t)P¯2(t)]2dt.\displaystyle-\frac{\theta}{2}\int_{0}^{T}\mathrm{e}^{\rho r(T-t)}[P_{1}(t)-\bar{P}_{2}(t)]^{2}\mathrm{d}t. (12)

According to the variational method, the first and second variations are

δJ\displaystyle\delta J =exp{α1x1erTα1v0Ter(Tt)P1(t)dt\displaystyle=\exp\left\{-\alpha_{1}x_{1}\mathrm{e}^{rT}-\alpha_{1}v\int_{0}^{T}\mathrm{e}^{r(T-t)}P_{1}(t)\mathrm{d}t\right.
+α12σ220Te2r(Tt)P12(t)dt}\displaystyle+\left.\frac{\alpha_{1}^{2}\sigma^{2}}{2}\int_{0}^{T}\mathrm{e}^{2r(T-t)}P_{1}^{2}(t)\mathrm{d}t\right\}
0T[ver(Tt)α1σ2e2r(Tt)P1(t)]δP1(t)dt\displaystyle\cdot\int_{0}^{T}\left[v\mathrm{e}^{r(T-t)}-\alpha_{1}\sigma^{2}\mathrm{e}^{2r(T-t)}P_{1}(t)\right]\delta P_{1}(t)\mathrm{d}t
θ0Teρr(Tt)[P1(t)P¯2(t)]δP1(t)dtand\displaystyle-\theta\int_{0}^{T}\mathrm{e}^{\rho r(T-t)}[P_{1}(t)-\bar{P}_{2}(t)]\delta P_{1}(t)\mathrm{d}t\quad\mbox{and} (13)
δ2J\displaystyle\delta^{2}J =exp{α1x1erTα1v0Ter(Tt)P1(t)dt\displaystyle=-\exp\left\{-\alpha_{1}x_{1}\mathrm{e}^{rT}-\alpha_{1}v\int_{0}^{T}\mathrm{e}^{r(T-t)}P_{1}(t)\mathrm{d}t\right.
+α12σ220Te2r(Tt)P12(t)dt}α1{0T[ver(Tt)\displaystyle+\left.\frac{\alpha_{1}^{2}\sigma^{2}}{2}\int_{0}^{T}\mathrm{e}^{2r(T-t)}P_{1}^{2}(t)\mathrm{d}t\right\}\cdot\alpha_{1}\left\{\int_{0}^{T}\left[v\mathrm{e}^{r(T-t)}\right.\right.
α1σ2e2r(Tt)P1(t)]δP1(t)dt}2\displaystyle-\left.\left.\alpha_{1}\sigma^{2}\mathrm{e}^{2r(T-t)}P_{1}(t)\right]\delta P_{1}(t)\mathrm{d}t\right\}^{2}
0T[exp{α1x1erTα1v0Ter(Tt)P1(t)dt\displaystyle-\int_{0}^{T}\left[\exp\left\{-\alpha_{1}x_{1}\mathrm{e}^{rT}-\alpha_{1}v\int_{0}^{T}\mathrm{e}^{r(T-t)}P_{1}(t)\mathrm{d}t\right.\right.
+α12σ220Te2r(Tt)P12(t)dt}α1σ2e2r(Tt)\displaystyle+\left.\left.\frac{\alpha_{1}^{2}\sigma^{2}}{2}\int_{0}^{T}\mathrm{e}^{2r(T-t)}P_{1}^{2}(t)\mathrm{d}t\right\}\cdot\alpha_{1}\sigma^{2}\mathrm{e}^{2r(T-t)}\right.
+θeρr(Tt)][δP1(t)]2dt<0.\displaystyle+\left.\theta\mathrm{e}^{\rho r(T-t)}\right][\delta P_{1}(t)]^{2}\mathrm{d}t<0. (14)

According to the Jacobi condition, the sufficient and necessary condition for the functional supremum is given by δJ=0\delta J=0 for {P1(t)}t𝒯\{P_{1}^{*}(t)\}_{t\in\mathcal{T}}. Therefore, we have

P1(t)=ηα2σ2eϱr(Tt)+θηα1σ2eϱr(Tt)+θP¯2(t),t𝒯.P_{1}^{*}(t)=\frac{\eta\alpha_{2}\sigma^{2}\mathrm{e}^{\varrho r(T-t)}+\theta}{\eta\alpha_{1}\sigma^{2}\mathrm{e}^{\varrho r(T-t)}+\theta}\cdot\bar{P}_{2}(t),t\in\mathcal{T}. (15)

Given {P¯2(t)}t𝒯\{\bar{P}_{2}(t)\}_{t\in\mathcal{T}} as in (5), we have (7), where the integral constant is defined as

η\displaystyle\eta :=α1𝔼ϕ(X1(T))\displaystyle:=-\alpha_{1}\mathbb{E}\phi(X_{1}^{*}(T))
=exp{α1xerTα1v0Ter(Tt)P1(t)dt\displaystyle=\exp\left\{-\alpha_{1}x\mathrm{e}^{rT}-\alpha_{1}v\int_{0}^{T}\mathrm{e}^{r(T-t)}P_{1}^{*}(t)\mathrm{d}t\right.
+α12σ220Te2r(Tt)P12(t)dt}.\displaystyle+\left.\frac{\alpha_{1}^{2}\sigma^{2}}{2}\int_{0}^{T}\mathrm{e}^{2r(T-t)}P_{1}^{*2}(t)\mathrm{d}t\right\}. (16)

From (16), η\eta is proportional to the utility of the optimal terminal wealth X1(T)X_{1}^{*}(T), the terminal wealth under the optimal decision {P1(t)}t𝒯\{P_{1}^{*}(t)\}_{t\in\mathcal{T}}, and the weight is α1<0-\alpha_{1}<0. Therefore, the integral constant η\eta is a decreasing function of the expected utility of the terminal wealth 𝔼ϕ(X1(T))\mathbb{E}\phi(X_{1}(T)). By substituting (7) into (16), we have

η=exp{α1x1erTv2T2σ2+0Tϑ2v2(α1/α21)2dt2σ2[ηeϱr(Tt)+ϑ]2}.\eta=\exp\left\{-\alpha_{1}x_{1}\mathrm{e}^{rT}-\frac{v^{2}T}{2\sigma^{2}}+\int_{0}^{T}\frac{\vartheta^{2}v^{2}\left(\alpha_{1}/\alpha_{2}-1\right)^{2}\mathrm{d}t}{2\sigma^{2}\left[\eta\mathrm{e}^{\varrho r(T-t)}+\vartheta\right]^{2}}\right\}. (17)

However, (17) has no closed-form solution. Thus, we propose Algorithm 1 to calculate its numerical solution. ∎

According to the Fixed Point Theorem, we prove the convergence of Algorithm 1 in Theorem 2.

Theorem 2.

Algorithm 1 converges to η\eta if

ϑ2v2(α1α2)2η¯α22σ2η¯31e2ϱrT2ϱr1,\frac{\vartheta^{2}v^{2}\left(\alpha_{1}-\alpha_{2}\right)^{2}\overline{\eta}}{\alpha_{2}^{2}\sigma^{2}\underline{\eta}^{3}}\cdot\frac{1-\mathrm{e}^{-2\varrho rT}}{2\varrho r}\leqslant 1, (18)

where the lower bound is η¯=exp{α1x1erTv2T2σ2}\underline{\eta}=\exp\left\{-\alpha_{1}x_{1}\mathrm{e}^{rT}-\frac{v^{2}T}{2\sigma^{2}}\right\}, and the upper bound is η¯=η¯exp{0Tϑ2v2(α1/α21)2dt2σ2[η¯eϱr(Tt)+ϑ]2}\overline{\eta}=\underline{\eta}\exp\left\{\int_{0}^{T}\frac{\vartheta^{2}v^{2}\left(\alpha_{1}/\alpha_{2}-1\right)^{2}\mathrm{d}t}{2\sigma^{2}\left[\underline{\eta}\mathrm{e}^{\varrho r(T-t)}+\vartheta\right]^{2}}\right\}.

Proof.

We define the following iteration function:

f(ξ):=η¯exp{0Tϑ2v2(α1/α21)2dt2σ2[ξeϱr(Tt)+ϑ]2},f(\xi):=\underline{\eta}\exp\left\{\int_{0}^{T}\frac{\vartheta^{2}v^{2}\left(\alpha_{1}/\alpha_{2}-1\right)^{2}\mathrm{d}t}{2\sigma^{2}\left[\xi\mathrm{e}^{\varrho r(T-t)}+\vartheta\right]^{2}}\right\}, (19)

which satisfies f(η)=ηf(\eta)=\eta, i.e., η\eta is a fixed point of the iteration function ff. To prove the convergence, we need to find a self-mapping interval, and then prove that the iteration function satisfies the Lipschitz continuity condition on the self-mapping interval. First, it is obvious that f(ξ)η¯f(\xi)\geqslant\underline{\eta} for all ξ[η¯,η¯]\xi\in[\underline{\eta},\overline{\eta}] because the exponential term in (19) is greater than 11. Second, it can be easily proved that f(ξ)f(\xi) is a monotonically decreasing function with respect to ξ\xi for all ξ[η¯,η¯]\xi\in[\underline{\eta},\overline{\eta}], so we have f(ξ)f(η¯)=η¯f(\xi)\leqslant f(\underline{\eta})=\overline{\eta} for all ξ[η¯,η¯]\xi\in[\underline{\eta},\overline{\eta}]. Therefore, there is always f(ξ)[η¯,η¯]f(\xi)\in[\underline{\eta},\overline{\eta}] for all ξ[η¯,η¯]\xi\in[\underline{\eta},\overline{\eta}], i.e., f:[η¯,η¯][η¯,η¯]f:[\underline{\eta},\overline{\eta}]\mapsto[\underline{\eta},\overline{\eta}]. It should be noted that f(ξ)f(\xi) is a continuously differentiable function on [η¯,η¯][\underline{\eta},\overline{\eta}]. To check the Lipschitz continuity condition, note that

|f(ξ)|\displaystyle|f^{\prime}(\xi)| =f(ξ)0Tϑ2v2(α1/α21)2eϱr(Tt)dtσ2[ξeϱr(Tt)+ϑ]3\displaystyle=f(\xi)\int_{0}^{T}\frac{\vartheta^{2}v^{2}\left(\alpha_{1}/\alpha_{2}-1\right)^{2}\mathrm{e}^{\varrho r(T-t)}\mathrm{d}t}{\sigma^{2}\left[\xi\mathrm{e}^{\varrho r(T-t)}+\vartheta\right]^{3}}
<f(ξ)0Tϑ2v2(α1/α21)2dtξ3σ2e2ϱr(Tt)\displaystyle<f(\xi)\int_{0}^{T}\frac{\vartheta^{2}v^{2}\left(\alpha_{1}/\alpha_{2}-1\right)^{2}\mathrm{d}t}{\xi^{3}\sigma^{2}\mathrm{e}^{2\varrho r(T-t)}}
=ϑ2v2(α1α2)2f(ξ)ξ3α22σ21e2ϱrT2ϱr.\displaystyle=\frac{\vartheta^{2}v^{2}\left(\alpha_{1}-\alpha_{2}\right)^{2}f(\xi)}{\xi^{3}\alpha_{2}^{2}\sigma^{2}}\cdot\frac{1-\mathrm{e}^{-2\varrho rT}}{2\varrho r}. (20)

If (18) holds, then we have

|f(ξ)|<ϑ2v2(α1α2)2η¯α22σ2η¯31e2ϱrT2ϱr1|f^{\prime}(\xi)|<\frac{\vartheta^{2}v^{2}\left(\alpha_{1}-\alpha_{2}\right)^{2}\overline{\eta}}{\alpha_{2}^{2}\sigma^{2}\underline{\eta}^{3}}\cdot\frac{1-\mathrm{e}^{-2\varrho rT}}{2\varrho r}\leqslant 1 (21)

for all ξ[η¯,η¯]\xi\in[\underline{\eta},\overline{\eta}]. Therefore, Algorithm 1 converges when the initial value ξ[η¯,η¯]\xi\in[\underline{\eta},\overline{\eta}]. Because the initial value in Algorithm 1 is η0=η¯[η¯,η¯]\eta_{0}=\underline{\eta}\in[\underline{\eta},\overline{\eta}], Algorithm 1 converges to η\eta. ∎

4 Impact of Herd Behaviour on Optimal Decision

4.1 Rational Decision Decomposition and Investment Opinion

Theorem 1 and Algorithm 1 provide the analytical solution for A1’s optimal decision in Problem 2. However, due to the complexity of the expression of (7), it is complicated to theoretically analyze the impact of herd behaviour on A1’s optimal decision. Because the objective functional in Problem 2 is a weighted average of the expected utility of terminal wealth 𝔼ϕ(X1(T))\mathbb{E}\phi(X_{1}(T)) and the average deviation D[P1P¯2]D[P_{1}\|\bar{P}_{2}], an intuitive notion is that A1’s optimal decision is a function of both A1 and A2’s rational decisions. In Theorem 3, we prove that A1’s optimal decision is a convex linear combination of the two agents’ rational decisions, which we call the rational decision decomposition.

Theorem 3.

The optimal decision {P1(t)}t𝒯\{P_{1}^{*}(t)\}_{t\in\mathcal{T}} in Problem 2 is a convex linear combination of the two agents’ rational decisions {P¯1(t)}t𝒯\{\bar{P}_{1}(t)\}_{t\in\mathcal{T}} and {P¯2(t)}t𝒯\{\bar{P}_{2}(t)\}_{t\in\mathcal{T}}, i.e.,

P1(t)=Z1(t)P¯1(t)+[1Z1(t)]P¯2(t),t𝒯,P_{1}^{*}(t)=Z_{1}(t)\bar{P}_{1}(t)+[1-Z_{1}(t)]\bar{P}_{2}(t),t\in\mathcal{T}, (22)

where the weight function {Z1(t)}t𝒯\{Z_{1}(t)\}_{t\in\mathcal{T}} is given by

Z1(t)=ηeϱr(Tt)ηeϱr(Tt)+ϑ,t𝒯.Z_{1}(t)=\frac{\eta\mathrm{e}^{\varrho r(T-t)}}{\eta\mathrm{e}^{\varrho r(T-t)}+\vartheta},t\in\mathcal{T}. (23)
Proof.

By combining (5), (15) and (22), we can obtain

Z1(t)\displaystyle Z_{1}(t) =P1(t)P¯2(t)P¯1(t)P¯2(t)=ηα2σ2eϱr(Tt)+θηα1σ2eϱr(Tt)+θP¯2(t)P¯2(t)α2α1P¯2(t)P¯2(t)\displaystyle=\frac{P_{1}^{*}(t)-\bar{P}_{2}(t)}{\bar{P}_{1}(t)-\bar{P}_{2}(t)}=\frac{\frac{\eta\alpha_{2}\sigma^{2}\mathrm{e}^{\varrho r(T-t)}+\theta}{\eta\alpha_{1}\sigma^{2}\mathrm{e}^{\varrho r(T-t)}+\theta}\cdot\bar{P}_{2}(t)-\bar{P}_{2}(t)}{\frac{\alpha_{2}}{\alpha_{1}}\bar{P}_{2}(t)-\bar{P}_{2}(t)}
=ηα1σ2eϱr(Tt)ηα1σ2eϱr(Tt)+θ=ηeϱr(Tt)ηeϱr(Tt)+ϑ,\displaystyle=\frac{\eta\alpha_{1}\sigma^{2}\mathrm{e}^{\varrho r(T-t)}}{\eta\alpha_{1}\sigma^{2}\mathrm{e}^{\varrho r(T-t)}+\theta}=\frac{\eta\mathrm{e}^{\varrho r(T-t)}}{\eta\mathrm{e}^{\varrho r(T-t)}+\vartheta}, (24)

for all t𝒯t\in\mathcal{T}, which is (23). From (23), the range of investment opinion is (0,1)(0,1), which indicates that A1’s optimal decision is a convex linear combination of the two agents’ rational decisions. ∎

From (22), {Z1(t)}t𝒯\{Z_{1}(t)\}_{t\in\mathcal{T}} quantitatively describes the extent to which A1 adheres to his/her own rational decision, reflecting A1’s opinion on how good his/her own rational decision is, and thereby influencing the amount of the risky asset. Therefore, we define it as A1’s investment opinion. A smaller Z1(t)Z_{1}(t) indicates that A1’s decision is more inclined to that of A2.

To analyze the temporal evolution of herd behaviour and its impact on A1’s decision, it is essential to examine the dynamics of A1’s investment opinion. In Theorem 4, we derive the differential equation of the investment opinion.

Theorem 4.

The differential equation of the investment opinion in (23) is given by

Z˙1(t)\displaystyle\dot{Z}_{1}(t) =ϱrZ1(t)[1Z1(t)],t𝒯,\displaystyle=-\varrho rZ_{1}(t)[1-Z_{1}(t)],t\in\mathcal{T}, (25)
Z1(T)\displaystyle Z_{1}(T) =ηη+ϑ.\displaystyle=\frac{\eta}{\eta+\vartheta}. (26)
Proof.

According to (23), we have

Z˙1(t)\displaystyle\dot{Z}_{1}(t) =ϱrηeϱr(Tt)ηeϱr(Tt)+ϑ+ϱrη2e2ϱr(Tt)[ηeϱr(Tt)+ϑ]2\displaystyle=-\varrho r\frac{\eta\mathrm{e}^{\varrho r(T-t)}}{\eta\mathrm{e}^{\varrho r(T-t)}+\vartheta}+\varrho r\frac{\eta^{2}\mathrm{e}^{2\varrho r(T-t)}}{[\eta\mathrm{e}^{\varrho r(T-t)}+\vartheta]^{2}}
=ϱrZ1(t)[1Z1(t)],\displaystyle=-\varrho rZ_{1}(t)[1-Z_{1}(t)], (27)

which is (25). By substituting TT into (23), we have (26). ∎

In the following, we reframe Problem 2 from the perspective of the investment opinion and show that Problem 2 is essentially equivalent to the following problem where A1 determines his/her optimal investment opinion.

Problem 3. sup{Z1(t)}t𝒯𝒰𝔼ϕ(X1(T))λI[Z1]\displaystyle\sup_{\{Z_{1}(t)\}_{t\in\mathcal{T}}\in\mathcal{U}}\mathbb{E}\phi(X_{1}(T))-\lambda I[Z_{1}]
s.t. dX1(t)=[rX1(t)+vP1(t)]dt+σP1(t)dW(t),\displaystyle\mathrm{d}X_{1}(t)=[rX_{1}(t)+vP_{1}(t)]\mathrm{d}t+\sigma P_{1}(t)\mathrm{d}W(t),
P1(t)=Z1(t)P¯1(t)+[1Z1(t)]P¯2(t),\displaystyle P_{1}(t)=Z_{1}(t)\bar{P}_{1}(t)+[1-Z_{1}(t)]\bar{P}_{2}(t),
X1(0)=x1,\displaystyle X_{1}(0)=x_{1},

where I[Z1]=120Teϱr(tT)Z12(t)dtI[Z_{1}]=\frac{1}{2}\int_{0}^{T}\mathrm{e}^{\varrho r(t-T)}Z_{1}^{2}(t)\mathrm{d}t and λ\lambda is a constant.

Theorem 5.

When λ=ϑv2(α1α2)2α22σ2\lambda=\frac{\vartheta v^{2}(\alpha_{1}-\alpha_{2})^{2}}{\alpha_{2}^{2}\sigma^{2}}, Problem 2 and Problem 3 are equivalent, i.e., the optimal solutions {P1(t)}t𝒯\{P_{1}^{*}(t)\}_{t\in\mathcal{T}} and {Z1(t)}t𝒯\{Z_{1}(t)\}_{t\in\mathcal{T}} are both given by (7) and (23).

Proof.

Substituting (22) into Problem 2, we have

λ\displaystyle\lambda =θD[P1P¯2]I[Z1]=θ0Teρr(Tt)[P1(t)P¯2(t)]2dt0Teϱr(tT)Z12(t)dt\displaystyle=\frac{\theta D[P_{1}\|\bar{P}_{2}]}{I[Z_{1}]}=\frac{\theta\int_{0}^{T}\mathrm{e}^{\rho r(T-t)}[P_{1}(t)-\bar{P}_{2}(t)]^{2}\mathrm{d}t}{\int_{0}^{T}\mathrm{e}^{\varrho r(t-T)}Z_{1}^{2}(t)\mathrm{d}t}
=θ0Teρr(Tt)[P¯1(t)P¯2(t)]2Z12(t)dt0Teϱr(tT)Z12(t)dt\displaystyle=\frac{\theta\int_{0}^{T}\mathrm{e}^{\rho r(T-t)}[\bar{P}_{1}(t)-\bar{P}_{2}(t)]^{2}Z_{1}^{2}(t)\mathrm{d}t}{\int_{0}^{T}\mathrm{e}^{\varrho r(t-T)}Z_{1}^{2}(t)\mathrm{d}t}
=θv2(α1α2)2α12α22σ4=ϑv2(α1α2)2α22σ2.\displaystyle=\frac{\theta v^{2}(\alpha_{1}-\alpha_{2})^{2}}{\alpha_{1}^{2}\alpha_{2}^{2}\sigma^{4}}=\frac{\vartheta v^{2}(\alpha_{1}-\alpha_{2})^{2}}{\alpha_{2}^{2}\sigma^{2}}. (28)

In this case, Problem 2 and Problem 3 are equivalent. ∎

4.2 Parameters’ Influence on the Optimal Decision

With Theorem 3 and Theorem 4, we can quantitatively analyze the impact of herd behaviour on A1’s optimal decision.

4.2.1 Decay rate

When 0ρ<20\leqslant\rho<2, we have Z˙1(t)<0\dot{Z}_{1}(t)<0 for all t𝒯t\in\mathcal{T}, which indicates a monotonic decrease in A1’s investment opinion over time. Consequently, A1’s decision converges to that of A2 over time. This is because later deviations exert a significant impact when the decay rate is relatively small.

When ρ=2\rho=2, we have Z˙1(t)=0\dot{Z}_{1}(t)=0 and Z1(t)=ηη+ϑZ_{1}(t)=\frac{\eta}{\eta+\vartheta} for all t𝒯t\in\mathcal{T}. Therefore, A1’s optimal decision is a time-invariant convex linear combination of the two agents’ rational decisions, i.e., P1(t)=ηη+ϑP¯1(t)+ϑη+ϑP¯2(t)P_{1}^{*}(t)=\frac{\eta}{\eta+\vartheta}\bar{P}_{1}(t)+\frac{\vartheta}{\eta+\vartheta}\bar{P}_{2}(t) for all t𝒯t\in\mathcal{T}.

When ρ>2\rho>2, we have Z˙1(t)>0\dot{Z}_{1}(t)>0 for all t𝒯t\in\mathcal{T}, which indicates that A1’s investment opinion displays a monotonically increasing trend over time, while the level of convergence gradually diminishes. This is due to the relatively large decay rate of the average deviation, where later deviations have less impact. Hence, A1’s decision increasingly tends toward his/her own rational decision, diverging from that of A2.

4.2.2 Herd coefficient

Theorem 6.

The investment opinion Z1(t)Z_{1}(t) is a decreasing function of θ\theta with Z1(t)θ<0\frac{\partial Z_{1}(t)}{\partial\theta}<0.

Proof.

According to (17), we have

η=η¯exp{0Tv2(α1/α21)2dt2σ2[ηϑα1σ2eϱr(Tt)+1]2}.\eta=\underline{\eta}\exp\left\{\int_{0}^{T}\frac{v^{2}\left(\alpha_{1}/\alpha_{2}-1\right)^{2}\mathrm{d}t}{2\sigma^{2}\left[\frac{\eta}{\vartheta}\alpha_{1}\sigma^{2}\mathrm{e}^{\varrho r(T-t)}+1\right]^{2}}\right\}. (29)

Assume η\eta is a decreasing function of ϑ\vartheta. As ϑ\vartheta increases, the left-hand side of (29) will decrease, while the right-hand side of (29) will increase, which is contradictory. Therefore, η\eta must be an increasing function of ϑ\vartheta, i.e., ηϑ>0\frac{\partial\eta}{\partial\vartheta}>0. Let y:=ηϑy:=\frac{\eta}{\vartheta}, and we have

η=η¯exp{0Tv2(α1/α21)2dt2σ2[yα1σ2eϱr(Tt)+1]2}.\eta=\underline{\eta}\exp\left\{\int_{0}^{T}\frac{v^{2}\left(\alpha_{1}/\alpha_{2}-1\right)^{2}\mathrm{d}t}{2\sigma^{2}\left[y\alpha_{1}\sigma^{2}\mathrm{e}^{\varrho r(T-t)}+1\right]^{2}}\right\}. (30)

It can be easily proved that yη<0\frac{\partial y}{\partial\eta}<0. Therefore, we have yθ=yηηϑϑθ<0\frac{\partial y}{\partial\theta}=\frac{\partial y}{\partial\eta}\cdot\frac{\partial\eta}{\partial\vartheta}\cdot\frac{\partial\vartheta}{\partial\theta}<0. According to (23), we have

Z1(t)\displaystyle Z_{1}(t) =yα1σ2eϱr(Tt)yα1σ2eϱr(Tt)+1and\displaystyle=\frac{y\alpha_{1}\sigma^{2}\mathrm{e}^{\varrho r(T-t)}}{y\alpha_{1}\sigma^{2}\mathrm{e}^{\varrho r(T-t)}+1}\quad\mbox{and} (31)
Z1(t)y\displaystyle\frac{\partial Z_{1}(t)}{\partial y} =α1σ2eϱr(Tt)[yα1σ2eϱr(Tt)+1]2>0.\displaystyle=\frac{\alpha_{1}\sigma^{2}\mathrm{e}^{\varrho r(T-t)}}{[y\alpha_{1}\sigma^{2}\mathrm{e}^{\varrho r(T-t)}+1]^{2}}>0. (32)

Therefore, we have Z1(t)θ=Z1(t)yyθ<0\frac{\partial Z_{1}(t)}{\partial\theta}=\frac{\partial Z_{1}(t)}{\partial y}\cdot\frac{\partial y}{\partial\theta}<0. ∎

Theorem 6 indicates that the investment opinion is a decreasing function of the herd coefficient θ\theta. Therefore, the following agent’s decision aligns more closely with that of the leading expert when the herd coefficient is larger.

4.2.3 Initial wealth, excess return rate and volatility

Next, we analyze the influence of other parameters, including the following agent’s initial wealth x1x_{1}, the excess return rate vv, and the volatility σ\sigma of the risky asset. We first study the influence of these parameters on the integral constant η\eta.

Theorem 7.

The integral constant η\eta is a decreasing function of x1x_{1} with ηx1<0\frac{\partial\eta}{\partial x_{1}}<0. When α1α2[0,2]\frac{\alpha_{1}}{\alpha_{2}}\in\left[0,2\right], η\eta is a decreasing function of vv with ηv<0\frac{\partial\eta}{\partial v}<0. When α1α2[133,1+33]\frac{\alpha_{1}}{\alpha_{2}}\in\left[1-\frac{\sqrt{3}}{3},1+\frac{\sqrt{3}}{3}\right], etaeta is an increasing function of σ\sigma with ησ>0\frac{\partial\eta}{\partial\sigma}>0.

Proof.

According to (17), we have

ηx1=ηα1erT1+η0Tϑ2v2(α1/α21)2eϱr(Tt)σ2[ηeϱr(Tt)+ϑ]3dt<0.\frac{\partial\eta}{\partial x_{1}}=-\frac{\eta\alpha_{1}\mathrm{e}^{rT}}{1+\eta\int_{0}^{T}\frac{\vartheta^{2}v^{2}(\alpha_{1}/\alpha_{2}-1)^{2}\mathrm{e}^{\varrho r(T-t)}}{\sigma^{2}[\eta\mathrm{e}^{\varrho r(T-t)}+\vartheta]^{3}}\mathrm{d}t}<0. (33)

When α1α2[0,2]\frac{\alpha_{1}}{\alpha_{2}}\in\left[0,2\right], we have (α1α21)2<1(\frac{\alpha_{1}}{\alpha_{2}}-1)^{2}<1 and

ηv\displaystyle\frac{\partial\eta}{\partial v} =η{vTσ2+0Tϑ2v(α1/α21)2σ2[ηeϱr(Tt)+ϑ]2dt}1+η0Tϑ2v2(α1/α21)2eϱr(Tt)σ2[ηeϱr(Tt)+ϑ]3dt\displaystyle=\frac{\eta\left\{-\frac{vT}{\sigma^{2}}+\int_{0}^{T}\frac{\vartheta^{2}v(\alpha_{1}/\alpha_{2}-1)^{2}}{\sigma^{2}[\eta\mathrm{e}^{\varrho r(T-t)}+\vartheta]^{2}}\mathrm{d}t\right\}}{1+\eta\int_{0}^{T}\frac{\vartheta^{2}v^{2}(\alpha_{1}/\alpha_{2}-1)^{2}\mathrm{e}^{\varrho r(T-t)}}{\sigma^{2}[\eta\mathrm{e}^{\varrho r(T-t)}+\vartheta]^{3}}\mathrm{d}t}
<ηvTσ2[(α1/α21)21]1+η0Tϑ2v2(α1/α21)2eϱr(Tt)σ2[ηeϱr(Tt)+ϑ]3dt<0.\displaystyle<\frac{\frac{\eta vT}{\sigma^{2}}[(\alpha_{1}/\alpha_{2}-1)^{2}-1]}{1+\eta\int_{0}^{T}\frac{\vartheta^{2}v^{2}(\alpha_{1}/\alpha_{2}-1)^{2}\mathrm{e}^{\varrho r(T-t)}}{\sigma^{2}[\eta\mathrm{e}^{\varrho r(T-t)}+\vartheta]^{3}}\mathrm{d}t}<0. (34)

When α1α2[133,1+33]\frac{\alpha_{1}}{\alpha_{2}}\in\left[1-\frac{\sqrt{3}}{3},1+\frac{\sqrt{3}}{3}\right], we have (α1α21)2<13(\frac{\alpha_{1}}{\alpha_{2}}-1)^{2}<\frac{1}{3} and

ησ\displaystyle\frac{\partial\eta}{\partial\sigma} =η{v2Tσ30Tθ2v2(α1/α21)2σ3[ηα1σ2eϱr(Tt)+θ]2dt}1+η0Tθ2v2(α1/α21)2α1eϱr(Tt)[ηα1σ2eϱr(Tt)+θ]3dt\displaystyle=\frac{\eta\left\{\frac{v^{2}T}{\sigma^{3}}-\int_{0}^{T}\frac{\theta^{2}v^{2}(\alpha_{1}/\alpha_{2}-1)^{2}}{\sigma^{3}[\eta\alpha_{1}\sigma^{2}\mathrm{e}^{\varrho r(T-t)}+\theta]^{2}}\mathrm{d}t\right\}}{1+\eta\int_{0}^{T}\frac{\theta^{2}v^{2}(\alpha_{1}/\alpha_{2}-1)^{2}\alpha_{1}\mathrm{e}^{\varrho r(T-t)}}{[\eta\alpha_{1}\sigma^{2}\mathrm{e}^{\varrho r(T-t)}+\theta]^{3}}\mathrm{d}t}
η0T2θ2v2(α1/α21)2ηα1eϱr(Tt)σ[ηα1σ2eϱr(Tt)+θ]3dt1+η0Tθ2v2(α1/α21)2α1eϱr(Tt)[ηα1σ2eϱr(Tt)+θ]3dt\displaystyle-\frac{\eta\int_{0}^{T}\frac{2\theta^{2}v^{2}(\alpha_{1}/\alpha_{2}-1)^{2}\eta\alpha_{1}\mathrm{e}^{\varrho r(T-t)}}{\sigma[\eta\alpha_{1}\sigma^{2}\mathrm{e}^{\varrho r(T-t)}+\theta]^{3}}\mathrm{d}t}{1+\eta\int_{0}^{T}\frac{\theta^{2}v^{2}(\alpha_{1}/\alpha_{2}-1)^{2}\alpha_{1}\mathrm{e}^{\varrho r(T-t)}}{[\eta\alpha_{1}\sigma^{2}\mathrm{e}^{\varrho r(T-t)}+\theta]^{3}}\mathrm{d}t}
>ηvTσ2[13(α1/α21)2]1+η0Tθ2v2(α1/α21)2α1eϱr(Tt)[ηα1σ2eϱr(Tt)+θ]3dt>0.\displaystyle>\frac{\frac{\eta vT}{\sigma^{2}}[1-3(\alpha_{1}/\alpha_{2}-1)^{2}]}{1+\eta\int_{0}^{T}\frac{\theta^{2}v^{2}(\alpha_{1}/\alpha_{2}-1)^{2}\alpha_{1}\mathrm{e}^{\varrho r(T-t)}}{[\eta\alpha_{1}\sigma^{2}\mathrm{e}^{\varrho r(T-t)}+\theta]^{3}}\mathrm{d}t}>0. (35)

So far, we have finished the proof of Theorem 7. ∎

Given Theorem 7, we can further analyze the impact of these parameters on the investment opinion.

Theorem 8.

The investment opinion Z1(t)Z_{1}(t) is a decreasing function of x1x_{1} with Z1(t)x1<0\frac{\partial Z_{1}(t)}{\partial x_{1}}<0. When α1α2[0,2]\frac{\alpha_{1}}{\alpha_{2}}\in\left[0,2\right], Z1(t)Z_{1}(t) is a decreasing function of vv with Z1(t)v<0\frac{\partial Z_{1}(t)}{\partial v}<0. When α1α2[133,1+33]\frac{\alpha_{1}}{\alpha_{2}}\in\left[1-\frac{\sqrt{3}}{3},1+\frac{\sqrt{3}}{3}\right], Z1(t)Z_{1}(t) is an increasing function of σ\sigma with Z1(t)σ>0\frac{\partial Z_{1}(t)}{\partial\sigma}>0.

Proof.

According to (23) and (25), we can prove that

Z1(t)η=ϑeϱr(Tt)[ηeϱr(Tt)+ϑ]2>0.\frac{\partial Z_{1}(t)}{\partial\eta}=\frac{\vartheta\mathrm{e}^{\varrho r(T-t)}}{[\eta\mathrm{e}^{\varrho r(T-t)}+\vartheta]^{2}}>0. (36)

Therefore, we have Z1(t)x1=Z1(t)ηηx1<0\frac{\partial Z_{1}(t)}{\partial x_{1}}=\frac{\partial Z_{1}(t)}{\partial\eta}\cdot\frac{\partial\eta}{\partial x_{1}}<0. When α1α2[0,2]\frac{\alpha_{1}}{\alpha_{2}}\in\left[0,2\right], we have Z1(t)v=Z1(t)ηηv<0\frac{\partial Z_{1}(t)}{\partial v}=\frac{\partial Z_{1}(t)}{\partial\eta}\cdot\frac{\partial\eta}{\partial v}<0. When α1α2[133,1+33]\frac{\alpha_{1}}{\alpha_{2}}\in\left[1-\frac{\sqrt{3}}{3},1+\frac{\sqrt{3}}{3}\right], we have Z1(t)σ=Z1(t)ηησ>0\frac{\partial Z_{1}(t)}{\partial\sigma}=\frac{\partial Z_{1}(t)}{\partial\eta}\cdot\frac{\partial\eta}{\partial\sigma}>0. ∎

From Theorem 8, we can draw the following conclusions.

The following agent’s investment opinion is a decreasing function of his/her initial wealth x1x_{1}, indicating that when the following agent has a higher initial wealth, the investment opinion Z1(t)Z_{1}(t) is smaller, and from (22), his/her decision tends to converge more to that of the leading expert. This can also be intuitively explained using the diminishing margin feature of the utility function. As the initial wealth increases, the additional utility gained from terminal wealth diminishes. Therefore, the following agent pays more attention to reducing the disparity from the leading expert’s decision, causing the convergence of the two agents’ decisions.

Similarly, when the two agents’ risk aversion coefficients are close, a higher excess return rate vv and lower volatility σ\sigma result in a smaller investment opinion from Theorem 8. Therefore, from (23), the following agent’s decision converges more to that of the leading expert. This can be intuitively explained as follows. With a larger vv and a smaller σ\sigma, the risky asset achieves a higher expected utility of terminal wealth. Thus, from the diminishing margin feature of the utility function, the following agent’s decision tends to converge more to that of the leading expert.

5 Numerical Experiments

In this section, we conduct numerical experiments to validate our analyses in Section 3 and Section 4.

Refer to caption
Figure 1: Optimal decisions and investment opinions with different decay rates.

5.1 Parameter Settings

We choose a temporal span covering fifty years from January 1972 to December 2022, i.e., T=50T=50. During this investment period, we collect daily closing prices of the Dow Jones Industrial Average to represent the prices of the risky asset. Utilizing parameter estimation of the geometric Brownian motion, we determine the expected return μ\mu to be 0.070.07 and volatility σ\sigma to be 0.170.17 for the risky asset. We use the daily average of the interest rates of U.S. 1-Year Treasury Bills in 2022 and 2023 to represent the risk-free interest rate rr, which is approximately equal to 0.040.04. From the prior work in [31], we set α1=0.2\alpha_{1}=0.2 and α2=0.4\alpha_{2}=0.4 as the risk aversion coefficients for the following agent and leading expert, respectively. We set the modified herd coefficient ϑ\vartheta to values of 1800\frac{1}{800}, 1400\frac{1}{400}, 1200\frac{1}{200}, and 1100\frac{1}{100}. We set the following agent’s initial wealth as X1,0=0X_{1,0}=0. We observe the same trend for other values of the parameters.

5.2 Optimal Decision

We consider three cases with different decay rates: ρ=0\rho=0, ρ=2\rho=2, and ρ=4\rho=4. Experiment results are shown in Fig. 1. It can be seen that A1’s optimal decision consistently remains within the region bounded by the rational decisions of the two agents {P¯1(t)}t𝒯\{\bar{P}_{1}(t)\}_{t\in\mathcal{T}} and {P¯2(t)}t𝒯\{\bar{P}_{2}(t)\}_{t\in\mathcal{T}}. Furthermore, the optimal decision progressively approaches A2’s rational decision {P¯2(t)}t𝒯\{\bar{P}_{2}(t)\}_{t\in\mathcal{T}} as the herd coefficient increases. When ρ=0\rho=0, the optimal decision increasingly converges towards A2’s rational decision {P¯2(t)}t𝒯\{\bar{P}_{2}(t)\}_{t\in\mathcal{T}}, and the investment opinion decreases over time. When ρ=2\rho=2, the optimal decision maintains a constant distance from the rational decisions of the two agents, and the investment opinion remains constant. When ρ=4\rho=4, the optimal decision increasingly converges towards A1’s rational decision {P¯1(t)}t𝒯\{\bar{P}_{1}(t)\}_{t\in\mathcal{T}}, and the investment opinion increases over time. These observations agree with our analyses of the optimal decision and investment opinion in Theorem 3.

5.3 Parameters’ Influence on the Optimal Decision

Next, we study the parameters’ influence on A1’s optimal decision. As shown in Fig. 1, when ϑ\vartheta increases, the investment opinion Z1(t)Z_{1}(t) decreases. Therefore, Z1(t)Z_{1}(t) is a decreasing function of the modified herd coefficient ϑ\vartheta. Because θ=α1σ2ϑ\theta=\alpha_{1}\sigma^{2}\vartheta, Z1(t)Z_{1}(t) is also a decreasing function of the herd coefficient θ\theta, which validates the correctness of Theorem 6.

Keeping other parameters constant and ensuring α1α2[133,1+33]\frac{\alpha_{1}}{\alpha_{2}}\in\left[1-\frac{\sqrt{3}}{3},1+\frac{\sqrt{3}}{3}\right], we plot the curves of the integral constant η\eta with respect to the initial wealth xx, excess return rate μ\mu, and volatility σ\sigma. As shown in Fig. 2, η\eta increases with the decrease of xx and vv, and increases with the increase of σ\sigma, thereby validating the correctness of Theorem 7.

6 Conclusion

In this paper, we formulate a dual-agent optimal investment problem considering herd behaviour and introduce the average deviation term in the traditional Merton problem’s objective functional to measure the distance between the two agents’ decisions. We obtain the analytical solution using the variational method and quantitatively analyze the impact of herd behaviour on the following agent’s optimal decision using the rational decision decomposition. Furthermore, we introduce the concept of investment opinion to quantify the following agent’s preference for his/her own rational decision over that of the leading expert. We validate our analyses through numerical experiments on real stock data.

Refer to caption
(a) Initial wealth
Refer to caption
(b) Excess return rate
Refer to caption
(c) Volatility
Figure 2: Parameters’ influence on the integral constant η\eta.

References

  • [1] A. Bodnaruk and A. Simonov, “Do financial experts make better investment decisions?” Journal of Financial Intermediation, vol. 24, no. 4, pp. 514–536, 2015.
  • [2] J. R. Brown, Z. Ivković, P. A. Smith, and S. Weisbenner, “Neighbors matter: Causal community effects and stock market participation,” Journal of Finance, vol. 63, no. 3, pp. 1509–1531, 2008.
  • [3] H. Hong, J. D. Kubik, and J. C. Stein, “Social interaction and stock-market participation,” Journal of Finance, vol. 59, no. 1, pp. 137–163, 2004.
  • [4] V. M. Eguiluz and M. G. Zimmermann, “Transmission of information and herd behavior: An application to financial markets,” Physical Review Letters, vol. 85, no. 26, p. 5659, 2000.
  • [5] S. Alfarano, T. Lux, and F. Wagner, “Estimation of agent-based models: The case of an asymmetric herding model,” Computational Economics, vol. 26, pp. 19–49, 2005.
  • [6] W. Hu, G. Liu, W. Zhang, and T. Wu, “Study on random trading behavior, herd behavior and asset price volatility,” in 2016 Chinese Control and Decision Conference (CCDC).   IEEE, 2016, pp. 3157–3163.
  • [7] E. C. Chang, J. W. Cheng, and A. Khorana, “An examination of herd behavior in equity markets: An international perspective,” Journal of Banking & Finance, vol. 24, no. 10, pp. 1651–1679, 2000.
  • [8] T. T.-L. Chong, X. Liu, and C. Zhu, “What explains herd behavior in the Chinese stock market?” Journal of Behavioral Finance, vol. 18, no. 4, pp. 448–456, 2017.
  • [9] K. Yu, Y. Li, and Z. Cao, “Detection and estimation of herd behavior in futures market,” in 2019 International Conference on Economic Management and Model Engineering (ICEMME).   IEEE, 2019, pp. 213–223.
  • [10] P. Wanidwaranan and C. Padungsaksawasdi, “Unintentional herd behavior via the Google search volume index in international equity markets,” Journal of International Financial Markets, Institutions and Money, vol. 77, p. 101503, 2022.
  • [11] S. Zhou and X. Liu, “Internet postings and investor herd behavior: evidence from China’s open-end fund market,” Humanities and Social Sciences Communications, vol. 9, no. 1, pp. 1–11, 2022.
  • [12] H. M. Markowitz, “Portfolio selection,” Journal of Finance, vol. 7, no. 1, p. 77, 1952.
  • [13] P. A. Samuelson, “Lifetime portfolio selection by dynamic stochastic programming,” Stochastic optimization models in finance, pp. 517–524, 1975.
  • [14] G. C. Calafiore, “Multi-period portfolio optimization with linear control policies,” Automatica, vol. 44, no. 10, pp. 2463–2473, 2008.
  • [15] L. Yang, Y. Zheng, and J. Shi, “Risk-sensitive stochastic control with applications to an optimal investment problem under correlated noises,” in 2019 Chinese Control Conference (CCC).   IEEE, 2019, pp. 1356–1363.
  • [16] G. Li, Z. Huang, and Z. Wu, “A kind of optimal investment problem under inflation and uncertain exit time,” in 2022 Chinese Control Conference (CCC).   IEEE, 2022, pp. 1739–1744.
  • [17] A. Aljalal and B. Gashi, “Optimal investment in a market with random interest rate for borrowing: an explicit closed-form solution,” in 2022 International Conference on Control, Decision and Information Technologies (CoDIT), vol. 1.   IEEE, 2022, pp. 764–769.
  • [18] N. Alasmi and B. Gashi, “Optimal investment in a market with borrowing and a combined interest rate model,” in 2023 International Conference on Control, Decision and Information Technologies (CoDIT).   IEEE, 2023, pp. 01–06.
  • [19] R. C. Merton, “Lifetime portfolio selection under uncertainty: The continuous-time case,” The Review of Economics and Statistics, pp. 247–257, 1969.
  • [20] J. Gali, “Keeping up with the Joneses: Consumption externalities, portfolio choice, and asset prices,” Journal of Money, Credit and Banking, vol. 26, no. 1, pp. 1–8, 1994.
  • [21] C. Gollier, “Misery loves company: Equilibrium portfolios with heterogeneous consumption externalities,” International Economic Review, vol. 45, no. 4, pp. 1169–1192, 2004.
  • [22] B. Lauterbach and H. Reisman, “Keeping up with the Joneses and the home bias,” European Financial Management, vol. 10, no. 2, pp. 225–234, 2004.
  • [23] J.-P. Gómez, “The impact of keeping up with the Joneses behavior on asset prices and portfolio choice,” Finance Research Letters, vol. 4, no. 2, pp. 95–103, 2007.
  • [24] J.-P. Gómez, R. Priestley, and F. Zapatero, “Implications of keeping-up-with-the-Joneses behavior for the equilibrium cross section of stock returns: International evidence,” Journal of Finance, vol. 64, no. 6, pp. 2703–2737, 2009.
  • [25] N. Roussanov, “Diversification and its discontents: Idiosyncratic and entrepreneurial risk in the quest for social status,” Journal of Finance, vol. 65, no. 5, pp. 1755–1788, 2010.
  • [26] D. Garcia and G. Strobl, “Relative wealth concerns and complementarities in information acquisition,” The Review of Financial Studies, vol. 24, no. 1, pp. 169–207, 2011.
  • [27] M. Levy and H. Levy, “Keeping up with the Joneses and optimal diversification,” Journal of Banking & Finance, vol. 58, pp. 29–38, 2015.
  • [28] J. M. Merigo and M. Casanovas, “Induced aggregation operators in the Euclidean distance and its application in financial decision making,” Expert Systems with Applications, vol. 38, no. 6, pp. 7603–7608, 2011.
  • [29] S. Frederick, G. Loewenstein, and T. O’donoghue, “Time discounting and time preference: A critical review,” Journal of Economic Literature, vol. 40, no. 2, pp. 351–401, 2002.
  • [30] J. W. Pratt, “Risk aversion in the small and in the large,” in Uncertainty in economics.   Elsevier, 1978, pp. 59–79.
  • [31] K. Yuen, H. Yang, and K. Chu, “Estimation in the constant elasticity of variance model,” British Actuarial Journal, vol. 7, no. 2, pp. 275–292, 2001.