Mean Field Games of Major-Minor Agents with Recursive Functionals

Jianhui Huang Department of Applied Mathematics, The Hong Kong Polytechnic University, Hong Kong, China. Email: james.huang@polyu.edu.hk. This author’s research is partially supported by RGC Grant PolyU 15301119, 15307621, N PolyU504/19, NSFC 12171407 and KKZT. Wenqiang Li Corresponding author. School of Mathematics and Statistics, Shandong University, Weihai 264209, P.R. China. Email: wenqiangli@sdu.edu.cn. This author’s research is supported by the NSF of P.R. China (No. 12101537, 12271304). Harry Zheng Department of Mathematics, Imperial College, London SW72BZ, UK. Email:h.zheng@imperial.ac.uk. This author is partially supported by EPSRC (UK) grant (EP/V008331/1).

Abstract. This paper investigates a novel class of mean field games involving a major agent and numerous minor agents, where the agents’ functionals are recursive with nonlinear backward stochastic differential equation (BSDE) representations. We term these games “recursive major-minor” (RMM) problems. Our RMM modeling is quite general, as it employs empirical (state, control) averages to define the weak couplings in both the functionals and dynamics of all agents, regardless of their status as major or minor. We construct an auxiliary limiting problem of the RMM by a novel unified structural scheme combining a bilateral perturbation with a mixed hierarchical recomposition. This scheme has its own merits as it can be applied to analyze more complex coupling structures than those in the current RMM. Subsequently, we derive the corresponding consistency condition and explore asymptotic RMM equilibria. Additionally, we examine the RMM problem in specific linear-quadratic settings for illustrative purposes.

Keywords. Backward stochastic differential equation, Controlled large population system, Exchangeable decomposition, Major and minor agents, Mean field game, Recursive functional.

MSC2020 subject classifications. 93E20, 60H10, 60K35.

1 Introduction

Mean field game (MFG) theory was independently introduced by [27] and [24] from different perspectives, serving as an effective methodology for analyzing controlled large population (LP) systems. Typically, a LP system comprises a large number of agents with interactions through their empirical distribution or averages. These interactions are in a weak sense, as the degrees of couplings among agents diminish rapidly when the number of agents tends to infinity. A core element of the MFG theory is the construction of a limiting auxiliary problem, under which all agents can be largely disentangled, allowing for characterizations of some decentralized approximate equilibrium through a consistency matching. As such, MFG analysis can significantly reduce the dimension of controlled LP systems that each agent needs to analyze, greatly simplifying related numerical analysis. A substantial body of research has been dedicated to the MFG theory, yielding fruitful outcomes. A partial list of literature relevant to this current work includes [5, 6, 8, 11, 22, 23, 32, 34]. This paper focuses on a new class of MFG problems with the following weakly-coupled LP system including a major $\mathcal{A}_{0}$ and multiple minors $\{\mathcal{A}_{i}\}_{i=1}^{N}$ , whose states $X^{0}$ and $\{X^{i}\}_{i=1}^{N}$ satisfy the following forward stochastic differential equations (SDEs)

\left\{\begin{aligned} dX_{t}^{0}=&b^{0}(t,X_{t}^{0},{u_{t}^{0}};X_{t}^{(N)},{u_{t}^{(N)}})dt+{\sigma^{0}(t,X_{t}^{0},{u_{t}^{0}};X_{t}^{(N)},{u_{t}^{(N)}})}dW^{0}_{t},\\ dX_{t}^{i}=&{b(t,X_{t}^{i},u_{t}^{i};X_{t}^{0},u_{t}^{0};X_{t}^{(N)},u_{t}^{(N)})dt+\sigma(t,X_{t}^{i},u_{t}^{i};X_{t}^{0},u_{t}^{0};X_{t}^{(N)},u_{t}^{(N)})dW_{t}^{i}},\end{aligned}\right.

(1.1)

with initial conditions $X_{0}^{0}=x_{0}^{0}\in\mathbb{R}^{n},X_{0}^{i}=x_{0}\in\mathbb{R}^{n}.$ Here, $X_{\cdot}^{(N)}=\frac{1}{N}\sum_{i=1}^{N}X_{\cdot}^{i},\ u_{\cdot}^{(N)}=\frac{1}{N}\sum_{i=1}^{N}u_{\cdot}^{i}$ stand for the (forward) state-average and control-average of all minors, respectively, and $W=(W^{0},W^{1},$ $\cdots,W^{N})^{\top}$ is a $(N+1)$ -dimensional standard Brownian motion on a complete probability space $(\Omega,\mathcal{F},{\mathbb{P}})$ , where $W^{i}$ is the idiosyncratic noise for $\mathcal{A}_{i}$ while $W^{0}$ is the common noise. $\{\mathcal{A}_{k}\}_{k=0}^{N}$ aim to maximize recursive-type functionals $\{J_{k}\}_{k=0}^{N}$ given by

	$\displaystyle J_{0}(u^{0};u^{1},\cdots,u^{N})=$	$\displaystyle\Gamma^{0}(Y_{0}^{0})+\mathbb{E}\int_{0}^{T}g^{0}(t,\Theta_{t}^{0},u_{t}^{0};\Theta_{t}^{(N)},u_{t}^{(N)})dt,$		(1.2)
	$\displaystyle J_{i}(u^{i};u^{-i},u^{0})=$	$\displaystyle\Gamma(Y_{0}^{i})+\mathbb{E}\int_{0}^{T}g(t,\Theta_{t}^{i},u_{t}^{i};\Theta_{t}^{0},u_{t}^{0};\Theta_{t}^{(N)},u_{t}^{(N)})dt,\ i=1,\cdots,N,$		(1.2)

where $u^{-i}=\{\cdots,u^{i-1},u^{i+1},\cdots\}$ is the control profile except that of $\mathcal{A}_{i}$ ; $\Theta_{\cdot}^{(N)}=(X_{\cdot}^{(N)},Y_{\cdot}^{(N)},Z_{\cdot}^{(N)})$ is state-average triple with $Y^{(N)}_{\cdot}=\frac{1}{N}\sum_{i=1}^{N}Y_{\cdot}^{i}$ as the recursive state average and $Z^{(N)}_{\cdot}=\frac{1}{N}\sum_{i=1}^{N}Z_{\cdot}^{i}$ the intensity state average (see [12]); the state-triple $\{\Theta_{\cdot}^{k}=(X_{\cdot}^{k},Y_{\cdot}^{k},{Z}_{\cdot}^{k})\}_{k=0}^{N}$ satisfy (1.1) and the backward SDE (BSDE) motivated by recursive utilities in economics [15, 28, 29]:

\left\{\begin{aligned} -dY_{t}^{0}=&f^{0}(t,\Theta_{t}^{0},u_{t}^{0};\Theta_{t}^{(N)},u_{t}^{(N)})dt-Z_{t}^{0,0}dW_{t}^{0}-\sum_{j=1}^{N}Z_{t}^{0,j}dW_{t}^{j},\\ -dY_{t}^{i}=&f(t,\Theta_{t}^{i},u_{t}^{i};\Theta_{t}^{0},u_{t}^{0};\Theta_{t}^{(N)},u_{t}^{(N)})dt-Z_{t}^{i,0}dW^{0}_{t}-Z_{t}^{i,i}dW^{i}_{t}-\sum_{j=1,j\neq i}^{N}Z_{t}^{i,j}dW_{t}^{j},\\ Y_{T}^{0}=&\Phi^{0}(X_{T}^{0},X_{T}^{(N)})+\xi^{0},\ Y_{T}^{i}=\Phi(X_{T}^{i},X_{T}^{0},X_{T}^{(N)})+\xi^{i},\end{aligned}\right.

(1.3)

where for $i=1,\cdots,N$ , $Z^{i}=(Z^{i,0},Z^{i,i},{\{Z^{i,j}\}_{j\neq i,j=1}^{N}})$ and $(Z^{i,0},Z^{i,i})$ is the principal intensity component, and $\{Z^{i,j}\}_{j\neq i}$ the marginal components. Since remainder terms $\sum_{j=1}^{N}Z_{t}^{0,j}dW_{t}^{j}$ and $\sum_{j=1,j\neq i}^{N}Z_{t}^{i,j}dW_{t}^{j}$ vanish as $N\rightarrow\infty$ (see Remark 3.1), we focus on the principal terms and set $Z^{(N)}_{\cdot}=\frac{1}{N}\sum_{i=1}^{N}(Z^{i,0}_{\cdot},Z^{i,i}_{\cdot})$ as averages on principal intensities.

We refer (1.1)-(1.3) as the recursive major-minor (RMM) problem. We defer its detailed assumptions later, and highlight its modeling features first as follows.

Modeling features. (i) The RMM model delves into the interaction between a major $\mathcal{A}_{0}$ and a large number of minors $\{\mathcal{A}_{i}\}_{i=1}^{N}$ . Traditional MFG studies assume that agents are all “minor” or “negligible”, meaning that an individual agent’s action cannot significantly impact the behaviors of the population at a macro scale. Associated MFGs are thus referred to as “symmetric” because it suffices to examine a representative agent, provided agents are homogenous hence statistically exchangeable. In contrast, our RMM explores asymmetric interactions where agents having varying decisional capacities. A major agent may significantly influence the population’s behavior through her own decisions, whereas numerous minor agents can only affect the population through collective actions. This model is more realistic than the homogeneous minor setting, as it captures a range of diversified interaction mechanisms, see [5, 11, 23, 32, 34].

(ii) The RMM model further posits that objectives of all agents are represented recursively through nonlinear BSDEs, such as (1.3), with non-additive drivers $f^{0}$ or $f$ . The inclusion of recursive functionals in MFG studies is motivated by their advantageous decision-theoretic properties, especially in the current LP context featuring complex decision couplings. Indeed, recursive functionals are well-suited for decision theory due to their capability to explain various observed non-standard decision behaviors, such as the separation of inter-temporal substitution and risk aversion. Consistently, recursive functionals extend classical expected functionals (see [4, 7, 16]), which are relevant to a special class of BSDEs with additive drivers.

(iii) The RMM problem restricts its weak coupling to empirical averages, refraining from discussing a more extensive empirical measures or distributions. Despite this limitation, the weak coupling of RMM remains quite general, as it is integrated into both the dynamics and payoff functionals of the major $\mathcal{A}_{0}$ and all minors $\{\mathcal{A}_{i}\}_{i=1}^{N}$ , encompassing elements from both the state and the control. Moreover, due to recursive functionals, the state averages is enriched by including not only $X^{(N)}$ on the objective (forward) states; but also $(Y^{(N)},Z^{(N)})$ on the (recursive and intensity) states reflecting the subjective averaged-out beliefs. Specifically, the intensity coupling $Z^{(N)}$ characterizes some average on risk (ambiguity) aversion across all agents.

Literature comparison. [10, 23] and [32] introduced the major-minor MFG within a linear-quadratic-Gaussian (LQG) framework on finite and infinite horizon, respectively. They employed augmented Riccati equations to characterize consistency conditions. [34] extended these major-minor studies to a nonlinear setting using the stochastic Hamilton-Jacobi-Bellman (HJB) approach, where the weak coupling is restrictive; for instance, the major’s state cannot enter the dynamics of the minors. Besides, [5] investigated a class of major-minor MFG problems also through the stochastic HJB approach. [8] studied major-minor MFG by master equations where the agents take closed loop control. Recently, [6] investigated a type of MFG problems with asymmetric information between major and minor agents. [11] explored nonlinear major-minor MFG with general weak couplings, allowing the major’s state to enter the dynamics of the minor agents. Additionally, the limiting control problem of the major agent incorporates an endogenous mean-field term, based on an approximation through a two-agent non-zero-sum game. A forward type of maximum principle was utilized in this context.

Our paper distinguishes itself from aforementioned works by its focus on a nonlinear major-minor interaction with recursive functionals, and associated methodology of a backward-forward type of stochastic maximum principle. Our RMM modeling is particularly noteworthy for its introduction and detailed analysis of the weak couplings of the backward (recursive and intensity) state-average $(Y^{(N)},Z^{(N)})$ originating from the recursive functional. As previously mentioned, these couplings hold significant decision-making impacts and, to the best of our knowledge, have not been systematically addressed in the MFG literature. Consequently, the maximum principle we adopt and the consistency condition we derive take unique forms that differ from those in [11]. Additionally, we apply our general nonlinear outcomes to specific RMM problems in LQG settings. Our LQG-RMM studies not only recap and extend existing results on the forward MFG studies, but also provide new insights into its backward counterpart.

Another relevant work is [7], which also explored the major-minor interaction and recursive functionals. However, it is framed on a weak formulation, substantially different from our strong formulation. Specifically, the approach of [7] originates from a variant of the Girsanov transformation and optimization of a Hamiltonian function, whereas our analysis is rooted in a refined backward-forward stochastic maximum principle. More significantly, all minors in [7] are cooperative, and the associated MFG thus encompasses a two-layer mixed structure: all (cooperative) minor agents form a mean-field team (rather than a game) in an inner layer, while in an outer layer, the interacting major agent and a representative minor agent induce a two-person, non-mean-field game. Although also termed a MFG, the model in [7] is essentially a hybrid of a mean-field team and a non-mean-field-type two-person game, which contrasts with the decision structure we investigate. Consequently, the consistency matching, a central step in MFG analysis, is not applicable in [7] at all. In contrast, our work is distinguished by a novel scheme to auxiliary control construction and consistency matching, as detailed below. Furthermore, the limiting equilibria in [7] are characterized as saddle points, which are remarkably different from our non-zero-sum setup. Additionally, the admissible controls in [7] take feedback forms and are compact, unlike our open-loop and unbounded convex admissibility.

A unified structural scheme. Last but not least, as the core element in MFG, the auxiliary problem of the RMM is formulated through a novel structural scheme, which incorporates a bilateral perturbation and a hierarchical recomposition. This scheme facilitates a more incisive auxiliary construction through a sequential network, enabling a clear-cut realization of a complex mixture involving two minor agents: one is exogenous and the other is endogenous, alongside the endogenous major agent. This scheme is new in the MFG literature and distinguishes our work from previous studies, where auxiliary constructions are based on heuristic arguments. More importantly, such scheme offers a unified methodology to tackle more complex LP interactions for which heuristic arguments are no longer tractable. For instance, when the LP system consists of heterogenous agents with varying beliefs on model uncertainties.

Contributions. (i) We introduce a new class of RMM problems featuring major-minor asymmetric interactions and recursive objectives. (ii) We present a novel unified structural scheme to construct its pivotal auxiliary problem. (iii) We derive a new class of mean-field type of forward-backward SDEs (FBSDEs) to characterize the consistency condition of the RMM problem. (iv) We examine LQG studies in the RMM context in detail to gain deeper insights.

The remainder of this paper is organized as follows: Section 2 introduces basic assumptions of the RMM problem. Section 3 presents a unified structural scheme for the RMM problem, including a bilateral perturbation and a mixed triple-agent two-layer analysis. Section 4 studies the auxiliary control and associated consistency condition (CC) of the RMM problem, and verifies its approximate Nash equilibrium. Section 5 devotes to some LQG-RMM problems. Section 6 concludes, and some technical proofs and heavy notations are given in Appendix.

2 Preliminary

For $i=0,1,\cdots,N$ , let $\mathbb{F}^{i}=\{{\mathcal{F}}_{t}^{i}\}$ be the complete filtration generated by the Brownian motion $W^{i}$ ; namely, ${\mathcal{F}}_{t}^{i}=\sigma(W_{i}(s),0\leq s\leq t)\vee\mathcal{N}_{\mathbb{P}}$ with $\mathcal{N}_{\mathbb{P}}$ the set of all $\mathbb{P}$ -null sets in $\mathcal{F}$ . Then, $\mathbb{F}=\{{\mathcal{F}}_{t}\}_{0\leq t\leq T}=\{\sigma(\bigcup_{i=0}^{N}{\mathcal{F}}_{t}^{i})\}_{0\leq t\leq T}$ denotes the centralized information generated by the Brownian motion $W$ , and by $\mathbb{F}^{i,0}=\{{\mathcal{F}}^{i,0}_{t}\}_{0\leq t\leq T}=\{\sigma({\mathcal{F}}_{t}^{i}\bigcup{\mathcal{F}}_{t}^{0})\}_{0\leq t\leq T}$ the decentralized information for a generic minor agent $\mathcal{A}_{i}$ , $i=1,\cdots,N$ . Let $U_{0}\subset\mathbb{R}^{k_{0}}$ and $U\subset\mathbb{R}^{k}$ be two convex sets.

Definition 2.1.

$u^{0}$ is a centralized (resp. decentralized) admissible control for $\mathcal{A}_{0}$ , if it is an $\mathbb{F}$ -adapted (resp. $\mathbb{F}^{0}$ -adapted) $U_{0}$ -valued process with ${||u||_{L^{2}}:=}\mathbb{E}\int_{0}^{T}|u_{t}|^{2}dt<\infty$ . Similarly, for $i=1,2,\cdots,N$ , a $U$ -valued process $u^{i}$ is called a centralized (resp. decentralized) admissible control for $\mathcal{A}_{i}$ , if it is $\mathbb{F}$ -adapted (resp. $\mathbb{F}^{i,0}$ -adapted) with $||u^{i}||_{L^{2}}<\infty$ . Let $\mathcal{U}_{0}^{c}$ and $\mathcal{U}^{c}$ (resp. $\mathcal{U}_{0}^{d}$ and $\mathcal{U}_{i}^{d}$ ) be the set of all centralized (resp. decentralized) controls for $\mathcal{A}_{0}$ and $\mathcal{A}_{i}$ , respectively.

Definition 2.2.

For any $\varepsilon>0$ , we say a $(N+1)$ -tuple admissible controls $(u^{0,*},u^{1,*},\cdots,u^{N,*})\in\mathcal{U}^{c}_{0}\times\mathcal{U}^{c}\times\cdots\mathcal{U}^{c}$ (resp. $\in\mathcal{U}^{d}_{0}\times\mathcal{U}^{d}_{1}\times\cdots\mathcal{U}^{d}_{N}$ ) depending on $\varepsilon$ is a centralized (resp. decentralized) approximate $\varepsilon$ -Nash equilibrium, if for all $(u^{0},u^{1},\cdots,u^{N})\in\mathcal{U}^{c}_{0}\times\mathcal{U}^{c}\times\cdots\mathcal{U}^{c}$ (resp. $\in\mathcal{U}^{d}_{0}\times\mathcal{U}^{d}_{1}\times\cdots\mathcal{U}^{d}_{N}$ ), we have

		$\displaystyle J_{0}(u^{0,},u^{1,},\cdots,u^{N,})\geq J_{0}(u^{0},u^{1,},\cdots,u^{N,*})-\varepsilon,$
		$\displaystyle J_{i}(u^{0,},u^{1,},\cdots,u^{N,})\geq J_{i}(u^{0,},\cdots,u^{i-1,},u^{i},u^{i+1,},\cdots,u^{N,*})-\varepsilon,\ i=1,\cdots,N.$

The exact Nash equilibrium corresponds to the case when $\varepsilon=0$ . Now we impose some assumptions on the following coefficients of (1.1)-(1.3) of the RMM problem:

		$\displaystyle(b^{0},\sigma^{0}):\ \ [0,T]\times\mathbb{R}^{n}\times U_{0}\times\mathbb{R}^{n}\times U\rightarrow\mathbb{R}^{n},\quad(b,\sigma):\ \ [0,T]\times\mathbb{R}^{n}\times U\times\mathbb{R}^{n}\times U_{0}\times\mathbb{R}^{n}\times U\rightarrow\mathbb{R}^{n},$
		$\displaystyle(f^{0},g^{0}):\ \ [0,T]\times\mathbb{R}^{n+m+m}\times U_{0}\times\mathbb{R}^{n+m+2m}\times U\rightarrow\mathbb{R}^{m}\times\mathbb{R},$
		$\displaystyle(f,g):\ \ [0,T]\times\mathbb{R}^{n+m+2m}\times U\times\mathbb{R}^{n+m+m}\times U_{0}\times\mathbb{R}^{n+m+2m}\times U\rightarrow\mathbb{R}^{m}\times\mathbb{R},$
		$\displaystyle\Phi^{0}:\ \mathbb{R}^{n+n}\rightarrow{\mathbb{R}^{m}},\quad\ \ \Phi:\ \mathbb{R}^{n+n+n}\rightarrow{\mathbb{R}^{m}},\quad\ \ {(\Gamma^{0},\Gamma):\ \mathbb{R}^{m}\rightarrow\mathbb{R}\times\mathbb{R}},$

Assumption (A1) (i) $(b^{0},\sigma^{0})$ and $(b,\sigma)$ are continuously differentiable in $(x^{0},u^{0};x^{(N)},u^{(N)})$ and $(x,u;x^{0},u^{0};$ $x^{(N)},u^{(N)})$ , respectively. All the derivatives of $(b^{0},\sigma^{0},b,\sigma)$ are bounded.
(ii) $(f^{0},g^{0})$ and $(f,g)$ are continuously differentiable in $(\theta^{0},\theta^{(N)})$ and $(\theta,\theta^{0},\theta^{(N)})$ , respectively, where $\theta^{0}=(x^{0},y^{0},z^{0},u^{0}),\ \theta^{(N)}=(x^{(N)},y^{(N)},z^{(N)},u^{(N)}),\ \theta=(x,y,z,u).$ $\Phi^{0}$ , $\Phi$ and $(\Gamma^{0},\Gamma)$ are continuously differentiable in $(x^{0},x^{(N)})$ , $(x,x^{0},x^{(N)})$ and $y$ , respectively.
(iii) All derivatives of $(f^{0},f,\Phi^{0},\Phi)$ are bounded. The derivatives of $g^{0},g$ and $(\Gamma^{0},\Gamma)$ are bounded by $C(1+|\theta^{0}|+|\theta^{(N)}|)$ , $C(1+|\theta|+|\theta^{0}|+|\theta^{(N)}|)$ and $C(1+|y|)$ , respectively, for some $C>0$ .
(iv) $\xi^{0}$ and $\xi^{i}$ , $1\leq i\leq N$ , are square-integrable, $\mathcal{F}_{T}^{0}$ - and $\mathcal{F}_{T}^{i,0}$ -measurable, respectively. Moreover, $\xi^{1},\cdots,\xi^{N}$ are independent and identically distributed conditionally on $\mathcal{F}_{T}^{0}$ .
(v) $(b^{0},\sigma^{0},b,\sigma;f^{0},f,g^{0},g)$ are uniformly continuous in $t$ .

For each admissible $(N+1)$ -tuple $\{u^{j}\}_{j=0}^{N}$ , the coupled SDE system (1.1) (resp. BSDE (1.3)) admits a unique solution tuple $\{X^{j}\}_{j=0}^{N}$ (resp. $\{Y^{j},Z^{j}\}_{j=0}^{N}$ ) under (A1), and the recursive-type payoffs $\{J_{i}\}_{i=0}^{N}$ are well-defined.

3 A unified structural scheme of the RMM problem

For a fixed $N$ , if each agent has access to centralized information about all agents, including their instantaneous states realized and controls adopted, the RMM becomes a classical but high-dimensional $(N+1)$ -agent game. Existence or uniqueness of its exact Nash equilibrium(s) can be ensured under certain mild but high-dimensional conditions, including semi-continuity, coercivity and concavities on $\{J_{i}\}_{0=1}^{N}$ , or compactness on control admissibility. The open-loop (exact) equilibrium, denoted as $\{u^{i,*}\}_{i=0}^{N}\in{\mathcal{U}_{0}^{c}}\times\prod_{i=1}^{N}\mathcal{U}_{i}^{c}$ , can be further characterized through a system of $N+1$ stationary conditions. However, this procedure to exact equilibriums is only feasible in theory and becomes impractical due to the curse of dimensionality when $N$ is large.

MFG theory offers one resolution to constructing near-optimal decentralized strategies, as an alternative approximation to the exact Nash equilibria. A key challenge in MFG is the construction of an auxiliary problem for dimension reduction. Previous MFG studies have constructed auxiliary problems intuitively based on heuristic arguments, effective only when the underlying coupling structure is not overly complex. For instance, when all agents are symmetric as in [14, 22, 24, 27], or even an asymmetric dominant major agent is included as in [5, 6, 8, 10, 11, 23, 32, 34]. However, heuristic analysis becomes infeasible to analyze LP systems with more intricate couplings. One reason is that it fails to effectively configure a complex logic network in which various representative agents shall be mutually connected by an “exogenous—endogenous” relation. Alternatively, we propose a structural scheme that can not only well amount for the extreme generality of the weak coupling in RMM problems, but also lay down an unified foundation to analyze more general and complex LP couplings. In current RMM context, this scheme yields a bilateral perturbation and a triple-agent two-layer game, as discussed below.

3.1 A bilateral perturbation: the major agent

Letting $\varepsilon=0$ in Definition 2.2, $\mathcal{A}_{0}$ faces an optimization problem: $\mathop{\rm sup}_{u^{0}\in\mathcal{U}_{0}^{c}}J_{0}(u^{0},u^{1,*},\cdots,u^{N,*}),$ by assuming that all minor $\{\mathcal{A}_{i}\}_{i=1}^{N}$ implement exact Nash equilibrium $\{u^{i,*}\}_{i=1}^{N}$ . When $\mathcal{A}_{0}$ adopts a perturbed centralized control $u^{0}\in\mathcal{U}_{0}^{c}$ instead the exact $u^{0,*}$ , her state (1.1) becomes

\left\{\begin{aligned} d{X}_{t}^{0,\dagger}=&b^{0}(t,{X}_{t}^{0,\dagger},u_{t}^{0};X_{t}^{(N),\dagger},u_{t}^{(N),\ast})dt+\sigma^{0}(t,{X}_{t}^{0,\dagger},u_{t}^{0};X_{t}^{(N),\dagger},u_{t}^{(N),\ast})dW_{t}^{0},\\ dX_{t}^{i,\dagger}=&b(t,X_{t}^{i,\dagger},u_{t}^{i,*};{X}_{t}^{0,\dagger},u_{t}^{0};X_{t}^{(N),\dagger},u_{t}^{(N),*})dt+\sigma(t,X_{t}^{i,\dagger},u_{t}^{i,*};{X}_{t}^{0,\dagger},u_{t}^{0};X_{t}^{(N),\dagger},u_{t}^{(N),*})dW_{t}^{i},\end{aligned}\right.

(3.1)

with ${X}_{0}^{0,\dagger}=x_{0}^{0},\ X_{0}^{i,\dagger}=x_{0},\ i=1,\cdots,N.$ Here, $X_{\cdot}^{(N),\dagger}:=\frac{1}{N}\sum_{i=1}^{N}X_{\cdot}^{i,\dagger}$ is a quasi-realized state average with the superscript “^†” to emphasize its dependence on the major’s perturbed $u^{0}$ ; whereas $u_{\cdot}^{(N),\ast}:=\frac{1}{N}\sum_{i=1}^{N}u_{\cdot}^{i,*}$ is the exact-realized control average only depending on the exact strategies so the superscript “^∗” is still applied. This is essentially an open-loop feature.

By “quasi-realized,” the states are not “exactly” the ones to be realized when all agents apply their exact strategies; instead, they are “quasi-exact” as only $\mathcal{A}_{0}$ deviates from the exact one by adopting a perturbed control. Actually, an exact Nash equilibrium $\{u_{t}^{i,*}\}_{i=1}^{N}$ , in its open-loop sense, are defined directly on the basic inputs $(t,\omega)$ rather than on the “intermediate” states. Thus, a perturbed $u^{0}$ by $\mathcal{A}_{0}$ , will not change the controls of the minors. This is very different to the closed-loop case, for which a perturbed $u^{0}$ will change the major’s state, so further alter the implementation of $u^{i,*}$ constructed on these realized states. Using the notation

		$\displaystyle Y_{t}^{(N),\dagger}=\frac{1}{N}\sum_{i=1}^{N}Y_{t}^{i,\dagger},\quad Z_{t}^{(N),\dagger}=\frac{1}{N}\sum_{i=1}^{N}Z_{t}^{i,\dagger}=\frac{1}{N}\sum_{i=1}^{N}(Z_{t}^{i,\dagger,0},Z_{t}^{i,\dagger,i},\{Z_{t}^{i,\dagger,j}\}_{j\neq i})^{\top},$
		$\displaystyle\Theta_{t}^{j,\dagger}=(X_{t}^{j,\dagger},Y_{t}^{j,\dagger},Z_{t}^{j,\dagger}),\ j=0,1,\cdots,N,\quad\Theta_{t}^{(N),\dagger}=(X_{t}^{(N),\dagger},Y_{t}^{(N),\dagger},Z_{t}^{(N),\dagger}),$

and similar to (3.1), we can get the following quasi-realized coupled BSDEs

\left\{\begin{aligned} -dY_{t}^{0,\dagger}=&{f^{0}\big{(}t,u_{t}^{0},\Theta_{t}^{0,\dagger};u_{t}^{(N),\ast},\Theta_{t}^{(N),\dagger}\big{)}}dt-Z_{t}^{0,\dagger}dW_{t}^{0}-\sum_{j=1}^{N}Z_{t}^{0,\dagger,j}dW_{t}^{j},\\ -dY_{t}^{i,\dagger}=&f\big{(}t,{\Theta_{t}^{i,\dagger},u_{t}^{i,*};\Theta_{t}^{0,\dagger},u_{t}^{0};\Theta_{t}^{(N),\dagger},u_{t}^{(N),*}}\big{)}dt{-Z_{t}^{i,\dagger,0}dW_{t}^{0}-Z_{t}^{i,\dagger,i}dW_{t}^{i}}-\sum_{j=1,j\neq i}^{N}Z_{t}^{i,\dagger,j}dW_{t}^{j},\\ Y_{T}^{0,\dagger}=&\Phi^{0}\big{(}X_{T}^{0,\dagger},X_{T}^{(N),\dagger}\big{)}+\xi^{0},\quad Y_{T}^{i,\dagger}=\Phi\big{(}X_{T}^{i,\dagger};X_{T}^{0,\dagger},X_{T}^{(N),\dagger}\big{)}+\xi^{i},\quad i=1,\cdots,N.\\ \end{aligned}\right.

(3.2)

We aim to analyze the asymptotic limit as $N\rightarrow\infty$ with $u_{t}^{(N),*}\rightarrow\overline{u}_{t}^{*}\in\mathcal{F}_{t}^{0}.$ Then we take the limit of (3.1), (3.2) and the related recursive payoff for the major agent. By the continuity of coefficients $b^{0}$ and $b$ , $\lim_{N\rightarrow\infty}b^{0}(t,{X}_{t}^{0,\dagger},u_{t}^{0};X_{t}^{(N),\dagger},u_{t}^{(N),\ast})=b^{0}(t,\mathbb{X}_{t}^{0,\dagger},u_{t}^{0};\overline{\alpha}_{t}^{\dagger},\overline{u}_{t}^{*}),$ and

		$\displaystyle\lim_{N\rightarrow\infty}\frac{1}{N}\sum_{i=1}^{N}b(t,X_{t}^{i,\dagger},u_{t}^{i,};{X}_{t}^{0,\dagger},u_{t}^{0};X_{t}^{(N),\dagger},u_{t}^{(N),})$
	$\displaystyle=$	$\displaystyle\lim_{N\rightarrow\infty}\frac{1}{N}\sum_{i=1}^{N}\left[{\lim_{N\rightarrow\infty}\frac{1}{N}\sum_{i=1}^{N}}b\left(t,a;{X}_{t}^{0,\dagger},u_{t}^{0};X_{t}^{(N),\dagger},u_{t}^{(N),}\right)\right]_{a=(X_{t}^{i,\dagger},u_{t}^{i,})}$

\displaystyle=

\displaystyle\lim_{N\rightarrow\infty}\frac{1}{N}\sum_{i=1}^{N}b(t,X_{t}^{i,\dagger},u_{t}^{i,*};{\mathbb{X}}_{t}^{0,\dagger},u_{t}^{0};\overline{\alpha}_{t}^{{\dagger}},\overline{u}_{t}^{*})={\mathbb{E}_{t}\Big{[}b(t,{\mathbb{X}}_{t}^{1,\dagger},u_{t}^{1,*};\mathbb{X}_{t}^{0,\dagger},u_{t}^{0};\overline{\alpha}_{t}^{{\dagger}},\overline{u}_{t}^{*})\Big{]}},

where ${\mathbb{X}}_{t}^{0,\dagger}=\lim_{N\rightarrow\infty}{X}_{t}^{0,\dagger},\ {\mathbb{X}}_{t}^{1,\dagger}=\lim_{N\rightarrow\infty}{X}_{t}^{1,\dagger},\ \overline{\alpha}_{t}^{{\dagger}}=\lim_{N\rightarrow\infty}X_{t}^{(N),\dagger}$ are the associated limiting quantities, and $\mathbb{E}_{t}[\cdot]$ denotes the conditional expectation on the common information $\mathbb{F}^{0}$ . Similarly, taking the limit on (3.1), we get the asymptotic limit of the major agent:

\left\{\begin{aligned} d\mathbb{X}_{t}^{0,\dagger}=&b^{0}(t,\mathbb{X}_{t}^{0,\dagger},u_{t}^{0};{\overline{\alpha}}_{t}^{\dagger},\overline{u}_{t}^{*})dt+\sigma^{0}(t,\mathbb{X}_{t}^{0,\dagger},u_{t}^{0};{\overline{\alpha}}_{t}^{\dagger},\overline{u}_{t}^{*})dW_{t}^{0},\quad\mathbb{X}_{0}^{0,\dagger}=x_{0}^{0},\\ d{\overline{\alpha}}_{t}^{\dagger}=&\mathbb{E}_{t}\Big{[}b(t,{\mathbb{X}}_{t}^{1,\dagger},u_{t}^{1,*};\mathbb{X}_{t}^{0,\dagger},u_{t}^{0};{\overline{\alpha}}_{t}^{\dagger},\overline{u}_{t}^{*})\Big{]}dt,\quad\overline{\alpha}_{0}^{\dagger}=x_{0},\end{aligned}\right.

(3.3)

where ${\mathbb{X}}^{1,\dagger}$ denotes the state of the representative minor agent, say, $\mathcal{A}_{1}$

d{\mathbb{X}}_{t}^{1,\dagger}=b(t,{\mathbb{X}}_{t}^{1,\dagger},u_{t}^{1,*};\mathbb{X}_{t}^{0,\dagger},u_{t}^{0};{{\overline{\alpha}}_{t}^{\dagger}},\overline{u}_{t}^{*})dt+\sigma(t,{\mathbb{X}}_{t}^{1,\dagger},u_{t}^{1,*};\mathbb{X}_{t}^{0,\dagger},u_{t}^{0};{{\overline{\alpha}}_{t}^{\dagger}},\overline{u}_{t}^{*})dW_{t}^{1},\quad{\mathbb{X}}_{0}^{1,\dagger}=x_{0}.

(3.4)

Comparing (3.3) and (3.4), we have ${\overline{\alpha}}_{t}^{\dagger}=\mathbb{E}_{t}[{\mathbb{X}}^{1,\dagger}_{t}]$ by noting $\overline{u}_{t}^{*}=\mathbb{E}_{t}[u_{t}^{1,*}]$ and the solution uniqueness of the second equation in (3.3). Then the limiting state $\mathbb{X}^{0,\dagger}$ of the major satisfies

\left\{\begin{aligned} d\mathbb{X}_{t}^{0,\dagger}=&b^{0}\big{(}t,\mathbb{X}_{t}^{0,\dagger},u_{t}^{0};\mathbb{E}_{t}[{\mathbb{X}}^{1,\dagger}_{t}],\mathbb{E}_{t}[{u}_{t}^{1,*}]\big{)}dt+\sigma^{0}\big{(}t,\mathbb{X}_{t}^{0,\dagger},u_{t}^{0};\mathbb{E}_{t}[{\mathbb{X}}^{1,\dagger}_{t}],\mathbb{E}_{t}[{u}_{t}^{1,*}]\big{)}dW_{t}^{0},\\ d{\mathbb{X}}_{t}^{1,\dagger}=&b\big{(}t,{\mathbb{X}}_{t}^{1,\dagger},u_{t}^{1,*};\mathbb{X}_{t}^{0,\dagger},u_{t}^{0};\mathbb{E}_{t}[{\mathbb{X}}^{1,\dagger}_{t}],\mathbb{E}_{t}[{u}_{t}^{1,*}]\big{)}dt+\sigma\big{(}t,{\mathbb{X}}_{t}^{1,\dagger},u_{t}^{1,*};\mathbb{X}_{t}^{0,\dagger},u_{t}^{0};\mathbb{E}_{t}[{\mathbb{X}}^{1,\dagger}_{t}],\mathbb{E}_{t}[{u}_{t}^{1,*}]\big{)}dW_{t}^{1},\end{aligned}\right.

(3.5)

with initial conditions $\mathbb{X}_{0}^{0,\dagger}=x_{0}^{0},\ {\mathbb{X}}_{0}^{1,\dagger}=x_{0}.$ Taking limit on BSDE (3.2) and similar to (3.5),

\left\{\begin{aligned} -d\mathbb{Y}_{t}^{0,\dagger}=&f^{0}\Big{(}t,{\Pi_{t}^{0,\dagger},u_{t}^{0};\mathbb{E}_{t}[\Pi_{t}^{1,\dagger}],\overline{u}_{t}^{*}}\Big{)}dt-\mathbb{Z}_{t}^{0,\dagger}dW_{t}^{0},\\ -d{\mathbb{Y}}_{t}^{1,\dagger}=&f(t,{\Pi_{t}^{1,\dagger},u_{t}^{1,*};\Pi_{t}^{0,\dagger},u_{t}^{0};\mathbb{E}_{t}[{u}_{t}^{1,*}],\mathbb{E}_{t}[\Pi_{t}^{1,\dagger}]})dt-{{\mathbb{Z}}_{t}^{1,\dagger,0}dW_{t}^{0}-{\mathbb{Z}}_{t}^{1,\dagger,1}dW_{t}^{1}},\\ \mathbb{Y}_{T}^{0,\dagger}=&\Phi^{0}\big{(}\mathbb{X}_{T}^{0,\dagger},\mathbb{E}_{T}[{\mathbb{X}}^{1,\dagger}_{T}]\big{)}+\xi^{0},\quad{\mathbb{Y}}_{T}^{1,\dagger}=\Phi\big{(}{\mathbb{X}}_{T}^{1,\dagger};\mathbb{X}_{T}^{0,\dagger},\mathbb{E}_{T}[{\mathbb{X}}^{1,\dagger}_{T}]\big{)}+\xi^{1},\end{aligned}\right.

(3.6)

where $\Pi_{t}^{k,\dagger}=\lim_{N\rightarrow\infty}\Theta_{t}^{k,\dagger}=(\mathbb{X}_{t}^{k,\dagger},\mathbb{Y}_{t}^{k,\dagger},\mathbb{Z}_{t}^{k,\dagger})$ , $k=0,1.$

Remark 3.1.

The remainder terms $\sum_{j=1}^{N}Z_{t}^{0,\dagger,j}dW_{t}^{j}$ and $\sum_{j=1,j\neq i}^{N}Z_{t}^{i,\dagger,j}dW_{t}^{j}$ in BSDE (3.2) vanish as $N\longrightarrow+\infty$ . For sake of presentation, we may omit these remainder terms hereafter.

Finally, for the given $\mathbb{F}$ -adapted ${u}^{1,*}$ , we construct the following auxiliary problem for $\mathcal{A}_{0}$ associated with (3.6) and (3.5):

\displaystyle\mathop{\rm sup}_{u^{0}\in\mathcal{U}_{0}^{c}}\Big{\{}\Gamma^{0}(\mathbb{Y}_{0}^{0,\dagger})+\mathbb{E}\big{[}\int_{0}^{T}g^{0}(t,\Pi_{t}^{0,\dagger},u_{t}^{0};\mathbb{E}_{t}[\Pi_{t}^{1,\dagger}],\overline{u}_{t}^{*})dt\big{]}\Big{\}}.

(3.7)

An optimal control (if exists) of (3.7) should rely on the given control $\big{(}{u}^{1,*},\bar{u}_{t}^{*}\big{)}$ .

3.2 A bilateral perturbation: a representative minor agent

We turn to a representative minor $\mathcal{A}_{1}$ . By Definition 2.2, $\mathcal{A}_{1}$ confronts an optimization problem when assuming $\mathcal{A}_{0}$ implements the exact Nash $u^{0,*}\in\mathcal{U}_{0}^{c}$ ; and $\mathcal{A}_{j}(j\geq 2)$ implement $u^{j,*}\in\mathcal{U}^{c}$ . If $\mathcal{A}_{1}$ applies a perturbed $u^{1}\in\mathcal{U}^{c}$ , the state and functional of $\mathcal{A}_{0},\mathcal{A}_{j}$ ( $j\geq 2$ ) become

\left\{\begin{aligned} dX_{t}^{j,*}=&b(t,X_{t}^{j,*},u_{t}^{j,*};X_{t}^{0,*},u_{t}^{0,*};X_{t}^{(N),*},u_{t}^{(N),*})dt+\sigma(t,X_{t}^{j,*},u_{t}^{j,*};X_{t}^{0,*},u_{t}^{0,*};X_{t}^{(N),*},u_{t}^{(N),*})dW_{t}^{j},\\ -dY_{t}^{j,*}=&f(t,{\Theta_{t}^{j,*},u_{t}^{j,*};\Theta_{t}^{0,*},u_{t}^{0,*};\Theta_{t}^{(N),*},u_{t}^{(N),*})dt-Z_{t}^{j,*,0}dW_{t}^{0}-Z_{t}^{j,*,j}dW_{t}^{j}},\\ dX_{t}^{0,*}=&b^{0}(t,X_{t}^{0,*},u_{t}^{0,*};{X_{t}^{(N),*},u_{t}^{(N),*}})dt+\sigma^{0}(t,X_{t}^{0,*},u_{t}^{0,*};{X_{t}^{(N),*},u_{t}^{(N),*}})dW_{t}^{0},\\ -dY_{t}^{0,*}=&f^{0}(t,\Theta_{t}^{0,*},u_{t}^{0,*};\Theta_{t}^{(N),*},u_{t}^{(N),*})dt-Z_{t}^{0,*}dW_{t}^{0},\end{aligned}\right.

(3.8)

with $X_{0}^{0,*}=x_{0}^{0},\ Y_{T}^{0,*}=\Phi^{0}(X_{T}^{0,*},X_{T}^{(N),*})+\xi^{0},\ X_{0}^{j,*}=x_{0},\ Y_{T}^{j,*}=\Phi(X_{T}^{j,*};X_{T}^{0,*},X_{T}^{(N),*})+\xi^{j}.$ We abuse notations (as with the same limits) to denote $\Upsilon_{\cdot}^{(N),*}:=\frac{1}{N-1}\sum_{j=2}^{N}\Upsilon_{\cdot}^{j,*},\ \Upsilon=X,Y,Z,u,$ and $\{\Theta_{t}^{k,*}=(X_{t}^{k,*},Y_{t}^{k,*},Z_{t}^{k,*})\}_{k=0}^{N};\ \ \Theta_{t}^{(N),*}=(X_{t}^{(N),*},Y_{t}^{(N),*},Z_{t}^{(N),*}).$ Noting $(\Theta^{0,*},\Theta^{j,*},\Theta^{(N),*})$ are independent of $\mathcal{A}_{1}$ , so they are exogenous for $\mathcal{A}_{1}$ . The state of $\mathcal{A}_{1}$ under $u^{1}$ satisfies

\displaystyle dX_{t}^{1,\ddagger}=

\displaystyle b(t,X_{t}^{1,\ddagger},u_{t}^{1};X_{t}^{0,*},u_{t}^{0,*};X_{t}^{(N),*},u_{t}^{(N),*})dt+\sigma(t,X_{t}^{1,\ddagger},u_{t}^{1};X_{t}^{0,*},u_{t}^{0,*};X_{t}^{(N),*},u_{t}^{(N),*})dW_{t}^{1}.

The quasi-realized state $X_{\cdot}^{1,\ddagger}$ of $\mathcal{A}_{1}$ is with the superscript “ $\ddagger$ ” to indicate its dependence on perturbed $u^{1}$ ; whereas the exact-realized state-control average $(X_{\cdot}^{(N),\ast},u_{\cdot}^{(N),\ast})$ only depend on the exact strategies so are still with “ $\ast$ ”. The value $Y^{1,\ddagger}_{0}$ affected by $u^{1}$ of $\mathcal{A}_{1}$ satisfies

\left\{\begin{aligned} -dY_{t}^{1,\ddagger}=&f(t,\Theta_{t}^{1,\ddagger},u_{t}^{1};\Theta_{t}^{0,*},u_{t}^{0,*};\Theta_{t}^{(N),*},u_{t}^{(N),*})dt-Z_{t}^{1,\ddagger,0}dW_{t}^{0}-Z_{t}^{1,\ddagger,1}dW_{t}^{1},\\ Y_{T}^{1,\ddagger}=&\Phi(X_{T}^{1,\ddagger};X_{T}^{0,*},X_{T}^{(N),*})+\xi^{1},\end{aligned}\right.

(3.9)

with $\Theta_{t}^{1,\ddagger}=(X_{t}^{1,\ddagger},Y_{t}^{1,\ddagger},Z_{t}^{1,\ddagger})$ . Next, similar to our analysis in Subsection 3.1, we can obtain the following coupled mean-field FBSDE from (3.8) (noting $\mathbb{E}_{t}[u_{t}^{j,*}]=\overline{u}_{t}^{*}$ ), $j\geq 2$ ,

\left\{\begin{aligned} d\mathbb{X}_{t}^{0,*}=&b^{0}\big{(}t,\mathbb{X}_{t}^{0,*},u_{t}^{0,*};\mathbb{E}_{t}[\mathbb{X}^{j,*}_{t}],\mathbb{E}_{t}[u_{t}^{j,*}]\big{)}dt+\sigma^{0}\big{(}t,\mathbb{X}_{t}^{0,*},u_{t}^{0,*};\mathbb{E}_{t}[\mathbb{X}^{j,*}_{t}],\mathbb{E}_{t}[u_{t}^{j,*}]\big{)}dW_{t}^{0},\\ d\mathbb{X}_{t}^{j,*}=&b\big{(}t,\mathbb{X}_{t}^{j,*},u_{t}^{j,*};\mathbb{X}_{t}^{0,*},u_{t}^{0,*};\mathbb{E}_{t}[\mathbb{X}^{j,*}_{t}],\mathbb{E}_{t}[u_{t}^{j,*}]\big{)}dt+\sigma\big{(}t,\mathbb{X}_{t}^{j,*},u_{t}^{j,*};\mathbb{X}_{t}^{0,*},u_{t}^{0,*};\mathbb{E}_{t}[\mathbb{X}^{j,*}_{t}],\mathbb{E}_{t}[u_{t}^{j,*}]\big{)}dW_{t}^{j},\\ -d\mathbb{Y}_{t}^{0,*}=&f^{0}\Big{(}t,{\Pi_{t}^{0,*},u_{t}^{0,*};\mathbb{E}_{t}[\Pi^{j,*}_{t}],\mathbb{E}_{t}[{u}^{j,*}_{t}]}\Big{)}dt-\mathbb{Z}_{t}^{0,*}dW_{t}^{0},\\ -d\mathbb{Y}_{t}^{j,*}=&f\Big{(}t,{\Pi_{t}^{j,*},u_{t}^{j,*};\Pi_{t}^{0,*},u_{t}^{0,*};\mathbb{E}_{t}[\Pi^{j,*}_{t}],\mathbb{E}_{t}[{u}^{j,*}_{t}]\Big{)}dt-\mathbb{Z}_{t}^{j,*,0}dW_{t}^{0}-\mathbb{Z}_{t}^{j,*,j}dW_{t}^{j}},\\ \mathbb{X}_{0}^{0,*}=&x_{0},\ \mathbb{X}_{0}^{j,*}=\xi_{j},\ \mathbb{Y}_{T}^{0,*}=\Phi^{0}\Big{(}\mathbb{X}_{T}^{0,*},\mathbb{E}_{T}[\mathbb{X}^{j,*}_{T}]\Big{)}+\xi^{0},\ \mathbb{Y}_{T}^{j,*}=\Phi\Big{(}\mathbb{X}_{T}^{j,*};\mathbb{X}_{T}^{0,*},\mathbb{E}_{T}[\mathbb{X}^{j,*}_{T}]\Big{)}+\xi^{j},\\ \end{aligned}\right.

(3.10)

by setting $\Pi_{\cdot}^{i,*}=(\mathbb{X}^{i,*}_{\cdot},\mathbb{Y}^{i,*}_{\cdot},\mathbb{Z}^{i,*}_{\cdot})=\lim_{N\rightarrow\infty}(X^{i,*}_{\cdot},Y^{i,*}_{\cdot},Z^{i,*}_{\cdot}),\ i\neq 1.$ Then by (3.2), (3.9), when $\mathcal{A}_{1}$ applying $u^{1}$ , the limiting $\mathbb{X}^{1,\ddagger}(:=\lim_{N\rightarrow\infty}X^{1,\ddagger})$ and $\mathbb{Y}^{1,\ddagger}_{t}(:=\lim_{N\rightarrow\infty}Y_{t}^{1,\ddagger})$ satisfy

\left\{\begin{aligned} d\mathbb{X}_{t}^{1,\ddagger}=&b\Big{(}t,\mathbb{X}_{t}^{1,\ddagger},u_{t}^{1};\mathbb{X}_{t}^{0,*},u_{t}^{0,*};\mathbb{E}_{t}[\mathbb{X}^{j,*}_{t}],\mathbb{E}_{t}[u_{t}^{j,*}]\Big{)}dt+\sigma\Big{(}t,\mathbb{X}_{t}^{1,\ddagger},u_{t}^{1};\mathbb{X}_{t}^{0,*},u_{t}^{0,*};\mathbb{E}_{t}[\mathbb{X}^{j,*}_{t}],\mathbb{E}_{t}[u_{t}^{j,*}]\Big{)}dW_{t}^{1},\\ -d\mathbb{Y}^{1,\ddagger}_{t}=&f\Big{(}t,{\Pi^{1,\ddagger}_{t},u_{t}^{1};\Pi_{t}^{0,*},u_{t}^{0,*};\mathbb{E}_{t}[\Pi^{j,*}_{t}],\mathbb{E}_{t}[{u}^{j,*}_{t}]}\Big{)}dt{-\mathbb{Z}^{1,\ddagger,0}_{t}dW_{t}^{0}-\mathbb{Z}^{1,\ddagger,1}_{t}dW_{t}^{1}},\\ \mathbb{X}_{0}^{i,\ddagger}=&x_{0},\quad\mathbb{Y}^{1,\ddagger}_{T}=\Phi\Big{(}\mathbb{X}^{1,\ddagger}_{T};\mathbb{X}_{T}^{0,*},\mathbb{E}_{T}[\mathbb{X}^{j,*}_{T}]\Big{)}+\xi^{1},\end{aligned}\right.

(3.11)

where $\Pi_{t}^{1,\ddagger}=(\mathbb{X}_{t}^{1,\ddagger},\mathbb{Y}_{t}^{1,\ddagger},\mathbb{Z}_{t}^{1,\ddagger})$ . Along with (3.11), the objective of $\mathcal{A}_{1}$ is to maximize

\Gamma(\mathbb{Y}_{0}^{1,\ddagger})+\mathbb{E}\Big{[}\int_{0}^{T}g(t,\Pi_{t}^{1,\ddagger},u_{t}^{1};\Pi_{t}^{0,*},u_{t}^{0,*};\mathbb{E}_{t}[\Pi^{j,*}_{t}],\mathbb{E}_{t}[{u}^{j,*}_{t}])dt\Big{]}.

(3.12)

Note that an optimal control of $\mathcal{A}_{1}$ , if exists, should depend on ${u}^{0,*}$ and $\bar{u}^{*}$ .

3.3 A hierarchical recomposition

We now apply a hierarchical recomposition to construct the desired auxiliary problem, that assumes a mixed two-layer game in the RMM context. To this end, we integrate the perturbed (3.5)-(3.7) in side of $\mathcal{A}_{0}$ indexed by $``{\dagger}"$ , and (3.11)-(3.12) in side of the representative minor $\mathcal{A}_{1}$ by $``{\ddagger}"$ together. This yields an (extended) state $({X}^{0,\dagger},{X}^{1,\dagger};{X}^{{1,}\ddagger},{X}^{0,\ddagger},{X}^{j,\ddagger})$ labeled by $j\geq 2$ :

\left\{\begin{aligned} \quad d{X}_{t}^{0,\dagger}=&b^{0}\big{(}t,{X}_{t}^{0,\dagger},{{u_{t}^{0}}};\mathbb{E}_{t}[{{X}}^{1,\dagger}_{t}],\mathbb{E}_{t}[{u_{t}^{1}}]\big{)}dt+\sigma^{0}\big{(}t,{X}_{t}^{0,\dagger},{{u_{t}^{0}}};\mathbb{E}_{t}[{{X}}^{1,\dagger}_{t}],\mathbb{E}_{t}[{u_{t}^{1}}]\big{)}dW^{0}_{t},\\ d{{X}}_{t}^{1,\dagger}=&b\big{(}t,{{X}}_{t}^{1,\dagger},{u_{t}^{1}};{X}_{t}^{0,\dagger},{u_{t}^{0}};\mathbb{E}_{t}[{{X}}^{1,\dagger}_{t}],\mathbb{E}_{t}[{u_{t}^{1}}]\big{)}dt+\sigma\big{(}t,{{X}}_{t}^{1,\dagger},{u_{t}^{1}};{X}_{t}^{0,\dagger},{u_{t}^{0}};\mathbb{E}_{t}[{{X}}^{1,\dagger}_{t}],\mathbb{E}_{t}[{u_{t}^{1}}]\big{)}dW_{t}^{1},\\ \quad d{X}_{t}^{{1,}\ddagger}=&b\big{(}t,{X}_{t}^{{1},\ddagger},{u_{t}^{1}};{{X}_{t}^{0,\ddagger},{u_{t}^{0}};\mathbb{E}_{t}[{X}^{j,\ddagger}_{t}],\mathbb{E}_{t}[{u_{t}^{j}}]}\big{)}dt+\sigma\big{(}t,{X}_{t}^{{1},\ddagger},{u_{t}^{1}};{{X}_{t}^{0,\ddagger},{u_{t}^{0}};\mathbb{E}_{t}[{X}^{j,\ddagger}_{t}],\mathbb{E}_{t}[{u_{t}^{j}}]}\big{)}dW_{t}^{1},\\ \quad d{X}_{t}^{0,\ddagger}=&b^{0}\big{(}t,{X}_{t}^{0,\ddagger},{u_{t}^{0}};\mathbb{E}_{t}[{X}^{j,\ddagger}_{t}],\mathbb{E}_{t}[{u_{t}^{j}}]\big{)}dt+b^{0}\big{(}t,{X}_{t}^{0,\ddagger},{u_{t}^{0}};\mathbb{E}_{t}[{X}^{j,\ddagger}_{t}],\mathbb{E}_{t}[{u_{t}^{j}}]\big{)}dW_{t}^{0},\\ \quad d{X}_{t}^{j,\ddagger}=&b\big{(}t,{X}_{t}^{j,\ddagger},{u_{t}^{j}};{X}_{t}^{0,\ddagger},{u_{t}^{0}};\mathbb{E}_{t}[{X}^{j,\ddagger}_{t}],\mathbb{E}_{t}[{u_{t}^{j}}]\big{)}dt+\sigma\big{(}t,{X}_{t}^{j,\ddagger},{u_{t}^{j}};{X}_{t}^{0,\ddagger},{u_{t}^{0}};\mathbb{E}_{t}[{X}^{j,\ddagger}_{t}],\mathbb{E}_{t}[{u_{t}^{j}}]\big{)}dW_{t}^{j},\\ {X}_{0}^{0,\dagger}=&{X}_{0}^{0,\ddagger}=x^{0}_{0},\quad{{X}}_{0}^{1,\dagger}={X}_{0}^{1,\ddagger}={X}_{0}^{j,\ddagger}=x_{0}.\end{aligned}\right.

(3.13)

Indeed, the first and second state $({X}^{0,\dagger},{{X}}^{1,\dagger})$ come from (3.5), the third ${X}^{1,\ddagger}$ from (3.11), and the last two $({X}^{0,\ddagger},{X}^{j,\ddagger})$ from (3.10) by replacing exact centralized controls $(u^{0,*},u^{1,*},u^{j,*})$ with decentralized ones $(u^{0},u^{1},u^{j}).$ With the extended state (3.13), there arise three agents, respectively: a follower $\mathcal{A}_{0}$ using decentralized control $u^{0}$ , a follower $\mathcal{A}_{1}$ using $u^{1}$ , and a leader $\mathcal{A}_{j}$ using $u^{j},$ for some $j\geq 2.$ Recall that $u^{0}$ is $\mathbb{F}^{0}$ -adapted and $u^{1}$ (resp. $u^{j}$ ) is $\mathbb{F}^{1,0}$ - (resp. $\mathbb{F}^{j,0}$ )-adapted. Therefore, the follower $\mathcal{A}_{0}$ can directly affect all 5-tuple components, the follower $\mathcal{A}_{1}$ directly affects the first three $({X}^{0,\dagger},{{X}}^{1,\dagger},{X}^{1,\ddagger}),$ while the leader $\mathcal{A}_{j}$ directly affect the last three $({X}^{1,\ddagger},{X}^{0,\ddagger},{X}^{j,\ddagger}).$ On the other hand, all 5-tuple components are coupled through their dependence on the control triple $(u^{0},u^{1},u^{j}).$ In this sense, $\mathcal{A}_{1}$ and $\mathcal{A}_{j}$ also influence the 5-tuple (indirectly); all components are endogenous and not redundant.

We can now formulate limiting recursive functionals. Specifically, the follower $\mathcal{A}_{0}$ aims to

{\mathop{\rm sup}\limits_{u^{0}\in\mathcal{U}^{d}_{0}}}\Big{\{}\Gamma^{0}({Y}_{0}^{0,\dagger})+\mathbb{E}\big{[}\int_{0}^{T}g^{0}(t,\Theta_{t}^{0,\dagger},u_{t}^{0};\mathbb{E}_{t}[{\Theta}_{t}^{1,\dagger}],\mathbb{E}_{t}[u_{t}^{1}])dt\big{]}\Big{\}},

(3.14)

where $({Y}^{0,\dagger},{Z}^{0,\dagger};{Y}^{1,\dagger},{Z}^{1,\dagger})$ is the solution of the coupled mean-field BSDE

\left\{\begin{aligned} -d{Y}_{t}^{0,\dagger}=&f^{0}\Big{(}t,{\Theta_{t}^{0,\dagger},u_{t}^{0};\mathbb{E}_{t}[{\Theta}_{t}^{1,\dagger}],\mathbb{E}_{t}[{u}_{t}^{1}]}\Big{)}dt-{Z}_{t}^{0,\dagger}dW_{t}^{0},\\ -d{{Y}}_{t}^{1,\dagger}=&f(t,{\Theta_{t}^{1,\dagger},u_{t}^{1};\Theta_{t}^{0,\dagger},u_{t}^{0};\mathbb{E}_{t}[\Theta_{t}^{1,\dagger}],\mathbb{E}_{t}[{u}_{t}^{1}])dt-{{Z}}_{t}^{1,\dagger,0}dW_{t}^{0}-{{Z}}_{t}^{1,\dagger,1}dW_{t}^{1}},\\ {Y}_{T}^{0,\dagger}=&\Phi^{0}\big{(}{X}_{T}^{0,\dagger},\mathbb{E}_{T}[{{X}}^{1,\dagger}_{T}]\big{)}+\xi^{0},\quad{{Y}}_{T}^{1,\dagger}=\Phi\big{(}{{X}}_{T}^{1,\dagger};{X}_{T}^{0,\dagger},\mathbb{E}_{T}[{{X}}^{1,\dagger}_{T}]\big{)}+\xi^{1}.\end{aligned}\right.

(3.15)

The aim of the follower $\mathcal{A}_{1}$ is

{\mathop{\rm sup}_{u^{1}\in\mathcal{U}_{1}^{d}}}\Big{\{}\Gamma({Y}_{0}^{1,\ddagger})+\mathbb{E}\Big{[}\int_{0}^{T}g(t,\Theta_{t}^{1,\ddagger},u_{t}^{1};\Theta_{t}^{0,\ddagger},u_{t}^{0};\mathbb{E}_{t}[\Theta^{j,\ddagger}_{t}],\mathbb{E}_{t}[{u}^{j}_{t}])dt\Big{]}\Big{\}},

(3.16)

where $({Y}^{1,\ddagger},{Z}^{1,\ddagger})$ is the solution of the BSDE

\left\{\begin{aligned} -d{Y}^{1,\ddagger}_{t}=&f\Big{(}t,\Theta^{1,\ddagger}_{t},u_{t}^{1};\Theta_{t}^{0,\ddagger},u_{t}^{0};\mathbb{E}_{t}[\Theta^{j,\ddagger}_{t}],\mathbb{E}_{t}[{u}^{j}_{t}]\Big{)}dt-{Z}^{1,\ddagger,0}_{t}dW_{t}^{0}-{Z}^{1,\ddagger,1}_{t}dW_{t}^{1},\\ {Y}^{1,\ddagger}_{T}=&\Phi\Big{(}{X}^{1,\ddagger}_{T};{X}_{T}^{0,\ddagger},\mathbb{E}_{T}[{X}^{j,\ddagger}_{T}]\Big{)}+\xi^{1},\end{aligned}\right.

(3.17)

with the exogenous processes $({Y}^{0,\ddagger},{Z}^{0,\ddagger};{Y}^{j,\ddagger},{Z}^{j,\ddagger})$ given by

\left\{\begin{aligned} -d{Y}_{t}^{0,\ddagger}=&f^{0}\Big{(}t,\Theta_{t}^{0,\ddagger},u_{t}^{0};\mathbb{E}_{t}[\Theta^{j,\ddagger}_{t}],\mathbb{E}_{t}[{u}^{j}_{t}]\Big{)}dt-{Z}_{t}^{0,\ddagger}dW_{t}^{0},\\ -d{Y}_{t}^{j,\ddagger}=&f\Big{(}t,\Theta_{t}^{j,\ddagger},u_{t}^{j};\Theta_{t}^{0,\ddagger},u_{t}^{0};\mathbb{E}_{t}[\Theta^{j,\ddagger}_{t}],\mathbb{E}_{t}[{u}^{j}_{t}]\Big{)}dt-{Z}_{t}^{j,\ddagger,0}dW_{t}^{0}-{Z}_{t}^{j,\ddagger,j}dW_{t}^{j},\\ {Y}_{T}^{0,\ddagger}=&\Phi^{0}\Big{(}{X}_{T}^{0,\ddagger},\mathbb{E}_{T}[{X}^{j,\ddagger}_{T}]\Big{)}+\xi^{0},\quad{Y}_{T}^{j,\ddagger}=\Phi\Big{(}{X}_{T}^{j,\ddagger};{X}_{T}^{0,\ddagger},\mathbb{E}_{T}[{X}^{j,\ddagger}_{T}]\Big{)}+\xi^{j}.\\ \end{aligned}\right.

(3.18)

And the leader $\mathcal{A}_{j}$ aims to minimize the following quadratic deviation functional

{\inf_{u^{j}\in\mathcal{U}^{d}_{j}}||u^{j}-u^{1}||_{L^{2}}^{W^{0}}:=\mathbb{E}\int_{0}^{T}|\mathbb{E}_{t}[u^{j}_{t}]-\mathbb{E}_{t}[u^{1}_{t}]|^{2}dt.}

(3.19)

Last, we identify mixed leader-follower-Nash interactions among $\mathcal{A}_{0},\mathcal{A}_{1},\mathcal{A}_{j}$ as follows:

•

As the unique leader, at the beginning $t=0,$ $\mathcal{A}_{j}$ (for some $j\geq 2$ ) announces an (open-loop) control $u^{j}$ on the whole horizon $[0,T]$ to the followers $\mathcal{A}_{0},\mathcal{A}_{1}$ both.
•

Given the announced $u^{j}|_{[0,T]},$ $\mathcal{A}_{0},\mathcal{A}_{1}$ will find a Nash equilibrium $u^{0},u^{1}$ simultaneously. In particular, $\mathcal{A}_{0}$ and $\mathcal{A}_{1}$ optimize the functionals (3.14) and (3.16), respectively.
•

Anticipate the best response $u^{0}$ and $u^{1}$ parameterized by the priori $u^{j}$ , the leader intakes such leader-follower interaction to minimize the quadratic deviation cost. Such minimum can be reached via a fixed point argument or consistency condition.

Given the above interactions, a mixed triple-agent game should be solved by the following steps.
(1) First, to formulate and solve an auxiliary control problem for $\mathcal{A}_{0}$ in terms of control $u^{0}$ and state $({X}^{0,\dagger},{{X}}^{1,\dagger},{Y}^{0,\dagger},{Z}^{0,\dagger},{Y}^{1,\dagger},{Z}^{1,\dagger})$ by fixing the generic admissible control $u^{1}$ of $\mathcal{A}_{1}$ . We may denote the optimal one as $\tilde{u}^{0}=\tilde{u}^{0}[u^{1}]$ to show its dependence on $u^{1}$ .
(2) Second, for given ${u}^{0}$ and pre-announced $u^{j}$ , to formulate and solve an auxiliary control problem for $\mathcal{A}_{1}$ in terms of control $u^{1}$ and state $({X}^{1,\ddagger},{X}^{0,\ddagger},{X}^{j,\ddagger},{Y}^{1,\ddagger},{Z}^{1,\ddagger},{Y}^{0,\ddagger},{Z}^{0,\ddagger},$ ${Y}^{j,\ddagger},{Z}^{j,\ddagger}).$ We may denote the optimal one as $\tilde{u}^{1}=\tilde{u}^{1}[u^{j},{u}^{0}]$ depending on $u^{j}$ and $u^{0}.$
(3) Last, get the Nash equilibrium of $\mathcal{A}_{0}$ and $\mathcal{A}_{1}$ , say $(\hat{u}^{0}[u^{j}],\hat{u}^{1}[u^{j}])$ depending on pre-announced $u^{j}$ ; and solve the optimization of the leader by matching $\mathbb{E}_{t}[u^{j}_{t}]=\mathbb{E}_{t}\big{[}\hat{u}_{t}^{1}[u^{j}_{t}]\big{]}.$

Remark 3.2.

(i) When the weak-coupling of RMM does not include the control-average of all minors, the term $\mathbb{E}_{t}[{X}^{j}_{t}]$ in the third and fifth equation of (3.13) can be replaced by some off-line process. The last equation in (3.13) thus becomes redundant. In fact, the inclusion of the control-average of all minors necessitates the introduction of additional dynamics to account for the averages on the physical, recursive and intensity state.

(ii) Furthermore, if we exclude the consideration of recursive functionals (namely, $\Gamma^{0}=\Gamma\equiv 0$ ), the mixed triple-agent game will simplify to a two-agent nonzero-sum game, as in [11].

(iii) The introduction of the additional $\mathcal{A}_{j}$ facilitates a clearer specification of the exogenous processes, as discussed in Subsection 3.2 from the standpoint of $\mathcal{A}_{1}$ . Otherwise, two exogenous processes s would need to be introduced to replace $(\mathbb{E}_{t}[\mathbb{X}_{t}^{j,*}],\mathbb{E}_{t}[u_{t}^{j,*}])$ simultaneously, accompanied by an associated fixed point analysis. The overall analysis will thus become complicated, given that the consistency matching already involves another fixed-point analysis. In contrast, our formulation of the mixed triple-agent game, particularly the introduction of the virtual $\mathcal{A}_{j}$ , may clearly elucidate the intricate exogenous-endogenous relationships.

3.4 A unified structural scheme

Our bilateral perturbations and mixed hierarchical recomposition in Sections 3.2 and 3.3 indeed suggest a unified scheme to analyze general LP systems with more complex couplings.

We start with a generic LP system characterized by a key 5-tuple $(\mathcal{I},\mathcal{A},J,\mathcal{D},\mathcal{C}_{0})$ where (i) $\mathcal{I}$ is the index set of all agents, as enumerated by $\mathcal{A}:=\{\mathcal{A}_{i}\}_{i\in\mathcal{I}}$ ; (ii) $J:=\{J_{i}\}_{i\in\mathcal{I}}$ the set of weak-coupled functionals to be optimized by $\mathcal{A}_{i},i\in\mathcal{I},$ respectively; (iii) $\mathcal{D}:=\{u_{i}\}_{i\in\mathcal{I}}$ denotes associated decision profile of $\mathcal{A}$ ; (iv) $\mathcal{C}_{0}:=\{\mathcal{I}_{k}\}_{k\in\Theta}$ is the maximal coalition structure on $\mathcal{I}$ , which is structurally determined by $J,\mathcal{D}$ and will be elaborated later. Typically, $\mathcal{I}=\mathcal{I}_{1}:=\{1,\cdots,N\}$ , $\mathcal{I}_{2}:=\{1,\cdots,k\}\times\mathcal{I}_{1}$ , or $\mathcal{I}_{3}:=\{1,\cdots,k\}\cup\mathcal{I}_{1}$ respectively, represent the classical LP systems with homogenous agents, $k$ classes of heterogenous agents, or $k$ major agents with homogenous minors. For second order countable or continuum infinity, we can let $\mathcal{I}_{4}:=\mathcal{I}_{1}\times\mathcal{I}_{1}$ or $\mathcal{I}_{5}=[0,1].$ Given $(\mathcal{I},J,\mathcal{D},\mathcal{C}_{0})$ , a structural scheme includes the following three steps.

Step 1: Exchangeable decomposition. Essentially, the MFG is an effective dimension-reduction analysis relying on exchangeabilities across agents, shares a similar spirit with the notable symmetric game in deterministic context. Exchangeabilities of a controlled $(\mathcal{I},\mathcal{A},J,\mathcal{D},\mathcal{C}_{0})$ can be characterized asymptotically, as $|\mathcal{I}|\longrightarrow+\infty$ , by the so-called coalition structure. This refers to a partition of the index set $\mathcal{I}$ (namely, $\sum_{k\in\Theta}\mathcal{I}_{k}=\mathcal{I}$ where $\sum$ the disjoint sets’ union); and for $\forall k\in\Theta$ , all agents $\{\mathcal{A}_{j}|j\in\mathcal{I}_{k}\}$ form an exchangeable sub-class, in the sense that

		$\displaystyle(\mathcal{I},J,\mathcal{D},\mathcal{C}_{0})=(\sum_{k\in\Theta}\mathcal{I}_{k},\ \ \sum_{j\in\mathcal{I}_{k}}\sum_{k\in\Theta}J_{j,k},\ \ \sum_{j\in\mathcal{I}_{k}}\sum_{k\in\Theta}u_{j,k},\ \ \mathcal{C}_{0})$
		$\displaystyle=(\mathcal{I}_{k}\cup\sum_{k^{\prime}\neq k\in\Theta}\mathcal{I}_{k^{\prime}},\ \ \{J_{j,k}\}_{j\in\mathcal{I}_{k}}\cup\sum_{j\in\mathcal{I}_{k^{\prime}}}\sum_{k^{\prime}\neq k\in\Theta}J_{j,k^{\prime}},\ \ \{u_{j,k}\}_{j\in\mathcal{I}_{k}}\cup\sum_{j\in\mathcal{I}_{k^{\prime}}}\sum_{k^{\prime}\neq k\in\Theta}u_{j,k^{\prime}},\ \ \mathcal{C}_{0})$
		$\displaystyle\Longleftrightarrow_{\mathcal{E}}(\widetilde{\mathcal{I}_{k}}\cup\sum_{k^{\prime}\neq k\in\Theta}\mathcal{I}_{k^{\prime}},\ \ \ \widetilde{\{J_{j,k}\}}_{j\in\mathcal{I}_{k}}\cup\sum_{j\in\mathcal{I}_{k^{\prime}}}\sum_{k^{\prime}\neq k\in\Theta}J_{j,k^{\prime}},\ \ \ \widetilde{\{u_{j,k}\}}_{j\in\mathcal{I}_{k}}\cup\sum_{j\in\mathcal{I}_{k^{\prime}}}\sum_{k^{\prime}\neq k\in\Theta}u_{j,k^{\prime}},\ \ \ \mathcal{C}_{0})$
		$\displaystyle:=(\widetilde{\mathcal{I}},\widetilde{J},\widetilde{\mathcal{D}},\mathcal{C}_{0})$

where $\left(\widetilde{\mathcal{I}_{k}},\widetilde{\{J_{j,k}\}}_{j\in\mathcal{I}_{k}},\widetilde{\{u_{j,k}\}}_{j\in\mathcal{I}_{k}}\right)$ is a simultaneous permutation on the sub-index set $\mathcal{I}_{k}$ and associated intersections on $J$ and $\mathcal{D}$ ; $``\Longleftrightarrow_{\mathcal{E}}"$ denotes the equivalence relation in terms of game equilibrium. That is, the set of Nash equilibria of $(\mathcal{I},J,\mathcal{D},\mathcal{C}_{0})$ is invariant to that of $(\widetilde{\mathcal{I}},\widetilde{J},\widetilde{\mathcal{D}},\mathcal{C}_{0})$ under finite permutation on $\mathcal{I}_{k}$ . Roughly, this means that the simultaneous optimizations faced by $\{\mathcal{A}_{j}\}_{j\in\mathcal{I}_{k}}$ , are endowed with the identical probabilistic structures. Moreover, under large symmetric assumption as in [9], the equivalence $\mathcal{E}$ should be element-wise and thus transitive. Therefore, there exists a maximal coalition, denoted as $\mathcal{C}_{0}$ , provided the set of coalition structures is non-empty. By “maximal”, it is the coarsest partition than other coalitions, that implies the largest exchangeable decomposition and hence the largest dimension reduction can be achieved. In fact, this maximal coalition can be constructed through the saturated sets using equivalent relation, as the union of all $\mathcal{E}$ -equivalent classes ([18]).

A trivial coalition is $\{\{e\}|e\in\mathcal{I}\},$ the set of all singleton sets generated by each element. In this case, $|\Theta|=|\mathcal{I}|$ without dimension reduction. The MFG is applicable for non-trivial coalitions in that $|\Theta|=O(1)$ as $|\mathcal{I}|\longrightarrow+\infty$ ; or reversely, for at least one $k\in\Theta,$ $|\mathcal{I}_{k}|=O(|\mathcal{I}|).$ For example, $|\mathcal{I}_{k}|=O(|\mathcal{I}|^{\alpha})$ for some $\alpha>0$ . For LP systems with $\mathcal{I}_{1,2,3},$ $\mathcal{C}_{0}=\mathcal{I}_{1},(\{1\}\times\mathcal{I}_{1},\cdots,\{k\}\times\mathcal{I}_{1}),(\{1\},\cdots,\{k\},\mathcal{I}_{1}),$ with associated $|\Theta|=1,k,k+1$ , respectively. For these cases, heuristic arguments are still feasible to construct auxiliary control. However, they boil down when coupling structures assume more complex forms, such as non-standard bridge configuration; and in this case, the identification of $\mathcal{C}_{0}$ with associated decomposition become necessary to alternatively yield a unified and systematic analysis.

Step 2: Representative multilateral perturbation. Given the maximal coalition $\mathcal{C}_{0}$ , one can select a representative agent, denoted as $\mathcal{A}^{\text{rep}}_{k},k\in\Theta$ , from each exchangeable sub-class $\{\mathcal{A}_{j}|j\in\mathcal{I}_{k}\}.$ This selection yields a representative collection $\mathcal{R}:=\{\mathcal{A}^{\text{rep}}_{k}\}_{k\in\Theta}$ that is dimension-reduced by noting $|\Theta|=O(1)$ . In fact, we may abuse notations to denote $\mathcal{R}={\mathcal{A}}/{\Theta}$ as the quotient space with elements in the equivalent classes by $\mathcal{E}$ . Then, a multilateral perturbation can be introduced in side of each $\mathcal{A}^{\text{rep}}_{k}$ separately, by assuming all other representatives still keep their equilibrium strategies. Depending on the weak-coupling mechanism structured by $\{{J_{j}}\}_{j\in\mathcal{I}}$ , each perturbation $u^{\text{rep}}_{k}$ will be transmitted throughout the whole LP system $\{\mathcal{A}_{j}\}_{j\in\mathcal{I}}=\sum_{k\in\Theta}\{{\mathcal{A}_{j}}\}_{j\in\mathcal{I}_{k}}$ across all exchangeable sub-classes. A typical transmission, in an open-loop and dynamic setting, can be sketched by the following channel via the weak-coupling of state-average:

u^{\text{rep}}_{k}\Longrightarrow x^{\text{rep}}_{k}(u^{\text{rep}}_{k})\Longrightarrow\left(x^{(N)}_{k}(x^{\text{rep}}_{k}(u^{\text{rep}}_{k})),\ \ x^{(N)}_{k^{\prime}\neq k}(x^{\text{rep}}_{k}(u^{\text{rep}}_{k}))\right)\Longrightarrow J_{j^{\prime}\neq j}(x^{(N)}_{k},x^{(N)}_{k^{\prime}\neq k})\Longrightarrow\cdots

Along with such transmission, influence of a representative $\mathcal{A}^{\text{rep}}_{k}$ on controlled LP systems can be completely quantified, that is essentially equivalent to the Fréchet differential of $J_{k}^{\text{rep}}$ on $u^{\text{rep}}_{k}$ .

Step 3: Hierarchical recomposition. The multilateral perturbation involves approximations to completely quantify all LP weak-couplings asymptotically, thereby leading to a variety of limiting quantities. Due to the exchangeability, these quantities typically assume relevant conditional expectations on the tail sigma-algebra, as per DeFinetti theorem. However, unlike those in classical McKean-Vlasov control problems, these expectations exhibit distinct modes in realized degrees of controllability, contingent upon their hierarchical positions within the entire weak-coupled structure. For instance, in the RMM, three modes emerge: exact-realized (exact limit), quasi-realized (semi-exact), and null-realized, as indicated by (3.5) and (3.8). In the current RMM or simpler coupled setups, these modes can be ordered using an exogenous-endogenous relation, where the “exogenous” exact-realized mode dominates the “endogenous” null-realized one. However, in more complex forms of LP couplings, these modes cannot be fully encapsulated by a binary ordering alone, instead forming a more intricate directed graph network, which is challenging to be studied by heuristic arguments alone.

Our resolution is a hierarchical recomposition, similar to the well-studied structural function through the so-called path sets in reliability theory ([1]). In fact, reliability of any system is equivalent to that of a serial (sequential) arrangement of parallel subsystems. Likewise, for a generic LP system, all involved modes by Step 2 can be stratified into a sequential layers with leader-follower-type hierarchies; and each layer consists of parallel Nash-type nodes with simultaneous decisions. This stratification, similar to the construction of structural function, is indeed applicable to any LP systems, such as those with non-classical intermediate coupling, akin to the bridge systems in reliability analysis. Such stratification enables an unified and more clear-cut construction of the desired auxiliary problem by recomposing all modes across hierarchical layers with a combination of corresponding variants of fixed-point matching. For example, the aforementioned three modes in RMM yield two layers, thereby enabling the construction of the auxiliary problem via a triple-agent leader-follower-Nash game. By coincidence, the multilateral perturbation assumes a similar role to the synthesis of all minimal path sets in reliability analysis, as both aim to quantify all transmission channels within a given system.

Attainability. For classical LP systems, leader-type agents at higher layers in the hierarchical recomposition often engage in a fixed-point analysis, as demonstrated by (3.19) for the RMM when $\mathcal{A}_{j}$ aligns the announced $u^{j}$ with the resultant $u^{1}(u^{j})$ on their conditional expectations $\mathbb{E}_{t}u^{j},\mathbb{E}_{t}u^{1}$ . In contrast to follower-type agents at lower layers, who confront more regular control problems (see (3.16)), the fixed-point analysis for leaders can indeed degenerate, as the minimal deviations become trivially attainable at zero, provided $\mathbb{E}_{t}u^{j}=\mathbb{E}_{t}u^{1}$ . Consequently, the choice of norm $\phi(\cdot)$ on deviation $\mathbb{E}_{t}u^{j}-\mathbb{E}_{t}u^{1}(u_{j})$ becomes indifferent. Specifically, the introduction of a quadratic deviation on $L^{2}$ -norm: $\phi(\cdot)=||\cdot||_{L^{2}}^{W^{0}}$ as in (3.19), is merely formal. In fact, the top layer, characterized by a high degree of realized controllability, presents a tradeoff where its remaining control capability is more prone to degeneration. However, in LP systems with more complex couplings, there may emerge nonclassical hierarchical layers and the relevant analysis, particularly of top layers, may no longer be degenerate. This is particularly true for LP systems with nested or asymmetric information ([3, 25]), or those with heterogenous robustness beliefs, as well as those with bridging-intermediate-type couplings. In such instances, the optimal deviation norms cannot be trivially attainable at zero, necessitating the replacement of fixed-point analysis with some non-trivial optimization problems.

Summary. Step 1-3 constitute an unified structural scheme to analyze more general LP systems, particularly those with non-classical coupling structures. In fact, for classical LP systems with only minor (homogenous or heterogenous) agents, auxiliary problems can be constructed by straightforward heuristic arguments. The main reason is that their coalition structure are relatively simple thus no need to invoke Step 2-3. A more complex but classical LP system is the one involving single major agent, for which auxiliary problem can still be constructed heuristically but shall invoke an exogenous-endogenous relation to tackle additional couplings by the major agent. Our RMM has fairly general weak-couplings, especially those of the recursive state pairs, that motivate us to introduce the structural scheme, especially Step 2-3, to fully quantify complexities of all resultant perturbation transmissions and construct the auxiliary problem hierarchically. For non-classical LP systems such as those with nested or asymmetric information, or those featuring bridge-type couplings, heuristic arguments are no longer feasible for constructing auxiliary problems. For example, even for LP systems with $k$ majors and $K$ classes of heterogenous minors, heuristic construction necessitates an exogenous-endogenous structure represented by a directed graph with $k\times K$ edges. By contrast, the structural scheme, conversely, simplifies this to a sequential flow with only $k+K$ binary relations.

4 Approximated Nash equilibrium of the RMM problem

4.1 Auxiliary control problem for the follower $\mathcal{A}_{0}$

Now we fix the follower $\mathcal{A}_{1}$ ’s control $u^{1}$ and solve the auxiliary problem of $\mathcal{A}_{0}$ . The auxiliary problem for $\mathcal{A}_{0}$ is formulated by (3.14) associated with mean-field BSDE (3.15) and McKean-Vlasov SDE (3.13) (its first two components). Related Hamiltonian is defined as

H_{0}=\langle p^{0},b^{0}\rangle+\langle p,b\rangle+\langle q^{00},\sigma^{0}\rangle+\langle q^{11},\sigma\rangle-l^{0}\cdot f^{0}-l\cdot f+g^{0},

where $(b^{0},\sigma^{0})=(b^{0},\sigma^{0})(t,x_{0},u_{0};\bar{x},\bar{u}),$ $(b,\sigma)=(b,\sigma)(t,x,u;x_{0},u_{0};\bar{x},\bar{u})$ , $(f^{0},g^{0})=(f^{0},g^{0})(t,x_{0},$ $y_{0},z_{0},u_{0};\bar{x},\bar{y},\bar{z},$ $\bar{u})$ , $f=f(t,x,y,z,u;x_{0},y_{0},z_{0},u_{0};\bar{x},\bar{y},\bar{z},\bar{u})$ . Using stochastic maximum principle ([2]), we get the following Hamiltonian system

\left\{\begin{aligned} d{X}_{t}^{0,*}=&\partial_{p_{0}}H_{0}(t)dt+\partial_{q_{00}}H_{0}(t)dW_{t}^{0},\ d{{X}}_{t}^{1,*}=\partial_{p}H_{0}(t)dt+\partial_{q_{11}}H_{0}(t)dW_{t}^{1},\\ -d{Y}_{t}^{0,*}=&-\partial_{l^{0}}H_{0}(t)dt-{Z}_{t}^{0,*}dW_{t}^{0},\ -d{{Y}}_{t}^{1,*}=-\partial_{l}H_{0}(t)dt-{{Z}}_{t}^{1,*,0}dW_{t}^{0}-{{Z}}_{t}^{1,*,1}dW_{t}^{1},\\ dL_{t}^{0}=&-\partial_{y_{0}}H_{0}(t)dt-\partial_{z_{0}}H_{0}(t)dW^{0}_{t},\\ dL_{t}=&-\big{[}\partial_{y}H_{0}(t)+\mathbb{E}_{t}[\partial_{\bar{y}}H_{0}(t)]\big{]}dt-\big{[}\partial_{z}H_{0}(t)+\mathbb{E}_{t}[\partial_{\bar{z}}H_{0}(t)]\big{]}d(W^{0},W^{1})_{t},\\ -dP_{t}^{0}=&\partial_{x_{0}}H_{0}(t)dt-Q_{t}^{00}dW^{0}_{t}-Q_{t}^{01}dW_{t}^{1},\\ -dP_{t}=&\big{[}\partial_{x}H_{0}(t)+\mathbb{E}_{t}[\partial_{\bar{x}}H_{0}(t)]\big{]}dt-Q_{t}^{10}dW^{0}_{t}-Q_{t}^{11}dW_{t}^{1},\\ \end{aligned}\right.

(4.1)

where ${X}_{0}^{0,*}=x_{0}^{0}$ , $X_{0}^{1,*}=x_{0},$ ${Y}_{T}^{0,*}=\Phi^{0}\big{(}{X}_{T}^{0,*},\mathbb{E}_{T}[{{X}}^{1,*}_{T}]\big{)}+\xi^{0}$ , $Y_{T}^{1,*}=\Phi\big{(}{{X}}_{T}^{1,*};{X}_{T}^{0,*},\mathbb{E}_{T}[{{X}}^{1,*}_{T}]\big{)}+\xi^{1},$ $L_{0}^{0}=-\Gamma^{0}_{y}(Y_{0}^{0,*}),$ $L_{0}=0$ , $P_{T}^{0}=-L_{T}^{0}\cdot\partial_{x_{0}}\Phi^{0}-L_{T}\cdot\partial_{x_{0}}\Phi,$ $P_{T}=-L_{T}\cdot\partial_{x}\Phi-\mathbb{E}_{T}[L_{T}^{0}\cdot\partial_{\bar{x}}\Phi^{0}+L_{T}\cdot\partial_{\bar{x}}\Phi]$ ; $(L^{0},L;P^{0},P,Q^{00},Q^{01},Q^{10},Q^{11})$ are the associated adjoint processes and

H_{0}(t):=H_{0}(t,\Theta_{t}^{0,*},u_{t}^{0,*};\Theta_{t}^{1,*},u_{t}^{1};\mathbb{E}_{t}[\Theta_{t}^{1,*}],\mathbb{E}_{t}[u_{t}^{1}];L_{t}^{0},L_{t};P_{t}^{0},P_{t},Q_{t}^{00},Q_{t}^{11}),

(4.2)

assuming that $u^{0,*}$ is an optimal control and $(\Theta^{0,*},\Theta^{1,*})=(X^{0,*},Y^{0,*},Z^{0,*};X^{1,*},Y^{1,*},Z^{1,*})$ the associated optimal state trajectory. Then we have the following result.

Proposition 4.1.

Let (A1) be in force. Moreover, we assume that
(1) There exists a unique maximizer of the Hamiltonian $H_{0}$ as a function of $u_{0}$ (denoted by $\widehat{u}^{0}$ );
(2) The function $H_{0}$ is convex in $(x_{0},x,y_{0},y,z_{0},z,u_{0})$ .
If $(\Theta^{0,*},\Theta^{1,*};L^{0},L;P^{0},Q^{00},Q^{01};P,Q^{10},Q^{11})$ solves system (4.1), the optimal control of $\mathcal{A}_{0}$ is

\widetilde{u}_{t}^{0}=\mathbb{E}_{t}\big{[}\widehat{u}^{0}\big{(}t,\Theta_{t}^{0,*},\Theta_{t}^{1,*},\mathbb{E}_{t}[\Theta_{t}^{1,*}];L^{0}_{t},L_{t},P^{0}_{t},Q^{00}_{t},P_{t},Q^{11}_{t};u^{1}_{t},\mathbb{E}_{t}[u^{1}_{t}]\big{)}\big{]}.

4.2 Auxiliary control problem for the follower $\mathcal{A}_{1}$

In this subsection, we fix $u^{j},u^{0}$ , the controls adopted by the leader $\mathcal{A}_{j}$ and the follower $\mathcal{A}_{0}$ respectively. Consider the control problem of the follower $\mathcal{A}_{1}$ associated with BSDE (3.17), SDE (3.13) and the functional (3.16). Given $(u^{0},u^{j}),$ (3.18) and the last two equations in (3.13) become exogenous processes. The Hamiltonian functional $H(t,x_{0},y_{0},z_{0},u_{0};x,y,z,u;\bar{x},\bar{y},\bar{z},\bar{u};l,p,q_{11})$ $=\langle p,b\rangle+\langle q_{11},\sigma\rangle-l\cdot f+g,$ and the related Hamiltonian system takes the following form

\left\{\begin{aligned} d{{X}}_{t}^{1,\ddagger}=&\partial_{p}H(t)dt+\partial_{q_{11}}H(t)dW_{t}^{1},\quad{X}_{0}^{1,\ddagger}=x_{0},\\ -d{{Y}}_{t}^{1,\ddagger}=&-\partial_{l}H(t)dt-{{Z}}_{t}^{1,\ddagger,0}dW_{t}^{0}-{{Z}}_{t}^{1,\ddagger,1}dW_{t}^{1},\ Y_{T}^{1,\ddagger}=\Phi\big{(}{{X}}_{T}^{1,\ddagger};{X}_{T}^{0,\ddagger},\mathbb{E}_{T}[{{X}}^{1,\ddagger}_{T}]\big{)}+\xi^{1},\\ dL_{t}^{\ddagger}=&-\partial_{y}H(t)dt-\partial_{z}H(t)d(W^{0},W^{1})_{t},\quad L_{0}^{\ddagger}=-\Gamma_{y}(Y_{0}^{1,\ddagger}),\\ -dP_{t}^{\ddagger}=&\partial_{x}H(t)dt-Q_{t}^{10,\ddagger}dW^{0}_{t}-Q_{t}^{11,\ddagger}dW_{t},\quad P_{T}^{\ddagger}=-L_{T}^{\ddagger}\cdot\partial_{{x}}\Phi({X}_{T}^{1,\ddagger};{X}_{T}^{0,\ddagger},\mathbb{E}_{T}[{X}_{T}^{j,\ddagger}]),\end{aligned}\right.

(4.3)

where the last two equations are the associated adjoint equations and

H(t)=\big{(}t,{\Theta_{t}^{0}},u_{t}^{0};{\Theta_{t}^{1}},u_{t}^{1,*};{\mathbb{E}_{t}[\Theta_{t}^{1}]},\mathbb{E}_{t}[u_{t}^{1,*}];L_{t}^{\ddagger},P_{t}^{\ddagger},Q^{11,\ddagger}_{t}\big{)}.

(4.4)

Then from the stochastic maximum principle for FBSDE system (e.g. [21]), we have

Proposition 4.2.

Under (A1), assume the FBSDE (4.3) admits an unique solution and
(1) There exists a unique maximizer of the Hamiltonian $H$ as a function of $u$ (denoted by $\widehat{u}$ );
(2) The function $H$ is convex in $(x_{0},x,y_{0},y,z_{0},z,u)$ .
Then the optimal control of $\mathcal{A}_{1}$ is given by $\begin{aligned} \widetilde{u}_{t}^{1}=\widehat{u}\Big{(}&t,\Theta_{t}^{0,\ddagger},\Theta_{t}^{1,\ddagger},\mathbb{E}_{t}[\Theta^{j,\ddagger}_{t}];L^{\ddagger}_{t},P_{t}^{\ddagger},Q^{11,\ddagger}_{t};\mathbb{E}_{t}[u^{j}_{t}],u^{0}_{t}\Big{)}.\end{aligned}$

4.3 Consistency condition system

For sake of presentation, hereafter we may write a bar on the top of a random variable (or process) to denote its conditional expectation with respect to $\mathbb{F}^{0}$ , for example, $\overline{u}_{t}^{j}=\mathbb{E}_{t}[u_{t}^{j}]$ . We impose the following consistency conditions on the followers $\mathcal{A}_{0},\mathcal{A}_{1}$ and the leader $\mathcal{A}_{j}$ :

\widehat{u}\big{[}\overline{u}_{t}^{j};\mathbb{E}_{t}[\widehat{u}^{0}[u_{t}^{1},\overline{u}_{t}^{1}]]\big{]}=u_{t}^{1},\quad\overline{u}^{j}_{t}=\overline{u}^{1}_{t},\quad t\in[0,T].

(4.5)

By (4.5) and the solution uniqueness of SDE (3.13) and BSDEs (3.15), (3.17)-(3.18), we have

\Theta^{0,\dagger}_{t}=\Theta^{0,\ddagger}_{t},\ \ \ \Theta^{1,\dagger}_{t}=\Theta^{1,\ddagger}_{t},\ \ \ \overline{\Theta}_{t}^{1,\dagger}=\overline{\Theta}_{t}^{j,\ddagger},\ t\in[0,T].

(4.6)

Noting (4.6) and by Proposition 4.1 and 4.2, we may introduce the following assumption:
Assumption (A2) Suppose that there exist a pair of deterministic continuous functions $({\varphi}^{0},{\varphi}):[0,T]\times\mathbb{R}^{3n+6m}\times\mathbb{R}^{3m}\times\mathbb{R}^{6n}\times U\rightarrow U^{0}\times U,$ satisfy the following conditions

\left\{\begin{aligned} \varphi^{0}=&\widehat{u}^{0}\Big{(}t,\theta^{0},\theta,\bar{\theta},l^{0},l,p^{0},q^{00},p,q^{11},\bar{u};\varphi\Big{)},\\ \varphi=&\widehat{u}\Big{(}t,\theta^{0},\theta,\bar{\theta};l^{\ddagger},p^{\ddagger},q^{11,\ddagger},\bar{u};\varphi^{0}\Big{)},\ t\in[0,T],\end{aligned}\right.

where $\widehat{u}^{0}$ and $\widehat{u}$ are the mappings given in Proposition 4.1 and 4.2.

Under (A2), by measurable selection theorem, there exists a measurable $\psi$ with

\psi_{t}=\mathbb{E}_{t}[\varphi(t,{\Theta_{t}^{0,\dagger},\Theta_{t}^{1,\dagger},\overline{\Theta}^{1,\dagger}_{t}};L^{0}_{t},L_{t},L_{t}^{\ddagger},P^{0}_{t},Q^{00}_{t},P_{t},Q^{11}_{t},P_{t}^{\ddagger},Q^{11,\ddagger}_{t};\psi_{t})].

(4.7)

Combining (4.7) and (A2), we can denote

		$\displaystyle\Psi^{0}(t,\Theta_{t}^{0,\dagger},\Theta_{t}^{1,\dagger};L^{0}_{t},L_{t},L_{t}^{\ddagger},P^{0}_{t},Q^{00}_{t},P_{t},Q^{11}_{t},P_{t}^{\ddagger},Q^{11,\ddagger}_{t})$		(4.8)
	$\displaystyle=$	$\displaystyle\mathbb{E}_{t}[\varphi^{0}(t,{\Theta_{t}^{0,\dagger},\Theta_{t}^{1,\dagger},\overline{\Theta}^{1,\dagger}_{t}};L^{0}_{t},L_{t},L_{t}^{\ddagger},P^{0}_{t},Q^{00}_{t},P_{t},Q^{11}_{t},P_{t}^{\ddagger},Q^{11,\ddagger}_{t};\psi_{t})],$		(4.8)

Similarly, $\Psi(t,\cdots)=\varphi(t,{\Theta_{t}^{0,\dagger},\Theta_{t}^{1,\dagger},\overline{\Theta}^{1,\dagger}_{t}};\cdots;\psi_{t}).$ Plugging the mappings $(\Psi^{0},\Psi)$ into (4.1) and (4.3), we obtain

\left\{\begin{aligned} d{X}_{t}^{0}=&b^{0}\big{(}t,{X}_{t}^{0},u_{t}^{0,*};\mathbb{E}_{t}[{X}^{1}_{t}],\mathbb{E}_{t}[{u}_{t}^{1,*}]\big{)}dt+\sigma^{0}\big{(}t,{X}_{t}^{0},u_{t}^{0,*};\mathbb{E}_{t}[{X}^{1}_{t}],\mathbb{E}_{t}[{u}_{t}^{1,*}]\big{)}dW_{t}^{0},\\ d{{X}}_{t}^{1}=&b\big{(}t,X_{t}^{1},u_{t}^{1,*};{X}_{t}^{0},u_{t}^{0,*};\mathbb{E}_{t}[X^{1}_{t}],\mathbb{E}_{t}[u_{t}^{1,*}]\big{)}dt+\sigma\big{(}t,X_{t}^{1},u_{t}^{1,*};{X}_{t}^{0},u_{t}^{0,*};\mathbb{E}_{t}[X^{1}_{t}],\mathbb{E}_{t}[u_{t}^{1,*}]\big{)}dW_{t}^{1},\\ dL_{t}^{0}=&-\partial_{y_{0}}H_{0}(t)dt-\partial_{z_{0}}H_{0}(t)dW^{0}_{t},\ \ \ \ \quad\quad dL_{t}^{\ddagger}=-\partial_{y}H(t)dt-\partial_{z}H(t)d(W^{0},W^{1})^{\top}_{t},\\ dL_{t}=&-\big{[}\partial_{y}H_{0}(t)+\mathbb{E}_{t}[\partial_{\bar{y}}H_{0}(t)]\big{]}dt-\big{[}\partial_{z}H_{0}(t)+\mathbb{E}_{t}[\partial_{\bar{z}}H_{0}(t)]\big{]}d(W^{0},W^{1})_{t}^{\top},\\ -d{Y}_{t}^{0}=&f^{0}\big{(}t,{{\Theta}_{t}^{0},u_{t}^{0,*};\mathbb{E}_{t}[{\Theta}^{1}_{t}],\mathbb{E}_{t}[{{u}_{t}^{1,*}}]}\big{)}dt-{Z}_{t}^{0}dW_{t}^{0},\\ -d{{Y}}_{t}^{1}=&f\big{(}t,{\Theta_{t}^{1},{u_{t}^{1,*}};\Theta_{t}^{0},u_{t}^{0,*};\mathbb{E}_{t}[\Theta^{1}_{t}],\mathbb{E}_{t}[{u}_{t}^{1,*}]\big{)}dt-{{Z}}_{t}^{1,0}dW_{t}^{0}-{{Z}}_{t}^{1,1}dW_{t}^{1}},\\ -dP_{t}^{0}=&\partial_{x_{0}}H_{0}(t)dt-Q_{t}^{00}dW^{0}_{t}-Q_{t}^{01}dW_{t}^{1},\quad\quad-dP_{t}=\big{[}\partial_{x}H_{0}(t)+\mathbb{E}_{t}[\partial_{\bar{x}}H_{0}(t)]\big{]}dt-Q_{t}^{10}dW^{0}_{t}-Q_{t}^{11}dW_{t}^{1},\\ -dP_{t}^{\ddagger}=&\partial_{x}H(t)dt-Q_{t}^{10,\ddagger}dW^{0}_{t}-Q_{t}^{11,\ddagger}dW_{t}^{1},\\ \end{aligned}\right.

(4.9)

with the mixed initial-terminal condition

	$\displaystyle{X}_{0}^{0}=$	$\displaystyle x_{0}^{0},\ \ {X}_{0}^{1}=x_{0},\ \ L_{0}^{0}=-\Gamma_{y}^{0}(Y_{0}^{0}),\ \ L_{0}^{\ddagger}=-\Gamma_{y}(Y_{0}^{1}),\ \ L_{0}=0,$
	$\displaystyle{Y}_{T}^{0}=$	$\displaystyle\Phi^{0}\big{(}{X}_{T}^{0},\mathbb{E}_{T}[{{X}}^{1}_{T}]\big{)}+\xi^{0},\ \ \ \ {{Y}}_{T}^{1}=\Phi\big{(}{{X}}_{T}^{1};{X}_{T}^{0},{E}_{T}[{{X}}^{1}_{T}]\big{)}+\xi^{1},\ \ \ \ P_{T}^{\ddagger}=-L_{T}^{\ddagger}\cdot\partial_{{x}}\Phi,$
	$\displaystyle P_{T}^{0}=$	$\displaystyle-L_{T}^{0}\cdot\partial_{x_{0}}\Phi^{0}-L_{T}\cdot\partial_{x_{0}}\Phi,\ \ \ \ P_{T}=-L_{T}\cdot\partial_{x}\Phi-\mathbb{E}_{T}[L_{T}^{0}\cdot\partial_{\bar{x}}\Phi^{0}+L_{T}\cdot\partial_{\bar{x}}\Phi].$

$H_{0}(t)$ (resp. $H(t)$ ) is given by (4.2) ((4.4)) by replacing $u^{1}$ ( $u^{0}$ ) with $u^{1,*}$ ( $u^{0,*}$ ) and

	$\displaystyle u_{t}^{0,*}=$	$\displaystyle\Psi^{0}(t,\Theta_{t}^{0,\dagger},\Theta_{t}^{1,\dagger};L^{0}_{t},L_{t},L_{t}^{\ddagger},P^{0}_{t},Q^{00}_{t},P_{t},Q^{11}_{t},P_{t}^{\ddagger},Q^{11,\ddagger}_{t}),$
	$\displaystyle u_{t}^{1,*}=$	$\displaystyle\Psi(t,\Theta_{t}^{0,\dagger},\Theta_{t}^{1,\dagger};L^{0}_{t},L_{t},L_{t}^{\ddagger},P^{0}_{t},Q^{00}_{t},P_{t},Q^{11}_{t},P_{t}^{\ddagger},Q^{11,\ddagger}_{t}).$

The following result is a direct consequence based on our previous analysis.

Corollary 4.1.

Assume the assumptions of Proposition 4.1 and 4.2 hold. If FBSDE (4.9) admits a solution, then $(u^{0,*},u^{1,*},u^{j,*})$ is an equilibrium of the triple-agent mixed leader-follower-Nash game, where $u^{j,*}\in\mathcal{U}_{j}^{d}$ satisfies $\overline{u}^{1,*}_{t}=\overline{u}^{j,*}_{t}$ , for all $t\in[0,T]$ .

4.4 $\varepsilon$ -Nash equilibrium of the RMM problem

Proposition 4.3 yields an approximate Nash equilibrium for the RMM problem. To verify it, we first introduce an technical assumption, that is common in control literature [11]:
Assumption (A3) (i) The diffusion coefficients $\sigma^{0},\sigma$ are independent of $(u^{0},u)$ , if applicable.
(ii) The maximizers $\widehat{u}^{0}$ and $\widehat{u}$ in Proposition 4.1 and 4.2 are independent of $(Z^{0},Z^{1})$ .
(iii) The system (4.9) has a unique solution $(\Theta_{t}^{0},\Theta_{t}^{1};L_{t}^{0},L_{t},L_{t}^{\ddagger};P^{0}_{t},P_{t},$ $P^{\ddagger}_{t},Q^{00}_{t},Q_{t}^{01},Q_{t}^{10},Q_{t}^{11},$ $Q^{10,\ddagger}_{t},Q^{11,\ddagger}_{t})$ , where $L^{0},L$ and $L^{\ddagger}$ are $\mathbb{F}^{0}$ -adapted.
(iv) There exists a random decoupling field $\eta:[0,T]\times\Omega\times\mathbb{R}^{2n+3m}\rightarrow\mathbb{R}^{2m+3n}$ such that

(Y^{0}_{t},Y^{1}_{t},P^{0}_{t},P_{t},P^{\ddagger}_{t})=\eta(t,X_{t}^{0},X_{t}^{1},L_{t}^{0},L_{t},L_{t}^{\ddagger}),\ \text{a.s.},

with $\eta(\cdot,x^{0},x^{1},l^{0},l,l^{\ddagger})$ is $\mathbb{F}^{0}$ -adapted for each $(x^{0},x^{1},l^{0},l,l^{\ddagger})\in\mathbb{R}^{2n+3m}$ , and $\eta(t,\cdots)$ is Lipschitz continuous on all its arguments uniformly on $t$ .

Decoupling field is a key to study the well-posedness of FBSDEs; see, [13, 30, 31]. Proposition 5.1 presents a sufficient condition ensuring the existence of decoupling field for LQG-RMM. By (A3-i), $\widehat{u}^{0}$ and $\widehat{u}$ in Proposition 4.1 and 4.2 are independent of $(Q^{00},Q^{11},Q^{11,\ddagger})$ , and it is also the case for $\varphi^{0},\varphi$ in (A2), $\psi$ in (4.7), $\Psi^{0}$ and $\Psi$ in (4.8). We abuse the notation to write

\displaystyle\chi(t,X^{0}_{t},X^{1}_{t})=\chi(t,X^{0}_{t},X^{1}_{t};L_{t}^{0},L_{t},L_{t}^{\ddagger},\eta(t,X^{0}_{t},X^{1}_{t},L_{t}^{0},L_{t},L_{t}^{\ddagger})),\ \chi=\Psi^{0},\Psi.

Then it follows from (4.8) and (A3-iii) that $\Psi^{0}(t,X_{t}^{0},X_{t}^{1})$ and $\Psi(t,\cdot,\cdot)$ are $\mathbb{F}^{0}$ -adapted.

Assumption (A4) Both $\Psi^{0}$ and $\Psi$ are Lipschitz in state variable $(x^{0},x)$ uniformly on $t$ , and $||\Psi^{0}(t,0,0)||_{L^{2}}+||\Psi(t,0,0)||_{L^{2}}<\infty.$

(A3) and (A4) are commonly adopted in MFG literature (e.g., (A7) and (A8) in [11]). Indeed, (A3) and (A4) are satisfied for the LQG-RMM problem in Section 5. Applying the feedback control pair $(\Psi^{0},\Psi)$ to (1.1) and (1.3), we get the following forward-backward system

\left\{\begin{aligned} dX_{t}^{0,N}=&b^{0}\big{(}t,X_{t}^{0,N},u_{t}^{0,N};X_{t}^{(N)},{u_{t}^{(N)}}\big{)}dt+{\sigma^{0}\big{(}t,X_{t}^{0,N};X_{t}^{(N)},{u_{t}^{(N)}}\big{)}}dW^{0}_{t},\ X_{0}^{0}=x_{0}^{0},\\ dX_{t}^{i,N}=&b\big{(}t,X_{t}^{i,N},u_{t}^{i,N};X_{t}^{0,N},u_{t}^{0,N};X_{t}^{(N)},u_{t}^{(N)}\big{)}dt\\ &+\sigma\big{(}t,X_{t}^{i,N},X_{t}^{0,N};X_{t}^{(N)},u_{t}^{(N)}\big{)}dW_{t}^{i},\quad X_{0}^{i}=x_{0},\\ -dY_{t}^{0,N}=&f^{0}\big{(}t,\Theta_{t}^{0,N},u_{t}^{0,N};\Theta_{t}^{(N)},u_{t}^{(N)}\big{)}dt-Z_{t}^{0,N}dW_{t}^{0},\ Y_{T}^{0,N}=\Phi^{0}(X_{T}^{0,N},X_{T}^{(N)})+\xi^{0},\\ -dY_{t}^{i,N}=&f\big{(}t,\Theta_{t}^{i,N},u_{t}^{i,N};\Theta_{t}^{0,N},u_{t}^{0,N};\Theta_{t}^{(N)},u_{t}^{(N)}\big{)}dt-Z_{t}^{i,N}d(W^{0},W^{i})_{t},\\ Y_{T}^{i}=&\Phi(X_{T}^{i,N},X_{T}^{0,N},X_{T}^{(N)})+\xi^{i},\end{aligned}\right.

(4.10)

where $\Xi_{\cdot}^{(N)}=\frac{1}{N}\sum_{i=1}^{N}\Xi_{\cdot}^{i,N},\ \Xi=X,Y,Z,\ u_{\cdot}^{(N)}=\frac{1}{N}\sum_{i=1}^{N}u_{t}^{i,N}$ , $u_{t}^{0,N}=\Psi^{0}(t,X_{t}^{0,N},X_{t}^{1,N}),$ $u_{t}^{i,N}=\Psi(t,X_{t}^{0,N};X_{t}^{i,N}).$ The payoff functionals (1.2) now become

	$\displaystyle J_{0}^{N}=\Gamma^{0}(Y_{0}^{0,N})+\mathbb{E}\big{[}\int_{0}^{T}g^{0}$	$\displaystyle\big{(}t,\Theta_{t}^{0,N},u_{t}^{0,N};\Theta_{t}^{(N)},u_{t}^{(N)}\big{)}dt\big{]},$		(4.11)
	$\displaystyle J_{i}^{N}=\Gamma(Y_{0}^{i,N})+\mathbb{E}\big{[}\int_{0}^{T}g$	$\displaystyle\big{(}t,\Theta_{t}^{i,N},u_{t}^{i,N};\Theta_{t}^{0,N},u_{t}^{0,N};\Theta_{t}^{(N)},u_{t}^{(N)}\big{)}dt\big{]}.$		(4.11)

To ease the notation, hereafter we may write $(J_{0}^{N},J_{i}^{N})$ instead of $(J_{0}^{N},J_{i}^{N})(u^{0},u^{1},\cdots,u^{N})$ .

Theorem 4.1.

Assume that (A1)-(A4) hold. The feedback control $\big{(}u^{0,N}_{t},u^{1,N}_{t},\cdots,u^{N,N}_{t}\big{)}$ is an $\varepsilon_{N}$ -Nash equilibrium for RMM problem, where $\varepsilon_{N}\leq\frac{C}{\sqrt{N}}$ .

The proof of Theorem 4.1 essentially relies on a tailor-made propagation chaos analysis in the same spirit of [11], but invokes additional estimates on BSDEs. We defer it in Appendix A.1.

5 Linear-quadratic-Gaussian cases

This section studies the RMM problem in some linear-quadratic-Gaussian (LQG) cases (LQG-RMM), where the agents’ states evolve by the following linear SDEs: for $1\leq i\leq N$ ,

\left\{\begin{aligned} dX_{t}^{0}=&\big{(}b_{1}^{0}X_{t}^{0}+b_{2}^{0}{u_{t}^{0}}+b_{3}^{0}X_{t}^{(N)}+b_{4}^{0}{u_{t}^{(N)}}\big{)}dt+\sigma_{0}dW^{0}_{t},\quad X_{0}^{0}=x_{0}^{0},\\ dX_{t}^{i}=&\big{(}b_{1}X_{t}^{i}+b_{2}u_{t}^{i}+b_{3}X_{t}^{0}+b_{4}u_{t}^{0}+b_{5}X_{t}^{(N)}+b_{6}u_{t}^{(N)}\big{)}dt+\sigma dW_{t}^{i},\quad X_{0}^{i}=x_{0},\end{aligned}\right.

(5.1)

and the recursive functionals assume the following quadratic forms

		$\displaystyle J_{0}^{N}=-\gamma_{0}\|Y_{0}^{0}\|^{2}-\mathbb{E}\big{[}\int_{0}^{T}Q_{0}\|X_{t}^{0}-\mu^{0}_{1}\cdot X_{t}^{(N)}\|^{2}+R_{0}\|u_{t}^{0}-\mu^{0}_{2}\cdot u_{t}^{(N)}\|^{2}dt\big{]},$		(5.2)
		$\displaystyle J_{i}^{N}=-\gamma\|Y_{0}^{i}\|^{2}-\mathbb{E}\big{[}\int_{0}^{T}Q\|X_{t}^{i}-\mu_{1}\cdot X_{t}^{(N)}-\mu_{2}\cdot X_{t}^{0}\|^{2}+R\|u_{t}^{i}-\mu_{3}\cdot u_{t}^{(N)}-\mu_{4}\cdot u_{t}^{0}\|^{2}dt\big{]},$		(5.2)

where $(Y^{0},Z^{0};Y^{i},Z^{i})$ satisfy the following coupled linear BSDE system:

\left\{\begin{aligned} -dY_{t}^{0}=&\big{(}f_{1}^{0}X_{t}^{0}+f_{2}^{0}Y_{t}^{0}+f_{3}^{0}Z_{t}^{0}+f_{4}^{0}u_{t}^{0}+f_{5}^{0}X_{t}^{(N)}+f_{6}^{0}Y_{t}^{(N)}+f_{7}^{0}Z_{t}^{(N),0}+f_{8}^{0}u_{t}^{(N)}\big{)}dt-Z_{t}^{0}dW_{t}^{0},\\ -dY_{t}^{i}=&\big{(}f_{1}X_{t}^{i}+f_{2}Y_{t}^{i}+f_{3}Z_{t}^{i,0}+f_{4}u_{t}^{i}+f_{5}X_{t}^{0}+f_{6}Y_{t}^{0}+f_{7}Z_{t}^{0}+f_{8}u_{t}^{0}+f_{9}X_{t}^{(N)}\\ &+f_{10}Y_{t}^{(N)}+f_{11}Z_{t}^{(N),0}+f_{12}u_{t}^{(N)}\big{)}dt-Z_{t}^{i,0}dW_{t}^{0}-Z_{t}^{i,i}dW_{t}^{i},\\ Y_{T}^{0}=&\Phi_{1}^{0}X_{T}^{0}+\Phi_{2}^{0}X_{T}^{(N)}+\xi^{0},\quad Y_{T}^{i}=\Phi_{1}X_{T}^{i}+\Phi_{2}X_{T}^{0}+\Phi_{3}X_{T}^{(N)}+\xi^{i}.\end{aligned}\right.

(5.3)

with $Y_{t}^{(N)}=\frac{1}{N}\sum_{i=1}^{N}Y_{t}^{i},Z_{t}^{(N),0}=\frac{1}{N}\sum_{i=1}^{N}Z_{t}^{i,0}$ . To simplify the analysis, we assume that $n=m=k^{0}=k=1$ ; all coefficients are constants with nonnegative $(\gamma_{0},\gamma,Q_{0},Q)$ , positive $(R_{0},R)$ , and $\mu_{3}+\mu^{0}_{2}\mu_{4}\neq 1$ . The forward-backward LQG setting of (5.1)-(5.3) is strongly motivated by various practical applications, see [19, 26, 33] for more details. We introduce the following notations:

		$\displaystyle\mathbf{X}_{t}=(X^{0}_{t},{X}^{1}_{t})^{\top},\ \mathbf{L}_{t}=({L}_{t}^{0},{L}_{t},{L}^{\ddagger}_{t})^{\top},\ \mathbf{Y}_{t}=(Y_{t}^{0},{Y}_{t}^{1})^{\top},\ \mathbf{P}_{t}=({P}^{0}_{t},{P}_{t},{P}^{\ddagger}_{t})^{\top},$		(5.4)
		$\displaystyle\mathbf{Z}_{t}=(Z_{t}^{0},{Z}_{t}^{1,0})^{\top},\ \mathbf{Q}_{t}=({Q}^{00}_{t},{Q}^{10}_{t},{Q}^{10,\ddagger}_{t})^{\top},\ \mathbf{Z}_{t}^{1}=(0,{Z}_{t}^{1,1})^{\top},\ \mathbf{Q}_{t}^{1}=({Q}^{01}_{t},{Q}^{11}_{t},{Q}^{11,\ddagger}_{t})^{\top}.$		(5.4)

By Proposition 4.1, 4.2 and (4.5), the equilibrium strategy $(u^{0,*},u^{1,*},u^{j,*})$ now read as

	$\displaystyle u_{t}^{0,*}=$	$\displaystyle{a_{1}^{0}}^{\top}\mathbb{E}_{t}[\mathbf{P}_{t}]+{a_{3}^{0}}^{\top}\mathbb{E}_{t}[\mathbf{L}_{t}],\quad u_{t}^{1,*}=a_{1}^{\top}\mathbb{E}_{t}[\mathbf{P}_{t}]+a_{3}^{\top}\mathbb{E}_{t}[\mathbf{L}_{t}]+a_{7}^{\top}\mathbf{P}_{t}+a_{8}^{\top}\mathbf{L}_{t},$		(5.5)
	$\displaystyle\mathbb{E}_{t}[{u}_{t}^{j,*}]=$	$\displaystyle({a}_{1}+a_{7})^{\top}\mathbb{E}_{t}[\mathbf{P}_{t}]+({a}_{3}+a_{8})^{\top}\mathbb{E}_{t}[\mathbf{L}_{t}].$		(5.5)

Combining with (4.9), we get the following FBSDE

\left\{\begin{aligned} d\mathbf{X}_{t}=&\Big{(}\mathbb{A}_{1}\mathbf{X}_{t}+\mathbb{A}_{2}\mathbb{E}_{t}[\mathbf{X}_{t}]+\mathbb{B}_{1}\mathbf{P}_{t}+\mathbb{B}_{2}\mathbb{E}_{t}[\mathbf{P}_{t}]+\mathbb{C}_{1}\mathbf{L}_{t}+\mathbb{C}_{2}\mathbb{E}_{t}[\mathbf{L}_{t}]\Big{)}dt\\ &+(\sigma_{0},0)^{\top}dW_{t}^{0}+(0,\sigma)^{\top}dW_{t}^{1},\ \mathbf{X}_{0}=({x}_{0}^{0},x_{0})^{\top},\\ d\mathbf{L}_{t}=&\Big{(}\mathbb{C}_{3}\mathbf{L}_{t}+\mathbb{C}_{4}\mathbb{E}_{t}[\mathbf{L}_{t}]\Big{)}dt+\Big{(}\mathbb{C}_{5}\mathbf{L}_{t}+\mathbb{C}_{6}\mathbb{E}_{t}[\mathbf{L}_{t}]\Big{)}dW_{t}^{0},\ \mathbf{L}_{0}=\rho\mathbf{Y}_{0},\\ -d\mathbf{Y}_{t}=&\Big{(}\mathbb{A}_{3}\mathbf{X}_{t}+\mathbb{A}_{4}\mathbb{E}_{t}[\mathbf{X}_{t}]+\mathbb{D}_{1}\mathbf{Y}_{t}+\mathbb{D}_{2}\mathbb{E}_{t}[\mathbf{Y}_{t}]+\mathbb{F}_{1}\mathbf{Z}_{t}+\mathbb{F}_{2}\mathbb{E}_{t}[\mathbf{Z}_{t}]\\ &+\mathbb{B}_{3}\mathbf{P}_{t}+\mathbb{B}_{4}\mathbb{E}_{t}[\mathbf{P}_{t}]+\mathbb{C}_{7}\mathbf{L}_{t}+\mathbb{C}_{8}\mathbb{E}_{t}[\mathbf{L}_{t}]\Big{)}dt-\mathbf{Z}_{t}dW_{t}^{0}-\mathbf{Z}_{t}^{1}dW_{t}^{1},\\ -d\mathbf{P}_{t}=&\Big{(}\mathbb{A}_{5}\mathbf{X}_{t}+\mathbb{A}_{6}\mathbb{E}_{t}[\mathbf{X}_{t}]+\mathbb{B}_{5}\mathbf{P}_{t}+\mathbb{B}_{6}\mathbb{E}_{t}[\mathbf{P}_{t}]+\mathbb{C}_{9}\mathbf{L}_{t}+\mathbb{C}_{10}\mathbb{E}_{t}[\mathbf{L}_{t}]\Big{)}dt-\mathbf{Q}_{t}dW_{t}^{0}-\mathbf{Q}_{t}^{1}dW_{t}^{1},\\ \mathbf{Y}_{T}=&\mathbb{A}_{7}\mathbf{X}_{T}+\mathbb{A}_{8}\mathbb{E}_{T}[\mathbf{X}_{T}]+(\xi^{0},\xi)^{\top},\ \mathbf{P}_{T}=\mathbb{C}_{11}\mathbf{L}_{T}+\mathbb{C}_{12}\mathbb{E}_{T}[\mathbf{L}_{T}].\end{aligned}\right.

(5.6)

For sake of presentation, we defer the definitions of $\{a_{i}^{0}\}_{i=1,3}$ , $\{a_{i}\}_{i=1,3,7,8}$ and matrices (vectors) of (5.5)-(5.6) to Appendix A.2. (5.6) is a fully-coupled FBSDE with mixed initial-terminal conditions, and its wellposedness can be discussed through the following steps. Set $\overline{M}_{t}=\mathbb{E}_{t}[M_{t}]$ for $M=\mathbf{X},\mathbf{Y},\mathbf{Z},\mathbf{L},\mathbf{P},\mathbf{Q},$ and take conditional expectation on (5.6), we have

\left\{\begin{aligned} d\overline{\mathbf{X}}_{t}=&[(\mathbb{A}_{1}+\mathbb{A}_{2})\overline{\mathbf{X}}_{t}+(\mathbb{B}_{1}+\mathbb{B}_{2})\overline{\mathbf{P}}_{t}+(\mathbb{C}_{1}+\mathbb{C}_{2})\overline{\mathbf{L}}_{t}]dt+(\sigma_{0},0)^{\top}dW_{t}^{0},\quad\overline{\mathbf{X}}_{0}=({x}_{0}^{0},x_{0})^{\top},\\ d\overline{\mathbf{L}}_{t}=&(\mathbb{C}_{3}+\mathbb{C}_{4})\overline{\mathbf{L}}_{t}dt+(\mathbb{C}_{5}+\mathbb{C}_{6})\overline{\mathbf{L}}_{t}dW_{t}^{0},\quad\overline{\mathbf{L}}_{0}=\rho\overline{\mathbf{Y}}_{0},\\ -d\overline{\mathbf{Y}}_{t}=&[(\mathbb{A}_{3}+\mathbb{A}_{4})\overline{\mathbf{X}}_{t}+(\mathbb{D}_{1}+\mathbb{D}_{2})\overline{\mathbf{Y}}_{t}+(\mathbb{F}_{1}+\mathbb{F}_{2})\overline{\mathbf{Z}}_{t}+(\mathbb{B}_{3}+\mathbb{B}_{4})\overline{\mathbf{P}}_{t}+(\mathbb{C}_{7}+\mathbb{C}_{8})\overline{\mathbf{L}}_{t}]dt-\overline{\mathbf{Z}}_{t}dW_{t}^{0},\\ -d\overline{\mathbf{P}}_{t}=&[(\mathbb{A}_{5}+\mathbb{A}_{6})\overline{\mathbf{X}}_{t}+(\mathbb{B}_{5}+\mathbb{B}_{6})\overline{\mathbf{P}}_{t}+(\mathbb{C}_{9}+\mathbb{C}_{10})\overline{\mathbf{L}}_{t}]dt-\overline{\mathbf{Q}}_{t}dW_{t}^{0},\\ \overline{\mathbf{Y}}_{T}=&(\mathbb{A}_{7}+\mathbb{A}_{8})\overline{\mathbf{X}}_{T}+(\xi^{0},\mathbb{E}_{T}[\xi])^{\top},\quad\overline{\mathbf{P}}_{T}=(\mathbb{C}_{11}+\mathbb{C}_{12})\overline{\mathbf{L}}_{T}.\end{aligned}\right.

(5.7)

Next, we consider the following ODE and BSDE

\left\{\begin{aligned} &\dot{S}_{t}+{S}_{t}{\mathbb{H}_{1}}+\mathbb{H}_{2}S_{t}+S_{t}{\mathbb{H}_{3}}S_{t}+(I+S_{t}\widetilde{\rho})\mathbb{H}_{4}(I+S_{t}\widetilde{\rho})^{-1}\big{(}S_{t}\mathbb{H}_{5}+S_{t}\mathbb{H}_{6}S_{t}\big{)}+\mathbb{H}_{7}=0,\\ &{S}_{T}=(I-\mathbb{G}_{1})^{-1}\mathbb{G}_{2},\end{aligned}\right.

(5.8)

\left\{\begin{aligned} -d\Upsilon_{t}=&\Big{[}\Big{(}S_{t}\mathbb{H}_{3}+\mathbb{H}_{2}+(I+S_{t}\widetilde{\rho})\mathbb{H}_{4}(I+S_{t}\widetilde{\rho})^{-1}S_{t}\mathbb{H}_{6}\Big{)}\Upsilon_{t}+(I+S_{t}\widetilde{\rho})\mathbb{H}_{4}(I+S_{t}\widetilde{\rho})^{-1}\upsilon_{t}\\ &+(S_{t}+I)\mathbb{H}_{4}(I+S_{t}\widetilde{\rho})^{-1}S_{t}(\sigma_{0},0,0,0,0)^{\top}\Big{]}dt-\upsilon_{t}dW_{t}^{0},\\ \Upsilon_{T}=&(I-\mathbb{G}_{1})^{-1}(\xi^{0},\mathbb{E}_{T}[\xi],0,0,0)^{\top}.\end{aligned}\right.

(5.9)

Again, we defer the definitions of $\{\mathbb{H}_{k}\}_{k=1}^{7}$ and $\mathbb{G}_{1},\mathbb{G}_{2}$ to Appendix A.2. The well-posedness of (5.8) may be obtained as Theorem 4.6 in [19]. We refrain to present these conditions in details, as they might be rather technical and incur unnecessary degression along our presentation. Instead, we directly assume that

(A5) (5.8) admits a unique $\mathbb{R}^{5\times 5}$ -valued solution $S$ with bounded $[I+S_{t}\widetilde{\rho}]^{-1},\ \text{a.e.}\ t.$

Under (A5), BSDE (5.9) admits an unique $\mathbb{F}^{0}$ -adapted solution $(\Upsilon,\upsilon)$ by the standard BSDE solvability arguments. Moreover, we have the following result:

Lemma 5.1.

Let (A5) holds. Then the linear FBSDE (5.7) has a unique $\mathbb{F}^{0}$ -adapted solution $(\overline{\mathbf{X}},\overline{\mathbf{L}},\overline{\mathbf{Y}},\overline{\mathbf{Z}},\overline{\mathbf{P}},\overline{\mathbf{Q}})$ with the following relations:

$\displaystyle(\overline{\mathbf{Y}}_{t},\overline{\mathbf{P}}_{t})^{\top}=$	$\displaystyle(I+S_{t}\widetilde{\rho})^{-1}S_{t}(\overline{\mathbf{X}}_{t},\overline{\mathbf{L}}_{t})^{\top}+(I+S_{t}\widetilde{\rho})^{-1}\Upsilon_{t},$	(5.10)
$\displaystyle(\overline{\mathbf{Z}}_{t},\overline{\mathbf{Q}}_{t})^{\top}=$	$\displaystyle(I+S_{t}\widetilde{\rho})^{-1}(S_{t}\mathbb{H}_{5}+S_{t}\mathbb{H}_{6}S_{t})\big{(}I-\widetilde{\rho}(I+S_{t}\widetilde{\rho})^{-1}S_{t}\big{)}(\overline{\mathbf{X}}_{t},\overline{\mathbf{L}}_{t})^{\top}$
	$\displaystyle-(I+S_{t}\widetilde{\rho})^{-1}(S_{t}\mathbb{H}_{5}+S_{t}\mathbb{H}_{6}S_{t})\widetilde{\rho}(I+S_{t}\widetilde{\rho})^{-1}\Upsilon_{t}$
	$\displaystyle+(I+S_{t}\widetilde{\rho})^{-1}S_{t}\mathbb{H}_{6}\Upsilon_{t}+(I+S_{t}\widetilde{\rho})^{-1}(S_{t}(\sigma_{0},0,0,0,0)^{\top}+\upsilon_{t}).$

Its proof is based on a standard linear transformation decoupling method (e.g., [17]) and we omit its details here. We now assume

(A6) $\det\{(0,I)e^{\mathcal{A}t}(0,I)^{\top}\}>0,\ \forall\ t\in[0,T],$ where $\mathcal{A}=\left(\begin{array}[]{cc}b_{1}&\frac{1}{2}R^{-1}|b_{2}|^{2}\\ 2Q&-b_{1}\end{array}\right)$ .

Proposition 5.1.

Let (A5)-(A6) hold. Then the Hamiltonian system (5.6) has a unique solution $(\mathbf{X},\mathbf{L},\mathbf{Y},\mathbf{Z},\mathbf{Z}^{1},\mathbf{P},\mathbf{Q},\mathbf{Q}^{1})$ (see (5.4) for notation), where $(X^{0},\mathbf{L},Y^{0},Z^{0},P^{0},Q^{00},P,Q^{10})$ are $\mathbb{F}^{0}$ -adapted, $(X^{1},Y^{1},Z^{10},Z^{11},P^{\ddagger},Q^{10,\ddagger},Q^{11,\ddagger})$ are $\mathbb{F}^{0,1}$ -adapted, and $Q^{01}=Q^{11}\equiv 0$ . Moreover, the following relation holds $P_{t}^{\ddagger}=\Sigma_{t}X_{t}^{1}+p_{t},\ t\in[0,T]$ , where $\Sigma$ and $p$ are given by

\Sigma_{t}=-\Big{[}(0,I)e^{\mathcal{A}(T-t)}(0,I)^{\top}\Big{]}^{-1}(0,I)e^{\mathcal{A}(T-t)}(I,0)^{\top},

(5.11)

\displaystyle p_{t}=\mathbb{E}_{t}\Big{[}-L_{T}^{\ddagger}\Phi_{1}\Pi_{T}+\int_{t}^{T}[(\Sigma_{s}\mathbf{\Lambda}_{1}+\Sigma_{s}\mathbf{\Lambda}_{2}(I+S_{s}\widetilde{\rho})^{-1}S_{s}+\mathbf{\Lambda}_{4})(\overline{\mathbf{X}}_{s},\mathbf{L}_{s})^{\top}+\Sigma_{s}\mathbf{\Lambda}_{2}(I+S_{s}\widetilde{\rho})^{-1}\Upsilon_{s}]\Pi_{s}ds\Big{]},

(5.12)

with $\Pi_{s}=e^{\int_{t}^{s}(\frac{1}{2}R^{-1}|b_{2}|^{2}\Sigma_{r}+b_{1})dr},\ \ s\in[t,T]$ , and $\mathbf{\Lambda}_{1},\mathbf{\Lambda}_{2}$ and $\mathbf{\Lambda}_{4}$ are given in Appendix A.2.

We defer the proof of Proposition 5.1 in Appendix. Combining (5.5), (5.10) and (5.12), the equilibrium for $\mathcal{A}_{0}$ and $\mathcal{A}_{1}$ are respectively

	$\displaystyle u_{t}^{0,*}=$	$\displaystyle(0,{a_{1}^{0}}^{\top})(\overline{\mathbf{Y}}_{t},\overline{\mathbf{P}}_{t})^{\top}+{a_{3}^{0}}^{\top}{\mathbf{L}}_{t}=A_{t}^{0,1}(X_{t}^{0},\mathbb{E}_{t}[X_{t}^{1}])^{\top}+M_{t}^{0},$
	$\displaystyle u_{t}^{1,*}=$	$\displaystyle(0,a_{1}^{\top})(\overline{\mathbf{Y}}_{t},\overline{\mathbf{P}}_{t})^{\top}+(a_{3}+a_{8})^{\top}\mathbf{L}_{t}+\frac{1}{2}R^{-1}b_{2}P_{t}^{\ddagger}=A_{t}^{1}(X_{t}^{0},\mathbb{E}_{t}[X_{t}^{1}])^{\top}$
		$\displaystyle+\frac{1}{2}R^{-1}b_{2}\Sigma_{t}X_{t}^{1}+\frac{1}{2}R^{-1}b_{2}\mathbb{E}_{t}\Big{[}\int_{t}^{T}C_{s}^{1}(X_{s}^{0},\mathbb{E}_{s}[X_{s}^{1}])^{\top}]\Pi_{s}ds\Big{]}+M_{t},$

where

		$\displaystyle(A_{t}^{0,1},A_{t}^{0,2}):=(0,{a_{1}^{0}}^{\top})(I+S_{t}\widetilde{\rho})^{-1}S_{t},\ \ \ \ \ M_{t}^{0}:=A_{t}^{0,2}\mathbf{L}_{t}+(0,{a_{1}^{0}}^{\top})(I+S_{t}\widetilde{\rho})^{-1}\Upsilon_{t}+{a_{3}^{0}}^{\top}\overline{\mathbf{L}}_{t},$
		$\displaystyle(A_{t}^{1},A_{t}^{2}):=(0,a_{1}^{\top})(I+S_{t}\widetilde{\rho})^{-1}S_{t},\ \ \ \ \ \ (C_{t}^{1},C_{t}^{2}):=\Sigma_{t}\mathbf{\Lambda}_{1}+\Sigma_{t}\mathbf{\Lambda}_{2}(I+S_{t}\widetilde{\rho})^{-1}S_{t}+\mathbf{\Lambda}_{4},$

		$\displaystyle M_{t}:=A_{t}^{2}\mathbf{L}_{t}+(0,a_{1}^{\top})(I+S_{t}\widetilde{\rho})^{-1}\Upsilon_{t}+(a_{3}+a_{8})^{\top}\mathbf{L}_{t}$
		$\displaystyle\qquad+\frac{1}{2}R^{-1}b_{2}\mathbb{E}_{t}\Big{[}-L_{T}^{\ddagger}\Phi_{1}\Pi_{T}+\int_{t}^{T}[C_{s}^{2}\mathbf{L}_{s}+\Sigma_{s}\mathbf{\Lambda}_{2}(I+S_{s}\widetilde{\rho})^{-1}\Upsilon_{s}]\Pi_{s}ds\Big{]}.$

Note that both $N_{t}^{0}$ and $N_{t}$ are $\mathbb{F}^{0}$ -adapted. By Theorem 4.1, we have the following result.

Theorem 5.1.

Let (A5)-(A6) hold. Then

	$\displaystyle\Big{(}$	$\displaystyle A_{t}^{0,1}(X_{t}^{0,N},\mathbb{E}_{t}[X_{t}^{1,N}])^{\top}+M_{t}^{0},\ \ \ A_{t}^{1}(X_{t}^{0,N},\mathbb{E}_{t}[X_{t}^{1,N}])^{\top}$		(5.13)
		$\displaystyle+\frac{1}{2}R^{-1}b_{2}\Sigma_{t}X_{t}^{i,N}+\frac{1}{2}R^{-1}b_{2}\mathbb{E}_{t}\Big{[}\int_{t}^{T}C_{s}^{1}(X_{s}^{0,N},\mathbb{E}_{s}[X_{s}^{1,N}])^{\top}]\Pi_{s}ds\Big{]}+M_{t}\Big{)},\ 1\leq i\leq N,$		(5.13)

is an $\varepsilon_{N}$ -Nash equilibrium strategy for LQG-RMM with $\varepsilon_{N}=O(\frac{1}{\sqrt{N}})$ ; $(X^{0,N},X^{1,N},$ $\cdots,X^{N,N})$ is the solution of McKean-Vlasov SDE (5.1) by applying the equilibrium (5.13).

5.1 Forward LQG-RMM

This subsection studies a special case with $\gamma_{0}=\gamma=0$ in (5.1), (5.2). In this case, all functionals are still quadratic but involve only the forward state, and the RMM reduces to the classical forward major-minor game in [11]. Although such forward setting is not novel, our RMM-LQG still gains novelties in its generality of weak-couplings. Now (5.8) and (5.9) become

\left\{\begin{aligned} &\dot{S}_{t}+{S}_{t}(\mathbb{A}_{1}+\mathbb{A}_{2})+(\mathbb{B}_{5}+\mathbb{B}_{6})S_{t}+S_{t}\mathbb{B}_{2}S_{t}+(\mathbb{A}_{5}+\mathbb{A}_{6})=0,\\ &-d\Upsilon_{t}=(S_{t}\mathbb{B}_{2}+\mathbb{B}_{5}+\mathbb{B}_{6})\Upsilon_{t}dt,\ \ \ \ S_{T}=\Upsilon_{T}=0.\end{aligned}\right.

It is easy to check that if

\det\{(0,I)e^{\widehat{\mathcal{A}}t}(0,I)^{\top}\}>0,\ \forall\ t\in[0,T],\ \ \ \text{with}\ \widehat{\mathcal{A}}=\left(\begin{array}[]{cc}\mathbb{A}_{1}+\mathbb{A}_{2}&\mathbb{B}_{2}\\ -\mathbb{A}_{5}-\mathbb{A}_{6}&-\mathbb{B}_{5}-\mathbb{B}_{6}\end{array}\right),

(5.14)

then (A5) can be ensured and $S$ admits the following representation

S_{t}=-\Big{[}(0,I)e^{\widehat{\mathcal{A}}(T-t)}(0,I)^{\top}\Big{]}^{-1}(0,I)e^{\widehat{\mathcal{A}}(T-t)}(I,0)^{\top}=:\left(\begin{array}[]{ccc}S_{t}^{11}&S_{t}^{12}\\ S_{t}^{21}&S_{t}^{22}\\ S_{t}^{31}&S_{t}^{32}\\ \end{array}\right).

(5.15)

By the solution uniqueness, $\Upsilon_{t}\equiv 0,$ so $M_{t}^{0}=M_{t}\equiv 0,\ t\in[0,T]$ (noting $L^{0}=L=L^{\ddagger}\equiv 0$ ). Then

Corollary 5.1.

Let (A6) and (5.14) hold. An $\varepsilon_{N}$ -Nash equilibrium strategy for the forward LQG-RMM problem takes the following form

	$\displaystyle\Big{(}$	$\displaystyle A_{t}^{0,1}(X_{t}^{0,N},\mathbb{E}_{t}[X_{t}^{1,N}])^{\top},\ A_{t}^{1}(X_{t}^{0,N},\mathbb{E}_{t}[X_{t}^{1,N}])^{\top}$
		$\displaystyle+\frac{1}{2}R^{-1}b_{2}\Sigma_{t}X_{t}^{i,N}+\frac{1}{2}R^{-1}b_{2}\mathbb{E}_{t}\Big{[}\int_{t}^{T}C_{s}^{1}(X_{s}^{0,N},\mathbb{E}_{s}[X_{s}^{1,N}])^{\top}]\Pi_{s}ds\Big{]}\Big{)},\quad 1\leq i\leq N,$

where $A_{t}^{0,1}={a_{1}^{0}}^{\top}(S_{t}^{11},S_{t}^{21},S_{t}^{31})^{\top},\ A_{t}^{1}=a_{1}^{\top}(S_{t}^{11},S_{t}^{21},S_{t}^{31})^{\top},\ C_{t}^{1}=\Sigma_{t}(b_{3},b_{5})+\Sigma_{t}\mathbf{\Lambda}_{3}S_{t}+(2Q\mu_{2},2Q\mu_{1}),$ $\mathbf{\Lambda}_{3}$ is given in Subsection 6.2; $S_{t}$ and $\Sigma_{t}$ are given by (5.15) and (5.11), respectively.

Example 5.1.

Consider a forward LQG-RMM with $\mu^{0}_{2}=\mu_{3}=\mu_{4}=0$ and $b_{4}^{0}=b_{4}=b_{6}=0$ (no weak-coupling of the control-average). The $\varepsilon_{N}$ -Nash equilibrium by Corollary 5.1 becomes

\displaystyle\Big{(}\frac{1}{2}R_{0}^{-1}b_{2}^{0}S_{t}^{11}X_{t}^{0,N}+\frac{1}{2}R_{0}^{-1}b_{2}^{0}S_{t}^{21}\mathbb{E}_{t}[X_{t}^{1,N}],\ \ \ \frac{1}{2}R^{-1}b_{2}\Sigma_{t}X_{t}^{i,N}+k_{t}\Big{)},\ 1\leq i\leq N,

where $k_{t}=\frac{1}{2}R^{-1}b_{2}\mathbb{E}_{t}\Big{[}\int_{t}^{T}C_{s}^{1}(X_{s}^{0,N},\mathbb{E}_{s}[X_{s}^{1,N}])^{\top}\Pi_{s}ds\Big{]}$ and $C_{t}^{1}=\Sigma_{t}(b_{3},b_{5})+(2Q\mu_{2},2Q\mu_{1}).$ This result recovers Theorem 5.1 in [11] with a subtle difference: note that $\varepsilon_{N}=O(N^{-\frac{1}{d+4}})$ in [11] while our $\varepsilon_{N}=O(\sqrt{N})$ . This is mainly due to the modeling differences in weak-couplings: [11] considers the empirical distribution while we focus on the more special empirical average. As a tradeoff, we can obtain an explicit expression for $k_{t}$ while [11] only shows its existence.

5.2 Backward LQG-RMM

In this subsection, $Q_{0}=Q=0$ , $f_{1}^{0}=f_{5}^{0}=f_{1,5,9}=0$ , $\Phi_{1}^{0}=\Phi_{2}^{0}=\Phi_{1,2,3}=0$ in (5.2) and (5.3). The LQG-RMM is now solely “backward” without the forward state (5.1). Backward LQG setting has found broad applications in such as optimal investment, recursive utility and hedging (e.g., [20]). Related MFG studies on this setting has also been well addressed (see [14, 22]). However, the backward LQG-RMM seems still novel in literature. It can well capture the insights of the large investor (see [16]) and relative performance ([14]), both are well motivated in financial studies. In this case, (5.8) reads as (noting that $S$ is $\mathbb{R}^{2\times 3}$ -valued)

		$\displaystyle\dot{S}_{t}+{S}_{t}(\mathbb{C}_{3}+\mathbb{C}_{4}+\rho\mathbb{C}_{7}+\rho\mathbb{C}_{8})+\widetilde{\mathbb{C}}S_{t}+S_{t}\widetilde{\mathbb{B}}S_{t}+(I+S_{t}\rho)(\mathbb{F}_{1}+\mathbb{F}_{2})(I+S_{t}\rho)^{-1}\cdot$		(5.16)
		$\displaystyle\big{(}S_{t}(\mathbb{C}_{5}+\mathbb{C}_{6})+S_{t}(\mathbb{C}_{5}+\mathbb{C}_{6})\rho S_{t}\big{)}+\mathbb{C}_{7}+\mathbb{C}_{8}=0,\ {S}_{T}=0,$		(5.16)

and the linear BSDE (5.9) now becomes

\left\{\begin{aligned} -d\Upsilon_{t}=&\Big{[}\Big{(}S_{t}\widetilde{\mathbb{B}}+\widetilde{\mathbb{C}}+(I+S_{t}\rho)(\mathbb{F}_{1}+\mathbb{F}_{2})(I+S_{t}\rho)^{-1}S_{t}(\mathbb{C}_{5}+\mathbb{C}_{6})\rho\Big{)}\Upsilon_{t}\\ &+(I+S_{t}\rho)(\mathbb{F}_{1}+\mathbb{F}_{2})(I+S_{t}\rho)^{-1}\upsilon_{t}\Big{]}dt-\upsilon_{t}dW_{t}^{0},\ \Upsilon_{T}=(\xi^{0},\mathbb{E}_{T}[\xi^{1}])^{\top},\end{aligned}\right.

(5.17)

where $\widetilde{\mathbb{C}}=\mathbb{C}_{7}\rho+\mathbb{C}_{8}\rho+\mathbb{D}_{1}+\mathbb{D}_{2},$ $\widetilde{\mathbb{B}}=(\mathbb{C}_{3}+\mathbb{C}_{4})\rho+\rho\widetilde{\mathbb{C}}$ . Moreover, it follows from (5.11)-(5.12) that $\Sigma_{t}=p_{t}\equiv 0$ . Besides, by (5.7) and Lemma 5.1, $\mathbf{L}_{t}=(L_{t}^{0},L_{t},L_{t}^{\ddagger})^{\top}$ satisfies

\displaystyle d\left(\begin{array}[]{c}{L}_{t}^{0}\\ L_{t}\end{array}\right)=(\mathbb{D}_{1}+\mathbb{D}_{2})^{\top}\left(\begin{array}[]{c}{L}_{t}^{0}\\ L_{t}\end{array}\right)dt+(\mathbb{F}_{1}+\mathbb{F}_{2})^{\top}\left(\begin{array}[]{c}{L}_{t}^{0}\\ L_{t}\end{array}\right)dW_{t}^{0},\ dL_{t}^{\ddagger}=f_{2}L_{t}^{\ddagger}dt+f_{3}L_{t}^{\ddagger}dW_{t}^{0},

(5.18)

with the initial condition $\mathbf{L}_{0}=\rho(I+S_{0}\rho)^{-1}S_{0}\mathbf{L}_{0}+\rho(I+S_{0}\rho)^{-1}\Upsilon_{0}.$ In particular, if $S_{0}=0$ , then $\mathbf{L}_{0}=\rho\Upsilon_{0}$ ; and if $\det(S_{0}^{\top}S_{0})\neq 0$ , then $\mathbf{L}_{0}=(S_{0}^{\top}S_{0})^{-1}S_{0}^{\top}(I+S_{0}\rho)S_{0}\rho(I+S_{0}\rho)^{-1}\Upsilon_{0}.$ Since (A6) can be readily verified, the following result follows directly from Theorem 5.1.

Corollary 5.2.

Under (A5), assume $(S,\Upsilon,\upsilon)$ and $\mathbf{L}$ solve (5.16)-(5.17) and (5.18), respectively. Then an $\varepsilon_{N}$ -Nash equilibrium $\Big{(}M_{t}^{0},M_{t},\cdots,M_{t}\Big{)}$ for backward LQG-RMM is given by

	$\displaystyle M_{t}^{0}=$	$\displaystyle{a_{3}^{0}}^{\top}\mathbf{L}_{t}={a_{3}^{0}}^{\top}\Big{(}e^{\mathbb{H}t+\mathbb{K}W_{t}^{0}}(L_{0}^{0},L_{0}),e^{(f_{2}-\frac{1}{2}\|f_{3}\|^{2})t+f_{3}W_{t}^{0}}L_{0}^{\ddagger}\Big{)}^{\top},$		(5.19)
	$\displaystyle M_{t}=$	$\displaystyle(a_{3}+a_{8})^{\top}\mathbf{L}_{t}=(a_{3}+a_{8})^{\top}\Big{(}e^{\mathbb{H}t+\mathbb{K}W_{t}^{0}}(L_{0}^{0},L_{0}),e^{(f_{2}-\frac{1}{2}\|f_{3}\|^{2})t+f_{3}W_{t}^{0}}L_{0}^{\ddagger}\Big{)}^{\top},$		(5.19)

with $\mathbb{H}=(\mathbb{D}_{1}+\mathbb{D}_{2})^{\top}-\frac{1}{2}(\mathbb{F}_{1}+\mathbb{F}_{2})^{\top}(\mathbb{F}_{1}+\mathbb{F}_{2}),\ \mathbb{K}=(\mathbb{F}_{1}+\mathbb{F}_{2})^{\top}.$

We have the following observations on (5.19):
(1) Both $M^{0}$ and $M$ are $\mathbb{F}^{0}$ -adapted, and depend on the common noise $W^{0}$ only through $e^{\mathbb{K}W_{t}^{0}}$ and $e^{f_{3}W_{t}^{0}}$ . If the BSDE’s driver of (5.3) is independent to the intensity state $z$ (namely, $f_{3}^{0}=f_{7}^{0}=f_{3,7,11}=0$ , so $\mathbb{K}=0$ ), the equilibrium for each agent becomes deterministic.
(2) Unlike the forward LQG-RMM (Corollary 5.1) with $\mathbb{F}^{i,0}$ -adapted equilibrium, the equilibrium $(M_{t})_{t\in[0,T]}$ in the backward case is $\mathbb{F}^{0}$ -adapted, hence the idiosyncratic information driven by individual noise $\{W^{i}\}_{i=1}^{N}$ plays no role in the equilibrium. This is mainly because the driver of BSDE (5.3) is now linear and independent to the principal intensity state $Z^{i,i}$ .
(3) If the major agent is absent, the equilibrium for each (minor) agent becomes $M_{t}=(a_{3}+a_{8})^{\top}(0,0,L_{t}^{\ddagger})^{\top}$ . Comparing with $M_{t}$ in (5.19), one can see that the term $(a_{3}+a_{8})^{\top}(L_{t}^{0},L_{t},0)^{\top}$ captures the influence of $\mathcal{A}_{0}$ .

Remark 5.1.

(i) When $f_{3}^{0}=f_{3}+f_{11},f_{7}^{0}=f_{7}=0$ ( $\mathbb{K}^{\top}=f_{3}^{0}I$ ), (5.16) can be solved as

S_{t}=-\Big{[}(0,I_{2})e^{\mathcal{B}(T-t)}(0,I)^{\top}\Big{]}^{-1}(0,I)e^{\mathcal{B}(T-t)}(I,0)^{\top},\ t\in[0,T],

(5.20)

where $\mathcal{B}=\left(\begin{array}[]{cc}\widehat{\mathbb{C}}&\widetilde{\mathbb{B}}+f_{3}^{0}(\mathbb{C}_{5}+\mathbb{C}_{6})\rho\\ -(\mathbb{C}_{7}+\mathbb{C}_{8})&-\widetilde{\mathbb{C}}\end{array}\right)$ , $\widehat{\mathbb{C}}=\mathbb{C}_{3}+\mathbb{C}_{4}+\rho(\mathbb{C}_{7}+\mathbb{C}_{8})+f_{3}^{0}(\mathbb{C}_{5}+\mathbb{C}_{6})$ and (A5) is satisfied when $\det\{(0,I)e^{\mathcal{B}t}(0,I)^{\top}\}>0,\ \forall\ t\in[0,T].$

(ii) If $f_{3}^{0}=f_{7}^{0}=f_{3,7,11}=0$ , $\mu^{0}_{2}=\mu_{3}=\mu_{4}=0$ and $f_{8}^{0}=f_{8}=f_{12}=0$ , there has no weak-coupling by the control-average, and the $\varepsilon_{N}$ -Nash equilibrium is given by $M_{t}^{0}=-\frac{1}{2}\big{(}R_{0}^{-1}f_{4}^{0},R_{0}^{-1}f_{8}\big{)}e^{\mathbb{H}t}(l_{0}^{0},l_{0})^{\top},\ M_{t}=-\frac{1}{2}R^{-1}f_{4}e^{f_{2}t}l_{0}^{\ddagger}.$ Besides, when the major agent is absent, the above result recovers that of [22] (Theorem 3.1).

We present two concrete examples with more explicit representations for $\big{(}M_{t}^{0},M_{t},\cdots,M_{t}\big{)}$ .

Example 5.2.

Consider the backward LQG-RMM problem with: for $1\leq i\leq N,$

\displaystyle J_{0}^{N}=-\frac{1}{2}|Y_{0}^{0}|^{2}-\frac{1}{2}\mathbb{E}\Big{[}\int_{0}^{T}|u_{t}^{0}-\mu^{0}_{2}\cdot u_{t}^{(N)}|^{2}dt\Big{]},\ J_{i}^{N}=-\frac{1}{2}|Y_{0}^{i}|^{2}-\frac{1}{2}\mathbb{E}\Big{[}\int_{0}^{T}|u_{t}^{i}-\mu_{3}\cdot u_{t}^{(N)}-\mu_{4}\cdot u_{t}^{0}|^{2}dt\Big{]},

\left\{\begin{aligned} -dY_{t}^{0}=&\Big{[}Z_{t}^{0}+\big{(}u_{t}^{0}-\mu^{0}_{2}u_{t}^{(N)}\big{)}\Big{]}dt-Z_{t}^{0}dW_{t}^{0},\quad Y_{T}^{0}=\xi^{0},\\ -dY_{t}^{i}=&\Big{[}f_{2}Y_{t}^{i}+{(1-f_{2})Z_{t}^{i,0}}-f_{2}Y_{t}^{(N)}+f_{2}Z_{t}^{(N,0)}+\big{(}u_{t}^{i}-\mu_{4}u_{t}^{0}-\mu_{3}u_{t}^{(N)}\big{)}\Big{]}dt\\ &-Z_{t}^{i,0}dW_{t}^{0}-Z_{t}^{i,i}dW_{t}^{i},\quad Y_{T}^{i}=\xi^{i}.\end{aligned}\right.

Therefore, the major and each minor agent are weakly-coupled through their control-average along with an identical relative performance parameter. $\mathcal{B}$ in Remark 5.1 now reads as

\mathcal{B}=\left(\begin{array}[]{ccccc}0&\mu_{4}&-\mu^{0}_{2}&0&-\mu^{0}_{2}\\ 0&1&0&0&0\\ 0&0&0&0&0\\ 1&-\mu_{4}&\mu^{0}_{2}&1&\mu^{0}_{2}\\ 0&0&1&0&1\\ \end{array}\right).

Since $\mathcal{B}^{n}=\mathcal{B}$ , $n\geq 1$ , $\det\{(0,I)e^{\mathcal{B}t}(0,I)^{\top}\}=\det\left(\begin{array}[]{cc}e^{t}&\mu^{0}_{2}(e^{t}-1)\\ 0&e^{t}\end{array}\right)=e^{2t}>0,\forall\ t\in[0,T]$ hence (A5) holds. From (5.20),

S_{t}=-\left(\begin{array}[]{ccc}1-e^{-(T-t)}&-\mu_{4}(1-e^{-(T-t)})&\mu^{0}_{2}e^{-(T-t)}(1-e^{-(T-t)})\\ 0&0&1-e^{-(T-t)}\end{array}\right),\ t\in[0,T].

We calculate from (5.19) that, for $t\in[0,T],$

	$\displaystyle M_{t}^{0}=$	$\displaystyle-(1-\mu_{3}-\mu^{0}_{2}\mu_{4})^{-1}e^{-\frac{1}{2}t+W_{t}^{0}}\mathbb{E}\Big{[}e^{-\frac{3}{2}T+W_{T}^{0}}\Big{(}(1-\mu_{3})\xi^{0}+\mu^{0}_{2}(e^{f_{2}t}-1+e^{-T}+\mu_{3}-\mu_{3}e^{-T})\xi^{1}\Big{)}\Big{]},$
	$\displaystyle M_{t}=$	$\displaystyle-(1-\mu_{3}-\mu^{0}_{2}\mu_{4})^{-1}e^{-\frac{1}{2}t+W_{t}^{0}}\mathbb{E}\Big{[}e^{-\frac{3}{2}T+W_{T}^{0}}\Big{(}\mu_{4}\xi^{0}+(e^{f_{2}t}-\mu_{4}\mu^{0}_{2}+\mu_{4}\mu^{0}_{2}e^{-T})\xi^{1}\Big{)}\Big{]}.$

Remark 5.2.

(1) The equilibrium $(M^{0},M)$ have linear dependence on the terminal conditions $(\xi^{0},\xi^{1})$ via the parameters $(\mu_{3},\mu^{0}_{2},f_{2},T,\mu_{4})$ . (2) When $\mu^{0}_{2}=0$ , the major’s payoff is independent on the control-average of all minors, so her Nash strategy $M^{0}$ will not depend on $\xi^{1}$ any more, but may still get influence from each minor agent provided $\mu_{3}\neq 0$ .

Example 5.3.

Consider the backward LQG-RMM problem with:

\displaystyle J_{0}^{N}=-\gamma_{0}|Y_{0}^{0}|^{2}-\mathbb{E}\Big{[}\int_{0}^{T}R_{0}|u_{t}^{0}|^{2}dt\Big{]},\ J_{i}^{N}=-\gamma|Y_{0}^{i}|^{2}-\mathbb{E}\Big{[}\int_{0}^{T}R|u_{t}^{i}|^{2}dt\Big{]},\ 1\leq i\leq N,

\left\{\begin{aligned} -dY_{t}^{0}=&\Big{(}f_{6}^{0}Y_{t}^{(N)}+f_{8}^{0}u_{t}^{(N)}\Big{)}dt-Z_{t}^{0}dW_{t}^{0},\quad Y_{T}^{0}=\xi^{0},\\ \ -dY_{t}^{i}=&\Big{(}f_{2}Y_{t}^{i}+f_{8}u_{t}^{0}-f_{2}Y_{t}^{(N)}+f_{12}u_{t}^{(N)}\Big{)}dt-Z_{t}^{i,0}dW_{t}^{0}-Z_{t}^{i,i}dW_{t}^{i},\ Y_{T}^{i}=\xi^{i},\ 1\leq i\leq N.\end{aligned}\right.

The major and each minor are interacted through the coupling $(Y^{(N)},u^{(N)})$ . Moreover, each minor agent is also influenced by $u^{0}$ directly, the major’s control. $\mathcal{B}$ in Remark 5.1 become

\mathcal{B}=\left(\begin{array}[]{ccccc}0&0&0&0&2\gamma_{0}f_{6}^{0}\\ f_{6}^{0}&0&0&2\gamma_{0}f_{6}^{0}&0\\ 0&0&0&0&0\\ 0&0&0&0&-f_{6}^{0}\\ 0&0&0&0&0\\ \end{array}\right).

Now, $\mathcal{B}^{2}=0$ , and $\det\{(0,I_{2})e^{\mathcal{B}t}(0,I_{2})^{\top}\}=\det\left(\begin{array}[]{cc}1&-f_{6}^{0}t\\ 0&1\end{array}\right)=1>0$ , $\forall\ t$ , so (A5) holds. Moreover, $S_{t}\equiv 0$ by (5.20), $M_{t}^{0}=-R_{0}^{-1}f_{8}f_{6}^{0}\gamma_{0}\Big{(}\mathbb{E}[\xi^{0}]+f_{6}^{0}T\mathbb{E}[\xi^{1}]\Big{)}t,\ M_{t}\equiv 0$ by (5.19).

6 Conclusions

This paper studies a new class recursive major-minor (RMM) games featured by: (1) recursive functionals with nonlinear BSDE representations; (2) comprehensive and general weak-couplings. We propose a novel structural scheme to construct its auxiliary problem, a key step towards the desired $\varepsilon$ -Nash equilibrium. In the RMM context, this scheme consists of a bilateral perturbation and a mixed triple-agent leader-follower-Nash analysis. In contrast to heuristic arguments in most MFG literature, such scheme indeed lay down an unified game-theoretic foundation to analyze more complex LP coupling structures, such as the ones with heterogenous robust beliefs, or with coalition interactions from nested information. We plan to address them in future.

Appendix A Appendix

A.1 Proof of Theorem 4.1

For a fixed $N$ , it suffices to verify the $\varepsilon_{N}$ -Nash equilibrium property of $(u^{0,N},u^{1,N},\cdots$ , $u^{N,N})$ in side of $\mathcal{A}_{0}$ . The verification in side of $\{\mathcal{A}_{i}\}_{i=1}^{N}$ is analogous thus we omit the details here. For this purpose, we consider the following limiting processes of (4.10) and (4.11):

\left\{\begin{aligned} dX_{t}^{0}=&b^{0}(t,X_{t}^{0},u_{t}^{0,*};\overline{X}_{t}^{i},\overline{u}_{t}^{i,*})dt+{\sigma^{0}(t,{X}_{t}^{0};\overline{X}_{t}^{i},\overline{u}_{t}^{i,*})}dW^{0}_{t},\quad X_{0}^{0}=x_{0}^{0},\\ dX_{t}^{i}=&b(t,X_{t}^{i},{u}_{t}^{i,*};X_{t}^{0},u_{t}^{0,*};\overline{X}_{t}^{i},\overline{u}_{t}^{i,*})dt+\sigma(t,X_{t}^{i},X_{t}^{0};\overline{X}_{t}^{i},\overline{u}_{t}^{i,*})dW_{t}^{i},\quad X_{0}^{i}=x_{0},\\ -dY_{t}^{0}=&f^{0}(t,\Theta_{t}^{0},u_{t}^{0,*};\overline{\Theta}_{t}^{i},\overline{u}_{t}^{i,*})dt-Z_{t}^{0}dW_{t}^{0},\quad Y_{T}^{0}=\Phi^{0}(X_{T}^{0},\overline{X}_{T}^{i})+\xi^{0},\\ -dY_{t}^{i}=&f(t,\Theta_{t}^{i},{u}_{t}^{i,*};\Theta_{t}^{0},u_{t}^{0,*};\overline{\Theta}_{t}^{i},\overline{u}_{t}^{i,*})dt-Z_{t}^{i}d(W^{0},W^{i})_{t},\quad Y_{T}^{i}=\Phi(X_{T}^{i},X_{T}^{0},\overline{X}_{T}^{i})+\xi^{i},\\ \end{aligned}\right.

and $J_{0}=\Gamma^{0}(Y_{0}^{0})+\mathbb{E}[\int_{0}^{T}g^{0}(t,\Theta_{t}^{0},{u}_{t}^{0,*};\overline{\Theta}_{t}^{i},\overline{u}_{t}^{i,*})dt],$ where $u_{t}^{0,*}=\Psi^{0}(t,X_{t}^{0},X_{t}^{1}),\ u_{t}^{i,*}=\Psi(t,X_{t}^{0};X_{t}^{i}).$ By Burkholder-Davis-Gundy inequality and standard convergence estimates of SDEs (e.g., Theorem 10.1.7 in [35]), for $0\leq s\leq T,$ $j=0,1,\cdots,N,$ we have

		$\displaystyle\mathbb{E}[\mathop{\rm sup}_{0\leq t\leq s}\|X_{t}^{j,N}-X_{t}^{j}\|^{2}]\leq C\int_{0}^{s}\sum_{k=0,1,i}\mathbb{E}[\mathop{\rm sup}_{0\leq t\leq r}\|X_{r}^{k,N}-X_{r}^{k}\|^{2}]dr$
		$\displaystyle+C\int_{0}^{s}\mathbb{E}\Big{[}\Big{\|}\frac{1}{N}\sum_{i=1}^{N}X_{r}^{i,N}-\mathbb{E}_{r}[X_{r}^{i}]\Big{\|}^{2}\Big{]}dr+C\int_{0}^{s}\mathbb{E}\Big{[}\Big{\|}\frac{1}{N}\sum_{i=1}^{N}\Psi(r,X_{r}^{0,N},X_{r}^{i,N})-\mathbb{E}_{r}[\Psi(r,X_{r}^{0},X_{r}^{i})]\Big{\|}^{2}\Big{]}dr$
		$\displaystyle\leq C\int_{0}^{s}\mathbb{E}[\mathop{\rm sup}_{0\leq t\leq r}\|X_{r}^{0,N}-X_{r}^{0}\|^{2}]dr+C\int_{0}^{s}\mathop{\rm sup}_{1\leq i\leq N}\mathbb{E}[\mathop{\rm sup}_{0\leq t\leq r}\|X_{r}^{i,N}-X_{r}^{i}\|^{2}]dr+\frac{C}{N}.$

It follows from Gronwall inequality that

\mathop{\rm sup}_{0\leq i\leq N}\mathbb{E}[\mathop{\rm sup}_{0\leq t\leq T}|X_{t}^{i,N}-X_{t}^{i}|^{2}]\leq{\frac{C}{N}}.

(A.1)

Applying Itô’s formula to $|Y_{t}^{0,N}-Y_{t}^{0}|^{2}$ and $|Y_{t}^{i,N}-Y_{t}^{i}|^{2}$ , we have

		$\displaystyle\mathbb{E}\big{[}\|Y_{t}^{j,N}-Y_{t}^{j}\|^{2}+\int_{t}^{T}\|Z_{s}^{j,N}-Z_{s}^{j}\|^{2}ds\big{]}\leq C\int_{t}^{T}\mathbb{E}[\|Y_{s}^{j,N}-Y_{s}^{j}\|^{2}]ds+C\mathbb{E}[\|X_{T}^{j,N}-X_{T}^{j}\|^{2}]+C_{j}\mathbb{E}[\|X_{T}^{0,N}-X_{T}^{0}\|^{2}]$
		$\displaystyle+C\mathbb{E}\big{[}\big{\|}\frac{1}{N}\sum_{i=1}^{N}X_{T}^{i,N}-\mathbb{E}_{T}[X_{T}^{i}]\big{\|}^{2}\big{]}+C\int_{t}^{T}\sum_{k=0,1,i}\mathbb{E}[\|X_{s}^{k,N}-X_{s}^{k}\|^{2}]ds+C_{j}\mathbb{E}\big{[}\int_{t}^{T}(\|Y_{s}^{0,N}-Y_{s}^{0}\|^{2}+\|Z_{s}^{0,N}-Z_{s}^{0}\|^{2})ds\big{]}$

		$\displaystyle+C\int_{t}^{T}\mathbb{E}\Big{[}\Big{\|}\frac{1}{N}\sum_{i=1}^{N}X_{s}^{i,N}-\mathbb{E}_{s}[X_{s}^{i}]\Big{\|}^{2}\Big{]}ds+C\int_{t}^{T}\mathbb{E}\Big{[}\Big{\|}\frac{1}{N}\sum_{i=1}^{N}\Psi(r,X_{r}^{0,N},X_{r}^{i,N})-\mathbb{E}_{r}[\Psi(r,X_{r}^{0},X_{r}^{i})]\Big{\|}^{2}\Big{]}dr$		(A.2)
		$\displaystyle+C\int_{t}^{T}\mathbb{E}\Big{[}\Big{\|}\frac{1}{N}\sum_{i=1}^{N}Y_{s}^{i,N}-\mathbb{E}_{s}[Y_{s}^{i}]\Big{\|}^{2}\Big{]}ds+C\int_{t}^{T}\mathbb{E}\Big{[}\Big{\|}\frac{1}{N}\sum_{i=1}^{N}Z_{s}^{i,N}-\mathbb{E}_{s}[Z_{s}^{i}]\Big{\|}^{2}\Big{]}ds$
		$\displaystyle\leq C\int_{t}^{T}\mathbb{E}[\|Y_{s}^{i,N}-Y_{s}^{i}\|^{2}]ds+C\mathbb{E}[\|X_{T}^{j,N}-X_{T}^{j}\|^{2}]+C_{j}\mathbb{E}[\|X_{T}^{0,N}-X_{T}^{0}\|^{2}]+C\int_{t}^{T}\sum_{k=0,1,i}\mathbb{E}[\|X_{s}^{k,N}-X_{s}^{k}\|^{2}]ds$
		$\displaystyle+C_{j}\mathbb{E}\big{[}\int_{t}^{T}(\|Y_{s}^{0,N}-Y_{s}^{0}\|^{2}+\|Z_{s}^{0,N}-Z_{s}^{0}\|^{2})ds\big{]}+\frac{C}{N},$

where $C_{0}=0$ and $\{C_{j}\}_{j=1}^{N}$ stands for some positive constant. Combining (A.1) and (A.2),

\mathop{\rm sup}_{0\leq i\leq N}\Big{\{}\mathop{\rm sup}_{0\leq t\leq T}\mathbb{E}[|Y_{t}^{i,N}-Y_{t}^{i}|^{2}]+\mathbb{E}[\int_{0}^{T}|Z_{t}^{i,N}-Z_{t}^{i}|^{2}dt]\Big{\}}\leq\frac{C}{N}.

As to the functionals,

		$\displaystyle\|J^{N}_{0}-J_{0}\|\leq\|\Gamma^{0}(Y_{0}^{0,N})-\Gamma^{0}(Y_{0}^{0})\|+\mathbb{E}\Big{[}\int_{0}^{T}\Big{\|}g^{0}\Big{(}t,\Theta_{t}^{0,N},\Psi^{0}(t,X_{t}^{0,N},X_{t}^{1,N});\Theta_{t}^{(N)},u_{t}^{N}\Big{)}$
		$\displaystyle-g^{0}\Big{(}t,\Theta_{t}^{0},\Psi^{0}(t,X_{t}^{0},X_{t}^{1});\overline{\Theta}_{t}^{i},\mathbb{E}_{t}\big{[}\Psi(t,X_{t}^{0},X_{t}^{i})\big{]}\Big{)}\Big{\|}dt\Big{]}$
		$\displaystyle\leq C(1+\|Y_{0}^{0,N}\|+\|Y_{0}^{0}\|)\cdot\|Y_{0}^{0,N}-Y_{0}^{0}\|$
		$\displaystyle+C\mathbb{E}\Big{[}\int_{0}^{T}\big{(}1+\|\Theta_{t}^{0,N}\|+\|\Theta_{t}^{0}\|+\|\Theta_{t}^{(N)}\|+\|\overline{\Theta}_{t}^{i}\|+\|u_{t}^{(N)}\|+\|\mathbb{E}_{t}[\Psi(t,X_{t}^{0},X_{t}^{1})]\|\big{)}$
		$\displaystyle\big{(}\|\Theta_{t}^{0,N}-\Theta_{t}^{0}\|+\|\Theta_{t}^{(N)}-\overline{\Theta}_{t}^{i}\|+\|u_{t}^{(N)}-\mathbb{E}_{t}[\Psi(t,X_{t}^{0},X_{t}^{1})]\|\big{)}dt$
		$\displaystyle\leq C(1+\|Y_{0}^{0,N}\|^{2}+\|Y_{0}^{0}\|^{2})\cdot\|Y_{0}^{0,N}-Y_{0}^{0}\|^{2}$
		$\displaystyle+C\Big{(}\mathbb{E}\Big{[}\int_{0}^{T}\big{(}1+\|\Theta_{t}^{0,N}\|^{2}+\|\Theta_{t}^{0}\|^{2}+\|\Theta_{t}^{(N)}\|^{2}+\|\overline{\Theta}_{t}^{i}\|^{2}+\|u_{t}^{(N)}\|^{2}+\|\Psi(t,X_{t}^{0},X_{t}^{1})\|^{2}\big{)}dt\Big{)}^{\frac{1}{2}}\cdot$
		$\displaystyle\Big{(}\mathbb{E}\Big{[}\int_{0}^{T}\big{(}\|\Theta_{t}^{0,N}-\Theta_{t}^{0}\|^{2}+\|\Theta_{t}^{(N)}-\overline{\Theta}_{t}^{i}\|^{2}+\|u_{t}^{(N)}-\mathbb{E}_{t}[\Psi(t,X_{t}^{0},X_{t}^{1})]\|^{2}\big{)}dt\Big{)}^{\frac{1}{2}}\leq\frac{C}{\sqrt{N}}.$

We study the uniliteral deviation of $\mathcal{A}_{0}$ from the strategy $u_{t}^{0,N}=\Psi^{0}(t,X_{t}^{0,N},X_{t}^{1,N})$ . Assume now that $\mathcal{A}_{0}$ adopts a different control $u^{0}\in\mathcal{U}_{d}^{0}$ and $\{\mathcal{A}_{i}\}_{i\geq 1}^{N}$ keep to apply $\{u_{t}^{i,N}\}_{i\geq 1}^{N}$ . The resulting perturbed states, denoted by $(\widehat{X}_{t}^{i,N})_{i\geq 0}$ , should satisfy

\left\{\begin{aligned} d\widehat{X}_{t}^{0,N}=&b^{0}\big{(}t,\widehat{X}_{t}^{0,N},u^{0}_{t};\widehat{X}_{t}^{(N)},{u_{t}^{(N)}}\big{)}dt+{\sigma^{0}\big{(}t,\widehat{X}_{t}^{0,N};\widehat{X}_{t}^{(N)},{u_{t}^{(N)}}\big{)}}dW^{0}_{t},\quad\widehat{X}_{0}^{0,N}=x_{0}^{0},\\ d\widehat{X}_{t}^{i,N}=&b\big{(}t,\widehat{X}_{t}^{i,N},u_{t}^{i,*,N};\widehat{X}_{t}^{0,N},u_{t}^{0};\widehat{X}_{t}^{(N)},u_{t}^{(N)}\big{)}dt+\sigma\big{(}t,\widehat{X}_{t}^{i,N},\widehat{X}_{t}^{0,N};\widehat{X}_{t}^{(N)},u_{t}^{(N)}\big{)}dW_{t}^{i},\quad\widehat{X}_{0}^{i,N}=x_{0},\\ \end{aligned}\right.

and the related limiting processes is given by

\left\{\begin{aligned} d\widehat{X}_{t}^{0}=&b^{0}\big{(}t,\widehat{X}_{t}^{0},u^{0}_{t};\mathbb{E}_{t}[\widehat{X}_{t}^{i}],\overline{u}_{t}^{i,N}\big{)}dt+{\sigma^{0}\big{(}t,\widehat{X}_{t}^{0};\mathbb{E}_{t}[\widehat{X}_{t}^{i}],\overline{u}_{t}^{i,N}\big{)}}dW^{0}_{t},\quad\widehat{X}_{0}^{0}=x_{0}^{0},\\ d\widehat{X}_{t}^{i}=&b\big{(}t,\widehat{X}_{t}^{i},{u}_{t}^{i,N};\widehat{X}_{t}^{0},u_{t}^{0};\mathbb{E}_{t}[\widehat{X}_{t}^{i}],\overline{u}_{t}^{i,N}\big{)}dt+\sigma\big{(}t,\widehat{X}_{t}^{i},\widehat{X}_{t}^{0};\mathbb{E}_{t}[\widehat{X}_{t}^{i}],\overline{u}_{t}^{i,N}\big{)}dW_{t}^{i},\quad\widehat{X}_{0}^{i}=x_{0}.\end{aligned}\right.

Similar to (A.1), $\mathop{\rm sup}_{0\leq i\leq N}\mathbb{E}[\mathop{\rm sup}_{0\leq t\leq T}|\widehat{X}_{t}^{i,N}-\widehat{X}_{t}^{i}|^{2}]\leq\frac{C}{N}.$ By the same estimates as in (LABEL:apr-7),

|\widehat{J}^{N}_{0}-\widehat{J}_{0}|\leq\frac{C}{\sqrt{N}}.

(A.3)

Since $\big{(}u^{0,*}_{t},u^{1,*}_{t},u_{t}^{j,*}\big{)}$ is an equilibrium strategy of the limiting triple-agent game problem, it is clear that $\widehat{J}_{0}\leq{J}_{0}$ , and combining (LABEL:apr-7) and (A.3), we get the desired result for $\mathcal{A}_{0}$ .

A.2 Some notations in Section 5

1. Constants in (5.5): $a=\frac{1}{2}(1-\mu_{3}-\mu^{0}_{2}\mu_{4})^{-1}$ and

\begin{array}[]{llll}&{a_{1}^{0}}^{\top}=a\big{(}(1-\mu_{3})R_{0}^{-1}b_{2}^{0},(1-\mu_{3})R_{0}^{-1}b_{4},\mu^{0}_{2}R^{-1}b_{2}\big{)},&{a_{3}^{0}}^{\top}=-a\big{(}(1-\mu_{3})R_{0}^{-1}f_{4}^{0},(1-\mu_{3})R_{0}^{-1}f_{8},\mu^{0}_{2}R^{-1}f_{4}\big{)},\\ &a_{1}^{\top}=a\big{(}\mu_{4}R_{0}^{-1}b_{2}^{0},\mu_{4}R_{0}^{-1}b_{4},(\mu_{3}+\mu_{4}\mu^{0}_{2})R^{-1}b_{2}\big{)},&a_{3}^{\top}=-\big{(}a\mu_{4}R_{0}^{-1}f_{4}^{0},a\mu_{4}R_{0}^{-1}f_{8},(a-\frac{1}{2})R^{-1}f_{4}\big{)},\\ &a_{7}^{\top}=\frac{1}{2}R^{-1}b_{2}(0,0,1),&a_{8}^{\top}=-\frac{1}{2}R^{-1}f_{4}(0,0,1).\\ \end{array}

2. Matrices in equation (5.6):

\mathbb{A}_{1}=\left(\begin{array}[]{cc}b_{1}^{0}&0\\ b_{3}&b_{1}\\ \end{array}\right),\ \mathbb{A}_{2}=\left(\begin{array}[]{cc}0&b_{3}^{0}\\ 0&b_{5}\\ \end{array}\right),\ \mathbb{A}_{3}=\left(\begin{array}[]{cc}f_{1}^{0}&0\\ f_{5}&f_{1}\\ \end{array}\right),\ \mathbb{A}_{4}=\left(\begin{array}[]{cc}0&f_{5}^{0}\\ 0&f_{9}\\ \end{array}\right),\ \mathbb{A}_{5}=\left(\begin{array}[]{cc}-2Q_{0}&0\\ 2\mu^{0}_{1}Q_{0}&0\\ 2\mu_{2}Q&-2Q\\ \end{array}\right),\

\mathbb{A}_{6}=\left(\begin{array}[]{cc}0&2\mu^{0}_{1}Q_{0}\\ 0&-2|\mu^{0}_{1}|^{2}Q_{0}\\ 0&2\mu_{1}Q\\ \end{array}\right),\ \mathbb{A}_{7}=\left(\begin{array}[]{cc}\Phi_{1}^{0}&0\\ \Phi_{2}&\Phi_{1}\\ \end{array}\right),\ \mathbb{A}_{8}=\left(\begin{array}[]{ccccc}0&\Phi_{2}^{0}\\ 0&\Phi_{3}\\ \end{array}\right),\ \mathbb{B}_{1}=\left(\begin{array}[]{c}0\\ b_{2}a_{7}^{\top}\\ \end{array}\right),\

\mathbb{B}_{2}=\left(\begin{array}[]{c}b_{2}^{0}{a_{1}^{0}}^{\top}+b_{4}^{0}{a}_{1}^{\top}+b_{4}^{0}{a}_{7}^{\top}\\ (b_{2}+b_{6})a_{1}^{\top}+b_{4}{a_{1}^{0}}^{\top}+b_{6}{a}_{7}^{\top}\\ \end{array}\right),\ \mathbb{B}_{3}=\left(\begin{array}[]{c}0\\ f_{4}a_{7}^{\top}\\ \end{array}\right),\ \mathbb{B}_{4}=\left(\begin{array}[]{c}f_{4}^{0}{a_{1}^{0}}^{\top}+f_{8}^{0}(a_{1}+a_{7})^{\top}\\ f_{4}a_{1}^{\top}+f_{8}{a_{1}^{0}}^{\top}+f_{12}(a_{1}+a_{7})^{\top}\\ \end{array}\right),\

\mathbb{B}_{5}=\left(\begin{array}[]{ccc}b_{1}^{0}&b_{3}&0\\ 0&b_{1}&0\\ 0&0&b_{1}\end{array}\right),\ \mathbb{B}_{6}=\left(\begin{array}[]{ccc}0&0&0\\ b_{3}^{0}&b_{5}&0\\ 0&0&0\end{array}\right),\ \mathbb{C}_{1}=\left(\begin{array}[]{c}0\\ b_{2}a_{8}^{\top}\\ \end{array}\right),\ \mathbb{C}_{2}=\left(\begin{array}[]{c}b_{2}^{0}{a_{3}^{0}}^{\top}+b_{4}^{0}{a}_{3}^{\top}+b_{4}^{0}{a}_{8}^{\top}\\ (b_{2}+b_{6})a_{3}^{\top}+b_{4}{a_{3}^{0}}^{\top}+b_{6}{a}_{8}^{\top}\\ \end{array}\right),\

\mathbb{C}_{3}=\left(\begin{array}[]{ccc}f_{2}^{0}&f_{6}&0\\ 0&f_{2}&0\\ 0&0&f_{2}\\ \end{array}\right),\ \mathbb{C}_{4}=\left(\begin{array}[]{ccccc}0&0&0\\ f_{6}^{0}&f_{10}&0\\ 0&0&0\\ \end{array}\right),\ \mathbb{C}_{5}=\left(\begin{array}[]{ccc}f_{3}^{0}&f_{7}&0\\ 0&f_{3}&0\\ 0&0&f_{3}\\ \end{array}\right),\ \mathbb{C}_{6}=\left(\begin{array}[]{ccccc}0&0&0\\ f_{7}^{0}&f_{11}&0\\ 0&0&0\\ \end{array}\right),\

\mathbb{C}_{7}=\left(\begin{array}[]{c}0\\ f_{4}a_{8}^{\top}\\ \end{array}\right),\ \mathbb{C}_{8}=\left(\begin{array}[]{c}f_{4}^{0}{a_{3}^{0}}^{\top}+f_{8}^{0}(a_{3}+a_{8})^{\top}\\ f_{4}a_{3}^{\top}+f_{8}{a_{3}^{0}}^{\top}+f_{12}(a_{3}+a_{8})^{\top}\\ \end{array}\right),\ \mathbb{C}_{9}=\left(\begin{array}[]{ccccc}-f_{1}^{0}&-f_{5}&0\\ 0&-f_{1}&0\\ 0&0&-f_{1}\\ \end{array}\right),\

\mathbb{C}_{10}=\left(\begin{array}[]{ccccc}0&0&0\\ -f_{5}^{0}&-f_{9}&0\\ 0&0&0\\ \end{array}\right),\ \mathbb{C}_{11}=\left(\begin{array}[]{ccc}-\Phi_{1}^{0}&-\Phi_{2}&0\\ 0&-\Phi_{1}&0\\ 0&0&-\Phi_{1}\\ \end{array}\right),\ \mathbb{C}_{12}=\left(\begin{array}[]{ccc}0&0&0\\ -\Phi_{2}^{0}&-\Phi_{3}&0\\ 0&0&0\\ \end{array}\right),\

\mathbb{D}_{1}=\left(\begin{array}[]{cc}f_{2}^{0}&0\\ f_{6}&f_{2}\\ \end{array}\right),\ \mathbb{D}_{2}=\left(\begin{array}[]{cc}0&f_{6}^{0}\\ 0&f_{10}\\ \end{array}\right),\ \mathbb{F}_{1}=\left(\begin{array}[]{cc}f_{3}^{0}&0\\ f_{7}&f_{3}\\ \end{array}\right),\ \mathbb{F}_{2}=\left(\begin{array}[]{cc}0&f_{7}^{0}\\ 0&f_{11}\\ \end{array}\right),\ \mathbb{\rho}=\left(\begin{array}[]{cc}2\gamma_{0}&0\\ 0&0\\ 0&2\gamma\\ \end{array}\right).

3. The $5\times 5$ block matrices in equations (5.8) and (5.9):

\mathbb{H}_{1}=\left(\begin{array}[]{cc}\mathbb{A}_{1}+\mathbb{A}_{2}&\mathbb{C}_{1}+\mathbb{C}_{2}\\ \rho(\mathbb{A}_{3}+\mathbb{A}_{4})&\rho(\mathbb{C}_{7}+\mathbb{C}_{8})\\ \end{array}\right),\ \mathbb{H}_{2}=\left(\begin{array}[]{cc}(\mathbb{C}_{7}+\mathbb{C}_{8})\rho+\mathbb{D}_{1}+\mathbb{D}_{2}&\mathbb{B}_{3}+\mathbb{B}_{4}\\ (\mathbb{C}_{9}+\mathbb{C}_{10})\rho&\mathbb{B}_{5}+\mathbb{B}_{6}\\ \end{array}\right),\

\mathbb{H}_{3}=\left(\begin{array}[]{cc}(\mathbb{C}_{1}+\mathbb{C}_{2})\rho&\mathbb{B}_{1}+\mathbb{B}_{2}\\ (\mathbb{C}_{3}+\mathbb{C}_{4})\rho+\rho(\mathbb{C}_{7}+\mathbb{C}_{8})\rho+\rho(\mathbb{D}_{1}+\mathbb{D}_{2})&\rho(\mathbb{B}_{3}+\mathbb{B}_{4})\\ \end{array}\right),\ \mathbb{H}_{4}=\left(\begin{array}[]{cc}\mathbb{F}_{1}+\mathbb{F}_{2}&0\\ 0&0\\ \end{array}\right),\

\mathbb{H}_{5}=\left(\begin{array}[]{cc}0&0\\ 0&\mathbb{C}_{5}+\mathbb{C}_{6}\\ \end{array}\right),\ \mathbb{H}_{6}=\left(\begin{array}[]{cc}0&0\\ (\mathbb{C}_{5}+\mathbb{C}_{6})\rho&0\\ \end{array}\right),\ \mathbb{H}_{7}=\left(\begin{array}[]{cc}\mathbb{A}_{3}+\mathbb{A}_{4}&\mathbb{C}_{7}+\mathbb{C}_{8}\\ \mathbb{A}_{5}+\mathbb{A}_{6}&\mathbb{C}_{9}+\mathbb{C}_{10}\\ \end{array}\right),\

\mathbb{G}_{1}=\left(\begin{array}[]{cc}0&0\\ (\mathbb{C}_{11}+\mathbb{C}_{12})\rho&0\\ \end{array}\right),\ \mathbb{G}_{2}=\left(\begin{array}[]{cc}\mathbb{A}_{7}+\mathbb{A}_{8}&0\\ 0&\mathbb{C}_{11}+\mathbb{C}_{12}\\ \end{array}\right),\ \widetilde{\rho}=\left(\begin{array}[]{cc}0&0\\ \rho&0\\ \end{array}\right).\

4. Constant vectors in equation (5.12)

	$\displaystyle\mathbf{\Lambda}_{1}=$	$\displaystyle\big{(}b_{3},b_{5},-[(b_{2}+b_{6})\mu_{4}+b_{4}(1-\mu_{3})]aR_{0}^{-1}f_{4}^{0},-[(b_{2}+b_{6})\mu_{4}+b_{4}(1-\mu_{3})]aR_{0}^{-1}f_{8},-[b_{2}+b_{4}\mu^{0}_{2}+b_{6}]aR^{-1}f_{4}\big{)},$
	$\displaystyle\mathbf{\Lambda}_{2}=$	$\displaystyle\big{(}0,0,\mathbf{\Lambda}_{3}\big{)},\ \mathbf{\Lambda}_{4}=(2Q\mu_{2},2Q\mu_{1},0,0,-f_{1}),$
	$\displaystyle\mathbf{\Lambda}_{3}=$	$\displaystyle\big{(}[b_{2}\mu_{4}+b_{4}(1-\mu_{3})+b_{6}\mu_{4}]aR_{0}^{-1}b_{2}^{0},[b_{2}\mu_{4}+b_{4}(1-\mu_{3})+b_{6}\mu_{4}]aR_{0}^{-1}b_{4},[b_{2}(\mu_{3}+\mu_{4}\mu^{0}_{2})+b_{4}\mu^{0}_{2}+b_{6}]aR^{-1}b_{2}\big{)}.$

A.3 Proof of Proposition 5.1

The uniqueness part follows by the standard arguments. Thus, we focus only on the existence part. By Lemma 6.1, we can substitute the solution $(\overline{\mathbf{X}}_{t},\overline{\mathbf{L}}_{t})^{\top}=(\overline{X}^{0}_{t},\overline{X}^{1}_{t},\overline{L}_{t}^{0},\overline{L}_{t},\overline{L}^{\ddagger}_{t})^{\top},$ $(\overline{\mathbf{Y}}_{t},\overline{\mathbf{P}}_{t})^{\top}=(\overline{Y}_{t}^{0},\overline{Y}_{t}^{1},\overline{P}^{0}_{t},\overline{P}_{t},\overline{P}^{\ddagger}_{t})^{\top},\ (\overline{\mathbf{Z}}_{t},\overline{\mathbf{Q}}_{t})^{\top}=(\overline{Z}_{t}^{0},\overline{Z}_{t}^{1},\overline{Q}^{00}_{t},\overline{Q}^{10}_{t},\overline{Q}^{10,\ddagger}_{t})^{\top}$ of (5.7) into (5.6). Then, its solution can be constructed by the following three steps.

Step 1. The construction of $(X^{0},Y^{0},Z^{0})$ and $(L^{0},L,L^{\ddagger};P^{0},Q^{00},Q^{01};P,Q^{10},Q^{11})$ .

The triple $(X^{0},Y^{0},Z^{0})$ (if exists), should be $\mathbb{F}^{0}$ -adapted, and define $X^{0}_{t}=\overline{X}^{0}_{t},\ Y^{0}_{t}=\overline{Y}^{0}_{t},\ Z^{0}_{t}=\overline{Z}^{0}_{t}.$ From (5.6), the adjoint equations (except those of $(P^{\ddagger},Q^{10,\ddagger},Q^{11,\ddagger}$ )) satisfy

\left\{\begin{aligned} dL_{t}^{0}=&[f_{2}^{0}L_{t}^{0}+f_{6}L_{t}]dt+[f_{3}^{0}L_{t}^{0}+f_{7}L_{t}]dW^{0}_{t},\quad L_{0}^{0}=2\gamma_{0}Y_{0}^{0},\\ dL_{t}=&\big{[}f_{2}L_{t}+f_{10}\overline{L}_{t}+f_{6}^{0}\overline{L}_{t}^{0}\big{]}dt+\big{[}f_{3}L_{t}+f_{11}\overline{L}_{t}+f_{7}^{0}\overline{L}_{t}^{0}\big{]}dW_{t}^{0},\quad L_{0}=0,\\ dL_{t}^{\ddagger}=&f_{2}L_{t}^{\ddagger}dt+f_{3}L_{t}^{\ddagger}dW_{t}^{0},\quad\ L_{0}^{\ddagger}=2\gamma\overline{Y}_{0}^{1},\\ \end{aligned}\right.

\left\{\begin{aligned} -d{P_{t}^{0}}=&\Big{[}b_{1}^{0}{P_{t}^{0}}+b_{3}{P_{t}}-f_{1}^{0}L_{t}^{0}-f_{5}L_{t}-2Q_{0}\big{(}{X_{t}^{0}}-\mu^{0}_{1}\overline{X}_{t}^{1}\big{)}\Big{]}dt-{Q_{t}^{00}}dW^{0}_{t}-{Q_{t}^{01}}dW_{t}^{1},\\ -d{P_{t}}=&\Big{[}b_{1}{P_{t}}-f_{1}L_{t}+b_{5}\overline{P}_{t}+b_{3}^{0}\overline{P}_{t}^{0}-f_{5}^{0}\overline{L}_{t}^{0}-f_{9}\overline{L}_{t}+2\mu^{0}_{1}Q_{0}\big{(}X_{t}^{0}-\mu^{0}_{1}\overline{X}_{t}^{1}\big{)}\Big{]}dt\\ &-{Q_{t}^{10}}dW^{0}_{t}-{Q_{t}^{11}}dW_{t}^{1},\\ {P_{T}^{0}}=&-L_{T}^{0}\Phi_{1}^{0}-L_{T}\Phi_{2},\quad{P_{T}}=-L_{T}\Phi_{1}-\overline{L}_{T}^{0}\Phi_{2}^{0}-\overline{L}_{T}\Phi_{3}.\end{aligned}\right.

(A.4)

(A.4) is a linear decoupled FBSDE with $L^{2}$ nonhomogeneous terms, thus it admits a unique solution $(L^{0},L,L^{\ddagger};P^{0},Q^{00},Q^{01};P,Q^{10},Q^{11})$ . Moreover, comparing (A.4), (5.7), we have

L_{t}^{0}=\overline{L}_{t}^{0},\ L_{t}=\overline{L}_{t},\ L_{t}^{\ddagger}=\overline{L}^{\ddagger}_{t},\ P_{t}^{0}=\overline{P}_{t}^{0},\ Q^{00}_{t}=\overline{Q}_{t}^{00},Q^{01}_{t}=0,\ P_{t}=\overline{P}_{t},\ Q^{10}_{t}=\overline{Q}_{t}^{10},Q^{11}_{t}=0,\ t\in[0,T].

Step 2. The construction of $(X^{1},P^{\ddagger},Q^{10,\ddagger},Q^{11,\ddagger})$ .

It follows from the system (5.6) that the 4-tuple $(X^{1},P^{\ddagger},Q^{10,\ddagger},Q^{11,\ddagger})$ satisfies

\left\{\begin{aligned} dX_{t}^{1}=&\Big{[}b_{1}X_{t}^{1}+b_{2}a_{7}P_{t}^{\ddagger}+\tilde{b}_{t}\Big{]}dt+\sigma dW_{t}^{1},\ {{X}_{0}^{1}}=x_{0},\\ -d{{P}_{t}^{\ddagger}}=&\Big{[}b_{1}{P_{t}^{\ddagger}}-2Q{X_{t}^{1}}+\tilde{f}_{t}\Big{]}dt-{Q_{t}^{10,\ddagger}}dW^{0}_{t}-{Q_{t}^{11,\ddagger}}dW_{t}^{1},\ {P_{T}^{\ddagger}}=-L_{T}^{\ddagger}\Phi_{1},\end{aligned}\right.

(A.5)

where $\tilde{b}_{t}=\mathbf{\Lambda}_{1}\overline{\mathbf{X}}_{t}+\mathbf{\Lambda}_{2}\overline{\mathbf{Y}}_{t}=[\mathbf{\Lambda}_{1}+\mathbf{\Lambda}_{2}(I+S_{t}\rho)^{-1}S_{t}]\overline{\mathbf{X}}_{t}+\mathbf{\Lambda}_{2}(I+S_{t}\rho)^{-1}\Upsilon_{t}$ and $\tilde{f}_{t}=\mathbf{\Lambda}_{4}\overline{\mathbf{X}}_{t}$ are $\mathbb{F}^{0}$ -adapted. Next we introduce the following Riccati equation and BSDE

\left\{\begin{aligned} &\dot{\Sigma}_{t}+2b_{1}\Sigma_{t}+\frac{1}{2}R^{-1}|b_{2}|^{2}|\Sigma_{t}|^{2}-2Q=0,\ \ \ \Sigma_{T}=0,\\ &-dp_{t}=[(\frac{1}{2}R^{-1}|b_{2}|^{2}\Sigma_{t}+b_{1})p_{t}+\Sigma_{t}\tilde{b}_{t}+\tilde{f}_{t}]dt-q_{t}^{0}dW_{t}^{0}-q_{t}^{1}dW_{t}^{1},\ \ \ p_{T}=-L_{T}^{\ddagger}\Phi_{1},\end{aligned}\right.

Under (A6), it follows from Theorem 4.3 in [31] (see p.48) that $\Sigma$ takes the form (5.11). Notice that $\Sigma$ is bounded, the above linear BSDE has an unique $\mathbb{F}^{0}$ -adapted solution $(p,q^{0},0)$ with (5.12). By the relation $P_{t}^{\ddagger}=\Sigma_{t}X_{t}^{1}+p_{t},$ one can show the well-posedness of (A.5).

Step 3. The construction of $(Y^{1},Z^{1,0},Z^{1,1})$ .

Denote $f_{t}=f_{1}X_{t}^{1}+f_{5}X_{t}^{0}+f_{6}Y_{t}^{0}+f_{7}Z_{t}^{0}+f_{9}\overline{X}_{t}^{1}+f_{10}\overline{Y}_{t}^{1}+{f_{11}\overline{Z}_{t}^{1,0}}+f_{4}a_{7}P_{t}^{\ddagger}+(f_{4}a_{1}+f_{8}a_{1}^{0}+f_{12}a_{1}+f_{12}a_{7})^{\top}\overline{\mathbf{P}}_{t}+[(f_{4}+f_{12})(a_{3}+a_{8})+f_{8}{a}_{3}^{0}]^{\top}\mathbf{L}_{t}+f_{4}a_{7}^{\top}\mathbf{P}_{t}.$ Then $(Y^{1},Z^{1,0},Z^{1,1})$ is the unique $\mathbb{F}^{0,1}$ -adapted solution of the following BSDE

\displaystyle-dY_{t}^{1}=\Big{[}f_{2}Y_{t}^{1}+{f_{3}Z_{t}^{1,0}}+f_{t}\Big{]}dt-{Z_{t}^{1,0}dW_{t}^{0}-Z_{t}^{1,1}dW_{t}^{1}},\ Y_{T}^{1}=\Phi_{1}X_{T}^{1}+\Phi_{2}X_{T}^{0}+\Phi_{3}\overline{X}_{T}^{1}+\xi^{1}.

Finally, combining the above three steps we construct a solution of the system (5.6).

References

[1] A. Agrawal and R. E. Barlow, A survey of network reliability and domination theory, Operations Research, 32 (1984), pp. 478–492.
[2] D. Andersson and B. Djehiche, A maximum principle for SDEs of mean-field type, Applied Mathematics & Optimization, 63 (2011), pp. 341–356.
[3] R. J. Aumann, M. Maschler, and R. E. Stearns, Repeated games with incomplete information, MIT press, 1995.
[4] J. Aurand and Y.-J. Huang, Mortality and healthcare: A stochastic control analysis under Epstein–Zin preferences, SIAM Journal on Control and Optimization, 59 (2021), pp. 4051–4080.
[5] A. Bensoussan, M. H. Chau, and S. C. Yam, Mean field games with a dominating player, Applied Mathematics & Optimization, 74 (2016), pp. 91–128.
[6] P. Bergault, P. Cardaliaguet, and C. Rainer, Mean field games in a stackelberg problem with an informed major player, SIAM Journal on Control and Optimization, 62 (2024), pp. 1737–1765.
[7] R. Buckdahn, J. Li, and S. Peng, Nonlinear stochastic differential games involving a major player and a large number of collectively acting minor agents, SIAM Journal on Control and Optimization, 52 (2014), pp. 451–492.
[8] P. Cardaliaguet, M. Cirant, and A. Porretta, Remarks on Nash equilibria in mean field game models with a major player, Proceedings of the American Mathematical Society, 148 (2020), pp. 4241–4255.
[9] R. Carmona, F. Delarue, R. Carmona, and F. Delarue, Extensions for volume I, Probabilistic Theory of Mean Field Games with Applications I: Mean Field FBSDEs, Control, and Games, (2018), pp. 619–680.
[10] R. Carmona and P. Wang, An alternative approach to mean field game with major and minor players, and applications to herders impacts, Applied Mathematics & Optimization, 76 (2017), pp. 5–27.
[11] R. A. Carmona and X. Zhu, A probabilistic approach to mean field games with major and minor players, Annals of Applied Probability, 26 (2016), pp. 1535–1580.
[12] Z. Chen and L. Epstein, Ambiguity, risk, and asset returns in continuous time, Econometrica, 70 (2002), pp. 1403–1443.
[13] F. Delarue, On the existence and uniqueness of solutions to FBSDEs in a non-degenerate case, Stochastic Processes and Their Applications, 99 (2002), pp. 209–286.
[14] K. Du, J. Huang, and Z. Wu, Linear quadratic mean-field-game of backward stochastic differential systems, Mathematical Control and Related Fields, 8 (2018), pp. 653–678.
[15] D. Duffie and L. G. Epstein, Stochastic differential utility, Econometrica, 60 (1992), pp. 353–394.
[16] N. El Karoui, S. Peng, and M. C. Quenez, A dynamic maximum principle for the optimization of recursive utilities under constraints, Annals of Applied Probability, 11 (2001), pp. 664–693.
[17] X. Feng, Y. Hu, and J. Huang, Backward stackelberg differential game with constraints: a mixed terminal-perturbation and linear-quadratic approach, SIAM Journal on Control and Optimization, 60 (2022), pp. 1488–1518.
[18] Z. Hellman and Y. J. Levy, Measurable selection for purely atomic games, Econometrica, 87 (2019), pp. 593–629.
[19] M. Hu, S. Ji, and X. Xue, Optimization under rational expectations: A framework of fully coupled forward-backward stochastic linear quadratic systems, Mathematics of Operations Research, 48 (2023), pp. 1767–1790.
[20] Y. Hu, J. Huang, and W. Li, Backward stochastic differential equations with conditional reflection and related recursive optimal control problems, SIAM Journal on Control and Optimization, 62 (2024), pp. 2557–2589.
[21] J. Huang, W. Li, and H. Zhao, A class of optimal control problems of forward–backward systems with input constraint, Journal of Optimization Theory and Applications, 199 (2023), pp. 1050–1084.
[22] J. Huang, S. Wang, and Z. Wu, Backward mean-field linear-quadratic-Gaussian (LQG) games: full and partial information, IEEE Transactions on Automatic Control, 61 (2016), pp. 3784–3796.
[23] M. Huang, Large-population LQG games involving a major player: the Nash certainty equivalence principle, SIAM Journal on Control and Optimization, 48 (2010), pp. 3318–3353.
[24] M. Huang, R. P. Malhamé, and P. E. Caines, Large population stochastic dynamic games: closed-loop mckean-vlasov systems and the nash certainty equivalence principle, Communications in Information and Systems, 6 (2006), pp. 221–252.
[25] E. Kamenica, Bayesian persuasion and information design, Annual Review of Economics, 11 (2019), pp. 249–272.
[26] X.-I. Kartala, N. Englezos, and A. N. Yannacopoulos, Future expectations modeling, random coefficient forward–backward stochastic differential equations, and stochastic viscosity solutions, Mathematics of Operations Research, 45 (2020), pp. 403–433.
[27] J.-M. Lasry and P.-L. Lions, Mean field games, Japanese Journal of Mathematics, 2 (2007), pp. 229–260.
[28] A. Lazrak, Generalized stochastic differential utility and preference for information, The Annals of Applied Probability, 14 (2004), pp. 2149–2175.
[29] A. Lazrak and M. C. Quenez, A generalized stochastic differential utility, Mathematics of operations research, 28 (2003), pp. 154–180.
[30] J. MA, Z. WU, D. ZHANG, and J. ZHANG, On well-posedness of forward-backward SDEs-a unified approach, Annals of Applied Probability, 25 (2015), pp. 2168–2214.
[31] J. Ma and J. Yong, Forward-Backward Stochastic Differential Equations and Their Applications, no. 1702, Springer Science & Business Media, 1999.
[32] Y. Ma and M. Huang, Linear quadratic mean field games with a major player: The multi-scale approach, Automatica, 113 (2020), p. 108774.
[33] M. Miller and P. Weller, Stochastic saddlepoint systems stabilization policy and the stock market, Journal of Economic Dynamics and Control, 19 (1995), pp. 279–302.
[34] M. Nourian and P. E. Caines, $\varepsilon$ -nash mean field game theory for nonlinear stochastic dynamical systems with major and minor agents, SIAM Journal on Control and Optimization, 51 (2013), pp. 3302–3331.
[35] S. T. Rachev and L. Rüschendorf, Mass Transportation Problems: Applications, Springer Science & Business Media, 2006.

		$\displaystyle+C\int_{t}^{T}\mathbb{E}\Big{[}\Big{\|}\frac{1}{N}\sum_{i=1}^{N}X_{s}^{i,N}-\mathbb{E}_{s}[X_{s}^{i}]\Big{\|}^{2}\Big{]}ds+C\int_{t}^{T}\mathbb{E}\Big{[}\Big{\|}\frac{1}{N}\sum_{i=1}^{N}\Psi(r,X_{r}^{0,N},X_{r}^{i,N})-\mathbb{E}_{r}[\Psi(r,X_{r}^{0},X_{r}^{i})]\Big{\|}^{2}\Big{]}dr$		(A.2)
		$\displaystyle+C\int_{t}^{T}\mathbb{E}\Big{[}\Big{\|}\frac{1}{N}\sum_{i=1}^{N}Y_{s}^{i,N}-\mathbb{E}_{s}[Y_{s}^{i}]\Big{\|}^{2}\Big{]}ds+C\int_{t}^{T}\mathbb{E}\Big{[}\Big{\|}\frac{1}{N}\sum_{i=1}^{N}Z_{s}^{i,N}-\mathbb{E}_{s}[Z_{s}^{i}]\Big{\|}^{2}\Big{]}ds$
		$\displaystyle\leq C\int_{t}^{T}\mathbb{E}[\|Y_{s}^{i,N}-Y_{s}^{i}\|^{2}]ds+C\mathbb{E}[\|X_{T}^{j,N}-X_{T}^{j}\|^{2}]+C_{j}\mathbb{E}[\|X_{T}^{0,N}-X_{T}^{0}\|^{2}]+C\int_{t}^{T}\sum_{k=0,1,i}\mathbb{E}[\|X_{s}^{k,N}-X_{s}^{k}\|^{2}]ds$
		$\displaystyle+C_{j}\mathbb{E}\big{[}\int_{t}^{T}(\|Y_{s}^{0,N}-Y_{s}^{0}\|^{2}+\|Z_{s}^{0,N}-Z_{s}^{0}\|^{2})ds\big{]}+\frac{C}{N},$

		$\displaystyle\|J^{N}_{0}-J_{0}\|\leq\|\Gamma^{0}(Y_{0}^{0,N})-\Gamma^{0}(Y_{0}^{0})\|+\mathbb{E}\Big{[}\int_{0}^{T}\Big{\|}g^{0}\Big{(}t,\Theta_{t}^{0,N},\Psi^{0}(t,X_{t}^{0,N},X_{t}^{1,N});\Theta_{t}^{(N)},u_{t}^{N}\Big{)}$
		$\displaystyle-g^{0}\Big{(}t,\Theta_{t}^{0},\Psi^{0}(t,X_{t}^{0},X_{t}^{1});\overline{\Theta}_{t}^{i},\mathbb{E}_{t}\big{[}\Psi(t,X_{t}^{0},X_{t}^{i})\big{]}\Big{)}\Big{\|}dt\Big{]}$
		$\displaystyle\leq C(1+\|Y_{0}^{0,N}\|+\|Y_{0}^{0}\|)\cdot\|Y_{0}^{0,N}-Y_{0}^{0}\|$
		$\displaystyle+C\mathbb{E}\Big{[}\int_{0}^{T}\big{(}1+\|\Theta_{t}^{0,N}\|+\|\Theta_{t}^{0}\|+\|\Theta_{t}^{(N)}\|+\|\overline{\Theta}_{t}^{i}\|+\|u_{t}^{(N)}\|+\|\mathbb{E}_{t}[\Psi(t,X_{t}^{0},X_{t}^{1})]\|\big{)}$
		$\displaystyle\big{(}\|\Theta_{t}^{0,N}-\Theta_{t}^{0}\|+\|\Theta_{t}^{(N)}-\overline{\Theta}_{t}^{i}\|+\|u_{t}^{(N)}-\mathbb{E}_{t}[\Psi(t,X_{t}^{0},X_{t}^{1})]\|\big{)}dt$
		$\displaystyle\leq C(1+\|Y_{0}^{0,N}\|^{2}+\|Y_{0}^{0}\|^{2})\cdot\|Y_{0}^{0,N}-Y_{0}^{0}\|^{2}$
		$\displaystyle+C\Big{(}\mathbb{E}\Big{[}\int_{0}^{T}\big{(}1+\|\Theta_{t}^{0,N}\|^{2}+\|\Theta_{t}^{0}\|^{2}+\|\Theta_{t}^{(N)}\|^{2}+\|\overline{\Theta}_{t}^{i}\|^{2}+\|u_{t}^{(N)}\|^{2}+\|\Psi(t,X_{t}^{0},X_{t}^{1})\|^{2}\big{)}dt\Big{)}^{\frac{1}{2}}\cdot$
		$\displaystyle\Big{(}\mathbb{E}\Big{[}\int_{0}^{T}\big{(}\|\Theta_{t}^{0,N}-\Theta_{t}^{0}\|^{2}+\|\Theta_{t}^{(N)}-\overline{\Theta}_{t}^{i}\|^{2}+\|u_{t}^{(N)}-\mathbb{E}_{t}[\Psi(t,X_{t}^{0},X_{t}^{1})]\|^{2}\big{)}dt\Big{)}^{\frac{1}{2}}\leq\frac{C}{\sqrt{N}}.$