Projective Embedding of Dynamical Systems:
uniform mean field equations

F. Caravelli Los Alamos National Laboratory
T-Division (T-4), Condensed Matter & Complex Systems,
Los Alamos, New Mexico 87545, USA caravelli@lanl.gov F. L. Traversa MemComputing Inc.
MemComputing Inc., 9909 Huennekens Street, Suite 110, San Diego, California 92121, USA M. Bonnin F. Bonani Politecnico di Torino
Department of Electronics and Telecommunication, Corso Duca degli Abruzzi 24, 10129 Turin, Italy

Abstract

We study embeddings of continuous dynamical systems in larger dimensions via projector operators. We call this technique PEDS, projective embedding of dynamical systems, as the stable fixed point of the original system dynamics are recovered via projection from the higher dimensional space. In this paper we provide a general definition and prove that for a particular type of rank-1 projector operator, the uniform mean field projector, the equations of motion become a mean field approximation of the dynamical system. While in general the embedding depends on a specified variable ordering, the same is not true for the uniform mean field projector. In addition, we prove that the original stable and saddle-node fixed points retain this feature in the embedding dynamics, while unstable fixed points become saddles. Direct applications of PEDS can be non-convex optimization and machine learning.

keywords:

projective embedding, projector operators, dynamical systems, fixed points, PEDS

1 Introduction

The past decades witnessed an increased interest in physics- or neuro-inspired algorithms for the analysis of dynamical systems, with the main area of application being problems that can be mapped onto optimization ones, whether continuous or discrete [2, TraversaSOLG, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14]. Among the most important neuro-inspired algorithms, we mention neural networks, which received a large amount of attention given their wide applicability and remarkable achievements: this is an active area of research falling at the boundary between complex systems, neuromorphic computing and nonlinear dynamics, dating back to Turing [15] at least. In the study of neural networks, one of the most important open problems is the acceleration of the training phase, a problem connected to the roughness of the energy landscape [16, 17]. Network training is one of the most difficult tasks, requiring in general huge computational power and a vast number of samples. Many algorithms attempt at modifying the energy function to reduce the time spent on saddle points [18, 19]. Changing the landscape is however challenging in general, as it somehow requires some a priori knowledge of what type of local extrema should be modified. Thus, finding valuable alternatives and/or generalizations of gradient descent has been a topic of intense study. In addition to this, analog models of computation is an active area of research [20] with several applications.

From the point of view of a dynamical system, however, there are not many strategies that one can attempt to employ. A possibility, incidentally the one we explore in this paper, is to increase the dimensionality of the system, by attempting to preserve some properties related to the original dynamical system, while aiming at a trade-off between convergence optimality and the curse of dimensionality. The basic rationale for this strategy is that increasing dimensions, there are more pathways that a system can take in order to reach a certain target point. As a simple example, consider a one dimensional barrier between two minima in a potential: following gradients, one could never move from one local minimum to the other, while in a higher dimension system, pathways around that confinement barrier are, at least in principle, possible.

The technique we propose here is inspired by recent results in the context of memristive circuits [21, 22, 23, 24, 25, 26]. In circuits, Kirchhoff laws are manifestations of the conservation of physical quantities such as charge or energy. Mathematically, these can be expressed via the introduction of projection operators, i.e. matrices $\boldsymbol{\Omega}$ satisfying the constraint $\boldsymbol{\Omega}^{2}=\boldsymbol{\Omega}$ , and directly connected to circuit topology. For instance, for a resistive circuit made of identical unitary resistances in series with impressed voltage generators, the Ohm’s law for the network can be expressed as

\displaystyle\vec{i}=\boldsymbol{\Omega}\vec{v},

(1)

where $\vec{v}$ is the collection of voltage generators connected in series to each resistance, while $\vec{i}$ contains the branch currents. The underlying assumption of (1) is that the voltage generators $v_{i}$ ’s are in series to the resistances $i$ ’s, while the circuit can be represented as graph with $E$ edges. Given the branch currents and a certain orientation of the graph loops $1,\dots,L$ , we can obtain the so called loop matrix of the circuit $A$ , of size $L\times E$ , such that $\boldsymbol{\Omega}=\boldsymbol{A}^{t}(\boldsymbol{A}\boldsymbol{A}^{t})^{-1}\boldsymbol{A}$ , where ^t denotes the transpose. The details of the derivation of $\boldsymbol{\Omega}$ from the circuit topology are beyond the scope of this paper, where $\boldsymbol{\Omega}$ will be kept generic and unrelated to any underlying graph or conservation law.

We assume a continuous dynamical system, but the technique can in principle be extended to vector maps, and thus works also for numerical implementations of a dynamical system. Let us consider a dynamical system expressed in vector form as a first-order differential system

\displaystyle\frac{dx_{i}}{dt}=f_{i}(\vec{x})\qquad i=1,\dots,m

(2)

where functions $f_{i}(\cdot)$ are assumed known, and analytic. We are in general interested in recovering the stable fixed points of (2), i.e. the values $\vec{x}^{*}$ such that $f_{i}(\vec{x}^{*})=0$ , if they exist.

To this aim we consider another dynamical system, of size $mN$ , written in the form

\displaystyle\frac{d}{dt}\vec{X}_{i}=\boldsymbol{\Omega}\vec{F}_{i}(\vec{X}_{1},\cdots,\vec{X}_{m})+\vec{G}_{i}(\vec{X}_{i})\qquad i=1,\dots,m

(3)

where for each $i$ value we define an augmented vector $\vec{X}_{i}$ of size $N$ . The question we aim at answering in this contribution is to ascertain whether functions $\vec{F}_{i}$ and $\vec{G}_{i}$ exist such that the dynamical system (2) is contained, in a sense we will make more precise in the next section, into the extended system (3). The answer we provide in this paper is affirmative, as we will explicitly construct such system along with the technique to recover the original dynamical system.

From a mathematical perspective, these generalizations can be investigated by the study of the properties of fixed points in the embedded system in terms of the original ones, which is the strategy we use in this paper. A fixed point $\vec{x}^{*}$ is particular point of the phase space satisfying $\frac{d\vec{x}}{dt}|_{\vec{x}^{*}}=f(\vec{x}^{*})=0$ . We dub the method developed in this paper Projective Embedding of Dynamical Systems (PEDS), as the technique involves the embedding of a target dynamical system of dimension $m$ into one of dimension $mN$ ; ultimately, we recover the fixed points of the original dynamical system by projecting back onto a chosen set (of size $m$ ) of observables. We will prove that the information of the fixed points of the original target system are related to the fixed points of the reduced observables. As we will see, the dynamical system in which the embedding is contained is a nontrivial and nonlinear extension of the original dynamical system which is obtained via a map between the original one and an extended one. Although the projection operator may be quite general, we prove most of the results here for a specific operator, that we call uniform mean-field projector, as in this simplified case mostly analytical proofs are available.

The structure of the paper is as follows. In Section 2 we introduce the PEDS procedure formally, and provide various examples to intuitively grasp why these definitions make sense. In Section 3 we study the uniform mean field projector, and both for 1-dimensional and $m$ -dimensional dynamical systems we prove exact results about the properties of the asymptotic stable fixed points and their Jacobians. In Section 4 we provide numerical examples alongside analytical analysis, to further corroborate the bulk of the paper. Finally, conclusions follow.

2 The PEDS procedure: key definitions and examples

In order to clarify the techniques developed in this paper, we now construct the simplest example of the embedding, before introducing the necessary definitions. Notation-wise, we will denote with $\boldsymbol{I}$ the identity matrix, while $\vec{1}$ is a column vector with elements equal to 1.

Example 2.1.

Exponential dynamics.
Let us consider the following one dimensional dynamical system:

\displaystyle\frac{dx}{dt}=a\tilde{x}\qquad x(0)=x_{0},

(4)

with $a\in\mathbb{R}$ , whose analytical solution is given by

\displaystyle x(t)=e^{at}x_{0}.

(5)

Considering an $N\times N$ projector matrix $\boldsymbol{\Omega}$ such that $\boldsymbol{\Omega}^{2}=\boldsymbol{\Omega}$ and, thus, $\boldsymbol{\Omega}(\boldsymbol{I}-\boldsymbol{\Omega})=0$ , we define the following enlarged (size $N$ ) dynamical system

\displaystyle\frac{d\vec{X}}{dt}=a\boldsymbol{\Omega}\vec{X}-\alpha(\boldsymbol{I}-\boldsymbol{\Omega})\vec{X}\qquad\vec{X}(0)=x_{0}\vec{b},

(6)

where $\alpha>0$ , and $\vec{b}$ is an arbitrary vector, satisfying the only requirement $\boldsymbol{\Omega}\vec{b}\neq\vec{0}$ .

Since the system above is linear, we do know the analytical solution, which is given by

\displaystyle\vec{X}(t)=e^{[a\boldsymbol{\Omega}-\alpha(\boldsymbol{I}-\boldsymbol{\Omega})]t}\vec{X}(0)\approx e^{a\boldsymbol{\Omega}t}\vec{X}(0)

(7)

where the approximation holds for $t\rightarrow+\infty$ , i.e. for $t\gg 1/\alpha$ . As for any projector $\boldsymbol{\Omega}$ the following identity holds

\displaystyle e^{a\boldsymbol{\Omega}}=\boldsymbol{I}+(e^{a}-1)\boldsymbol{\Omega}

(8)

the asymptotic solution reads

\displaystyle\vec{X}(t)\approx(\boldsymbol{I}-\boldsymbol{\Omega})\vec{X}(0)+e^{at}x_{0}\boldsymbol{\Omega}\vec{b}=(\boldsymbol{I}-\boldsymbol{\Omega})\vec{X}(0)+x(t)\boldsymbol{\Omega}\vec{b}

(9)

Therefore, projecting (9)

\displaystyle\boldsymbol{\Omega}\vec{X}(t)\approx x(t)\boldsymbol{\Omega}\vec{b},

(10)

i.e., the asymptotic solution of (7) is contained as a common factor in all the modes of $\vec{X}(t)$ , the “replicated” dynamics.

As a last comment, we recover the solution of the original dynamical system by averaging the elements of (10)

\displaystyle\frac{1}{N}\vec{1}^{T}\boldsymbol{\Omega}\vec{X}(t)\approx x(t)\frac{1}{N}\vec{1}^{T}\boldsymbol{\Omega}\vec{b},

(11)

where ^T represents the transpose. Therefore, choosing vector $\vec{b}$ such that

\displaystyle\frac{1}{N}\vec{1}^{T}\boldsymbol{\Omega}\vec{b}=\frac{1}{N}\sum_{i,j=1}^{N}\Omega_{ij}b_{j}=1

(12)

we find

\displaystyle x(t)=\frac{1}{N}\vec{1}^{T}\boldsymbol{\Omega}\vec{X}(t)=\frac{1}{N}\sum_{i,j=1}^{N}\Omega_{ij}X_{j}(t)

(13)

i.e., the projected dynamics recovers the original one dimensional system.

The main goal of this paper is to extend the results of Example 2.1 to arbitrary dynamical systems. Let us now identify the key steps of the procedure. First, we begin with a dynamical system in the standard form.

Refer to caption — Figure 1: Graphical representation of the PEDS procedure and of the associated maps. The horizontal arrows represent the time evolution maps $\Phi_{t}$ and $\Phi_{t}^{\prime}$ , while the vertical arrows represent the embedding $\mathcal{O}$ and the projection $\mathcal{P}$ map, respectively.

Definition 2.2.

Embedding procedure: PEDS. We explicitly define here the steps involved in developing the PEDS procedure.

1.

We begin with a tuple $(\{f_{1}(\vec{x}),\cdots,f_{m}(\vec{x})\},\boldsymbol{\Omega},\{\vec{G}_{1},\cdots,\vec{G}_{m}\},\mathcal{S},\{\vec{b}_{1},\cdots,\vec{b}_{m}\},N)$ , where $\boldsymbol{\Omega}$ is a size $N$ projector operator. We call $\{f_{1}(\vec{x}),\cdots,f_{m}(\vec{x})\}$ the target dynamical system, while $x_{i}$ represent the target variables. $\mathcal{S}$ represents an ordering, relevant for the case of a multi-dimensional target system if the embedding is non-commutative. Vector $\vec{b}$ is constant and such that $\boldsymbol{\Omega}\vec{b}\neq\vec{0}$ . Finally, $N$ is the dimension of the embedding for each scalar variable. As such, it can also be interpreted as the number of dimensions in which each scalar variable is expanded into.

Given the target dynamical system of dimension $m$ , we build an extended dynamical system of size $mN$ , represented by a set of canonical equations of the form

\displaystyle\frac{d}{dt}\vec{X}_{i}=\boldsymbol{\Omega}\boldsymbol{F}_{i}(\vec{X}_{1},\cdots,\vec{X}_{m})\vec{b}_{i}+\vec{G}_{i}(\boldsymbol{\Omega};\vec{X}_{i})\qquad i=1,\dots,m

(14)

This step is represented by the arrow $\mathcal{O}$ in Fig. 1, being it a mapping between each scalar functions $f_{i}$ to the vector function $\vec{F}_{i}=\boldsymbol{F}_{i}\vec{b}_{i}$ . Thus, for each dimension of the original dynamical system, we obtain an extended $N$ dimensional subspace in the $Nm$ dynamical system, so that

	$\displaystyle(\{f_{1}(\vec{x}),\cdots,f_{m}(\vec{x})\})$	$\displaystyle\underset{\mathcal{O}}{\rightarrow}$	$\displaystyle(\{\vec{F}_{1}(\vec{X}_{i}),\cdots,\vec{F}_{m}(\vec{X}_{i})\},\{\vec{G}_{1}(\boldsymbol{\Omega};\vec{X}_{i}),\cdots,\vec{G}_{m}(\boldsymbol{\Omega};\vec{X}_{i})\})$		(15)
			$\displaystyle\ \ \ \equiv(F,G)$		(15)

We call the specific map $\mathcal{O}$ the embedding, while $(F,G)$ is the extended system and $\vec{X}_{i}$ the extended variable (i.e., a set of $N$ scalar variables in the extended system). We also dub the set $\vec{G}_{i}$ , the decay functions. In each extended subspace, $\vec{X}_{i}$ is a vector of components $X_{i,j}$ , while diagonal matrix $\boldsymbol{X}_{i}$ is made of elements $\boldsymbol{X}_{i,jk}=X_{i,j}\delta_{jk}$ , where $\delta_{jk}$ represents Kronecker symbol. The original vector can be easily recovered from the diagonal matrix as $\vec{X}_{i}=\boldsymbol{X}_{i}\vec{1}$ . We stress that in principle $\boldsymbol{F}_{i}$ can be a non-trivial function of $\Omega$ , as we shall discuss later on.

3.

We consider the time evolution of both the original and the extended system, represented by maps $\Phi_{t}^{\prime}$ and $\Phi_{t}$ , respectively, in Fig. 1. In Example 2.1, the two maps were analytically expressed, thanks to the simplicity of the target system.
4.

Arrow $\mathcal{P}$ in Fig. 1, finally, projects the extended dynamical system from size $Nm$ to an $m$ dimensional system, that is required to coincide with the trajectory of the target dynamical system. For each variable, the projection is derived from the projector operator as, given a certain extended variable $\vec{X}_{i}$ , we obtain $\bar{x}_{i}=\frac{1}{N}\sum_{j,k=1}^{N}{\Omega}_{jk}X_{i,k}.$

2.1 Extended variable ordering

Before delving into the details of the construction of (14), let us clarify what we mean by ordering. During the development of the PEDS procedure, commuting variable products such as $x_{1}x_{2}$ will be mapped onto matrix products of the form $(\boldsymbol{\Omega}\boldsymbol{X}_{1})(\boldsymbol{\Omega}\boldsymbol{X}_{2})$ . As matrix products do not commute, the ordering of the variables will have a role.

Definition 2.3.

Ordering. Within the context of PEDS, an ordering $S$ is a map between commuting monomials of the form $x_{1}^{i_{1}}x_{2}^{i_{2}}\cdots x^{i_{m}}_{m}$ and non-commuting matrix monomials of the form $(\boldsymbol{\Omega}\boldsymbol{X}_{1})^{i_{1}}(\boldsymbol{\Omega}\boldsymbol{X}_{2})^{i_{2}}\cdots(\boldsymbol{\Omega}\boldsymbol{X}_{m})^{i_{m}}$ .

In general, an ordering can be written in terms of a certain set of coefficients. We will use the following notation

\displaystyle\{(\boldsymbol{\Omega}\boldsymbol{X}_{1})^{i_{1}}\cdots(\boldsymbol{\Omega}\boldsymbol{X}_{m})^{i_{m}}\}_{S}=\sum_{\sigma\in\mathcal{S}(m)}o_{\sigma(1)\cdots\sigma(m)}(\boldsymbol{\Omega}\boldsymbol{X}_{\sigma(1)})^{i_{\sigma(1)}}\cdots(\boldsymbol{\Omega}\boldsymbol{X}_{\sigma(m)})^{i_{\sigma(m)}}

(16)

where $\sigma$ is an element of the permutation group $\mathcal{S}(m)$ over $m$ variables, the coefficients $o_{\sigma(1)\cdots\sigma(m)}$ are zero if at least two indices are equal, and

\sum_{\sigma\in\mathcal{S}(m)}o_{\sigma(1)\cdots\sigma(m)}=1.

(17)

Definition 2.4.

Given the monomial $x_{1}^{i_{1}}x_{2}^{i_{2}}\cdots x_{m}^{i_{m}}$ , the standard ordering is given by $(\boldsymbol{\Omega}\boldsymbol{X}_{1})^{i_{1}}(\boldsymbol{\Omega}\boldsymbol{X}_{2})^{i_{2}}\cdots(\boldsymbol{\Omega}\boldsymbol{X}_{m})^{i_{m}}$ , i.e. a matrix monomial where the matrix products strictly follow the same sequence as in the scalar case.

Definition 2.5.

Given the monomial $x_{1}^{i_{1}}x_{2}^{i_{2}}\cdots x_{m}^{i_{m}}$ , the balanced ordering is given by

\frac{1}{m!}\sum_{\sigma\in\mathcal{S}(m)}(\boldsymbol{\Omega}\boldsymbol{X}_{\sigma(1)})^{i_{\sigma(1)}}(\boldsymbol{\Omega}\boldsymbol{X}_{\sigma(2)})^{i_{\sigma(2)}}\cdots(\boldsymbol{\Omega}\boldsymbol{X}_{\sigma(m)})^{i_{\sigma(m)}}

Notice that

\frac{1}{m!}\sum_{\sigma\in\mathcal{S}(m)}1=1.

and that, given an order-independent function $M$ , i.e. a function satisfying

\displaystyle M(a_{1},\cdots,a_{m})=M(a_{\sigma(1)},\cdots,a_{\sigma(m)}),

(18)

for any permutation $\sigma\in\mathcal{S}(m)$ , then

$\displaystyle\sum_{\sigma\in\mathcal{S}(m)}o_{\sigma(1),\cdots,\sigma(m)}M(a_{\sigma(1)},\cdots,a_{\sigma(m)})$	$\displaystyle=\sum_{\sigma\in\mathcal{S}(m)}o_{\sigma(1),\cdots,\sigma(m)}M(a_{1},\cdots,a_{m})$
	$\displaystyle=M(a_{1},\cdots,a_{m})\sum_{\sigma\in\mathcal{S}(m)}o_{\sigma(1),\cdots,\sigma(m)}$
	$\displaystyle=M(a_{1},\cdots,a_{m})$	(19)

Example 2.6.

The standard ordering is characterized by

\displaystyle o_{\sigma(1)\cdots\sigma(m)}=\delta_{\sigma(1)1}\cdots\delta_{\sigma(m)m},

(20)

while for the balanced ordering, $o_{\sigma(1)\cdots\sigma(m)}={1}/{m!}$ .

Considering the case of two scalar variables (i.e., $m=2$ ), choosing $o_{12}=1$ and $o_{21}=o_{11}=o_{22}=0$ , we get $\{(\boldsymbol{\Omega}\boldsymbol{X}_{1})^{i_{1}}(\boldsymbol{\Omega}\boldsymbol{X}_{2})^{i_{2}}\}_{S}=(\boldsymbol{\Omega}\boldsymbol{X}_{1})^{i_{1}}(\boldsymbol{\Omega}\boldsymbol{X}_{2})^{i_{2}}$ . Another possible choice is $o_{12}=a,o_{21}=b,o_{11}=o_{22}=0$ , where $0\leq a,b\leq 1$ and $a+b=1$ , so that

\{(\boldsymbol{\Omega}\boldsymbol{X}_{1})^{i_{1}}(\boldsymbol{\Omega}\boldsymbol{X}_{2})^{i_{2}}\}_{S}=a(\boldsymbol{\Omega}\boldsymbol{X}_{1})^{i_{1}}(\boldsymbol{\Omega}\boldsymbol{X}_{2})^{i_{2}}+b(\boldsymbol{\Omega}\boldsymbol{X}_{2})^{i_{2}}(\boldsymbol{\Omega}\boldsymbol{X}_{1})^{i_{1}}

(21)

2.2 Decay functions

We discuss now the decay functions $\vec{G}_{i}(\boldsymbol{\Omega};\vec{X}_{i})$ . The choice made in Example 2.1 was

\vec{G}(\boldsymbol{\Omega};\vec{X})=-\alpha(\boldsymbol{I}-\boldsymbol{\Omega})\vec{X}

(22)

where $\alpha\geq 0$ . This particular choice corresponds to a precise definition:

Definition 2.7.

Standard decay function. The decay function in (22) is called standard decay function.

As seen in Example 2.1, the standard decay function allowed to recover the target dynamical system dynamics, that in turn was reconstructed in the Span $(\boldsymbol{\Omega})$ . The role played by the decay functions is to enforce that in each extended subspace, the modal components associated to the Ker $(\boldsymbol{\Omega})$ are asymptotically vanishing.

Definition 2.8.

A decay function $\vec{G}_{i}(\boldsymbol{\Omega};\vec{X}_{i})$ is $\boldsymbol{\Omega}$ -eligible if

\lim_{t\rightarrow+\infty}\boldsymbol{\Omega}\vec{G}_{i}(\boldsymbol{\Omega};\vec{X}_{i}(t))=\vec{0}

(23)

and if the solution of the dynamical system obtained projecting (14) onto the Ker $(\boldsymbol{\Omega})$ (i.e., projecting the extended equation via $(\boldsymbol{I}-\boldsymbol{\Omega})$ and defining $\vec{X}_{ci}(t)=(\boldsymbol{I}-\boldsymbol{\Omega})\vec{X}_{i}(t)$ )

\frac{d\vec{X}_{ci}}{dt}=(\boldsymbol{I}-\boldsymbol{\Omega})\vec{G}_{i}(\boldsymbol{\Omega};\vec{X}_{i})

(24)

is decaying, i.e. if $\lim_{t\rightarrow\infty}\vec{X}_{ci}(t)=\vec{0}$ .

Obviously, the standard decay functions are $\boldsymbol{\Omega}$ -eligible.

2.3 Embedding map $\mathcal{O}$

We are now ready to state the exact definition of the $\mathcal{O}$ map. However, this step requires to express the nonlinear scalar functions defining the target dynamical system as a power series. From this standpoint, it is convenient to formulate the Taylor expansion of an $m$ variable, scalar analytic function $f(\vec{x})$ as a superposition of monomials exploiting Kronecker symbol:

f(\vec{x})=\sum_{j=0}^{\infty}\sum_{i_{1},\dots,i_{j}=0}^{j}\delta_{j,\sum_{k=1}^{m}i_{k}}b_{j;i_{1}\dots i_{j}}x_{1}^{i_{1}}\cdots x_{m}^{i_{m}}=\sum_{j=0}^{\infty}\sum_{i_{1},\dots i_{j}=0}^{j}a_{j;i_{1}\dots i_{j}}x_{1}^{i_{1}}\cdots x_{m}^{i_{m}},

(25)

where $a_{j;i_{1}\cdots i_{m}}=\delta_{j,\sum_{k=1}^{j}i_{j}}b_{j;i_{1}\cdots i_{j}}$ .

Definition 2.9.

Matrix map.
Given a scalar analytic function $f(\vec{x})$ with Taylor expansion as in (25), we call a matrix map for $f$ the following construction

\boldsymbol{F}(\vec{X}_{1},\dots,\vec{X}_{m})=\sum_{j=0}^{\infty}\sum_{i_{1},\cdots i_{j}=0}^{j}a_{j;i_{1}\dots i_{j}}\{(\boldsymbol{\Omega_{1}}\boldsymbol{X}_{1})^{i_{1}}\cdots(\boldsymbol{\Omega_{1}}\boldsymbol{X}_{m})^{i_{m}}\}_{S}

(26)

where $S$ is a properly defined ordering.

Definition 2.10.

Embedding map.
The embedding map $\mathcal{O}$ is defined as the tuple $\mathcal{O}=\Big{(}\{f_{i}\},\boldsymbol{\Omega},\{\vec{G}_{i}\},S,\{\vec{b}_{i}\},N\Big{)}$ , where $\vec{G}_{i}$ represents decay functions, and $\vec{b}_{i}$ is a set of constant vectors satisfying condition $\boldsymbol{\Omega}\vec{b}_{i}\neq\vec{0}$ . The embedding map $\mathcal{O}$ of (2) is given by

\frac{dx_{i}}{dt}=f_{i}(\vec{x})\underset{\mathcal{O}}{\longrightarrow}\frac{d\vec{X}_{i}}{dt}=\boldsymbol{\Omega}\boldsymbol{F}_{i}(\vec{X}_{i},\cdots,\vec{X}_{m})\vec{b}_{i}+\vec{G}_{i}(\boldsymbol{\Omega};\vec{X}_{i})\qquad i=1,\dots,m

(27)

where $\boldsymbol{F}_{i}$ is the matrix map associated to $f_{i}$ according to Definition 2.9.

Let us now provide three examples of matrices $\boldsymbol{F}_{i}$ which will be used in the following. Each target function is analytical, with series representation as in (25):

f_{i}(x_{1},\cdots,x_{m})=\sum_{k=0}^{\infty}\sum_{j_{1},\cdots,j_{m}}^{k}a_{i,k;j_{1},\cdots,j_{m}}x_{1}^{j_{1}}\cdots x_{m}^{j_{m}}

Definition 2.11.

We define the following three possible matrix embeddings:

the standard commutative map is

\displaystyle\boldsymbol{F}^{(c)}_{i}(\boldsymbol{X}_{1},\cdots,\boldsymbol{X}_{m})=\sum_{k=0}^{\infty}\sum_{j_{1},\cdots,j_{m}}^{k}a_{i,k;j_{1},\cdots,j_{m}}\boldsymbol{X}_{1}^{j_{1}}\cdots\boldsymbol{X}_{m}^{j_{m}},

(28)

the mixed commutative map is

\displaystyle\boldsymbol{F}^{(mc)}_{i}(\boldsymbol{\Omega};\boldsymbol{X}_{1},\cdots,\boldsymbol{X}_{m})=a_{i,0}\boldsymbol{I}+\sum_{k=1}^{\infty}\sum_{j_{1},\cdots,j_{m}}^{k}a_{i,k;j_{1},\cdots,j_{m}}(\boldsymbol{\Omega}(\boldsymbol{X}_{1}^{j_{1}}\cdots\boldsymbol{X}_{m}^{j_{m}})^{1/k})^{k},

(29)

where $a_{i,0}$ denotes the constant term of the series expansion for function $f_{i}$

the standard non-commutative map is

\displaystyle\boldsymbol{F}^{(nc)}_{i}(\boldsymbol{\Omega};\boldsymbol{X}_{1},\cdots,\boldsymbol{X}_{m})=\sum_{k=0}^{\infty}\sum_{j_{1},\cdots,j_{m}}^{k}a_{i,k;j_{1},\cdots,j_{m}}\{(\boldsymbol{\Omega}\boldsymbol{X}_{1})^{j_{1}}\cdots(\boldsymbol{\Omega}\boldsymbol{X}_{m})^{j_{m}}\}_{S}

(30)

where $S$ is the chosen ordering.

Clearly, since diagonal matrices $\boldsymbol{X_{i}}$ commute, defining an ordering for the standard and the mixed commutative maps is unnecessary. As we will see, such difference is important for embeddings of vector dynamical systems in the case of the mixed commutative map, but not for a scalar system. Notice also that

the standard commutative map is a linear mix of the dynamical systems functions $f_{i}(\vec{x})$ , since a direct calculation shows

\displaystyle\boldsymbol{F}^{(c)}_{i}(\boldsymbol{X}_{1},\cdots,\boldsymbol{X}_{m})=\text{diag}\big{(}f_{i}(X_{1,1},\cdots,X_{m,1}),\cdots,f_{i}(X_{1,N},\cdots,X_{m,N})\big{)}.

(31)

which simplifies drastically the evaluation

2.

for scalar dynamical systems, the mixed commutative map and the standard non-commutative map reduce to the same quantity
3.

in the case of vector dynamical systems, the mixed commutative map preserves the commutativity of the target variables, since diagonal matrices $\boldsymbol{X_{i}}$ commute among themselves.

As a consequence, for scalar dynamical system we will study only the standard commutative and non-commutative maps, while the result will follow also for the mixed commutative map from the non-commutative one. However, we will have to be more careful in the vector case.

2.4 Projection operator $\boldsymbol{\Omega}$

We provide here a few definitions on the projection operators of size $N$ we will consider in the following.

Definition 2.12.

A projector $\boldsymbol{\Omega}$ is called trivial if rank $(\boldsymbol{\Omega})=N$ , or, equivalently, if Span $(\boldsymbol{\Omega})=\mathbb{R}^{N}$ .

A simple proof shows that the only trivial projector is the identity matrix $\boldsymbol{I}$ .

Definition 2.13.

The uniform mean-field projector $\boldsymbol{\Omega}_{1}$ is defined as the square matrix with elements

\Omega_{1,ij}=\frac{1}{N}

Let us consider $\boldsymbol{X}$ to be a diagonal matrix, as in the PEDS embedding procedure. Projection using the uniform mean-field operator yields

\boldsymbol{\Omega}_{1}\boldsymbol{X}=\frac{1}{N}\begin{pmatrix}1&\cdots&1\\ \vdots&\ddots&\vdots\\ 1&\cdots&1\end{pmatrix}\begin{pmatrix}X_{1}&0&0\\ \vdots&\ddots&\vdots\\ 0&0&X_{N}\end{pmatrix}=\frac{1}{N}\begin{pmatrix}X_{1}&X_{2}&\cdots&X_{N}\\ \vdots&\vdots&\vdots&\vdots\\ X_{1}&X_{2}&\cdots&X_{N}\end{pmatrix}

(32)

therefore, the powers of $\boldsymbol{\Omega}\boldsymbol{X}$ appearing in the PEDS procedure, are neither trivial expressions nor sparse matrices, and indeed contain non linear components in the $X_{i}$ variables.

Example 2.14.

For $N=2$ we have

\boldsymbol{\Omega}_{1}\boldsymbol{X}=\frac{1}{2}\begin{pmatrix}X_{1}&X_{2}\\ X_{1}&X_{2}\end{pmatrix},\ \ \ \ (\boldsymbol{\Omega}_{1}\boldsymbol{X})^{2}=\frac{1}{4}\begin{pmatrix}X_{1}(X_{1}+X_{2})&X_{2}(X_{1}+X_{2})\\ X_{1}(X_{1}+X_{2})&X_{2}(X_{1}+X_{2})\end{pmatrix}=\langle X\rangle\boldsymbol{\Omega}_{1}\boldsymbol{X}

(33)

where $\langle X\rangle=\frac{1}{N}\sum_{j=1}^{N}X_{j}$ .

The previous example can easily be generalized to size $N$ , showing that $(\boldsymbol{\Omega}_{1}\boldsymbol{X})^{k}=\langle X\rangle^{k-1}\boldsymbol{\Omega}_{1}\boldsymbol{X}$ , thus justifying the definition of $\boldsymbol{\Omega}_{1}$ as the uniform mean-field projector.

A similar property applies to vectors, as $\boldsymbol{\Omega}_{1}\vec{X}=\langle X\rangle\vec{1}$ .

Example 2.15.

Consider again the Example 2.1. We can write the embedded system as

\frac{d\vec{X}}{dt}=a\boldsymbol{\Omega}\vec{X}-\alpha(\boldsymbol{I}-\boldsymbol{\Omega})\vec{X}=\boldsymbol{\Omega}(a\boldsymbol{\Omega}\boldsymbol{X})\vec{1}-\alpha(\boldsymbol{I}-\boldsymbol{\Omega})\vec{X}

(34)

which is in the form of a PEDS (3), with $\boldsymbol{F}=a\boldsymbol{\Omega}\boldsymbol{X}$ , $\vec{b}=\vec{1}$ and $\vec{G}=-\alpha(\boldsymbol{I}-\boldsymbol{\Omega})\vec{X}$ .

We would like to stress the fact that the PEDS mapping is, in general, highly non-trivial, at least as far as the projection operator is not the trivial one: this condition is required because in this case the matrix powers of the form $(\boldsymbol{\Omega}\boldsymbol{X}_{i})^{k}$ couple all the subspace variables in a nonlinear way.

On the other hand, for the trivial projector, the standard decay function is identically zero, and the extended system as well as any extended monomial are ordering independent. In fact, as the diagonal matrices $\boldsymbol{X}_{j}$ commute among themselves, we have that

\displaystyle\{(\boldsymbol{\Omega}\boldsymbol{X}_{1})^{j_{1}}\cdots(\boldsymbol{\Omega}\boldsymbol{X}_{m})^{j_{m}}\}_{S}=(\boldsymbol{X}_{1})^{j_{1}}\cdots(\boldsymbol{X}_{m})^{j_{m}}

(35)

if and only if $\boldsymbol{\Omega}=\boldsymbol{I}$ . As a consequence, (27) becomes

\frac{d\vec{X}_{i}}{dt}=\boldsymbol{\Omega}\boldsymbol{F}_{i}(\vec{X}_{i},\cdots,\vec{X}_{m})\vec{b}_{i}=\begin{pmatrix}f_{i}(X_{i,1},\dots,X_{i,m})b_{i,1}\\ \vdots\\ f_{i}(X_{i,N},\dots,X_{i,m})b_{i,N}\end{pmatrix}\qquad i=1,\dots,m

(36)

thus showing that the PEDS procedure for the trivial projector decouples into $N$ identical copies of the original system.

3 Embedding via the uniform mean field projector

We derive here in a more rigorous way the key results presented above. We focus on the uniform mean field projector $\boldsymbol{\Omega}_{1}$ , as the proofs are easier to be carried out. Nevertheless, several results are actually valid even for a more general projection operator $\boldsymbol{\Omega}$ : these will be explicitly denoted by using the general projector $\boldsymbol{\Omega}$ in place of the uniform mean field operator.

3.1 Simple case: Scalar target system, embedding without decay function

We start from the case of a one dimensional target dynamical system

\frac{dx}{dt}=f(x)

(37)

where $f(x)$ is analytic, so that

f(x)=\sum_{i=0}^{\infty}a_{i}x^{i}.

(38)

Following the PEDS procedure, we introduce the projector operator $\boldsymbol{\Omega}$ . The extended variable $\vec{X}$ is thus an $N$ -dimensional vector with components $X_{i}$ , and the matrix map associated to $f$ takes either the standard commutative form (28) so that

\vec{F}(\vec{X})=\sum_{i=0}^{\infty}a_{i}\boldsymbol{X}^{i}\vec{1}

(39)

where we have chosen $\vec{b}=\vec{1}$ , or the standard non-commutative form (30) (we remind that for scalar target systems, the mixed commutative and the standard non-commutative forms coincide)

\vec{F}(\vec{X})=\sum_{i=0}^{\infty}a_{i}\boldsymbol{\Omega}\boldsymbol{X}^{i}\vec{1}

(40)

Before we begin our discussion on the embedding, it is worth giving a definition of what we mean when we say that a dynamical system is contained in another one. Taking Fig. 1 as a reference, let our target system be described by the evolution map (the solution of the dynamical system) $\phi_{t}^{\prime}:\mathbb{R}\rightarrow\mathbb{R}$ , while the PEDS evolution is instead a map $\phi_{t}:\mathbb{R}^{N}\rightarrow\mathbb{R}^{N}$ .

Definition 3.1.

A dynamical system $A$ of size $N_{A}$ is contained in a dynamical system $B$ of dimensions $N_{B}>N_{A}$ if a linear operator $\mathcal{P}:\mathbb{R}^{N_{B}}\rightarrow\mathbb{R}^{N_{A}}$ exists such that, for $\vec{X}\in\mathbb{R}^{N_{B}}$

\mathcal{P}\phi_{t}(\vec{X})=\phi_{t}^{\prime}(\vec{x}),

(41)

where $\vec{x}$ has size $N_{A}$ .

Given the definition above, we can now prove the following

Proposition 3.2.

Banality of mean value. Let $\mathcal{O}=(f(x),\boldsymbol{\Omega},0,\vec{1},N)$ be a PEDS tuple of a target dynamical system as in (37), where the matrix map can take either the standard commuting or standard non-commuting forms. Then, the extended dynamical system (39) or (40) contains the dynamics of (38) for a generic projection operator $\boldsymbol{\Omega}$ satisfying $\boldsymbol{\Omega}\vec{1}\neq\vec{0}$ .

Proof.

Let us consider an extended variable $\vec{X}$ subject to the condition $\vec{X}=x\vec{1}$ . Then, as $\boldsymbol{X}=x\vec{1}$ and $\boldsymbol{\Omega}^{i}=\boldsymbol{\Omega}$ $i>0$ , for both the standard commuting and non-commuting maps we have:

\frac{d}{dt}\vec{X}=\frac{dx}{dt}\vec{1}=\begin{cases}\displaystyle\sum_{i=0}^{\infty}a_{i}x^{i}\boldsymbol{\Omega}\vec{1}&\text{commuting map}\\[4.30554pt] \displaystyle\sum_{i=0}^{\infty}a_{i}x^{i}\boldsymbol{\Omega}\vec{1}&\text{non commuting map}\end{cases}=a_{0}\boldsymbol{\Omega}\vec{1}+\sum_{i=1}^{\infty}a_{i}x^{i}\boldsymbol{\Omega}\vec{1}=f(x)\boldsymbol{\Omega}\vec{1}

(42)

Therefore, projecting the previous relation onto the span of $\boldsymbol{\Omega}$ , i.e. evaluating $\boldsymbol{\Omega}\frac{d}{dt}\vec{X}=\boldsymbol{\Omega}\vec{F}(\boldsymbol{X})$ , we obtain

\left(\frac{dx}{dt}-f(x)\right)\boldsymbol{\Omega}\vec{1}=0.

(43)

It follows that as $\boldsymbol{\Omega}\vec{1}\neq\vec{0}$ , then $\frac{dx}{dt}-f(x)=0$ . In order to prove that the dynamical system is contained, we can project on any component $i$ , obtaining

\displaystyle\omega_{i}\left(\frac{dx}{dt}-f(x)\right)=0

(44)

if $\omega_{i}=(\boldsymbol{\Omega}\vec{1})_{i}\neq 0$ , we have then proven that an initial condition exists for which (41) applies. ∎

Proposition 3.2 is a warm up for the type of proofs that will follow. It shows that if the initial condition for the variables $\vec{X}$ are chosen homogeneously, then the extended dynamical system will follow the one dimensional dynamics of (37). However, condition $\vec{X}=x\vec{1}$ is a strong requirement for the dynamical system. In principle, a dynamically obtained convergence towards a state of the form $\vec{X}=x\vec{1}$ would be a much better demand. To this aim, we introduce the decay functions.

3.2 Scalar target system: Enforcing the convergence to the mean via decay functions

We consider now the following form for the extended system (14) based on the uniform mean field projector:

\frac{d\vec{X}}{dt}=\boldsymbol{\Omega}_{1}\vec{F}(\vec{X})-\alpha(\vec{X}-\langle X\rangle\vec{1})=\boldsymbol{\Omega}_{1}\vec{F}(\vec{X})-\alpha(\boldsymbol{I}-\boldsymbol{\Omega_{1}})\vec{X}

(45)

where $\langle X\rangle=({1}/{N})\sum_{i=1}^{N}X_{i}$ and $\alpha>0$ . Notice that $\vec{X}-\langle X\rangle\vec{1}=(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X}$ because of the properties of $\boldsymbol{\Omega}_{1}$ discussed in Sec. 2.4. The second term on the right hand side of (45) is an “elastic” force compelling the extended trajectories to remain close to the mean. The relative strength of the two addends determines the behavior of the system.

Proposition 3.3.

Convergence to the mean. The dynamics of (45) is characterized by the same fixed points, if they exist, as for the target system (37) both for the standard commuting and non-commuting matrix maps.

Proof.

Projecting (45) through $\boldsymbol{\Omega}_{1}$ and using (39) we find

\boldsymbol{\Omega}_{1}\frac{d\vec{X}}{dt}=\begin{cases}\displaystyle\sum_{i=0}^{\infty}a_{i}\boldsymbol{\Omega}_{1}\boldsymbol{X}^{i}\vec{1}-\alpha\cancel{\boldsymbol{\Omega}_{1}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X}}&\text{standard commuting map}\\[4.30554pt] \displaystyle\sum_{i=0}^{\infty}a_{i}\boldsymbol{\Omega}_{1}(\boldsymbol{\Omega_{1}X})^{i}\vec{1}-\alpha\cancel{\boldsymbol{\Omega}_{1}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X}}&\text{standard noncommuting map}\\ \end{cases}

(46)

which reduces to the banality lemma enforcing $\vec{X}=x\vec{1}$ .

Considering the complementary projection, we have

(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\frac{d\vec{X}}{dt}=\cancel{(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\boldsymbol{\Omega}_{1}}\vec{F}(\vec{X})-\alpha((\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X}-\langle X\rangle\cancel{(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{1}})

(47)

or, defining $\vec{X}_{c}=(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X}$ ,

\frac{d}{dt}\vec{X}_{c}=-\alpha\vec{X}_{c}.

(48)

Equation (48) represents the dynamics of the $N-1$ modes that make $\vec{X}$ non-uniform. The above implies that any non-uniform mode of $\vec{X}$ decays exponentially, and thus $\vec{X}-\langle X\rangle\vec{1}\rightarrow 0$ in a time $t\gg\tau={1}/{\alpha}$ . This concludes the proof. ∎

In conclusion, using the uniform mean field projector $\boldsymbol{\Omega}_{1}$ and the standard decay functions as in (45), $\vec{X}(t)$ converges to the right mean, and thus to the same fixed points as the target system. Let us now provide some technical results to support the idea that the decay functions project back on the subspace of our interest. The result is in fact not strictly limited to the standard decay functions. We now prove the decay of the modes in Ker $(\boldsymbol{\Omega}_{1})$ for a generalized set of decay functions. Let us consider

\frac{d\vec{X}}{dt}=\boldsymbol{\Omega}_{1}\vec{F}-\begin{cases}\boldsymbol{D}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X}&\text{generalization A}\\ (\boldsymbol{I}-\boldsymbol{\Omega}_{1})\boldsymbol{D}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X}&\text{generalization B}\end{cases}

(49)

where $\boldsymbol{D}$ is a positive diagonal matrix with diagonal elements $\alpha_{1},\dots,\alpha_{n}>0$ . If $\alpha=\alpha_{1}=\dots=\alpha_{n}$ , both generalizations reduce to the standard decay function. In the general case, their difference becomes evident projecting via $\boldsymbol{\Omega}_{1}$

\displaystyle\boldsymbol{\Omega}_{1}\frac{d\vec{X}}{dt}=\boldsymbol{\Omega}_{1}\vec{F}-\begin{cases}\boldsymbol{\Omega}_{1}\boldsymbol{D}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X}&\text{generalization A}\\ \vec{0}&\text{generalization B}\end{cases}

(50)

i.e., the PEDS embeddings

\mathcal{O}_{A}=(f(x),\boldsymbol{\Omega}_{1},-\boldsymbol{D}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X},\vec{1},N)

and

\mathcal{O}_{B}=(f(x),\boldsymbol{\Omega}_{1},-(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\boldsymbol{D}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X},\vec{1},N).

Clearly, the first part of the proof of the banality lemma remains valid also in these cases. On the other hand, projecting via $(\boldsymbol{I}-\boldsymbol{\Omega}_{1})$ , we obtain for both generalizations the following governing equation for the non-uniform modes

\frac{d\vec{X}_{c}}{dt}=-(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\boldsymbol{D}\vec{X}_{c}

(51)

whose solution is

\displaystyle\vec{X}_{c}(t)=\left.e^{-(\boldsymbol{I}-\boldsymbol{\Omega})\boldsymbol{D}t}\vec{X}_{c}(0)\right|_{\boldsymbol{\Omega}=\boldsymbol{\Omega}_{1}}.

(52)

We show now that the two generalizations A and B are $\boldsymbol{\Omega}$ -eligible, i.e. that $\vec{X}_{c}$ asymptotically approaches the zero vector. We prove the following proposition for general projectors:

Proposition 3.4.

Given the governing equation (51) written for a general projector $\boldsymbol{\Omega}$ , assuming $\vec{X}_{c}(0)\in\mathrm{Span}(\boldsymbol{I}-\boldsymbol{\Omega})$ then $\vec{X}_{c}(t)\in\mathrm{Span}(\boldsymbol{I}-\boldsymbol{\Omega})\ \forall t$ .

Proof.

The solution of (51) takes the form

\vec{X}_{c}(t)=e^{\boldsymbol{A}t}\vec{X}_{c}(0)

(53)

where $\boldsymbol{A}=(\boldsymbol{I}-\boldsymbol{\Omega})\boldsymbol{D}$ . Expanding the exponential, we get

\vec{X}_{c}(t)=\vec{X}_{c}(0)+\sum_{k=1}^{+\infty}\frac{t^{k}}{k!}((\boldsymbol{I}-\boldsymbol{\Omega})\boldsymbol{D})^{k}\vec{X}_{c}(0)

that, projecting through $\boldsymbol{I}-\boldsymbol{\Omega}$ , becomes

(\boldsymbol{I}-\boldsymbol{\Omega})\vec{X}_{c}(t)=(\boldsymbol{I}-\boldsymbol{\Omega})\vec{X}_{c}(0)+\sum_{k=1}^{+\infty}\frac{t^{k}}{k!}((\boldsymbol{I}-\boldsymbol{\Omega})\boldsymbol{D})^{k}\vec{X}_{c}(0).

(54)

We notice that if $\vec{X}_{c}(0)\in\mathrm{Span}(\boldsymbol{I}-\boldsymbol{\Omega})$ , then we can express $\vec{X}_{c}(0)=\sum_{j}a_{j}\vec{v}_{j}$ where $\vec{v}_{j}$ are eigenvectors associated to the eigenvalue equal to 1 of $\boldsymbol{I}-\boldsymbol{\Omega}$ . Thus, $(\boldsymbol{I}-\boldsymbol{\Omega})\vec{X}_{c}(0)=\sum_{j}a_{j}(\boldsymbol{I}-\boldsymbol{\Omega})\vec{v}_{j}=\sum_{j}a_{j}\vec{v}_{j}=\vec{X}_{c}(0)$ . This implies that

(\boldsymbol{I}-\boldsymbol{\Omega})\vec{X}_{c}(t)=\vec{X}_{c}(0)+\sum_{k=1}^{+\infty}\frac{t^{k}}{k!}((\boldsymbol{I}-\boldsymbol{\Omega})\boldsymbol{D})^{k}\vec{X}_{c}(0)=e^{(\boldsymbol{I}-\boldsymbol{\Omega})\boldsymbol{D}t}\vec{X}_{c}(0)=\vec{X}_{c}(t)

(55)

Thus, we have $(\boldsymbol{I}-\boldsymbol{\Omega})\vec{X}_{c}(t)=\vec{X}_{c}(t)$ , i.e. $\vec{X}_{c}(t)\in\mathrm{Span}(\boldsymbol{I}-\boldsymbol{\Omega})$ . ∎

As a result of the proposition above, vector $\vec{X}_{c}(t)$ is contained in the subspace spanned by $\boldsymbol{I}-\boldsymbol{\Omega}$ at all times and for any projector, and thus also for $\boldsymbol{\Omega}_{1}$ .

Corollary 3.5.

Equation (51) implies that, if $\vec{X}_{c}(0)\in\mathrm{Span}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})$ , then in (50) one has $lim_{t\rightarrow\infty}\vec{X}(t)=\langle X\rangle\vec{1}$ .

Proof.

We consider the dynamics for the modes $\vec{X}_{c}=(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X}(t)$ from (51), and we use a Lyapunov stability argument. Let us consider the following functional: $V(\vec{X}_{c})=\vec{X}_{c}\cdot\vec{X}_{c}\geq 0$ . Then,

$\displaystyle\frac{d}{dt}V$	$\displaystyle=2\left(\frac{d}{dt}\vec{X}_{c}(t)\right)\cdot\vec{X}_{c}(t)$
	$\displaystyle=-2((\boldsymbol{I}-\boldsymbol{\Omega}_{1})\boldsymbol{D}\vec{X}_{c}(t))\cdot\vec{X}_{c}(t)$
	$\displaystyle=-2(\sqrt{\boldsymbol{D}}\vec{X}_{c}(t))\cdot\sqrt{\boldsymbol{D}}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})^{T}\vec{X}_{c}(t)$
	$\displaystyle=-2(\sqrt{\boldsymbol{D}}\vec{X}_{c}(t))\cdot\sqrt{\boldsymbol{D}}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X}_{c}(t).$	(56)

Using the fact that $\vec{X}_{c}(t)\in\text{Span}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})$ from the previous Proposition, we obtain that

\frac{d}{dt}V=-2||\sqrt{\boldsymbol{D}}\vec{X}_{c}(t)||^{2}\leq 0.

(57)

Since the only minimum of $V(\vec{X})$ is $\vec{X}=\vec{0}$ , then $\vec{X}_{c}(t)\rightarrow\vec{0}$ for $t\rightarrow\infty$ . This proves that $\vec{X}_{c}(0)\rightarrow\vec{0}$ , and thus $\vec{X}\rightarrow\langle X\rangle\vec{1}$ , for $t\rightarrow\infty$ . ∎

Propositions 3.2, 3.3 and 3.4, along with Corollary 3.5 imply that for a one-dimensional dynamical system, the PEDS $\mathcal{O}=(f(x),\boldsymbol{\Omega}_{1},-(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X},\vec{1},N)$ contains the fixed points of the original dynamical system. In particular, Corollary 3.5 implies that the extended system converges to an asymptotic state of the form $\vec{X}(t)=x(t)\vec{1}$ . Therefore, through the banality of the mean value Lemma, the PEDS embedding will contain the original dynamical system.

For practical purposes, it is sufficient to consider the observable $\tilde{x}(t)=\langle X\rangle=\frac{1}{N}\sum_{i,j=1}^{N}{\Omega}_{1,ij}X_{j}(t)$ in order to recover the location of the fixed points. Clearly, this example applies only to a one-dimensional dynamical system. However, the result can be extended to the vector case following similar considerations.

Example 3.6.

As an example of a one dimensional dynamical system embedded in $N=2$ dimensions, let us consider the dynamical system

\displaystyle\frac{dx}{dt}=x-x^{2}

(58)

whose stable fixed point is given by $x^{*}=1$ . A PEDS embedding $\mathcal{O}=(x-x^{2},\boldsymbol{\Omega}_{1},-\alpha(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X},\vec{1},2)$ is given by the two coupled differential equations:

\displaystyle\frac{d\vec{X}}{dt}=\boldsymbol{\Omega}_{1}\left(\boldsymbol{X}-\boldsymbol{X}^{2}\right)\vec{1}-\alpha(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X}

(59)

whose components can be made explicit introducing $\langle X\rangle=\frac{1}{2}(X_{1}+X_{2})$ and evaluating the matrix expressions. The result is

	$\displaystyle\frac{dX_{1}}{dt}=\langle X\rangle-\langle X\rangle^{2}-\alpha(X_{1}-\langle X\rangle)$		(60)
	$\displaystyle\frac{dX_{2}}{dt}=\langle X\rangle-\langle X\rangle^{2}-\alpha(X_{2}-\langle X\rangle)$		(61)

whose stable fixed point is easily seen to be $X^{*}_{1}=X^{*}_{2}=1$ .

Example 3.6 highlights an important property of the uniform mean field embedding. We gain some intuition on how the uniform mean field projector works by exploiting a direct evaluation of the matrix powers. For instance, for the PEDS $\mathcal{O}=(f(x),\boldsymbol{\Omega}_{1},-\alpha(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X},\vec{1},N)$ , then

\displaystyle\frac{d\vec{X}}{dt}=\boldsymbol{\Omega}_{1}\vec{F}(\vec{X})-\alpha(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X}.

(62)

Using the properties

(\boldsymbol{\Omega}_{1}\boldsymbol{X})^{k}=\langle X\rangle^{k-1}\boldsymbol{\Omega}_{1}\boldsymbol{X}\quad k\geq 1,\qquad\boldsymbol{\Omega}_{1}\boldsymbol{X}\vec{1}=\boldsymbol{\Omega}_{1}\vec{X}=\langle X\rangle\vec{1},\qquad\boldsymbol{\Omega}_{1}\vec{1}=\vec{1}

we obtain the equivalent form for (62)

\displaystyle\frac{d\vec{X}}{dt}=f(\langle x\rangle)\vec{1}-\alpha(\vec{X}-\langle x\rangle\vec{1}).

(63)

Multiplying on the left times $\boldsymbol{\Omega}_{1}$ and times $\boldsymbol{I}-\boldsymbol{\Omega}_{1}$ , yields

	$\displaystyle\frac{d}{dt}\left(\langle X\rangle\vec{1}\right)$	$\displaystyle=f(\langle X\rangle)\vec{1},$		(64)
	$\displaystyle\frac{d}{dt}\left(\vec{X}-\langle X\rangle\vec{1}\right)$	$\displaystyle=-\alpha(\vec{X}-\langle X\rangle\vec{1}).$		(65)

Therefore, the mean value $\langle X\rangle$ in (64) follows exactly the target system dynamics, while (65) asymptotically determines $\vec{X}\rightarrow\langle X\rangle\vec{1}$ for $t\gg 1/\alpha$ . It follows that for scalar target systems, because of the identity $(\boldsymbol{\Omega_{1}X})^{k}=\langle X\rangle^{k-1}\boldsymbol{\Omega_{1}X}$ , the standard commutative and non-commutative maps are identical. This is no longer true for vector target systems, as we discuss below.

3.3 General case: Vector target system

We will now focus on the generalization of the previous results to the case of a vector target dynamical system of arbitrary dimension. Such generalization is involved, basically because the ordering $S$ becomes important (at least for the standard non-commutative map) and this depends on the fact that matrix terms such as $(\boldsymbol{\Omega}\boldsymbol{X})^{i}$ and $(\boldsymbol{\Omega}\boldsymbol{X})^{j}$ do not commute. Nevertheless, exploiting the properties of the uniform mean field projector $\boldsymbol{\Omega}_{1}$ certain exact results can be obtained.

We consider an $m$ dimensional target system as in (2), where a Taylor expansion of the defining functions $f_{i}(x_{1},\dots,x_{m})$ takes the form (25) after choosing the ordering $S$ . Following the PEDS procedure, we construct as a first instance the embedding map $\mathcal{O}=(\{f_{i}\},\boldsymbol{\Omega}_{1},0,S,\vec{1},N)$ in the abscence of decay functions. Given the extended variables $\vec{X}_{s}$ ( $s=1,\dots,m$ ) of size $N$ , we write the embedding maps (28)–(30) as


$\displaystyle\frac{d\vec{X}_{s}}{dt}$	$\displaystyle=\boldsymbol{\Omega}_{1}\sum_{k=0}^{\infty}\sum_{i_{1},\dots,i_{m}}^{k}a_{s,k;i_{1}\dots i_{m}}\boldsymbol{X}_{1}^{i_{1}}\dots\boldsymbol{X}_{m}^{i_{m}}\vec{1}$
	standard commuting map	(66a)
$\displaystyle\frac{d\vec{X}_{s}}{dt}$	$\displaystyle=a_{s,0}\boldsymbol{\Omega}_{1}+\boldsymbol{\Omega}_{1}\sum_{k=1}^{\infty}\sum_{i_{1},\cdots,i_{m}}^{k}a_{s,k;i_{1},\cdots,i_{m}}(\boldsymbol{\Omega}_{1}(\boldsymbol{X}_{1}^{i_{1}}\cdots\boldsymbol{X}_{m}^{i_{m}})^{1/k})^{k}\vec{1}$
	mixed commuting map	(66b)
$\displaystyle\frac{d\vec{X}_{s}}{dt}$	$\displaystyle=\boldsymbol{\Omega}_{1}\sum_{k=0}^{\infty}\sum_{i_{1},\dots,i_{m}}^{k}a_{s,k;i_{1}\dots i_{m}}\{(\boldsymbol{\Omega}_{1}\boldsymbol{X}_{1})^{i_{1}}\dots(\boldsymbol{\Omega}_{1}\boldsymbol{X}_{m})^{i_{m}}\}_{S}\vec{1}$
	standard noncommuting map	(66c)

where $S$ is the chosen ordering of the variables. The question is whether also in this case the banality of the mean value Proposition 3.2 still holds.

An easy proof shows that assuming a solution $\vec{X}_{s}=x_{s}\vec{1}$ , the banality of the mean value lemma applies also in the vector case, irrespective of the chosen extended variable ordering.

Corollary 3.7.

Multivariate banality of the mean value. Let $\boldsymbol{\Omega}_{1}$ be the uniform mean field projector, and $\mathcal{O}=(\{f_{i}\},\boldsymbol{\Omega}_{1},\{-\alpha(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X}_{i}\},S,\vec{1},N)$ the embedding with $S$ an arbitrary ordering. Following the PEDS procedure, the dynamics of the $m$ variables $\vec{X}_{i}$ is determined by

\displaystyle\frac{d\vec{X}_{i}}{dt}=\boldsymbol{\Omega}_{1}\vec{F}_{i}(\vec{X}_{1},\cdots,\vec{X}_{m})-\alpha(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X}_{i}

(67)

where $\vec{F}_{i}=\boldsymbol{F}_{i}(\vec{X}_{1},\cdots,\vec{X}_{m})\vec{1}$ is the vector constructed following the PEDS procedure defined in (66). We define the projection operator $\mathcal{P}_{\boldsymbol{\Omega}_{1}}=\frac{1}{N}\vec{1}^{T}$ , and the projected variables $\langle X_{i}(t)\rangle=\mathcal{P}_{\boldsymbol{\Omega}_{1}}\vec{X}_{i}(t)$ . Then, the following two statements hold true

(a)\ \ \ \frac{d}{dt}\langle X_{i}\rangle=0\Longrightarrow\frac{d}{dt}x_{i}=0,

(68)

and (b) if the extended system approaches a fixed point $\vec{X}_{i}^{*}$ for times $t\gg{1}/{\alpha}$ , then the projection $\mathcal{P}_{\boldsymbol{\Omega}_{1}}\vec{X}_{i}^{*}$ is a fixed point of the target system.

Proof.

Let us first prove statement $(a)$ , which is a corollary of the banality of the mean value Proposition 3.2. We set $\vec{X}_{i}(t)=\langle X_{i}(t)\rangle\vec{1}$ . In all cases of the standard commutative, mixed commutative and non-commutative maps, the standard decay function is identically zero, and it is not hard to see that, for an arbitrary orderings $S$ , we have in all cases the same expression for each term of the expansion:

	$\displaystyle\left.\begin{array}[]{c}\displaystyle\boldsymbol{\Omega}_{1}(\boldsymbol{X}_{1})^{i_{1}}\cdots(\boldsymbol{X}_{m})^{i_{m}}\vec{1}\\[4.30554pt] \displaystyle\left(\boldsymbol{\Omega}_{1}(\boldsymbol{X}_{1})^{\frac{i_{1}}{\sum_{k}i_{k}}}\cdots(\boldsymbol{X}_{m})^{\frac{i_{m}}{\sum_{k}i_{k}}}\right)^{\sum_{k}i_{k}}\vec{1}\\[8.61108pt] \displaystyle\{(\boldsymbol{\Omega}_{1}\boldsymbol{X}_{1})^{i_{1}}\cdots(\boldsymbol{\Omega}_{1}\boldsymbol{X}_{m})^{i_{m}}\}_{S}\vec{1}\end{array}\right\}$	$\displaystyle=\langle X_{1}(t)\rangle^{i_{1}}\cdots\langle X_{m}(t)\rangle^{i_{m}}\boldsymbol{\Omega}_{1}\vec{1}$		(72)
		$\displaystyle=\langle X_{1}(t)\rangle^{i_{1}}\cdots\langle X_{m}(t)\rangle^{i_{m}}\vec{1}$		(73)

As a consequence, a relatively simple calculation shows

\displaystyle\frac{d}{dt}\vec{X}_{i}=\frac{d}{dt}\langle X_{i}(t)\rangle\vec{1}=f_{i}(\langle X_{1}(t)\rangle,\dots,\langle X_{m}(t)\rangle)\boldsymbol{\Omega}_{1}\vec{1}=f_{i}(\langle X_{1}(t)\rangle,\dots,\langle X_{m}(t)\rangle)\vec{1},

(74)

\displaystyle\left(\frac{d}{dt}\langle X_{i}(t)\rangle-f_{i}(\langle X_{1}(t)\rangle,\dots,\langle X_{m}(t)\rangle)\right)\vec{1}=0.

(75)

Replacing $\langle X_{i}(t)\rangle$ with $x_{i}(t)$ , we obtain that the fixed points of the extended system under the assumption $\vec{X}_{i}(t)=\langle X_{i}(t)\rangle\vec{1}$ must be the same as for the target dynamical system.

We now turn to statement $(b)$ . The initial condition $\vec{X}_{i}(t=0)$ is now arbitrary. Multiplying on the left (67) times $(\boldsymbol{I}-\boldsymbol{\Omega}_{1})$ , we get

(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\frac{d\vec{X}_{i}}{dt}=-\alpha(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X}_{i}

(76)

where we define $\vec{X}_{i,c}=(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X}_{i}$ . As Span $(\boldsymbol{I}-\boldsymbol{\Omega}_{1})=\mathrm{Ker}(\boldsymbol{\Omega}_{1})$ , $\vec{X}_{i,c}$ can be interpreted as a deviation from the average, since $\vec{X}_{i}=\langle X_{i}\rangle\vec{1}+\vec{X}_{i,c}$ . Following almost the same steps as in the proof of the convergence of the mean for the one dimensional system, we arrive at

	$\displaystyle\frac{d\vec{X}_{i,c}}{dt}=-\alpha\vec{X}_{i,c}\implies$
	$\displaystyle\left(\langle X_{i}(t)\rangle-f_{i}(\langle X_{1}(t)\rangle,\dots,\langle X_{m}(t)\rangle)\right)\vec{1}$
	$\displaystyle=\Big{(}\mathcal{P}_{\boldsymbol{\Omega}_{1}}\vec{X}_{i}(t)-f_{i}(\mathcal{P}_{\boldsymbol{\Omega}_{1}}\vec{X}_{1}(t),\dots,\mathcal{P}_{\boldsymbol{\Omega}_{1}}\vec{X}_{m}(t))\Big{)}\vec{1}\rightarrow\vec{0},\text{ for }t\gg\frac{1}{\alpha}.$		(77)

where the second expression follows from the first being $\alpha>0$ , so that $\vec{X}_{i,c}(t)\rightarrow\vec{0}$ for $t\gg 1/\alpha$ . Essentially, this implies that the extended system fixed points are those of the target dynamical system: for long enough times the system converges exponentially to the mean in each variable, for which the banality lemma applies. Thus, if the projected PEDS given by $\mathcal{O}$ approaches a fixed point, it has to be a fixed point of the target system. Alternatively, the system must not converge. ∎

The results of this section show that at least one type of PEDS exists which preserves the fixed points of the target system, thus justifying the entire construction of the PEDS embedding. We now focus the attention on how the PEDS procedure modifies the properties of the fixed points, by analyzing their stability. Therefore, we need to look at the spectral properties of the Jacobian at the embedding fixed points.

3.4 Properties of the Jacobian and fixed points

Let us now investigate the properties of the Jacobian.

We begin with the one dimensional case

\displaystyle\frac{d\vec{X}}{dt}=\underbrace{\boldsymbol{\Omega}_{1}\boldsymbol{F}\vec{1}}_{1}-\underbrace{\alpha(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X}}_{2}

(78)

For the sake of generality, we consider also the generalized decay functions in (49)

\displaystyle\frac{d\vec{X}}{dt}=\underbrace{\boldsymbol{\Omega}_{1}\boldsymbol{F}\vec{1}}_{1}-\begin{cases}\underbrace{\boldsymbol{D}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X}}_{2A}&\text{generalization A}\\ \underbrace{(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\boldsymbol{D}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X}}_{2B}&\text{generalization B}\end{cases}

(79)

As usual, we shall derive the results for a general projector $\boldsymbol{\Omega}$ whenever possible.

3.4.1 Simple case: scalar target system

We will focus first on the PEDS of a one dimensional target system, as in this case the ordering is immaterial and proofs are easier to carry out. Initially we consider the PEDS map $\mathcal{O}=(f(x),\boldsymbol{\Omega}_{1},-\alpha(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X},\vec{b},N)$ , i.e. the standard decay function. The embedding thus takes the form

\displaystyle\frac{d}{dt}X_{i}=(\boldsymbol{\Omega}_{1}\boldsymbol{F}\vec{b}-\alpha(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X})_{i}=M_{i}.

(80)

We prove the following:

Proposition 3.8.

One dimensional PEDS Jacobian
For a scalar target system, a PEDS of the form $\mathcal{O}=(f(x),\boldsymbol{\Omega},-\alpha(\boldsymbol{I}-\boldsymbol{\Omega})\vec{X},\vec{b},N)$ is characterized by the following functional form of the Jacobian

for the standard non-commuting map

{J}^{(nc)}_{ir}=\sum_{j=1}^{N}\sum_{l=1}^{N}{\Omega}_{il}\sum_{z=1}^{\infty}a_{z}\sum_{k=0}^{z-1}\sum_{s=1}^{N}(\boldsymbol{\Omega}\boldsymbol{X})^{k}_{ls}{\Omega}_{sr}(\boldsymbol{\Omega}\boldsymbol{X})^{z-1-k}_{rj}b_{j}-\alpha(\boldsymbol{I}-\boldsymbol{\Omega})_{ir},

(81)

2.

for the standard commuting map

$\displaystyle{J}^{(c)}_{ir}={\Omega}_{ir}f^{\prime}(X_{r})b_{r}-\alpha(\boldsymbol{I}-\boldsymbol{\Omega})_{ir}.$ (82)

Proof.

The Jacobian elements are defined as

\displaystyle{J}_{ir}=\frac{\partial{M}_{i}}{\partial X_{r}}=\frac{\partial}{\partial X_{r}}\sum_{j=1}^{N}(\boldsymbol{\Omega F})_{ij}b_{j}-\alpha\frac{\partial}{\partial X_{r}}((\boldsymbol{I}-\boldsymbol{\Omega})\vec{X})_{i}.

(83)

As we need to evaluate the derivatives of the matrix maps, we consider first the derivatives of the matrix quantities depending on $\boldsymbol{X}$ . We have

	$\displaystyle\frac{\partial}{\partial X_{r}}(\boldsymbol{\Omega}\boldsymbol{X})^{m}_{ij}$	$\displaystyle=$	$\displaystyle\sum_{k=0}^{m-1}\sum_{s=1}^{N}(\boldsymbol{\Omega}\boldsymbol{X})^{k}_{is}\sum_{l,t=1}^{N}\frac{\partial({\Omega}_{sl}X_{l}\delta_{lt})}{\partial X_{r}}(\boldsymbol{\Omega}\boldsymbol{X})^{m-1-k}_{tj}$		(84)
		$\displaystyle=$	$\displaystyle\sum_{k=0}^{m-1}\sum_{s=1}^{N}(\boldsymbol{\Omega}\boldsymbol{X})^{k}_{is}{\Omega}_{sr}(\boldsymbol{\Omega}\boldsymbol{X})^{m-1-k}_{rj},$		(84)

where a matrix to zero power coincides with the identity matrix.

Taking into account definition (30) we find

	$\displaystyle\frac{\partial{F}_{ij}^{(nc)}}{\partial X_{r}}$	$\displaystyle=$	$\displaystyle\sum_{l=1}^{N}{\Omega}_{il}\frac{\partial}{\partial{X_{r}}}\sum_{z=1}^{\infty}a_{z}(\boldsymbol{\Omega}\boldsymbol{X})^{z}_{lj}$		(85)
		$\displaystyle=$	$\displaystyle\sum_{l=1}^{N}{\Omega}_{il}\sum_{z=1}^{\infty}a_{z}\sum_{k=0}^{z-1}\sum_{s=1}^{N}(\boldsymbol{\Omega}\boldsymbol{X})^{k}_{ls}{\Omega}_{sr}(\boldsymbol{\Omega}\boldsymbol{X})^{z-1-k}_{rj}$		(85)

Thus, the first term of the Jacobian is simply given by

\displaystyle{J}^{(nc,1)}_{ir}=\sum_{j=1}^{N}\sum_{l=1}^{N}{\Omega}_{il}\sum_{z=1}^{\infty}a_{z}\sum_{k=0}^{z-1}\sum_{s=1}^{N}(\boldsymbol{\Omega}\boldsymbol{X})^{k}_{ls}{\Omega}_{sr}(\boldsymbol{\Omega}\boldsymbol{X})^{z-1-k}_{rj}b_{j}.

(86)

In the case of the standard commutative map, the result can be easily derived exploiting (31) if function $f(x)$ is known in closed form. On the other hand, making use of the power expansion of the function, we can directly calculate the derivatives noticing that

\displaystyle\frac{\partial}{\partial X_{r}}{X}^{k}_{ij}=\frac{\partial}{\partial X_{r}}(\text{diag}(X_{1}^{k},\cdots,X_{m}^{k}))_{ij}=(\text{diag}(0,\dots,kX_{r}^{k-1},\dots,0))_{ij}=k\delta_{ij}\delta_{ir}X_{r}^{k-1}

(87)

so that

$\displaystyle{J}^{(c,1)}_{ir}$	$\displaystyle=$	$\displaystyle\sum_{j=1}^{N}\frac{\partial\left(\boldsymbol{\Omega}\boldsymbol{F}^{(c)}\right)_{ij}}{\partial X_{r}}b_{j}$	(88)
	$\displaystyle=$	$\displaystyle\sum_{j=1}^{N}\sum_{l=1}^{N}{\Omega}_{il}\sum_{z=0}^{\infty}a_{z}\frac{\partial{X}^{z}_{lj}}{\partial X_{r}}b_{j}$
	$\displaystyle=$	$\displaystyle\sum_{j=1}^{N}\sum_{l=1}^{N}{\Omega}_{il}\sum_{z=1}^{\infty}za_{z}\delta_{lj}\delta_{lr}X_{r}^{z-1}b_{j}$
	$\displaystyle=$	$\displaystyle{\Omega}_{ir}f^{\prime}(X_{r})b_{r}$

where $f^{\prime}(x)$ denotes the derivative of $f(x)$ .

Similarly, for the standard decay function $\boldsymbol{D}=\alpha\boldsymbol{I}$ , it is not hard to see that the second term of the Jacobian is the same irrespective of the chosen matrix embedding

	$\displaystyle{J}^{(2)}_{ir}$	$\displaystyle=$	$\displaystyle-\alpha\sum_{j=1}^{N}\frac{\partial}{\partial x_{r}}\Big{(}(\boldsymbol{I}-\boldsymbol{\Omega})\boldsymbol{X}\Big{)}_{ij}$		(89)
		$\displaystyle=$	$\displaystyle-\alpha\sum_{j=1}^{N}(\boldsymbol{I}-\boldsymbol{\Omega})_{ir}\delta_{rj}=-\alpha(\boldsymbol{I}-\boldsymbol{\Omega})_{ir}$		(89)

Summing $\boldsymbol{J}^{(1)}$ and $\boldsymbol{J}^{(2)}$ , we find the expression to be proven. ∎

As a direct Corollary of Proposition 3.8, we obtain that for $\boldsymbol{\Omega}=\boldsymbol{\Omega}_{1}$ and $\vec{b}=\vec{1}$ , the Jacobian takes a simpler form.

Corollary 3.9.

Consider the uniform mean field PEDS with standard decay function $\mathcal{O}=(f(x),\boldsymbol{\Omega}_{1},-\alpha(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X},\vec{1},N)$ of a scalar dynamical system characterized by a fixed point $x^{\ast}$ . The Jacobian of the PEDS in its fixed point $\vec{X}^{\ast}=x^{\ast}\vec{1}$ is given by

{J}_{ir}(\vec{X}^{\ast})=-\alpha{\delta}_{ir}+\frac{1}{N}(f^{\prime}(x^{\ast})+\alpha)\Longleftrightarrow\boldsymbol{J}(\vec{X}^{\ast})=-\alpha\boldsymbol{I}+(f^{\prime}(x^{\ast})+\alpha)\boldsymbol{\Omega}_{1},

(90)

both for the standard commutative and non-commutative maps.

Proof.

We exploit Proposition 3.8, considering $\boldsymbol{\Omega}=\boldsymbol{\Omega}_{1}$ and $\vec{b}=\vec{1}$ , from which, because of Proposition 3.2, we have $\boldsymbol{X}^{\ast}=x^{\ast}\boldsymbol{I}$ . Substituting into (85), we obtain

	$\displaystyle{J}^{(nc,1)}_{ir}(\vec{X}^{\ast})$	$\displaystyle=$	$\displaystyle\sum_{j=1}^{N}\sum_{l=1}^{N}{\Omega}_{1,il}\sum_{z=1}^{\infty}a_{z}(x^{\ast})^{z-1}\sum_{k=0}^{z-1}\sum_{s=1}^{N}\Omega^{k}_{1,ls}{\Omega}_{1,sr}\Omega^{z-1-k}_{1,rj}$		(91)
		$\displaystyle=$	$\displaystyle\sum_{j=1}^{N}\sum_{z=1}^{\infty}a_{z}(x^{\ast})^{z-1}\sum_{k=0}^{z-1}\sum_{s=1}^{N}{\Omega}_{1,is}{\Omega}_{1,sr}\sum_{j=1}^{N}{\Omega}^{z-1-k}_{1,rj}$		(91)

Since $\boldsymbol{I}\vec{1}=\boldsymbol{\Omega}_{1}\vec{1}=\vec{1}$ , we find

	$\displaystyle{J}^{(nc,1)}_{ir}(\vec{X}^{\ast})$	$\displaystyle=$	$\displaystyle\sum_{z=1}^{\infty}a_{z}(x^{\ast})^{z-1}\sum_{k=0}^{z-1}{\Omega}_{1,ir}$		(92)
		$\displaystyle=$	$\displaystyle\sum_{z=1}^{\infty}za_{z}(x^{\ast})^{z-1}{\Omega}_{1,ir}=f^{\prime}(x^{\ast}){\Omega}_{1,ir}$		(92)

therefore, in these conditions the first part of the Jacobian takes the same expression as for the standard commutative map. Summing the second term (89), finally yields for both maps

	$\displaystyle{J}_{ir}(\vec{X}^{\ast})$	$\displaystyle=$	$\displaystyle f^{\prime}(x^{\ast}){\Omega}_{1,ir}-\alpha(\boldsymbol{I}-\boldsymbol{\Omega}_{1})_{ir}=-\alpha{\delta}_{ir}+(f^{\prime}(x^{\ast})+\alpha){\Omega}_{1,ir}$		(93)
		$\displaystyle=$	$\displaystyle-\alpha{\delta}_{ir}+\frac{1}{N}(f^{\prime}(x^{\ast})+\alpha).$		(93)

∎

At this point we can start to draw some partial conclusions. In fact, we can use the properties of the fixed points of the target system to understand how these are transformed by the embedding procedure. The target system equilibrium is unstable if $f^{\prime}(x^{\ast})>0$ , while it is stable for $f^{\prime}(x^{\ast})<0$ . Finally, $f^{\prime}(x^{\ast})=0$ corresponds to a saddle. Since any scalar dynamical system is conservative, it can be expressed as

\displaystyle\frac{dx}{dt}=f(x)=-\frac{\partial V(x)}{\partial x},

(94)

where $V$ is a potential function. The extrema $x^{\ast}$ of $V(x)$ correspond to minima, maxima and saddles, as we show in Fig. 2.

For the PEDS $\mathcal{O}=(f(x),\boldsymbol{\Omega}_{1},-\alpha(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X},\vec{1},N)$ , Corollary 3.9 implies that the Jacobian spectrum follows from the spectral properties of $\boldsymbol{\Omega}_{1}$ . In fact, $\boldsymbol{\Omega}_{1}$ has one eigenvalue equal to $1$ , and $N-1$ identical, null eigenvalues. Thus, the spectrum $\Lambda$ of the Jacobian at $\vec{X}^{*}=x^{\ast}\vec{1}$ is given by

\displaystyle\Lambda(\boldsymbol{J}(x^{\ast}))=\{\{-\alpha\}_{N-1},\{f^{\prime}(x^{\ast})\}_{1}\},

(95)

i.e. $-\alpha$ with multiplicity $N-1$ , and $f^{\prime}(x^{\ast})$ with multiplicity 1.

We can therefore carry out a stability analysis of the PEDS fixed point $\vec{X}^{\ast}$ as follows:

\displaystyle\vec{X}^{*}~{}\text{is}~{}\begin{cases}\text{stable}&\text{if }x^{*}\text{ is stable},\\ \text{a saddle point}&\text{if }x^{*}\text{ is a saddle point},\\ \text{a saddle point}&\text{if }x^{*}\text{ is unstable}.\end{cases}

(96)

We thus see what are the benefits of the PEDS from the point of view of the target system. While in the scalar case “barriers” can be present, these can be made to disappear via the PEDS. Although this specific feature is peculiar to scalar target systems, this result will later be useful also for vector target systems. A graphical representation is shown in Fig. 3.

We can conclude that the PEDS procedure $\mathcal{O}=(f(x),\boldsymbol{\Omega}_{1},-\alpha(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X},\vec{1},N)$ preserves stable and saddle fixed points of the target dynamics, while it turns unstable fixed points into saddle points.

Let us now consider the Jacobian properties in presence of the generalized decay functions in (79). A simple generalization of (89) yields

\displaystyle{J}^{(2)}_{ir}(\vec{X}^{*})=\begin{cases}-\Big{(}\boldsymbol{D}(\boldsymbol{I}-\boldsymbol{\Omega})\Big{)}_{ir}&\text{generalization A}\\[4.30554pt] -\Big{(}(\boldsymbol{I}-\boldsymbol{\Omega})\boldsymbol{D}(\boldsymbol{I}-\boldsymbol{\Omega})\Big{)}_{ir}&\text{generalization B}.\end{cases}

(97)

Therefore, the full Jacobian for $\boldsymbol{\Omega}=\boldsymbol{\Omega}_{1}$ and $\vec{b}=\vec{1}$ , becomes

\displaystyle\boldsymbol{J}(\vec{X}^{*})=f^{\prime}(x^{*})\boldsymbol{\Omega}_{1}-\begin{cases}\boldsymbol{D}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})&\text{generalization A}\\[4.30554pt] (\boldsymbol{I}-\boldsymbol{\Omega}_{1})\boldsymbol{D}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})&\text{generalization B}.\end{cases}

(98)

We wish, now, to determine the spectrum $\Lambda(\boldsymbol{J}(\vec{X}^{*}))$ . We consider the two generalizations separately:

1.

Generalization B. As a consequence of the identities $\boldsymbol{\Omega}(\boldsymbol{I}-\boldsymbol{\Omega})=(\boldsymbol{I}-\boldsymbol{\Omega})\boldsymbol{\Omega}=\boldsymbol{0}$ , we can deduce $[\boldsymbol{J}^{(1)},\boldsymbol{J}^{(2)}]=\boldsymbol{0}$ $\forall\boldsymbol{D}$ . Thus, both Jacobian components can be diagonalized in the same basis assembled in matrix $\boldsymbol{T}$ , such that $\boldsymbol{T}\boldsymbol{\Omega}\boldsymbol{T}^{-1}=\boldsymbol{D}_{1}$ and $\boldsymbol{T}(\boldsymbol{I}-\boldsymbol{\Omega})\boldsymbol{D}(\boldsymbol{I}-\boldsymbol{\Omega})\boldsymbol{T}^{-1}=\boldsymbol{D}_{2}$ . Then the eigenvalues are given by the elements of the diagonal matrix $\boldsymbol{D}_{1}+\boldsymbol{D}_{2}$ . Since it is not hard to see that $\text{Span}(\boldsymbol{\Omega})=\text{Ker}((\boldsymbol{I}-\boldsymbol{\Omega})\boldsymbol{D}(\boldsymbol{I}-\boldsymbol{\Omega}))$ and $\text{Span}(\boldsymbol{\Omega})\cup\text{Span}((\boldsymbol{I}-\boldsymbol{\Omega})\boldsymbol{D}(\boldsymbol{I}-\boldsymbol{\Omega}))=\mathbb{R}^{N}$ with $\text{Span}(\boldsymbol{\Omega})\cap\text{Span}((\boldsymbol{I}-\boldsymbol{\Omega})\boldsymbol{D}(\boldsymbol{I}-\boldsymbol{\Omega}))=\emptyset$ , we can focus on the eigenvalues of the two addends of $\boldsymbol{J}$ . For $\boldsymbol{J}^{(1)}$ , there are $M$ eigenvalues equal to 0 and $N-M$ identical eigenvalues $\lambda=f^{\prime}(x^{*})$ . For $\boldsymbol{J}^{(2B)}=(\boldsymbol{I}-\boldsymbol{\Omega})\boldsymbol{D}(\boldsymbol{I}-\boldsymbol{\Omega})$ , there are $N-M$ null eigenvalues, while the remaining $M$ eigenvalues satisfy $0<\lambda\leq\max\{D_{ii}\}$ . Clearly, $M$ is determined by the cardinality of $\text{Span}(\boldsymbol{\Omega})$ , equal to 1 for $\boldsymbol{\Omega}_{1}$ .

Generalization A. This case is slightly more complicated, since the Jacobian is not symmetric and, thus, its eigenvalues can be complex. We can provide some results exploiting Gerschgorin’s theorem

\displaystyle|\lambda-{J}_{ii}|\leq\sum_{k\neq i}|{J}_{ki}|.

(99)

Since $\boldsymbol{\Omega}=\boldsymbol{\Omega}_{1}$ and $\vec{b}=\vec{1}$ , we have

{J}_{ii}=\frac{1}{N}f^{\prime}(x^{*})-\frac{N-1}{N}D_{ii}\qquad\sum_{k\neq i}|{J}_{ki}|=\frac{N-1}{N}|f^{\prime}(x^{*})+D_{ii}|=R_{i}

Let $\bar{D}=\text{max}_{i}|D_{ii}|$ and $\bar{R}=\frac{N-1}{N}|f^{\prime}(x^{*})|$ . It follows that the eigenvalues $\lambda\in\Lambda(\boldsymbol{J})$ must be enclosed, in the complex plane, in circles of radius $\bar{R}$ and center $z_{i}=\frac{1}{N}f^{\prime}(x^{*})-\frac{N-1}{N}D_{ii}$ .

3.4.2 General case: Vector target system

The analysis of the PEDS embedding for scalar target systems carried out in the previous section showed that exploiting the uniform mean field projector and the standard or generalized decay functions, the embedding Jacobian can be fully characterized on the basis of the features of the target system fixed points (stable, unstable, or neutral). We have obtained this result both for the standard commutative map, in which essentially one “mixes” linearly the dynamical systems, and in the case of the standard non-commuting map, in which the mix is non-trivial. The difference between the two is, thus, essentially contained in the embedding intermediate dynamics.

While in higher dimensions the nature of the questions to be answered is quite similar, the derivations are technically more challenging. The reason lies in the ordering, that, as mentioned earlier, does play a role in how the PEDS is defined. However, the case $\boldsymbol{\Omega}=\boldsymbol{\Omega}_{1}$ still makes it possible to carry out an almost entirely analytical derivation, even if at least some results can be proved for a more general projector structure.

Let us focus on the following PEDS for an $m$ -dimensional target system

\mathcal{O}=\{\{f_{1}(\vec{x}),\cdots,f_{m}(\vec{x})\},\boldsymbol{\Omega}_{1},-\{\boldsymbol{Q}_{1}(\boldsymbol{\Omega}_{1})\vec{X},\cdots,\boldsymbol{Q}_{m}(\boldsymbol{\Omega}_{1})\vec{X}\},\vec{1},N\}.

Therefore we consider an extended system as in (14) characterized by the mean field projector and any eligible decay functions.

Similarly to the scalar case, we also consider here the two generalizations $\boldsymbol{Q}$ to the standard decay functions defined in (49). The corresponding PEDS take the form $\mathcal{O}_{A}=(\{f_{i}(x)\},\boldsymbol{\Omega}_{1},\{-\boldsymbol{D}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X}_{i}\},S,\vec{1},N)$ and $\mathcal{O}_{B}=(\{f_{i}(x)\},\boldsymbol{\Omega}_{1},\{-(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\boldsymbol{D}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X}_{i}\},S,\vec{1},N)$ , which in terms of PEDS equations, read

\frac{d\vec{X}_{i}}{dt}=\boldsymbol{\Omega}_{1}\boldsymbol{F}_{i}(\vec{X}_{1},\dots,\vec{X}_{m})\vec{1}-\boldsymbol{Q}_{i}(\boldsymbol{\Omega}_{1})\vec{X}_{i}

(100)

where

\boldsymbol{Q}_{i}(\boldsymbol{\Omega}_{1})=\begin{cases}\boldsymbol{D}_{i}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})&\text{generalization A}\\[4.30554pt] (\boldsymbol{I}-\boldsymbol{\Omega}_{1})\boldsymbol{D}_{i}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})&\text{generalization B}\end{cases}

(101)

being $\boldsymbol{D}_{i}$ positive, diagonal matrices that make the corresponding decay function $\boldsymbol{\Omega}_{1}$ -eligible.

For any eligible decay function, we can obtain the vector target system Jacobian for the PEDS system as follows. Let us consider first the standard commuting map. We use the representation in (31), so that

\displaystyle F^{(c)}_{i,k}(\vec{X}_{1},\dots,\vec{X}_{m})=f_{i}(X_{1,k},\dots,X_{m,k})b_{k}=f_{i}(X_{1,k},\dots,X_{m,k})

(102)

as $\vec{b}=\vec{1}$ . We evaluate the Jacobian in blocks, starting from component 1 as in the scalar case

\displaystyle(\boldsymbol{J}_{ij}^{(c,1)}(\{\vec{X}_{i}\}))_{ks}=\sum_{r=1}^{N}\boldsymbol{\Omega}_{1,kr}\frac{\partial f_{i}(X_{1,r},\dots,X_{m,r})}{\partial X_{j,s}}=\boldsymbol{\Omega}_{1,ks}\frac{\partial f_{i}(X_{1,s},\cdots,X_{m,s})}{\partial X_{j,s}}

(103)

thus, in the fixed point $\vec{X}_{i}^{*}=x_{i}^{*}\vec{1}$ (see the multivariate banality of the mean Corollary 3.7) we find

\displaystyle\boldsymbol{J}_{ij}^{(c)}(\{\vec{X}_{i}^{*}\})=\boldsymbol{\Omega}_{1}f^{\prime}_{i,x_{j}}(\vec{x}^{*})+\delta_{ij}\boldsymbol{Q}_{i}(\boldsymbol{\Omega}_{1}).

(104)

where $f^{\prime}_{i,x_{j}}=\partial f_{i}(\vec{x})/\partial x_{j}$ . In other words, the Jacobian in the equilibria can be built as

\displaystyle\begin{pmatrix}f^{\prime}_{1,x_{1}}(\vec{x}^{*})\boldsymbol{\Omega}_{1}+\boldsymbol{Q}_{1}(\boldsymbol{\Omega}_{1})&f^{\prime}_{1,x_{2}}(\vec{x}^{*})\boldsymbol{\Omega}_{1}&\cdots&f^{\prime}_{1,x_{n}}(\vec{x}^{*})\boldsymbol{\Omega}_{1}\\ f^{\prime}_{2,x_{1}}(\vec{x}^{*})\boldsymbol{\Omega}_{1}&f^{\prime}_{2,x_{2}}(\vec{x}^{*})\boldsymbol{\Omega}_{1}+\boldsymbol{Q}_{2}(\boldsymbol{\Omega}_{1})&\cdots&\vdots\\ \vdots&\vdots&\vdots&\vdots\\ f^{\prime}_{m,x_{1}}(\vec{x}^{*})\boldsymbol{\Omega}_{1}&\cdots&\cdots&f^{\prime}_{m,x_{m}}(\vec{x}^{*})\boldsymbol{\Omega}_{1}+\boldsymbol{Q}_{m}(\boldsymbol{\Omega}_{1})\end{pmatrix}

(105)

where each block is of size $N\times N$ .

Surprisingly, the same result holds also for the mixed and standard non-commutative maps. We start by proving this in the mixed commuting case, where ordering is immaterial.

Proposition 3.10.

Let $\mathcal{O}=(\{f_{1}(\vec{x}),\cdots,f_{m}(\vec{x})\},\boldsymbol{\Omega}_{1},\{\boldsymbol{Q}_{i}(\boldsymbol{\Omega}_{1})\vec{X}_{i}\},S,\vec{1},N)$ be the PEDS built on a mixed commutative map, considering the decay functions $\boldsymbol{Q}_{i}(\boldsymbol{\Omega}_{1})\vec{X}$ assumed to be $\boldsymbol{\Omega}_{1}$ -eligible. Then, for any ordering $S$ , the Jacobian matrix, evaluated at the equilibrium $\vec{X}_{i}^{*}=x_{i}^{*}\vec{1}$ being $\vec{x}$ an equilibrium of the target system, is given by

\displaystyle\boldsymbol{J}_{ij}^{(mc)}(\{\vec{X}_{i}^{*}\})=\boldsymbol{\Omega}_{1}f^{\prime}_{i,x_{j}}(\vec{x}^{*})+\delta_{ij}\boldsymbol{Q}_{i}(\boldsymbol{\Omega}_{1}).

(106)

Proof.

We aim at evaluating the Jacobian of¹¹1As usual, we use a general projection operator wherever possible.

\displaystyle\boldsymbol{F}_{i}^{(mc)}(\boldsymbol{X}_{1},\cdots,\boldsymbol{X}_{m})=a_{i,0}\boldsymbol{I}+\sum_{k=1}^{\infty}\sum_{j_{1},\cdots,j_{m}}^{k}a_{k,i;j_{1},\cdots,j_{m}}(\boldsymbol{\Omega}(\boldsymbol{X}_{1}^{j_{1}}\cdots\boldsymbol{X}_{m}^{j_{m}})^{1/k})^{k}.

(107)

as it is needed for the estimation of the first Jacobian component

	$\displaystyle(\boldsymbol{J}^{(mc,1)}_{ij}(\{\vec{X}_{i}\}))_{st}$	$\displaystyle=$	$\displaystyle\frac{\partial(\boldsymbol{F}_{i}^{(mc)}(\boldsymbol{X}_{1},\cdots,\boldsymbol{X}_{m})\vec{1})_{s}}{\partial X_{j,t}}$		(108)
		$\displaystyle=$	$\displaystyle\sum_{k=1}^{\infty}\sum_{j_{1},\cdots,j_{m}}^{k}a_{k,i;j_{1},\cdots,j_{m}}\left(\frac{\partial(\boldsymbol{\Omega}(\boldsymbol{X}_{1}^{j_{1}}\cdots\boldsymbol{X}_{m}^{j_{m}})^{1/k})^{k}}{\partial X_{j,t}}\vec{1}\right)_{s}.$		(108)

Therefore, we need to evaluate

\displaystyle\frac{\partial(\boldsymbol{\Omega}(\boldsymbol{X}_{1}^{j_{1}}\cdots\boldsymbol{X}_{m}^{j_{m}})^{1/k})^{k}}{\partial X_{j,t}}

(109)

We exploit the identity

\displaystyle\frac{\partial\boldsymbol{B}^{k}}{\partial X}=\sum_{z=0}^{k-1}\boldsymbol{B}^{k-1-z}\frac{\partial\boldsymbol{B}}{\partial X}\boldsymbol{B}^{z}\qquad k\geq 1

(110)

valid for any matrix $\boldsymbol{B}$ , obtaining

	$\displaystyle\frac{\partial(\boldsymbol{\Omega}(\boldsymbol{X}_{1}^{j_{1}}\cdots\boldsymbol{X}_{m}^{j_{m}})^{1/k})^{k}}{\partial X_{j,t}}$	$\displaystyle=$	$\displaystyle\sum_{z=0}^{k-1}\Big{(}\boldsymbol{\Omega}(\boldsymbol{X}_{1}^{j_{1}}\cdots\boldsymbol{X}_{m}^{j_{m}})^{1/k}\Big{)}^{k-1-z}\left(\boldsymbol{\Omega}\frac{\partial\big{(}(\boldsymbol{X}_{1}^{j_{1}}\cdots\boldsymbol{X}_{m}^{j_{m}})^{1/k}\big{)}}{\partial X_{j,t}}\right)$		(111)
			$\displaystyle\ \ \ \ \ \times\Big{(}\boldsymbol{\Omega}(\boldsymbol{X}_{1}^{j_{1}}\cdots\boldsymbol{X}_{m}^{j_{m}})^{1/k}\Big{)}^{z}$		(111)

where, since $\boldsymbol{X}_{i}$ are diagonal, we have

\left(\frac{\partial\big{(}(\boldsymbol{X}_{1}^{j_{1}}\cdots\boldsymbol{X}_{m}^{j_{m}})^{1/k}\big{)}}{\partial X_{j,t}}\right)_{pq}=\delta_{pq}\delta_{pt}\left(\prod_{r\neq j}X_{r,t}^{j_{t}/k}\right)\frac{j_{j}}{k}X_{j,t}^{j_{j}/k-1}.

Now, taking into account that $\boldsymbol{\Omega}=\boldsymbol{\Omega}_{1}$ and that the fixed points are defined by $\vec{X}_{i}^{*}=x_{i}^{*}\vec{1}$ (because of the multivariate banality of the mean Corollary 3.7), we find

	$\displaystyle\left.\big{(}(\boldsymbol{\Omega_{1}}(\boldsymbol{X}_{1}^{\alpha_{1}}\cdots\boldsymbol{X}_{m}^{\alpha_{m}})^{1/\eta})^{\beta}\big{)}_{ij}\right\|_{\{\vec{X}_{i}^{*}\}}$	$\displaystyle=$	$\displaystyle{\Omega}_{1,ij}\prod_{i=1}^{m}(x_{i}^{*})^{\frac{\beta}{\eta}\alpha_{i}}$		(112)
	$\displaystyle\left.\delta_{pq}\delta_{pt}\left(\prod_{r\neq j}X_{r,t}^{j_{t}/k}\right)\frac{j_{j}}{k}X_{j,t}^{j_{j}/k-1}\right\|_{\{\vec{X}_{i}^{*}\}}$	$\displaystyle=$	$\displaystyle\delta_{pq}\delta_{pt}\left(\prod_{r\neq j}(x_{r}^{})^{j_{t}/k}\right)\frac{j_{j}}{k}(x_{j}^{})^{j_{j}/k-1}$		(113)

thus, substituting into (111)

	$\displaystyle\left.\left(\frac{\partial(\boldsymbol{\Omega}_{1}(\boldsymbol{X}_{1}^{j_{1}}\cdots\boldsymbol{X}_{m}^{j_{m}})^{1/k})^{k}}{\partial X_{j,t}}\right)_{pq}\right\|_{\{\vec{X}_{i}^{}\}}=\sum_{b,c,d=1}^{N}\sum_{z=0}^{k-1}\left.\big{(}(\boldsymbol{\Omega_{1}}(\boldsymbol{X}_{1}^{j_{1}}\cdots\boldsymbol{X}_{m}^{j_{m}})^{1/k})^{k-1-z}\big{)}_{pb}\right\|_{\{\vec{X}_{i}^{}\}}$
	$\displaystyle\quad\times\left.\left(\Omega_{1,bc}\frac{\partial\left((\boldsymbol{\Omega}_{1}(\boldsymbol{X}_{1}^{j_{1}}\cdots\boldsymbol{X}_{m}^{j_{m}})^{1/k}\right)_{cd}}{\partial X_{j,t}}\right\|_{\{\vec{X}_{i}^{}\}}\right)\left.\big{(}(\boldsymbol{\Omega_{1}}(\boldsymbol{X}_{1}^{j_{1}}\cdots\boldsymbol{X}_{m}^{j_{m}})^{1/k})^{z}\big{)}_{dq}\right\|_{\{\vec{X}_{i}^{}\}}$
	$\displaystyle\quad=j_{j}(x_{j}^{})^{j_{j}-1}\prod_{r\neq j}(x_{r}^{})^{j_{t}}\sum_{b,c,d=1}^{N}{\Omega}_{1,pb}{\Omega}_{1,bc}\delta_{cd}\delta_{ct}{\Omega}_{1,dq}$
	$\displaystyle\quad=j_{j}(x_{j}^{})^{j_{j}-1}\prod_{r\neq j}(x_{r}^{})^{j_{t}}\Omega_{1,pt}\Omega_{1,tq}$		(114)

where each element of $\boldsymbol{\Omega}_{1}$ is equal to $1/N$ . Substituting into (108) we recognize the series expansion of $f^{\prime}_{i,x_{j}}(\vec{x}^{*})$ . This leads to the final expression

(\boldsymbol{J}^{(mc,1)}_{ij}(\{\vec{X}_{i}^{*}\}))_{st}=f^{\prime}_{i,x_{j}}(\vec{x}^{*})\Omega_{1,st}.

(115)

Taking into account the second Jacobian component, i.e. the Jacobian of the decay functions, finally yields the result to be proved

\boldsymbol{J}^{(mc)}_{ij}(\{\vec{X}_{i}^{*}\})=f^{\prime}_{i,x_{j}}(\vec{x}^{*})\boldsymbol{\Omega}_{1}+\delta_{ij}\boldsymbol{Q}_{i}(\boldsymbol{\Omega}_{1}).

(116)

∎

Finally, let us turn to the non-commutative map Jacobian, for which we have the following result.

Proposition 3.11.

Let $\mathcal{O}=(\{f_{1}(\vec{x}),\cdots,f_{m}(\vec{x})\},\boldsymbol{\Omega}_{1},\{\boldsymbol{Q}_{i}(\boldsymbol{\Omega}_{1})\vec{X}_{i}\},S,\vec{1},N)$ be the PEDS built on a non-commutative map, considering the decay functions $\boldsymbol{Q}_{i}(\boldsymbol{\Omega}_{1})\vec{X}$ assumed to be $\boldsymbol{\Omega}_{1}$ -eligible. Then, for any ordering $S$ , the Jacobian matrix, evaluated at the equilibrium $\vec{X}_{i}^{*}=x_{i}^{*}\vec{1}$ , is given by

\displaystyle\boldsymbol{J}_{ij}^{(nc)}(\{\vec{X}_{i}^{*}\})=\boldsymbol{\Omega}_{1}f^{\prime}_{i,x_{j}}(\vec{x}^{*})+\delta_{ij}\boldsymbol{Q}_{i}(\boldsymbol{\Omega}_{1}).

(117)

Proof.

As we aim at demonstrating the independence of the Jacobian from the ordering $S$ (at the fixed points), we consider the formalism introduced in Section 2.1. Given the matrix map definition (26), we use the general form

\boldsymbol{F}_{s}(\vec{X})=\sum_{k=0}^{\infty}\sum_{i_{1},\cdots,i_{j}=0}^{j}a_{s,k;i_{1},\cdots,i_{m}}\{(\boldsymbol{\Omega}\boldsymbol{X}_{1})^{i_{1}}\cdots(\boldsymbol{\Omega}\boldsymbol{X}_{m})^{i_{m}}\}_{S}

(118)

that forms the basis for the evaluation of the first Jacobian component. Writing, to simplify the notation, $o_{\sigma(s_{1})\cdots\sigma(s_{m})}=o_{\vec{\sigma}}$ , we compute the derivatives

	$\displaystyle\frac{\partial\phantom{X_{a,b}}}{\partial X_{a,b}}\{(\boldsymbol{\Omega}\boldsymbol{X}_{1})^{i_{1}}\cdots(\boldsymbol{\Omega}\boldsymbol{X}_{m})^{i_{m}}\}_{S}$
	$\displaystyle\qquad=\sum_{\sigma\in\mathcal{S}(m)}o_{\vec{\sigma}}\frac{\partial\phantom{X_{a,b}}}{\partial X_{a,b}}\left((\boldsymbol{\Omega}\boldsymbol{X}_{\sigma(s_{1})})^{i_{1}}\cdots(\boldsymbol{\Omega}\boldsymbol{X}_{\sigma(s_{m})})^{i_{m}}\right)$
	$\displaystyle\qquad=\sum_{\sigma\in\mathcal{S}(m)}o_{\vec{\sigma}}\left(\frac{\partial\phantom{X_{a,b}}}{\partial X_{a,b}}(\boldsymbol{\Omega}\boldsymbol{X}_{\sigma(s_{1})})^{i_{1}}\right)\cdots(\boldsymbol{\Omega}\boldsymbol{X}_{\sigma(s_{m})})^{i_{m}}+\cdots$
	$\displaystyle\qquad\quad+\sum_{\sigma\in\mathcal{S}(m)}o_{\vec{\sigma}}(\boldsymbol{\Omega}\boldsymbol{X}_{\sigma(s_{1})})^{i_{1}}\cdots\left(\frac{\partial\phantom{X_{a,b}}}{\partial X_{a,b}}(\boldsymbol{\Omega}\boldsymbol{X}_{\sigma(s_{m})})^{i_{m}}\right)$
	$\displaystyle\qquad=\sum_{\sigma\in\mathcal{S}(m)}o_{\vec{\sigma}}\delta_{a,\sigma(s_{1})}\left(\frac{\partial\phantom{X_{\sigma(s_{1}),b}}}{\partial X_{\sigma(s_{1}),b}}(\boldsymbol{\Omega}\boldsymbol{X}_{\sigma(s_{1})})^{i_{1}}\right)\cdots(\boldsymbol{\Omega}\boldsymbol{X}_{\sigma(s_{m})})^{i_{m}}+\cdots$
	$\displaystyle\qquad\quad+\sum_{\sigma\in\mathcal{S}(m)}o_{\vec{\sigma}}\delta_{a,\sigma(s_{m})}(\boldsymbol{\Omega}\boldsymbol{X}_{\sigma(s_{1})})^{i_{1}}\cdots\left(\frac{\partial\phantom{X_{\sigma(s_{m}),b}}}{\partial X_{\sigma(s_{m}),b}}(\boldsymbol{\Omega}\boldsymbol{X}_{\sigma(s_{m})})^{i_{m}}\right)$		(119)

Applying identity (110) to matrix $(\boldsymbol{\Omega}\boldsymbol{X}_{s_{k}})^{i_{k}}$ we find

\frac{\partial\phantom{X_{s_{k},b}}}{\partial X_{s_{k},b}}\left((\boldsymbol{\Omega}\boldsymbol{X}_{s_{k}})^{i_{k}}\right)_{ij}=\sum_{t=0}^{i_{k}-1}\sum_{l=1}^{N}\left((\boldsymbol{\Omega}\boldsymbol{X}_{s_{k}})^{t}\right)_{il}{\Omega}_{lb}\left((\boldsymbol{\Omega}\boldsymbol{X}_{s_{k}})^{i_{k}-1-t}\right)_{bj},

(120)

At this point, the multivariate banality of the mean Corollary 3.7 proves that the PEDS fixed points are given by $\boldsymbol{X}_{s_{k}}^{*}=x_{s_{k}}^{*}\boldsymbol{I}$ , therefore for the mean field projector $\boldsymbol{\Omega}=\boldsymbol{\Omega}_{1}$ we find

	$\displaystyle\left.\frac{\partial\phantom{X_{s_{k},b}}}{\partial X_{s_{k},b}}\left((\boldsymbol{\Omega}_{1}\boldsymbol{X}_{s_{k}})^{i_{k}}\right)_{ij}\right\|_{\{\vec{X}_{i}^{*}\}}$	$\displaystyle=\sum_{t=0}^{i_{k}-1}(x^{*}_{s_{k}})^{i_{k}-1}\sum_{s=1}^{N}\Omega_{1,is}^{t}\Omega_{1,sb}\Omega_{1,bj}^{i_{k}-1-t}$
		$\displaystyle=i_{k}(x^{*}_{s_{k}})^{i_{k}-1}\Omega_{1,ib}\Omega_{1,bj}$		(121)

Substituting this expression into the derivative of the ordered product, yields

	$\displaystyle\left.\frac{\partial\phantom{X_{a,b}}}{\partial X_{a,b}}\{(\boldsymbol{\Omega}_{1}\boldsymbol{X}_{1})^{i_{1}}\cdots(\boldsymbol{\Omega}_{1}\boldsymbol{X}_{m})^{i_{m}}\}_{S}\right\|_{\{\vec{X}_{i}^{*}\}}$
	$\displaystyle\qquad=\sum_{\sigma\in\mathcal{S}(m)}o_{\vec{\sigma}}\delta_{a,\sigma(s_{1})}\Big{(}i_{1}(x^{*}_{\sigma(s_{1})})^{i_{1}-1}(\boldsymbol{\Omega}_{1})_{:b}(\boldsymbol{\Omega}_{1})_{b:}\Big{)}$
	$\displaystyle\qquad\qquad\times(x^{}_{\sigma(s_{2})})^{i_{2}}\boldsymbol{\Omega}_{1}^{i_{2}}\cdots(x^{}_{\sigma(s_{m})})^{i_{m}}\boldsymbol{\Omega}_{1}^{i_{m}}+\cdots$
	$\displaystyle\qquad\quad+\sum_{\sigma\in\mathcal{S}(m)}o_{\vec{\sigma}}\delta_{a,\sigma(s_{m})}(x^{}_{\sigma(s_{1})})^{i_{1}}\boldsymbol{\Omega}_{1}^{i_{1}}(x^{}_{\sigma(s_{2})})^{i_{2}}\boldsymbol{\Omega}_{1}^{i_{2}}\cdots$
	$\displaystyle\qquad\qquad\times(x^{}_{\sigma(s_{m-1})})^{i_{m-1}}\boldsymbol{\Omega}_{1}^{i_{m-1}}\Big{(}i_{m}(x^{}_{\sigma(s_{m})})^{i_{m}-1}(\boldsymbol{\Omega}_{1})_{:b}(\boldsymbol{\Omega}_{1})_{b:}\Big{)}$
	$\displaystyle\qquad=\sum_{\sigma\in\mathcal{S}(m)}o_{\vec{\sigma}}\delta_{a,\sigma(s_{1})}i_{1}(x^{}_{\sigma(s_{1})})^{i_{1}-1}(x^{}_{\sigma(s_{2})})^{i_{2}}\cdots(x^{*}_{\sigma(s_{m})})^{i_{m}}(\boldsymbol{\Omega}_{1})_{:b}(\boldsymbol{\Omega}_{1})_{b:}+\cdots$
	$\displaystyle\qquad\quad+\sum_{\sigma\in\mathcal{S}(m)}o_{\vec{\sigma}}\delta_{a,\sigma(s_{m})}i_{m}(x^{}_{\sigma(s_{1})})^{i_{1}}(x^{}_{\sigma(s_{2})})^{i_{2}}\cdots(x^{*}_{\sigma(s_{m})})^{i_{m}-1}(\boldsymbol{\Omega}_{1})_{:b}(\boldsymbol{\Omega}_{1})_{b:}.$		(122)

where $(\boldsymbol{\Omega}_{1})_{:b}$ and $(\boldsymbol{\Omega}_{1})_{b:}$ are vectors made of the $b$ -th column and row of $\boldsymbol{\Omega}_{1}$ , respectively. As the matrix products all collapse into the same quantity, the previous expression is independent of the ordering $S$ . Therefore, we can write

\left.\frac{\partial\phantom{X_{a,b}}}{\partial X_{a,b}}\{(\boldsymbol{\Omega}_{1}\boldsymbol{X}_{1})^{i_{1}}\cdots(\boldsymbol{\Omega}_{1}\boldsymbol{X}_{m})^{i_{m}}\}_{S}\right|_{\{\vec{X}_{i}^{*}\}}=\left(\frac{\partial\phantom{x_{a}}}{\partial x_{a}}\prod_{j=1}^{m}(x_{j}^{*})^{i_{j}}\right)(\boldsymbol{\Omega}_{1})_{:b}(\boldsymbol{\Omega}_{1})_{b:}.

(123)

This means that, for any ordering $S$ , at the fixed point the sum of the terms for the derivative with respect to the elements of each extended variable $\vec{X}_{a}$ leads, once taking into account the factor $\vec{1}$ in (100), to a scalar factor corresponding to $f^{\prime}_{i,x_{a}}(\vec{x}^{*})$ , i.e. the corresponding element of the Jacobian of the target system multiplied times matrix $\boldsymbol{\Omega}_{1}$ .

Concerning the second part of the Jacobian, i.e. the derivatives of $\boldsymbol{Q}_{i}(\boldsymbol{\Omega}_{1})\vec{X}_{i}$ , the result is a block diagonal matrix of the type $\text{diag}\{\boldsymbol{Q}_{i}(\boldsymbol{\Omega}_{1})\}$ .

In summary, even in this case the full Jacobian at the fixed points follows the block structure claimed in the proposition. ∎

We are now ready to discuss the Jacobian spectral properties irrespective of the chosen map, as the matrix is the same for all of the three maps that we consider. For the sake of simplicity, we limit the discussion to the standard decay functions $\boldsymbol{Q}_{i}(\boldsymbol{\Omega}_{1})=-\alpha_{i}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})$ , so that

	$\displaystyle\boldsymbol{J}(\{\vec{X}_{i}^{*}\})$	$\displaystyle=\begin{pmatrix}f^{\prime}_{1,x_{1}}(\vec{x}^{})\boldsymbol{\Omega}_{1}&f^{\prime}_{1,x_{2}}(\vec{x}^{})\boldsymbol{\Omega}_{1}&\cdots&\cdots&f^{\prime}_{1,x_{n}}(\vec{x}^{})\boldsymbol{\Omega}_{1}\\ f^{\prime}_{2,x_{1}}(\vec{x}^{})\boldsymbol{\Omega}_{1}&f^{\prime}_{2,x_{2}}(\vec{x}^{})\boldsymbol{\Omega}_{1}&f^{\prime}_{2,x_{3}}(\vec{x}^{})\boldsymbol{\Omega}_{1}&\cdots&f^{\prime}_{2,x_{n}}(\vec{x}^{})\boldsymbol{\Omega}_{1}\\ \vdots&\vdots&\ddots&\vdots&\vdots\\ f^{\prime}_{n,x_{1}}(\vec{x}^{})\boldsymbol{\Omega}_{1}&\cdots&\cdots&\cdots&f^{\prime}_{n,x_{n}}(\vec{x}^{*})\boldsymbol{\Omega}_{1}\end{pmatrix}$
		$\displaystyle-\begin{pmatrix}\alpha_{1}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})&0&\cdots&\cdots&0\\ 0&\alpha_{2}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})&&\cdots&\vdots\\ \vdots&\vdots&\ddots&\vdots&0\\ 0&\cdots&\cdots&0&\alpha_{m}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\end{pmatrix}$		(124)

that can be cast in the following form

\boldsymbol{J}(\{\vec{X}_{i}^{*}\})=(\boldsymbol{J}_{m}(\vec{x}^{*})+\boldsymbol{D}^{1}_{\alpha})\otimes\boldsymbol{\Omega}_{1}-\boldsymbol{D}^{N}_{\alpha}

(125)

where $\boldsymbol{J}_{m}(\vec{x}^{*})$ is the Jacobian of the target system functions $\{f_{i}(\vec{x})\}$ evaluated at the target system equilibrium $\vec{x}^{*}$ , $\otimes$ denotes matrix Kronecker product²²2According to the definition, the $i,j$ block of the matrix Kronecker product $A\otimes B$ is $a_{ij}B$ ., and

\boldsymbol{D}^{k}_{\alpha}=\text{diag}(\underbrace{\alpha_{1},\cdots,\alpha_{1}}_{\text{$k$ times}},\underbrace{\alpha_{2},\cdots,\alpha_{2}}_{\text{$k$ times}},\cdots,\underbrace{\alpha_{m},\cdots,\alpha_{m}}_{\text{$k$ times}}).

The Jacobian (125) is a generalization of the scalar case (see Proposition 3.8). We are interested in assessing the properties of its eigenvalues.

For the time being, we discuss the simpler case $\alpha_{i}\equiv\alpha$ , and since it will be useful later, let us think of this Jacobian for a general $\boldsymbol{\Omega}$ , only to then consider $\boldsymbol{\Omega_{1}}$ as a special case.³³3To motivate this generalized discussion, we briefly anticipate the result of an upcoming paper, in which we show that (126) is in fact the first term of the representation obtained for the Jacobian of a general projector $\boldsymbol{\Omega}$ . This general case is, however, beyond the scope of this paper. Let us therefore discuss the spectrum of

\displaystyle\boldsymbol{J}(\{\vec{X}_{i}^{*}\})=(\boldsymbol{J}_{m}(\vec{x}^{*})+\alpha\boldsymbol{I}_{m})\otimes\boldsymbol{\Omega}-\alpha\boldsymbol{I}_{Nm}.

(126)

where $\boldsymbol{I}_{q}$ is the identity matrix of size $q\times q$ . Being $\lambda_{i}$ the eigenvalues of $\boldsymbol{J}_{m}(\vec{x}^{*})$ , the eigenvalues of $\boldsymbol{J}_{m}(\vec{x}^{*})+\alpha\boldsymbol{I}_{m}$ are given by $\lambda_{i}+\alpha$ , and $-\alpha$ are the eigenvalues of $-\alpha\boldsymbol{I}_{Nm}$ for the whole matrix. Let us assume that $\boldsymbol{\Omega}$ has $k$ unitary eigenvalues ( $k=1$ for $\boldsymbol{\Omega}=\boldsymbol{\Omega}_{1}$ ), while the remaining $N-k$ eigenvalues are equal to $0$ . Then matrix $\boldsymbol{J}(\{\vec{X}_{i}^{*}\})$ has eigenvalues [27]

1.

$\lambda_{i}$ , $1\leq i\leq m$ with multiplicity $k$
2.

$-\alpha$ with multiplicity $m(N-k)$ .

As a consequence, if $\lambda_{i}<0$ $\forall i$ , then a stable equilibrium point for the target system is still stable in the PEDS embedding. Similarly, if the equilibrium point is unstable, or if at least some $i$ values exist for which $\lambda_{i}>0$ , then it becomes a saddle point for the extended system, being characterized by $m(N-k)$ negative eigenvalues and $mk$ positive eigenvalues. Thus, the following classification holds

\displaystyle\{\vec{X}^{*}_{i}\}=\begin{cases}\text{stable}&\text{if $\vec{x}^{*}$ is stable},\\ \text{saddle point}&\text{if $\vec{x}^{*}$ is a saddle point},\\ \text{saddle point}&\text{if $\vec{x}^{*}$ is unstable}.\end{cases}

(127)

This analysis suggests that the presence of “barriers” in the target system, characterized by unstable equilibria, can (in principle) be overcome in the PEDS embedding via their transformation into saddle points in the extended system.

3.5 Dynamical ordering-equivalence for the uniform mean field projector

The uniform mean field projector has various properties that are interesting per se. In particular, we wish to show here that not only the fixed points, but also the embedding dynamics is ordering independent.

For this purpose, consider a PEDS of the form

\mathcal{O}_{r}=(\{f_{i}(x_{1},\cdots,x_{m})\},\boldsymbol{\Omega_{1}},\{\boldsymbol{G}_{i}(\boldsymbol{\Omega}_{1})\vec{X}_{i}\},S_{r},\vec{1},N),

and for arbitrary $\boldsymbol{\Omega}_{1}$ -eligible decay functions. Given the PEDS above, the standard non-commutative matrix embedding is given by

\displaystyle\vec{F}_{s}=\sum_{k=0}^{\infty}\sum_{i_{1},\cdots,i_{j}}^{k}a_{s,k;i_{1}\cdots i_{m}}\{(\boldsymbol{\Omega}_{1}\boldsymbol{X}_{1})^{i_{1}}\cdots(\boldsymbol{\Omega}_{1}\boldsymbol{X}_{m})^{i_{m}}\}_{S_{r}}\vec{1}

(128)

We prove the following

Proposition 3.12.

The quantity

\{(\boldsymbol{\Omega}_{1}\boldsymbol{X}_{1})^{i_{1}}\cdots(\boldsymbol{\Omega}_{1}\boldsymbol{X}_{m})^{i_{m}}\}_{S_{r}}\vec{1}

is independent of the ordering $S_{r}$ , for any $i_{1},\cdots,i_{m}$ .

Proof.

The proof relies on the following observation. We have in general that

\displaystyle\boldsymbol{\Omega}_{1}\boldsymbol{X}_{s}\boldsymbol{\Omega}_{1}=\langle X_{s}\rangle\boldsymbol{\Omega}_{1}.

(129)

with $\langle X_{s}\rangle=\frac{1}{N}\sum_{j=1}^{N}X_{s}^{j}$ . The previous result can be easily shown as follows

$\displaystyle(\boldsymbol{\Omega}_{1}\boldsymbol{X}_{s}\boldsymbol{\Omega}_{1})_{ij}$	$\displaystyle=$	$\displaystyle\sum_{k_{1},k_{2}=1}^{N}\Omega_{1,ik_{1}}{X}_{s,k_{1}k_{2}}{\Omega}_{1,k_{2}j}$	(130)
	$\displaystyle=$	$\displaystyle\frac{1}{N^{2}}\sum_{k_{1},k_{2}=1}^{N}X_{s,k_{1}}\delta_{k_{1}k_{2}}$
	$\displaystyle=$	$\displaystyle\frac{1}{N}\sum_{k_{1}=1}^{N}X_{s,k_{1}}\frac{1}{N}=\langle X_{s}\rangle{\Omega}_{1,ij}$

Because of (129), we can always write the following

\displaystyle(\boldsymbol{\Omega}_{1}\boldsymbol{X}_{\sigma(1)})^{\sigma(i_{1})}\cdots(\boldsymbol{\Omega}_{1}\boldsymbol{X}_{\sigma(m)})^{\sigma(i_{m})}=f_{\sigma(i_{1}),\cdots,\sigma(i_{m})}(\langle{X}_{\sigma(1)}\rangle,\cdots,\langle{X}_{\sigma(m)}\rangle)\boldsymbol{\Omega}_{1}\boldsymbol{X}_{\sigma(m)}.

(131)

where function $f$ is scalar. To gain an intuition about the scalar $f$ , consider for instance $(\boldsymbol{\Omega}_{1}\boldsymbol{X}_{1})^{a}(\boldsymbol{\Omega}_{1}\boldsymbol{X}_{2})^{b}$ . Using (129), the previous expression can be written as

\displaystyle(\boldsymbol{\Omega}_{1}\boldsymbol{X}_{1})^{a}(\boldsymbol{\Omega}_{1}\boldsymbol{X}_{2})^{b}=\langle X_{1}\rangle^{a-1}\langle X_{2}\rangle^{b-1}\boldsymbol{\Omega}_{1}\boldsymbol{X}_{1}\boldsymbol{\Omega}_{1}\boldsymbol{X}_{2}=\langle X_{1}\rangle^{a}\langle X_{2}\rangle^{b-1}\boldsymbol{\Omega}_{1}\boldsymbol{X}_{2},

(132)

so that $f_{12}=\langle X_{1}\rangle^{a}\langle X_{2}\rangle^{b-1}$ . At this point we have

$\displaystyle\{(\boldsymbol{\Omega}_{1}\boldsymbol{X}_{1})^{i_{1}}\cdots(\boldsymbol{\Omega}_{1}\boldsymbol{X}_{m})^{i_{m}}\}_{S_{r}}\vec{1}$	$\displaystyle=$	$\displaystyle\sum_{\sigma\in S_{m}}o_{\vec{\sigma}}f_{\sigma(i_{1}),\cdots,\sigma(i_{m})}(\langle{X}_{\sigma(1)}\rangle,\cdots,\langle{X}_{\sigma(m)}\rangle)\boldsymbol{\Omega}_{1}\boldsymbol{X}_{\sigma(m)}\vec{1}$	(133)
	$\displaystyle=$	$\displaystyle\sum_{\sigma\in S_{m}}o_{\vec{\sigma}}f_{\sigma(i_{1}),\cdots,\sigma(i_{m})}(\langle{X}_{\sigma(1)}\rangle,\cdots,\langle{X}_{\sigma(m)}\rangle)\langle X_{\sigma(m)}\rangle\vec{1}$
	$\displaystyle=$	$\displaystyle\sum_{\sigma\in S_{m}}o_{\vec{\sigma}}f_{\sigma(i_{1}),\cdots,\sigma(i_{m})}(\langle{X}_{\sigma(1)}\rangle,\cdots,\langle{X}_{\sigma(m)}\rangle)\langle X_{\sigma(m)}\rangle\vec{1}$
	$\displaystyle=$	$\displaystyle f_{i_{1},\cdots,i_{m}}(\langle{X}_{\sigma(1)}\rangle,\cdots,\langle{X}_{\sigma(m)}\rangle)\langle X_{m}\rangle\vec{1}$

which follows from the fact that the scalar variables $\langle X_{j}\rangle$ do commute. ∎

Proposition 3.12 is important because it implies the following

Corollary 3.13.

Dynamical ordering independence for the uniform mean field projector. For any analytic functions $f_{i}$ , we have

	$\displaystyle\mathcal{O}_{r}$	$\displaystyle=$	$\displaystyle(\{f_{i}(x_{1},\cdots,x_{m})\},\boldsymbol{\Omega_{1}},\{\boldsymbol{Q_{i}}(\boldsymbol{\Omega_{1}})\vec{X}_{i}\},S_{r},\vec{1},N)$		(134)
		$\displaystyle=$	$\displaystyle(\{f_{i}(x_{1},\cdots,x_{m})\},\boldsymbol{\Omega_{1}},\{\boldsymbol{Q_{i}}(\boldsymbol{\Omega_{1}})\vec{X}_{i}\},\vec{1},N),$		(134)

or, alternatively, the uniform mean field PEDS are ordering independent.

Proof.

The proof follows directly from the fact that any analytic function $f_{i}(x_{1},\cdots,x_{m})$ can be written in the form of a series expansion as in Proposition 3.12. ∎

This implies essentially that for any dynamical system, we can write the dynamics with the most convenient ordering, without affecting the dynamics.

4 Numerical Examples

We present here some numerical examples of application of the PEDS procedure.

4.1 Implementation remarks

For the sake of implementation, it would be convenient to define the PEDS transformation without having to evaluate the Taylor expansion, as for the theoretical developments in the previous Sections. This can be easily carried out for factorized vector target systems $f_{i}(x_{1},\cdots,x_{m})=\prod_{k=1}^{m}f_{i,k}(x_{k})$ (or linear combinations of factorized terms of the same type). For such factorized target systems, the matrix map can be built as the function $\boldsymbol{F}_{i,k}(\boldsymbol{\Omega}\boldsymbol{X}_{k})$ (and $\boldsymbol{F}_{i}(\boldsymbol{X}_{1},\cdots,\boldsymbol{X}_{m})=\sum_{i}a_{i}\prod_{k=1}^{m}\boldsymbol{F}_{i,k}(\boldsymbol{X}_{k})$ or similar expressions).

The question is therefore how to efficiently evaluate such matrix functions. This can be done defining the matrix maps as the Taylor expansion evaluated in matrix $\boldsymbol{\Omega}\boldsymbol{X}_{k}$ . We can write

(\boldsymbol{\Omega}\boldsymbol{X}_{k})^{s}=\sqrt{\boldsymbol{X}_{k}}^{-1}\big{(}\sqrt{\boldsymbol{X}_{k}}\boldsymbol{\Omega}\sqrt{\boldsymbol{X}_{k}}\big{)}^{s}\sqrt{\boldsymbol{X}_{k}}

(135)

where $\sqrt{\boldsymbol{X}_{k}}$ always exists since $\boldsymbol{X}_{k}$ is diagonal. Notice that (135) defines a similarity transformation, i.e. it conserves the spectrum of the similar matrices. An important point is to verify that if the spectrum of $\boldsymbol{\Omega}\boldsymbol{X}_{k}$ is real and $\boldsymbol{X}_{k}$ is diagonal, then the spectrum of $\sqrt{\boldsymbol{X}_{k}}\boldsymbol{\Omega}\sqrt{\boldsymbol{X}_{k}}$ is also real. This can be shown using the fact that, based on the definition of the Cayley polynomial and on the determinant properties, the eigenvalue problem for $\boldsymbol{\Omega}\boldsymbol{X}_{k}$ is equivalent to the generalized eigenvalue problem $\boldsymbol{\Omega}\vec{v}-\lambda\boldsymbol{X}_{k}^{-1}\vec{v}=\vec{0}$ , assuming $\boldsymbol{X}_{k}$ invertible. Then, a proof similar to the spectral theorem shows that if $\boldsymbol{\Omega}$ and $\boldsymbol{X}_{k}^{-1}$ are symmetric and real, then $\text{Im}(\lambda)=0$ . This guarantees that an extension to the complex field of the scalar target system functions $f_{i,k}(x_{k})$ is not required. In fact, (135) implies

\displaystyle\boldsymbol{F}_{i,k}(\boldsymbol{\Omega}\boldsymbol{X}_{k})=\sqrt{\boldsymbol{X}_{k}}^{-1}\boldsymbol{F}_{i,k}(\sqrt{\boldsymbol{X}_{k}}\boldsymbol{\Omega}\sqrt{\boldsymbol{X}_{k}})\sqrt{\boldsymbol{X}_{k}}.

(136)

Since $\sqrt{\boldsymbol{X}_{k}}\boldsymbol{\Omega}\sqrt{\boldsymbol{X}_{k}}$ is symmetric, $\boldsymbol{P}_{x_{k}}$ exists such that

\sqrt{\boldsymbol{X}_{k}}\boldsymbol{\Omega}\sqrt{\boldsymbol{X}_{k}}=\boldsymbol{P}_{x_{k}}\boldsymbol{\Sigma}_{x_{k}}\boldsymbol{P}_{x_{k}}^{-1}

where the real matrix $\boldsymbol{\Sigma}_{x_{k}}=\text{diag}\{\sigma_{x_{k},1},\cdots,\sigma_{x_{k},N}\}$ is made of the elements of the spectrum of $\sqrt{\boldsymbol{X}_{k}}\boldsymbol{\Omega}\sqrt{\boldsymbol{X}_{k}}$ . As a result, we can write

\displaystyle\boldsymbol{F}_{i,k}(\boldsymbol{\Omega}\boldsymbol{X}_{k})=\sqrt{\boldsymbol{X}_{k}}^{-1}\boldsymbol{P}_{x_{1}}\boldsymbol{F}_{i,k}(\boldsymbol{\Sigma}_{x_{k}})\boldsymbol{P}_{x_{k}}^{-1}\sqrt{\boldsymbol{X}_{k}}

(137)

where

\displaystyle\boldsymbol{F}_{i,k}(\boldsymbol{\Sigma}_{x_{1}})=\text{diag}\big{(}f_{i,k}(\sigma_{x_{k},1}),\cdots,f_{i,k}(\sigma_{x_{k},N})\big{)}.

(138)

Thus, evaluating the matrix maps boils down to the knowledge of the eigenvalues and eigenvectors of $\sqrt{\boldsymbol{X}_{k}}\boldsymbol{\Omega}\sqrt{\boldsymbol{X}_{k}}$ . The question is whether these two must evaluated at every time step, as $\sqrt{\boldsymbol{X}_{k}}$ is a dynamical variable: unfortunately this is the case.

4.2 Uniform mean field projector

We now provide several examples to show the applications of the theory developed in this paper.

4.2.1 One dimensional potential

Our first example is a one dimensional nonlinear dynamical system, written as:

\frac{dx}{dt}=f(x)=-\frac{\partial V(x)}{\partial x}.

(139)

Let us thus analyze numerically the PEDS $\mathcal{O}=\{f(x),\boldsymbol{\Omega}_{1},-\alpha(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X},\vec{1},N\}$ , with a potential of the form

\displaystyle V(x)=a_{0}+a_{1}x+\frac{a_{2}}{2}x^{2}+\frac{a_{3}}{3}x^{3}+\frac{a_{4}}{4}x^{4}

(140)

for a set of parameters for which two minima are present, as shown in Fig. 4. First, we compare the standard commutative and non-commutative maps. The difference is shown in Fig. 5 for identical initial conditions.

The associated PEDS is given by the differential system

	$\displaystyle\frac{d\vec{X}}{dt}$	$\displaystyle=$	$\displaystyle\boldsymbol{F}(\vec{X})\vec{1}-\alpha(\boldsymbol{I}-\boldsymbol{\Omega_{1}})\vec{X}$		(141)
		$\displaystyle=$	$\displaystyle-\Big{(}a_{1}\boldsymbol{I}+a_{2}(\boldsymbol{\Omega_{1}}\boldsymbol{X})+a_{3}(\boldsymbol{\Omega_{1}}\boldsymbol{X})^{2}+a_{4}(\boldsymbol{\Omega_{1}}\boldsymbol{X})^{3}\Big{)}\vec{1}-\alpha(\boldsymbol{I}-\boldsymbol{\Omega_{1}})\vec{1}$		(141)

The results of the numerical integration, using a simple Euler scheme for Gaussian-distributed initial conditions around the potential maximum, at $x^{*}=-0.51$ , of the target system and of the PEDS embedding with the standard non-commutative map. The PEDS trajectories all reach the global minimum of the potential $V(x)$ , while the uncoupled trajectories split between the two stable equilibria.

4.2.2 Vector target system

As an example of vector target system, we consider a two-dimensional dynamical system embedded with the standard non-commutative map. The target system is:

	$\displaystyle\frac{dx}{dt}$	$\displaystyle=$	$\displaystyle-\frac{\partial V(x,y)}{\partial x}$		(142)
	$\displaystyle\frac{dy}{dt}$	$\displaystyle=$	$\displaystyle-\frac{\partial V(x,y)}{\partial y}$		(143)

where

V(x,y)=\exp\left(\frac{x^{2}}{2}-\frac{y^{2}}{2}+\frac{y^{4}}{4}\right)

(144)

which is characterized by two local minima, $(x^{*}=0,y^{*}=\pm 1)$ . The equations of motion define the gradient descent dynamics

	$\displaystyle\frac{dx}{dt}$	$\displaystyle=$	$\displaystyle f_{x}(x,y)=-xV(x,y)$		(145)
	$\displaystyle\frac{dy}{dt}$	$\displaystyle=$	$\displaystyle f_{y}(x,y)=-(y-y^{3})V(x,y).$		(146)

The interest in this examples lies in the fact that the PEDS equations of motion depend on the ordering prescription considered. We discuss here the two cases defined below:

	$\displaystyle S_{1}$	$\displaystyle\rightarrow$	$\displaystyle\begin{cases}\boldsymbol{F}^{(1)}_{x}(\vec{X},\vec{Y})&=-\boldsymbol{\Omega}_{1}\boldsymbol{X}\ \boldsymbol{V}_{1}(\boldsymbol{X},\boldsymbol{Y})\\ \boldsymbol{F}_{y}^{(1)}(\vec{X},\vec{Y})&=-(\boldsymbol{\Omega}_{1}\boldsymbol{Y})\big{(}\boldsymbol{I}-(\boldsymbol{\Omega}_{1}\boldsymbol{Y})^{2}\big{)}\ \boldsymbol{V}_{1}(\boldsymbol{X},\boldsymbol{Y})\\ \end{cases}$		(147)
	$\displaystyle S_{2}$	$\displaystyle\rightarrow$	$\displaystyle\begin{cases}\boldsymbol{F}_{x}^{(2)}(\vec{X},\vec{Y})&=-\boldsymbol{\Omega}_{1}\boldsymbol{X}\ \boldsymbol{V}_{2}(\boldsymbol{X},\boldsymbol{Y})\\ \boldsymbol{F}_{y}^{(2)}(\vec{X},\vec{Y})&=-(\boldsymbol{\Omega}_{1}\boldsymbol{Y})\big{(}\boldsymbol{I}-(\boldsymbol{\Omega}_{1}\boldsymbol{Y})^{2}\big{)}\ \boldsymbol{V}_{2}(\boldsymbol{X},\boldsymbol{Y})\\ \end{cases}$		(148)

where

	$\displaystyle\boldsymbol{V}_{1}(\boldsymbol{X},\boldsymbol{Y})$	$\displaystyle=$	$\displaystyle\text{exp}\left(\frac{1}{2}(\boldsymbol{\Omega_{1}}\boldsymbol{X})^{2}+(\boldsymbol{\Omega_{1}}\boldsymbol{Y})^{2}(\frac{1}{2}\boldsymbol{I}-\frac{1}{4}(\boldsymbol{\Omega_{1}}\boldsymbol{Y})^{2}\right)$		(149)
	$\displaystyle\boldsymbol{V}_{2}(\boldsymbol{X},\boldsymbol{Y})$	$\displaystyle=$	$\displaystyle\text{exp}\Big{(}\frac{1}{2}(\boldsymbol{\Omega_{1}}\boldsymbol{X})^{2}\Big{)}\text{exp}\Big{(}(\boldsymbol{\Omega_{1}}\boldsymbol{Y})^{2}(\frac{1}{2}\boldsymbol{I}-\frac{1}{4}(\boldsymbol{\Omega_{1}}\boldsymbol{Y})^{2}\big{)}\Big{)}$		(150)

where $\boldsymbol{V}_{1}\neq\boldsymbol{V}_{2}$ since $\exp(\boldsymbol{\Omega}_{1}\boldsymbol{X}\boldsymbol{\Omega}_{1}\boldsymbol{Y})\neq\exp(\boldsymbol{\Omega}_{1}\boldsymbol{X})\text{exp}(\boldsymbol{\Omega}_{1}\boldsymbol{Y})$ . Thus, choosing one versus the other is equivalent to a different ordering choice.

We embed this system of equations via $\mathcal{O}_{1}=(\{f_{x},f_{y}\},\boldsymbol{\Omega_{1}},\{-\alpha(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X},-\alpha(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{Y}\},S_{1},\vec{1},N)$ and $\mathcal{O}_{2}=(\{f_{x},f_{y}\},\boldsymbol{\Omega_{1}},\{-\alpha(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X},-\alpha(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{Y}\},S_{2},\vec{1},N)$ , obtaining

	$\displaystyle\frac{d\vec{X}}{dt}$	$\displaystyle=$	$\displaystyle\boldsymbol{\Omega}_{1}\boldsymbol{F}_{x}^{(i)}\vec{1}-\alpha(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X},$		(151)
	$\displaystyle\frac{d\vec{Y}}{dt}$	$\displaystyle=$	$\displaystyle\boldsymbol{\Omega}_{1}\boldsymbol{F}_{y}^{(i)}\vec{1}-\alpha(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{Y},$		(152)

where $i=1,2$ is the label for $S_{1},S_{2}$ . The numerical solutions are shown in Fig. 8 using $\alpha=0.1$ and $N=50$ . The two dynamic behaviours are essentially identical, as per Corollary 3.13. Interestingly, even if in general $[\boldsymbol{\Omega_{1}X},\boldsymbol{\Omega_{1}Y}]\neq 0$ , for this example we can work out the full details leading to the independence on the ordering.

First, let us note that $\boldsymbol{\Omega_{1}X\Omega_{1}}=\langle X\rangle\boldsymbol{\Omega}_{1}$ . Using this expression, we can write

f(\boldsymbol{\Omega}_{1}\boldsymbol{X})=\boldsymbol{I}+\frac{f(\langle X\rangle)-1}{\langle X\rangle}\boldsymbol{\Omega}_{1}\boldsymbol{X}

therefore

	$\displaystyle\exp\left(f(\boldsymbol{\Omega}_{1}\boldsymbol{X})+g(\boldsymbol{\Omega}_{1}\boldsymbol{Y})\right)$	$\displaystyle=\exp\left(2\boldsymbol{I}+\frac{f(\langle X\rangle)-1}{\langle X\rangle}\boldsymbol{\Omega}_{1}\boldsymbol{X}+\frac{g(\langle Y\rangle)-1}{\langle Y\rangle}\boldsymbol{\Omega}_{1}\boldsymbol{Y}\right)$
		$\displaystyle=\exp(2)\exp\left[\boldsymbol{\Omega}_{1}\left(\frac{f(\langle X\rangle)-1}{\langle X\rangle}\boldsymbol{X}+\frac{g(\langle Y\rangle)-1}{\langle Y\rangle}\boldsymbol{Y}\right)\right]$		(153)

We can now apply again the same formula

	$\displaystyle\exp\left[\boldsymbol{\Omega}_{1}\left(\frac{f(\langle X\rangle)-1}{\langle X\rangle}\boldsymbol{X}+\frac{g(\langle Y\rangle)-1}{\langle Y\rangle}\boldsymbol{Y}\right)\right]$
	$\displaystyle\quad=\exp\left[\boldsymbol{I}+\frac{{f(\langle X\rangle)+g(\langle Y\rangle)-2}-1}{f(\langle X\rangle)+g(\langle Y\rangle)-2}\boldsymbol{\Omega}_{1}\left(\frac{f(\langle X\rangle)-1}{\langle X\rangle}\boldsymbol{X}+\frac{g(\langle Y\rangle)-1}{\langle Y\rangle}\boldsymbol{Y}\right)\right]$		(154)

yielding, after taking into account the multiplication times $\vec{1}$

	$\displaystyle\exp(2)\exp\left[\boldsymbol{I}+\frac{{f(\langle X\rangle)+g(\langle Y\rangle)-2}-1}{f(\langle X\rangle)+g(\langle Y\rangle)-2}\boldsymbol{\Omega}_{1}\left(\frac{f(\langle X\rangle)-1}{\langle X\rangle}\boldsymbol{X}+\frac{g(\langle Y\rangle)-1}{\langle Y\rangle}\boldsymbol{Y}\right)\right]\vec{1}$
	$\displaystyle\quad=\exp\left(f(\langle X\rangle)+g(\langle Y\rangle)\right)\vec{1}.$		(155)

In other words, we have shown that

\exp\left(f(\boldsymbol{\Omega}_{1}\boldsymbol{X})+g(\boldsymbol{\Omega}_{1}\boldsymbol{Y})\right)\vec{1}=\exp\left(f(\boldsymbol{\Omega}_{1}\boldsymbol{X})\right)\exp\left(g(\boldsymbol{\Omega}_{1}\boldsymbol{Y})\right)\vec{1},

(156)

i.e. the equivalence of the dynamics of $S_{1}$ and $S_{2}$ .

4.2.3 Hamiltonian equations with dissipation

As a third example, let us consider another two-dimensional vector target system: the description of a dissipative Hamiltonian system for a single particle of mass $m$ . The target system reads

	$\displaystyle\frac{dx}{dt}$	$\displaystyle=$	$\displaystyle\frac{p}{m}$		(157)
	$\displaystyle\frac{dp}{dt}$	$\displaystyle=$	$\displaystyle-\frac{\partial V}{\partial x}-\chi\frac{p}{m}$		(158)

where $\chi$ denotes the dissipation and we define the force $f(x,p)=-\partial V/\partial x$ . Following the prescription of the previous sections, we write the PEDS

\mathcal{O}=(\{\frac{p}{m},f(x,p)-\chi{p}/{m}\},\boldsymbol{\Omega_{1}},\{-\alpha_{x}(\boldsymbol{I-\Omega_{1}})\vec{X},-\alpha_{p}(\boldsymbol{I-\Omega_{1}})\vec{P}\},\vec{1},N).

The extended system equations are given by

	$\displaystyle\frac{d\vec{X}}{dt}$	$\displaystyle=$	$\displaystyle\frac{1}{m}\boldsymbol{\Omega}_{1}\boldsymbol{P}\vec{1}-\alpha_{x}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{X}$		(159)
	$\displaystyle\frac{d\vec{P}}{dt}$	$\displaystyle=$	$\displaystyle\boldsymbol{\Omega}_{1}\left(\boldsymbol{F}(\boldsymbol{X})-\chi\frac{\boldsymbol{P}}{m}\right)\vec{1}-\alpha_{p}(\boldsymbol{I}-\boldsymbol{\Omega}_{1})\vec{P}$		(160)

which is thus a $2N$ set of equations. Let us focus on the potential $V(x,p)$ defined in (140). The results are shown in Fig. 9.

4.3 Beyond the uniform mean field projector

This paper is focused mostly on the uniform mean field projection $\boldsymbol{\Omega}_{1}$ . Before concluding, we wish to numerically simulate also the case of a PEDS with a different projection operator. Let us consider a PEDS where $\boldsymbol{\Omega}=\boldsymbol{B}^{t}(\boldsymbol{B}\boldsymbol{B}^{t})^{-1}\boldsymbol{B}$ , where $\boldsymbol{B}$ is a random square matrix (uniformly distributed on $[0,1]$ ) of size $N\times K$ . The scalar target dynamical system we are interested in is again

\frac{dx}{dt}=f(x)=-\frac{\partial V(x)}{\partial x}.

(161)

with the potential in (140), choosing parameters $a_{4}=a_{0}=0$ , $a_{3}=-2$ , $a_{2}=-10$ and $a_{1}=9.85$ that guarantee a single potential minimum, as shown Fig. 10 (left). We then consider the PEDS $\mathcal{O}=(f(x),\boldsymbol{\Omega},-\alpha(\boldsymbol{I-\Omega})\vec{X},\boldsymbol{\Omega}\vec{1},N)$ , with $N=50$ , and follow the observable $\tilde{x}=\mathcal{P}_{\boldsymbol{\Omega}}\vec{X}$ . The results are shown in Fig. 10 (right). The PEDS embedding converges also in this case to the potential absolute minimum, thus confirming that a generalization of the results of this paper to arbitrary projectors is possible.

As a last comment, one of the main motivations for this study is that in circuits, conservation laws can be expressed in terms of projector operators. An example is the volatile but (almost) ideal memristor. A resistor with memory can be described, at the lowest level of approximation for a current controlled device, by an effective dynamical resistance depending on an internal parameter $x$ . In this sense, memristors are approximately described by the functional form $R(x)=R_{\text{off}}(1-x)+xR_{\text{on}}$ , where $R_{\text{on}}<R_{\text{off}}$ are the boundary resistances, and $x\in[0,1]$ . We assume that the internal memory parameter $x$ evolves according to a simple equation of the form $dx/{dt}={R_{\text{on}}}I/{\beta}-\alpha x$ . The parameters $\alpha$ and $\beta$ are the decay constant and the effective activation voltage per unit time, respectively. For a recent paper which inspired this study, consider [30], where transitions between effective minima of a lower dimensional potential were observed. Using Ohm’s law, we define voltage $V=R(x)I$ , so as to obtain a normalized equation for $x(t)$

\displaystyle\frac{dx}{dt}=\frac{V}{\beta}\frac{1}{1-\chi x}-\alpha x=-\alpha\frac{\partial V(x,s)}{\partial x}

(162)

where $\chi={(R_{\text{off}}-R_{\text{on}})}/{R_{\text{off}}}$ and $s=\frac{S}{\alpha\beta}$ , with $0\leq\chi\leq 1$ in the physically relevant cases, and $V(x,s)$ as an effective potential, where $S$ is the voltage applied to the circuit, and $s$ is a normalized quantity with units of inverse time.

The dynamics of a single memristor (162) is fully characterized by the gradient following the dynamics of the effective potential

V(x,s)=\frac{1}{2}x^{2}+\frac{s}{\chi}\log(1-\chi x),

(163)

with $s=\frac{S}{\alpha\beta}$ ; the constant $\alpha$ also acts as the learning rate in (162).

For a network of memristors, the differential equation for $x_{i}(t)$ is a set of coupled ODE of the form [23]:

\displaystyle\frac{d}{dt}\vec{x}=\frac{1}{\beta}(\boldsymbol{I}-\chi\boldsymbol{\Omega X})^{-1}\boldsymbol{\Omega}\vec{S}-\alpha\vec{x},

(164)

where $\boldsymbol{X}_{ij}(t)=x_{i}(t)\delta_{ij}$ . The matrix $\boldsymbol{\Omega}$ is the projection operator on the vector space of cycles of $\mathcal{G}$ ,the graph representing the circuit [23], and, as discussed in the Introduction, a mathematical consequence of Kirchhoff’s conservation laws. Now, we note that we can write (164) as:

\displaystyle\frac{d}{dt}\vec{x}=\boldsymbol{\Omega}(\frac{1}{\beta}(\boldsymbol{I}-\chi\boldsymbol{\Omega X})^{-1}\boldsymbol{\Omega}\vec{S}-\alpha\vec{x})-\alpha(\boldsymbol{I}-\boldsymbol{\Omega})\vec{x},

(165)

which is exactly in the form of a PEDS, with a standard decay function. Thus, the results of [30] can be interpreted as the relaxation of the system towards the minima defined by the embedding function. If $\boldsymbol{\Omega}=\boldsymbol{\Omega}_{1}$ , then using the results of this paper we know that the potential (163) determines the effective minima of the system. However, in order to justify the presence of the rumbling transitions shown in [30], a deeper understanding of the PEDS properties for a general projector $\boldsymbol{\Omega}$ is required.

5 Conclusions and perspective

In the present paper we presented and studied a map between dynamical systems of size $m$ and dynamical systems in a higher number of variables. This is the first of a series of papers formally investigating the projective embeddings of dynamical systems (PEDS) paradigm that we defined here. The purpose of this work was to formally show the properties of this type of embeddings, within the context of a particular projector matrix. As we have seen, their structure is such that for long times, the asymptotic equilibria of the target dynamical system can be recovered.

We have discussed in particular the case of the uniform mean field projector operator ${\Omega}_{1,ij}=\frac{1}{N}$ . For this choice, we have been able to prove analytically that the asymptotic equilibria are strictly connected to those of the original system. Aside from establishing the formalism, this paper also established some exact results about how the embedding changes the properties of the dynamics critical, including the cases of unstable equilibria and saddle points.

Specifically, we have studied the embedding of $m$ dimensional dynamical systems in $Nm$ -dimensional systems. The purpose of such embedding is to modify the nature of the fixed points of the dynamics, i.e. those satisfying $\{\vec{x}^{*}\ \text{s.t.}\ {d\vec{x}}/{dt}|_{\vec{x}^{*}}=0\}$ . In particular, we have shown that stable and saddle type fixed points retain their properties, while unstable fixed points become saddles. This observation justify future works in this direction, in particular exploiting different types of decay functions, matrix embeddings and projectors with respect to this contribution. It is worth to mention that a follow up of this work is in preparation, in which we discuss the behavior of PEDS for general projectors; many of the results on the Jacobian obtain in this paper do actually apply also in the general case [PEDS2].

An important aspect of interest of future works will be to focus on how to further modify the spectral properties of the fixed points, i.e. the nature of the Jacobian once evaluated at $\vec{x}^{*}$ . What we have shown in the present paper is that, for the uniform projector, the PEDS Jacobian is always symmetric, and thus characterized by real eigenvalues, that in particular are negative if the corresponding fixed point of the target system is stable. This implies that the dynamics near stable fixed points is always laminar, e.g. slowly decaying towards the fixed point. As we will see in future works, this is not the case for general projectors, for which approximate but special techniques will have to be employed.

As discussed, the spectral signature is in part inherited by the original, target dynamical system, but modified through the extended number of dimensions. The idea of generalizing the space of solutions to higher dimensions is not new. In a way, the PEDS technique is in spirit close to both Markov Chain Monte Carlo methods [28] and the notion of lifts in convex optimization [29], but is specifically developed for the fixed points of dynamical systems.

In particular, in [30] it was observed that memristive circuits have an effective lower dimensional representation in terms of an effective potential, and that they can exhibit a “rumbling” transition, i.e. a transient chaotic tunneling between local minima of a properly defined potential. As it turns out, such dynamics is only a particular case of the PEDS introduced here, in which the projector operator was given by random circuit connections.

The rumbling transition in [30] was pinpointed numerically to be due to an effective “Lyapunov force”, shown to be present in connection with the rumbling transition phenomenon. Such force was defined essentially as a deviation from a mean field theory, and we provided evidence of an athermal and novel mechanism in which barrier escapes emerge in the effective description of a multi-particle system. This paper is a continuation of that work, attempting at generalizing those findings to general systems, although focusing specifically on a particular type of projector: in this case, these “Lyapunov” forces are not present. Similar yet different types of behavior were also observed previously within the context of memory-based computing (memcomputing) solutions [31, 32, 33, 34, 35, 36].

The main focus of this paper represents a first step towards a clarification of the general reasons why the introduction of hidden variables in a dynamical system can lead to transitions between local and global minima of the effective description via instabilities in the full system. Since maxima can be turned into saddle points, generically there cannot be no “barriers” when the target system is a gradient descent. However, as we will show formally in future works, in order to obtain the rumbling transitions, one has to go beyond the uniform mean field approximation and study a more general type of projector.

Clearly, the projective embedding studied in this paper can be employed in a variety of dynamical systems, including all sort of gradient-based dynamics, with applicability to machine learning and neural networks. These applications will also be the subject of future studies. In particular, we hope that the introduction of “hidden variables” in dynamical systems [37] can be further investigated for the purpose of machine learning and optimization applications [38]. In general, the study of transient chaos in dynamical systems and optimization [39, 40] is an interesting area of research with possible applications also in memristor-based algorithms [41].

Acknowledgments. The work of F.C. was carried out under the auspices of the NNSA of the U.S. DoE at LANL under Contract No. DE-AC52-06NA25396, and in particular grant PRD20190195 from the LDRD. F. C. would also like to thank W. Bruinsma for various comments and observations on the paper.

References

[1] M. Di Ventra, Y. V. Pershin, The parallel approach, Nature Physics 9 (4) (2013) 200–202. doi:10.1038/nphys2566.
[2] M. Di Ventra, F. L. Traversa, Perspective: Memcomputing: Leveraging memory and physics to compute efficiently, Journal of Applied Physics 123 (18) (2018) 180901. doi:10.1063/1.5026506.
[3] S. Kirkpatrick, C. D. Gelatt, M. P. Vecchi, Optimization by simulated annealing, Science 220 (4598) (1983) 671–680. doi:10.1126/science.220.4598.671.
[4] G. E. Santoro, Theory of quantum annealing of an ising spin glass, Science 295 (5564) (2002) 2427–2430. doi:10.1126/science.1068774.
[5] C. Baldassi, R. Zecchina, Efficiency of quantum vs. classical annealing in nonconvex learning problems, Proceedings of the National Academy of Sciences 115 (7) (2018) 1457–1462. doi:10.1073/pnas.1711456115.
[6] J. L. Hennessy, D. A. Patterson, A new golden age for computer architecture, Communications of the ACM 62 (2) (2019) 48–60. doi:10.1145/3282307.
[7] S. K. Vadlamani, T. P. Xiao, E. Yablonovitch, Physics successfully implements lagrange multiplier optimization, Proceedings of the National Academy of Sciences 117 (43) (2020) 26639–26650. doi:10.1073/pnas.2015192117.
[8] F. L. Traversa, M. Di Ventra, Universal memcomputing machines, IEEE Transactions on Neural Networks aMemristive devices and systems10.1109/PROC.1976.10092nd Learning Systems 26 (11) (2015) 2702–2715. doi:10.1109/tnnls.2015.2391182.
[9] B. Sutton, K. Y. Camsari, B. Behin-Aein, S. Datta, Intrinsic optimization using stochastic nanomagnets, Scientific Reports 7 (1). doi:10.1038/srep44370.
[10] F. Böhm, G. Verschaffelt, G. Van der Sande, A poor man’s coherent ising machine based on opto-electronic feedback systems for solving optimization problems, Nature Communications 10 (1). doi:10.1038/s41467-019-11484-3.
[11] D. Pierangeli, G. Marcucci, C. Conti, Large-scale photonic ising machine by spatial light modulation, Physical Review Letters 122 (21) (2019) 213902. doi:10.1103/physrevlett.122.213902.
[12] G. Csaba, W. Porod, Coupled oscillators for computing: A review and perspective, Applied Physics Reviews 7 (1) (2020) 011302. doi:10.1063/1.5120412.
[13] H. Goto, K. Endo, M. Suzuki, Y. Sakai, T. Kanao, Y. Hamakawa, R. Hidaka, M. Yamasaki, K. Tatsumura, High-performance combinatorial optimization based on classical mechanics, Science Advances 7 (6). doi:10.1126/sciadv.abe7953.
[14] M. Dorigo, T. Stützle, Ant colony optimization, MIT Press, Cambridge, Mass, 2004.
[15] A. M. Turing, The Essential Turing: Seminal Writings in Computing, Logic, Philosophy, Artificial Intelligence, and Artificial Life Plus the Secrets of Enigma, OXFORD UNIV PR, 2004.
URL https://www.ebook.de/de/product/3612004/alan_m_turing_the_essential_turing_seminal_writings_in_computing_logic_philosophy_artificial_intelligence_and_artificial_life_plus_the_secrets_of_eni.html
[16] D. J. C. MacKay, Information Theory, Inference and Learning Algorithms, Cambridge University Press, 2003.
URL https://www.ebook.de/de/product/3259882/david_j_c_mackay_information_theory_inference_and_learning_algorithms.html
[17] D. Barber, Bayesian Reasoning and Machine Learning, Cambridge University Press, 2019.
URL https://www.ebook.de/de/product/13930073/david_barber_bayesian_reasoning_and_machine_learning.html
[18] J. D. Lee, M. Simchowitz, M. I. Jordan, B. Recht, Gradient descent only converges to minimizers, in: V. Feldman, A. Rakhlin, O. Shamir (Eds.), 29th Annual Conference on Learning Theory, Vol. 49 of Proceedings of Machine Learning Research, PMLR, Columbia University, New York, New York, USA, 2016, pp. 1246–1257.
URL https://proceedings.mlr.press/v49/lee16.html
[19] C. Jin, P. Netrapalli, M. I. Jordan, Accelerated gradient descent escapes saddle points faster than gradient descent, in: S. Bubeck, V. Perchet, P. Rigollet (Eds.), Proceedings of the 31st Conference On Learning Theory, Vol. 75 of Proceedings of Machine Learning Research, PMLR, 2018, pp. 1042–1085.
URL https://proceedings.mlr.press/v75/jin18a.html
[20] O. Bournez, A. Pouly, A survey on analog models of computation, in: Theory and Applications of Computability, Springer International Publishing, 2021, pp. 173–226. doi:10.1007/978-3-030-59234-9_6.
[21] L. Chua, Memristor-the missing circuit element, IEEE Transactions on Circuit Theory 18 (5) (1971) 507–519. doi:10.1109/tct.1971.1083337.
[22] D. B. Strukov, G. S. Snider, D. R. Stewart, R. S. Williams, The missing memristor found, Nature 453 (7191) (2008) 80–83. doi:10.1038/nature06932.
[23] F. Caravelli, F. L. Traversa, M. Di Ventra, The complex dynamics of memristive circuits: Analytical results and universal slow relaxation, Physical Review E 95 (2) (2017) 022140. doi:10.1103/physreve.95.022140.
[24] F. Caravelli, The mise en scéne of memristive networks: effective memory, dynamics and learning, International Journal of Parallel, Emergent and Distributed Systems 33 (4) (2017) 350–366. doi:10.1080/17445760.2017.1320796.
[25] F. Caravelli, Locality of interactions for planar memristive circuits, Physical Review E 96 (5) (2017) 052206. doi:10.1103/physreve.96.052206.
[26] A. Zegarac, F. Caravelli, Memristive networks: From graph theory to statistical physics, EPL (Europhysics Letters) 125 (1) (2019) 10001. doi:10.1209/0295-5075/125/10001.
[27] D. S. Bernstein, Scalar, Vector, and Matrix Mathematics: Theory, Facts, and Formulas - Revised and Expanded Edition, PRINCETON UNIV PR, 2018.
URL https://www.ebook.de/de/product/28983788/dennis_s_bernstein_scalar_vector_and_matrix_mathematics_theory_facts_and_formulas_revised_and_expanded_edition.html
[28] S. Asmussen, P. W. Glynn, Stochastic Simulation: Algorithms and Analysis, Stochastic Modelling and Applied Probability, Vol. 57, Springer, 2007.
[29] H. Fawzi, J. Gouveia, P. A. Parrilo, J. Saunderson, R. R. Thomas, Lifting for simplicity: Concise descriptions of convex sets, arXiv:2002.09788.
[30] F. Caravelli, F. Sheldon, F. L. Traversa, Global minimization via classical tunneling assisted by collective force field formation, accepted for publication in Sci. Adv. arxiv:2102.03385 (2021).
[31] F. Caravelli, J. Carbajal, Memristors for the curious outsiders, Technologies 6 (4) (2018) 118. doi:10.3390/technologies6040118.
[32] F. Sheldon, F. L. Traversa, M. Di Ventra, Taming a nonconvex landscape with dynamical long-range order: Memcomputing ising benchmarks, Physical Review E 100 (5) (2019) 053311. doi:10.1103/physreve.100.053311.
[33] F. L. Traversa, M. Di Ventra, Polynomial-time solution of prime factorization and NP-complete problems with digital memcomputing machines, Chaos: An Interdisciplinary Journal of Nonlinear Science 27 (2) (2017) 023107. doi:10.1063/1.4975761.
[34] F. L. Traversa, C. Ramella, F. Bonani, M. Di Ventra, Memcomputing NP-complete problems in polynomial time using polynomial resources and collective states, Science Advances 1 (6) (2015) e1500031. doi:10.1126/sciadv.1500031.
[35] M. D. Ventra, F. L. Traversa, I. V. Ovchinnikov, Topological field theory and computing with instantons, Annalen der Physik 529 (12) (2017) 1700123. doi:10.1002/andp.201700123.
[36] S. R. B. Bearden, H. Manukian, F. L. Traversa, M. Di Ventra, Instantons in self-organizing logic gates, Physical Review Applied 9 (3) (2018) 034029. doi:10.1103/physrevapplied.9.034029.
[37] D. Bohm, A suggested interpretation of the quantum theory in terms of ”hidden” variables. II, Physical Review 85 (2) (1952) 180–193. doi:10.1103/physrev.85.180.
[38] B. Poole, S. Lahiri, M. Raghu, J. Sohl-Dickstein, S. Ganguli, Exponential expressivity in deep neural networks through transient chaos, in: D. Lee, M. Sugiyama, U. Luxburg, I. Guyon, R. Garnett (Eds.), Advances in Neural Information Processing Systems, Vol. 29, Curran Associates, Inc., 2016, pp. 3360–3368.
URL https://proceedings.neurips.cc/paper/2016/file/148510031349642de5ca0c544f31b2ef-Paper.pdf
[39] M. Ercsey-Ravasz, Z. Toroczkai, The chaos within sudoku, Scientific Reports 2 (1). doi:10.1038/srep00725.
[40] T. Tél, The joy of transient chaos, Chaos: An Interdisciplinary Journal of Nonlinear Science 25 (9) (2015) 097619. doi:10.1063/1.4917287.
[41] K. Yang, Q. Duan, Y. Wang, T. Zhang, Y. Yang, R. Huang, Transiently chaotic simulated annealing based on intrinsic nonlinearity of memristors for efficient solution of optimization problems, Science Advances 6 (33) (2020) eaba9901. doi:10.1126/sciadv.aba9901.

	$\displaystyle\left.\left(\frac{\partial(\boldsymbol{\Omega}_{1}(\boldsymbol{X}_{1}^{j_{1}}\cdots\boldsymbol{X}_{m}^{j_{m}})^{1/k})^{k}}{\partial X_{j,t}}\right)_{pq}\right\|_{\{\vec{X}_{i}^{}\}}=\sum_{b,c,d=1}^{N}\sum_{z=0}^{k-1}\left.\big{(}(\boldsymbol{\Omega_{1}}(\boldsymbol{X}_{1}^{j_{1}}\cdots\boldsymbol{X}_{m}^{j_{m}})^{1/k})^{k-1-z}\big{)}_{pb}\right\|_{\{\vec{X}_{i}^{}\}}$
	$\displaystyle\quad\times\left.\left(\Omega_{1,bc}\frac{\partial\left((\boldsymbol{\Omega}_{1}(\boldsymbol{X}_{1}^{j_{1}}\cdots\boldsymbol{X}_{m}^{j_{m}})^{1/k}\right)_{cd}}{\partial X_{j,t}}\right\|_{\{\vec{X}_{i}^{}\}}\right)\left.\big{(}(\boldsymbol{\Omega_{1}}(\boldsymbol{X}_{1}^{j_{1}}\cdots\boldsymbol{X}_{m}^{j_{m}})^{1/k})^{z}\big{)}_{dq}\right\|_{\{\vec{X}_{i}^{}\}}$
	$\displaystyle\quad=j_{j}(x_{j}^{})^{j_{j}-1}\prod_{r\neq j}(x_{r}^{})^{j_{t}}\sum_{b,c,d=1}^{N}{\Omega}_{1,pb}{\Omega}_{1,bc}\delta_{cd}\delta_{ct}{\Omega}_{1,dq}$
	$\displaystyle\quad=j_{j}(x_{j}^{})^{j_{j}-1}\prod_{r\neq j}(x_{r}^{})^{j_{t}}\Omega_{1,pt}\Omega_{1,tq}$		(114)

	$\displaystyle\left.\frac{\partial\phantom{X_{a,b}}}{\partial X_{a,b}}\{(\boldsymbol{\Omega}_{1}\boldsymbol{X}_{1})^{i_{1}}\cdots(\boldsymbol{\Omega}_{1}\boldsymbol{X}_{m})^{i_{m}}\}_{S}\right\|_{\{\vec{X}_{i}^{*}\}}$
	$\displaystyle\qquad=\sum_{\sigma\in\mathcal{S}(m)}o_{\vec{\sigma}}\delta_{a,\sigma(s_{1})}\Big{(}i_{1}(x^{*}_{\sigma(s_{1})})^{i_{1}-1}(\boldsymbol{\Omega}_{1})_{:b}(\boldsymbol{\Omega}_{1})_{b:}\Big{)}$
	$\displaystyle\qquad\qquad\times(x^{}_{\sigma(s_{2})})^{i_{2}}\boldsymbol{\Omega}_{1}^{i_{2}}\cdots(x^{}_{\sigma(s_{m})})^{i_{m}}\boldsymbol{\Omega}_{1}^{i_{m}}+\cdots$
	$\displaystyle\qquad\quad+\sum_{\sigma\in\mathcal{S}(m)}o_{\vec{\sigma}}\delta_{a,\sigma(s_{m})}(x^{}_{\sigma(s_{1})})^{i_{1}}\boldsymbol{\Omega}_{1}^{i_{1}}(x^{}_{\sigma(s_{2})})^{i_{2}}\boldsymbol{\Omega}_{1}^{i_{2}}\cdots$
	$\displaystyle\qquad\qquad\times(x^{}_{\sigma(s_{m-1})})^{i_{m-1}}\boldsymbol{\Omega}_{1}^{i_{m-1}}\Big{(}i_{m}(x^{}_{\sigma(s_{m})})^{i_{m}-1}(\boldsymbol{\Omega}_{1})_{:b}(\boldsymbol{\Omega}_{1})_{b:}\Big{)}$
	$\displaystyle\qquad=\sum_{\sigma\in\mathcal{S}(m)}o_{\vec{\sigma}}\delta_{a,\sigma(s_{1})}i_{1}(x^{}_{\sigma(s_{1})})^{i_{1}-1}(x^{}_{\sigma(s_{2})})^{i_{2}}\cdots(x^{*}_{\sigma(s_{m})})^{i_{m}}(\boldsymbol{\Omega}_{1})_{:b}(\boldsymbol{\Omega}_{1})_{b:}+\cdots$
	$\displaystyle\qquad\quad+\sum_{\sigma\in\mathcal{S}(m)}o_{\vec{\sigma}}\delta_{a,\sigma(s_{m})}i_{m}(x^{}_{\sigma(s_{1})})^{i_{1}}(x^{}_{\sigma(s_{2})})^{i_{2}}\cdots(x^{*}_{\sigma(s_{m})})^{i_{m}-1}(\boldsymbol{\Omega}_{1})_{:b}(\boldsymbol{\Omega}_{1})_{b:}.$		(122)

Projective Embedding of Dynamical Systems: uniform mean field equations

Abstract

keywords:

1 Introduction

2 The PEDS procedure: key definitions and examples

Example 2.1.

Definition 2.2.

2.1 Extended variable ordering

Definition 2.3.

Definition 2.4.

Definition 2.5.

Example 2.6.

2.2 Decay functions

Definition 2.7.

Definition 2.8.

2.3 Embedding map 𝒪\mathcal{O}

Definition 2.9.

Definition 2.10.

Definition 2.11.

2.4 Projection operator 𝛀\boldsymbol{\Omega}

Definition 2.12.

Definition 2.13.

Example 2.14.

Example 2.15.

3 Embedding via the uniform mean field projector

3.1 Simple case: Scalar target system, embedding without decay function

Definition 3.1.

Proposition 3.2.

Proof.

3.2 Scalar target system: Enforcing the convergence to the mean via decay functions

Proposition 3.3.

Proof.

Proposition 3.4.

Proof.

Corollary 3.5.

Proof.

Example 3.6.

3.3 General case: Vector target system

Corollary 3.7.

Proof.

3.4 Properties of the Jacobian and fixed points

3.4.1 Simple case: scalar target system

Proposition 3.8.

Proof.

Corollary 3.9.

Proof.

3.4.2 General case: Vector target system

Proposition 3.10.

Proof.

Proposition 3.11.

Proof.

3.5 Dynamical ordering-equivalence for the uniform mean field projector

Proposition 3.12.

Proof.

Corollary 3.13.

Proof.

4 Numerical Examples

4.1 Implementation remarks

4.2 Uniform mean field projector

4.2.1 One dimensional potential

4.2.2 Vector target system

4.2.3 Hamiltonian equations with dissipation

4.3 Beyond the uniform mean field projector

5 Conclusions and perspective

References

Projective Embedding of Dynamical Systems:
uniform mean field equations

2.3 Embedding map $\mathcal{O}$

2.4 Projection operator $\boldsymbol{\Omega}$