This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Vector-Valued Gossip over ww-Holonomic Networks

E. Bayram* M.-A. Belabbas* T. Başar E. Bayram, M.-A. Belabbas and T. Başar are with Coordinated Science Laboratory, University of Illinois, Urbana-Champaign, Email: {ebayram2,belabbas,basar1}@illinois.edu
Abstract

We study the weighted average consensus problem for a gossip network of agents with vector-valued states. For a given matrix-weighted graph, the gossip process is described by a sequence of pairs of adjacent agents communicating and updating their states based on the edge matrix weight. Our key contribution is providing conditions for the convergence of this non-homogeneous Markov process as well as the characterization of its limit set. To this end, we introduce the notion of “ww-holonomy” of a set of stochastic matrices, which enables the characterization of sequences of gossiping pairs resulting in reaching a desired consensus in a decentralized manner. Stated otherwise, our result characterizes the limiting behavior of infinite products of (non-commuting, possibly with absorbing states) stochastic matrices.

Keywords: Consensus; Gossiping; Non-Homogeneous Markov Processes; Holonomy; Convergence of Matrix Products; Permutation Groups

1 Introduction

Consensus entails reaching an agreement between a set of agents [1]. Many applications of distributed control systems require agents to reach a consensus for a given quantity; for example consensus to the average value of their respective initial states. An extension of average value consensus is the weighted average consensus, in which each agent contributes to the agreed-upon consensus value based on its assigned weight; see the literature review below for more details. In this paper, we study the weighted average consensus problem for a gossiping network of agents with vector-valued states. Specifically, given a matrix-weighted communication graph, we study the process whereby at each time step, two adjacent agents in the network communicate and update their states based on the matrix weight of the edge adjoining them. These two agents are called a gossiping pair and the overall process is called a weighted gossip process [2, 3]. It is akin to a non-homogeneous Markov process, and the study of its convergence thus reduces to the study of convergence of an infinite product of row stochastic matrices taken from a finite set. It is well known that this is a hard problem for which no general solution is known.This is due in part to the fact that, save for particular cases such as a set of commuting matrices, the order in which stochastic matrices appear in the infinite product clearly affects the limit set; in fact, this set can be a continuum (see, e.g. [4] for examples) or it can be finite [5].

We adopt here a vantage point on the consensus problem similar to the one of [5], where the notion of holonomy of a set of stochastic matrices was introduced. There, the authors used the term holonomy to indicate a change of a certain left eigenvector (referred to as weight vector below) corresponding to eigenvalue 11 of the product of stochastic matrices along any cycle in the graph. For each cycle in the graph, one can associate a holonomy group (see [6] for the precise definition of a holonomy group). This group characterizes how the eigenvector changes as gossiping occurs along the cycle. In [5], the authors consider gossip processes for agents with scalar states and impose that the entries of a gossip matrix be strictly positive. Together, these restrictions imply that the holonomy group for a cycle, if it exists, can only be the trivial group.

Our work here extends this earlier work in two fundamental ways. First, we allow vector-valued states for the agents. Second, and more importantly, we allow zero entries in the gossip matrices. Said otherwise, we allow for update matrices that have absorbing states (i.e., have a standard unit vector as a row). These extensions together make possible the existence of a non-trivial, finite holonomy group in a gossip process, whose investigation will be one of the main concerns of this paper.

More generally, the hereby adopted set-up raises the following questions: (1)(1) How to understand the appearance of non-trivial holonomy groups? I.e., situations where the weight vector changes after completing one loop in the gossip graph, but then returns to its initial value after completing this loop a finite number of times? (2)(2) Can we still guarantee the convergence of the weighted gossip process to a limit or a finite limit set by following a sequence of gossiping pairs in a decentralized manner? (3)(3) How does the potential presence of absorbing states in gossip updates impact the consensus weight vector? These three questions are fully addressed in this paper.

To understand the phenomenon described in the first question, we introduce a concept which we call ww-holonomy of a set of stochastic matrices. This concept helps us describe the set of stochastic matrices that possess finite orbit sets when acting on some vectors. Such matrices are the ones enabling the appearance of holonomy groups in gossip processes.

For the second question, we introduce the so-called derived graph of the communication graph GG for the weight vector ww, which we denote by DG(w)D_{G}(w). Infinite exhaustive closed walks in the derived graph will correspond to allowable sequences of updates in the gossip process. These updates can be implemented in a decentralized manner, and yield a process which converges to a finite limit set.

For the third question, the presence of zeros and ones in the update matrices can significantly impact the consensus weights in our analysis. In particular, they can lead to some agents not contributing to the consensus value average and to the appearance of permutation matrices as update matrices. In fact, even when none of the update matrices are permutation matrices, their product within the gossip iteration can result in a permutation matrix (as will be illustrated later). This fact greatly complicates the analysis, and is at the root of the existence of finite limit sets.

The paper is organized as follows: We provide a brief review of the relevant literature on distributed control and weighted average consensus in the following paragraph. We then describe the notations and conventions used in the paper at the end of this section. In Section 2, we provide a precise formulation of the problem solved in this paper. The notion of holonomy and the main results of the paper are presented in Section 3. The proof of the main theorem is provided in Section 4 along with some auxiliary results. A summary of the results of the paper and outlook for future research are provided in Section 5.

Literature Review

In recent decades, there has been an increase in the applications of multi-agent systems and distributed control. These applications aim to achieve consensus among agents, as seen in works like [7, 8, 9, 10, 11, 12]. Many of these systems involve agents with multiple states, highlighting the importance of addressing weighted average consensus.

The field of weighted average consensus has seen diverse perspectives and contributions over the years, such as [1, 9, 13, 14, 15, 16, 17] and [18]. Research has tackled challenges like time delays and asynchronous information spread [19, 20, 21], as well as changing network topologies due to link failures or reconfiguration [20, 22, 23]. Moreover, works presented by [24, 25] have focused on continuous-time consensus problems. Various techniques have been used to solve consensus problems, including Lyapunov function-based methods [14, 26], and approaches inspired by ergodicity theory [27, 28, 29]. Furthermore, research efforts have addressed the constant network topology driven by the gossip process [4, 5, 30, 31]. Our work falls within the scope of this latter category of research.

Notations and conventions.

We denote by G=(V,E)G=(V,E) an undirected graph, with V={v1,,v|V|}V=\{v_{1},\ldots,v_{|V|}\} the node set and E={e1,,e|E|}E=\{e_{1},\ldots,e_{|E|}\} the edge set. The edge linking nodes viv_{i} and vjv_{j} is denoted by (vi,vj)(v_{i},v_{j}), a self-arc or self loop is denoted by (vi,vi)(v_{i},v_{i}). We call GG simple if it has no self-loops.

Given a sequence of edges γ=e1ek\gamma=e_{1}\cdots e_{k} in EE, a node vVv\in V is called covered by γ\gamma if it is incident to an edge in γ\gamma. Given a sequence γ=e1e2\gamma=e_{1}e_{2}\cdots, we say that γ\gamma^{\prime} is a string of γ\gamma if it is a contiguous subsequence, i.e., γ=ekek+1el\gamma^{\prime}=e_{k}e_{k+1}\cdots e_{l} for some k1k\geq 1 and lkl\geq k. Let γ=e1ek\gamma=e_{1}\cdots e_{k} be a finite sequence and ek+1e_{k+1} be an edge of GG. The sequence e1ekek+1e_{1}\cdots e_{k}e_{k+1} obtained by adding ek+1e_{k+1} to the end of γ\gamma is denoted by γek+1\gamma\lor e_{k+1} .

For a given simple undirected graph GG as above, we denote by \vvG=(V,\vvE)\vv{G}=(V,\vv{E}) a directed graph on the same node set and with a “bidirectionalized” edge set; precisely, \vvE\vv{E} is defined as follows: we assign to every edge (vi,vj)(v_{i},v_{j}) of GG, iji\neq j, two directed edges vivjv_{i}v_{j} and vjviv_{j}v_{i}.

We denote a walk in \vvG\vv{G} either by the succession of edges or the succession of nodes visited. We say that γ=vi1vik\gamma=v_{i_{1}}\cdots v_{i_{k}} is a walk in the directed graph \vvG\vv{G} if vilvil+1v_{i_{l}}v_{i_{l+1}}, for =1,,k1\ell=1,\cdots,k-1, is an edge of \vvG\vv{G}. We refer to vi1v_{i_{1}} and vikv_{i_{k}} as the starting- and ending-nodes of γ\gamma, respectively. We define γ1:=vikvik1vi1\gamma^{-1}:=v_{i_{k}}v_{i_{k-1}}\cdots v_{i_{1}}. Let γ=vilvil+1vim\gamma^{\prime}=v_{i_{l}}v_{i_{l+1}}\cdots v_{i_{m}} be another walk in G\vec{G}. We denote by γγ=vi1vik,vilvim\gamma\lor\gamma^{\prime}=v_{i_{1}}\cdots v_{i_{k}},v_{i_{l}}\cdots v_{i_{m}} the concatenation of the two walks.

If each edge ee in GG is labeled with some quantity, the graph GG is called a weighted graph. If each edge ee in GG is labeled with some matrix AeA_{e}, then the graph GG is called a matrix-weighted graph.

We say that pnp\in\mathbb{R}^{n} is a probability vector if pi0p_{i}\geq 0 and i=1npi=1\sum_{i=1}^{n}p_{i}=1. The set of probability vectors in n\mathbb{R}^{n} is the (n1)(n-1)-simplex, which is denoted by Δn1\Delta^{n-1}. Its interior with respect to the standard Euclidean topology in n\mathbb{R}^{n} is denoted by intΔn1\operatorname{int}\Delta^{n-1}. Then, if pintΔn1p\in\operatorname{int}\Delta^{n-1}, all entries of pp are positive.

On the space of n×mn\times m real matrices, we define the following semi-norm for a given An×mA\in\mathbb{R}^{n\times m},

AS:=max1jmmax1i1,i2n|ai1jai2j|.\left\|A\right\|_{S}:=\max_{1\leq j\leq m}\max_{1\leq i_{1},i_{2}\leq n}|a_{i_{1}j}-a_{i_{2}j}|.

It should be clear that the semi-norm of AA is zero if and only if all rows of AA are equal.

We let 𝟙\mathds{1} be a vector of all ones, whose dimension will be clear from the context.

The support of a matrix A=[aij]A=[a_{ij}], denoted by supp(A)supp(A), is the set of indices ijij such that aij0a_{ij}\neq 0. We denote by minA\min A the smallest non zero entry of AA: minA=minijsupp(A)aij\min A=\min_{ij\in supp(A)}a_{ij}.

A matrix AA with order nn is reducible if there exist a permutation matrix PP such that,

PAP=[A11A120A22]P^{\top}AP=\left[\begin{array}[]{cc}A_{11}&A_{12}\\ 0&A_{22}\end{array}\right] (1)

where A11A_{11} and A12A_{12} are nonempty square matrices. If AA is not reducible, then AA is called irreducible. For convenience, we denote similarity through the permutation matrix PP by P\sim_{P}.

The spectral radius of a matrix AA is the maximum of the modulus of the elements of its spectrum, denoted by ρ(A)\rho(A). A circle on \mathbb{C} with radius ρ(A)\rho(A) is called spectral circle of the matrix AA. A nonnegative irreducible matrix AA having h>1h>1 eigenvalues on its spectral circle is called imprimitive, and hh is referred to as the index of imprimitivity. If there is only one eigenvalue on the spectral circle of AA, then the matrix AA is primitive.

The period of the ithi^{th} entry of a nonnegative matrix AA is defined as ωA(i):=gcd{m:[Am]ii>0,m}\mathcal{\omega}_{A}(i):=gcd\{m:[A^{m}]_{ii}>0,m\in\mathbb{N}\}. If AA is irreducible, then ωA(i)=ωA(j),i,j\mathcal{\omega}_{A}(i)=\mathcal{\omega}_{A}(j),\forall i,j [32]. This common value is called the period of the matrix AA, denoted by ωA\mathcal{\omega}^{A}.

2 Preliminaries

Let G=(V,E)G=(V,E) be an undirected simple graph on nn nodes. Each node represents an agent, and each agents’ state is a vector in m\mathbb{R}^{m}. We denote the state vector of the agent ii at time tt by xi(t)=[x1i(t)x2i(t)xmi(t)].{x^{i}(t)}=\begin{bmatrix}x_{1}^{i}(t)&x_{2}^{i}(t)&\ldots&x_{m}^{i}(t)\end{bmatrix}^{\top}. The state of the system is the concatenation of the agents’ states

x(t)=[(x1(t))(x2(t))(xn(t))]nm.{x(t)}=[(x^{1}(t))^{\top}(x^{2}(t))^{\top}\ldots(x^{n}(t))^{\top}]^{\top}\in\mathbb{R}^{nm}.

To an edge (vi,vj)E(v_{i},v_{j})\in E, we associate a 2m×2m2m\times 2m row stochastic matrix A~ij={akl}\tilde{A}_{ij}=\{a_{kl}\}. We refer to A~ij\tilde{A}_{ij} as a pre-local stochastic matrix for agents ii and jj. It describes the local information exchange when these two agents interact as part of the gossip process.

The stochastic process we analyze here is described by sequences of edges γ=ei1eit\gamma=e_{i_{1}}\cdots e_{i_{t}}\cdots in GG with the convention that if eit=(vi,vj)e_{i_{t}}=(v_{i},v_{j}), then agents ii and jj update their states according to

[xi(t+1)xj(t+1)]=A~ij[xi(t)xj(t)]\begin{bmatrix}x^{i}(t+1)\\ x^{j}(t+1)\end{bmatrix}=\tilde{A}_{ij}\begin{bmatrix}x^{i}(t)\\ x^{j}(t)\end{bmatrix} (2)

while the other agents’ states remain constant

xk(t+1)=xk(t) for all ki,j.\displaystyle x^{k}(t+1)=x^{k}(t)\mbox{ for all }k\neq i,j. (3)

The update equation for x(t)x(t) is given by the local stochastic matrix Aij{A_{ij}}. It is an nmnm-dimensional stochastic matrix such that the rows/columns corresponding to the states of agent ii and agent jj is the submatrix A~ij\tilde{A}_{ij} (2) and the rows/columns corresponding to the other agents is the identity matrix (3). Then, the gossip process on edge eit=(vi,vj)e_{i_{t}}=(v_{i},v_{j}) at time tt is given by

x(t+1)=Aijx(t).x(t+1)={A}_{ij}x(t). (4)

For example, the local stochastic matrix A12A_{12}, which is associated with the edge (v1,v2)(v_{1},v_{2}), is given by

A12=[A~1202m×(n2)m0(n2)m×2mI(n2)m×(n2)m]A_{12}=\begin{bmatrix}\tilde{A}_{12}&0_{2m\times(n-2)m}\\ 0_{(n-2)m\times 2m}&I_{(n-2)m\times(n-2)m}\end{bmatrix} (5)

We assume here that Aij=AjiA_{ij}=A_{ji}. Hence, when dealing with sequences of edges in G\vec{G}, we associate AijA_{ij} with both vivjv_{i}v_{j} and vjviv_{j}v_{i}. This makes the graph G\vec{G} a directed matrix-weighted graph.

It is important to note that we allow a pre-local stochastic matrix to have zeros in any row. Consequently, it is possible to construct a valid local stochastic matrix by having a single non-zero element in any given row while setting all other elements in that row to zero (e.g. having a standard unit vector as a row). Moreover, this leads to the possibility that a valid stochastic matrix can either be a complete permutation matrix or include a permutation block. We can elaborate on the concept of a permutation block using the following definition. Given an index set π{1,,nm}\pi\subseteq\{1,\ldots,nm\}, a stochastic matrix AA has a permutation block for π\pi if the submatrix of AA with rows/columns indexed by π\pi is a permutation matrix in S|π|S_{|\pi|}. Let πA{\pi_{A}} be the largest index set among the sets indexing permutation submatrices in AA; we refer to it as the maximal permutation index of AA.

To illustrate, consider the following instances of pre-local stochastic matrices, which are associated with the edges e1,e2e_{1},e_{2} and e3e_{3}, respectively, in the graph G\vec{G}:

A~e1:=[0100a21a22a23a241000a41a42a43a44],A~e2:=[00101000b53b54b55b56b63b64b65b66],A~e3:=[c11c12c15c161000c51c52c55c56c61c62c65c66]\tilde{A}_{e_{1}}:=\begin{bmatrix}0&1&0&0\\ a_{21}&a_{22}&a_{23}&a_{24}\\ 1&0&0&0\\ a_{41}&a_{42}&a_{43}&a_{44}\end{bmatrix},\tilde{A}_{e_{2}}:=\begin{bmatrix}0&0&1&0\\ 1&0&0&0\\ b_{53}&b_{54}&b_{55}&b_{56}\\ b_{63}&b_{64}&b_{65}&b_{66}\end{bmatrix},\tilde{A}_{e_{3}}:=\begin{bmatrix}c_{11}&c_{12}&c_{15}&c_{16}\\ 1&0&0&0\\ c_{51}&c_{52}&c_{55}&c_{56}\\ c_{61}&c_{62}&c_{65}&c_{66}\end{bmatrix} (6)

with aij,bija_{ij},b_{ij} and cijc_{ij} real numbers in the open interval (0,1)(0,1). Note that none of the pre-local matrices provided in (6) has a permutation block for any index set; in contrast, they have rows that have a single nonzero entry.

For a finite sequence γ=e1ek\gamma=e_{1}\cdots e_{k} of edges in GG and for a given pair of integers 0stk0\leq s\leq t\leq k, we define the transition matrix Pγ(t:s)P_{\gamma}(t:s) for ts+1t\geq s+1 as follows:

Pγ(t:s):=AetAet1Aes+1P_{\gamma}(t:s):=A_{e_{t}}A_{e_{t-1}}\cdots A_{e_{s+1}} (7)

We set Pγ(t:s)=IP_{\gamma}(t:s)=I for tst\leq s. This allows us to write the following update for the state vector xx at ss :

x(t)=Pγ(t:s)x(s)x(t)=P_{\gamma}(t:s)x(s) (8)

When clear from the context, we will simply write PγP_{\gamma} for Pγ(t:s)P_{\gamma}(t:s).

A pointed cycle in G\vec{G} is a walk vi1vi2vikvi1v_{i_{1}}v_{i_{2}}\cdots v_{i_{k}}v_{i_{1}}, where vi1v_{i_{1}} is called the basepoint of the cycle. Let \vv𝒞\vv{\mathcal{C}} be the set of all pointed cycles in \vvG\vv{G}. We define an equivalence relation on the set \vv𝒞\vv{\mathcal{C}} by saying that Ci,Cj\vv𝒞C_{i},C_{j}\in\vv{\mathcal{C}} are equivalent if they visit the same vertices in the same cyclic order. The set of equivalence classes of pointed cycles are referred to as cycles. By abuse of notation, we also denote by \vv𝒞\vv{\mathcal{C}} the set of cycles.

To each pointed cycle C\vv𝒞C\in\vv{\mathcal{C}}, we assign a transition matrix PCP_{C} as in (7); when we want to emphasize the basepoint, we write PC,iP_{C,i} if the basepoint is viv_{i}.

Given a cycle CC in G\vec{G}, we can reduce the dimension of vectors in Δnm1\Delta^{nm-1} and stochastic matrices Anm×nmA\in\mathbb{R}^{nm\times nm} by removing rows and/or columns corresponding to nodes that are not covered by CC. For example, if C=v1v2v1C=v_{1}v_{2}v_{1}, then we let A¯2m×2m\bar{A}\in\mathbb{R}^{2m\times 2m} be the principal submatrix of AA obtained by keeping the first 2m2m rows and columns; similarly, w¯2m\bar{w}\in\mathbb{R}^{2m} is the subvector of ww obtained by keeping the first 2m2m entries. The cycle CC, and hence the dimension of the operation ¯\bar{\cdot}, will always be clear from the context.

Within the state transition matrix definition (7), we can elaborate on the concept of permutation block. Consider a pointed cycle C1C_{1} to be a walk such that C1:=e1e2e3C_{1}:=e_{1}e_{2}e_{3}. Assume that the associated pre-local stochastic matrices with the edges in C1C_{1} are provided in (6). According to (7), we then have the following:

P¯C,1=A¯e1A¯e2A¯e3=[100000d21d22d230d25d26d31d32d330d35d36d41d42d430d45d46d51d52d53d54d55d56d61d62d63d64d65d66]\bar{P}_{C,1}=\bar{A}_{e_{1}}\bar{A}_{e_{2}}\bar{A}_{e_{3}}=\begin{bmatrix}1&0&0&0&0&0\\ d_{21}&d_{22}&d_{23}&0&d_{25}&d_{26}\\ d_{31}&d_{32}&d_{33}&0&d_{35}&d_{36}\\ d_{41}&d_{42}&d_{43}&0&d_{45}&d_{46}\\ d_{51}&d_{52}&d_{53}&d_{54}&d_{55}&d_{56}\\ d_{61}&d_{62}&d_{63}&d_{64}&d_{65}&d_{66}\end{bmatrix} (9)

with dijd_{ij} real numbers in the open interval (0,1)(0,1). The matrix P¯C,1\bar{P}_{C,1} has a permutation block for the index set π={1}\pi=\{1\}. It is worth noting that, in contrast, none of the pre-local stochastic matrices provided in (6), which are associated with the edges in C1C_{1} have a permutation block for any index set. This observation underlines that even though none of the pre-local stochastic matrices for the edges in C1C_{1} conform to the criteria of a permutation block, the corresponding state transition matrix for the pointed cycle C1C_{1} distinctly features such a permutation block.

3 Main Result

In this section, we introduce the main concepts and present the main result of this paper. As already mentioned, an important concept is the one of holonomy. In differential geometry, holonomy deals with the variation of some quantity (e.g., a vector) along the loops in a given space. If there is no variation in this quantity after completing a loop, the process is defined as holonomic. Otherwise, it is said to be non-holonomic. In our convention, if there exists a finite k>1k>1 such that the quantity does change after completing the loop once but comes back to the initial value after completing the loop kk-times, the process which evolves the said quantity is called finitely non-holonomic. For our purpose, the quantity is a left weight vector, the process evolving the quantity is the gossip process, and the space is the graph G\vec{G}.

Holonomy in the network.

We need the following lemma to introduce the notion of holonomy.

Lemma 1.

Let CiC_{i} and CjC_{j} be two pointed cycles in a cycle CC. If there exists a weight vector ww such that w=w(PC,i)kw=w(P_{C,i})^{k} holds for some positive kk, then there exists a weight vector ww^{\prime} such that w=w(PC,j)kw^{\prime}=w^{\prime}(P_{C,j})^{k} holds.

Proof.

Let C=viv1vjviC=v_{i}v_{1}\cdots v_{j}v_{i} be a cycle in \vvG\vv{G}. Consider the pointed cycles Ci=viv1v2vjviC_{i}=v_{i}v_{1}\cdots v_{2}v_{j}v_{i} and Cj=vjviv1v2vjC_{j}=v_{j}v_{i}v_{1}\cdots v_{2}v_{j} in the cycle CC. According to the statement, it holds that:

w=w(PC,i)k where PC,i=AvjviAv2vjAviv1.w=w({P}_{C,i})^{k}\mbox{ where }{P}_{C,i}={A}_{v_{j}v_{i}}{A}_{v_{2}v_{j}}\cdots{A}_{v_{i}v_{1}}. (10)

By multiplying (10) by the matrix Avjvi{A}_{v_{j}v_{i}} from the right, we get

wAvjvi\displaystyle w{A}_{v_{j}v_{i}} =w(AvjviAv2vjAviv1)kAvjvi\displaystyle=w({A}_{v_{j}v_{i}}{A}_{v_{2}v_{j}}\cdots{A}_{v_{i}v_{1}})^{k}{A}_{v_{j}v_{i}}
=wAvjvi(Av2vjAviv1Avjvi)k=wAvjvi(PC,j)k\displaystyle=w{A}_{v_{j}v_{i}}({A}_{v_{2}v_{j}}\cdots{A}_{v_{i}v_{1}}{A}_{v_{j}v_{i}})^{k}=w{A}_{v_{j}v_{i}}({P}_{C,j})^{k}

It is clear that the product wAvjviw{A}_{v_{j}v_{i}} is a weight vector. This shows that there exists a weight vector ww^{\prime} such that w=w(PC,j)kw^{\prime}=w^{\prime}({P}_{C,j})^{k} and it is equal to wAvjviw{A}_{v_{j}v_{i}}. ∎

Thanks to Lemma 1, we can introduce the following definition:

Definition 3.1 (Holonomic Stochastic Matrices).

Let CC be a cycle in \vvG\vv{G} of length greater than 22 and wintΔnm1w^{\top}\in\operatorname{int}\Delta^{nm-1} be a weight vector. The ww-order of CC is defined as

ordwC:=min{k1:w¯=w¯(P¯C)k},\operatorname{ord}_{w}C:=\min\{k\geq 1:\bar{w}=\bar{w}(\bar{P}_{C})^{k}\},

and ordwC=0\operatorname{ord}_{w}C=0 if the set is empty. The local stochastic matrices AeA_{e}, eCe\in C, are said to be ww-holonomic for CC if there exists a weight vector ww such that ordwC\operatorname{ord}_{w}C is finite and non-zero.

Note that the definition of holonomy is independent from the basepoint of CC in \vvG\vv{G}. We observe that if w¯=w¯(P¯C)k\bar{w}=\bar{w}(\bar{P}_{C})^{k} holds for some positive integer kk, then w¯=w¯(P¯C)nk\bar{w}=\bar{w}(\bar{P}_{C})^{nk} holds for all nn\in\mathbb{N}, thus making the set of integers kk for which w¯=w¯(P¯C)k\bar{w}=\bar{w}(\bar{P}_{C})^{k} holds of infinite cardinality.

The local stochastic matrices AeA_{e} are ww-holonomic for GG if there exists a common weight vector ww so that the AeA_{e}’s are ww-holonomic for all C\vv𝒞C\in\vv{\mathcal{C}} of length greater than 22.

It is easy to see that the ww-order of a given cycle CC can vary as a function of ww. For our purpose, we need to consider the ww’s that yield the largest ww-order and thus define the order of a cycle as

ordC:=supwintΔnm1ordwC.\operatorname{ord}C:=\sup_{w^{\top}\in\operatorname{int}\Delta^{nm-1}}\operatorname{ord}_{w}C. (11)

Now, we can define the holonomy of a gossip process as follows.

  • If ordC=0\operatorname{ord}C=0, then the process on CC is non-holonomic.

  • If ordC=1\operatorname{ord}C=1, then the process on CC is holonomic.

  • If ordC>1\operatorname{ord}C>1, then the process on CC is finitely non-holonomic.

One can assign a (holonomy) group to the process on the cycle CC if ordC1\operatorname{ord}C\geq 1. If ordC=1\operatorname{ord}C=1, the cycle CC is said to have trivial holonomy since the corresponding group has only identity operation (trivial group). If ordC>1\operatorname{ord}C>1, the cycle CC is said to have non-trivial holonomy since the corresponding group is a cyclic group with an order greater than one.

We denote the orbit set of a weight vector ww around a cycle CC as 𝒪wC\mathcal{O}_{w}^{C}, that is,

𝒪wC:={wC(a)nm|wC(a)=w(PC)a for a}.\mathcal{O}_{w}^{C}:=\{w_{C}^{(a)}\in\mathbb{R}^{nm}|w_{C}^{(a)}=w({P}_{C})^{a}\mbox{ for }a\in\mathbb{N}\}. (12)

For the sake of simplicity, if a=0a=0, we denote the weight vector wC(0)w^{(0)}_{C} as wC:=ww_{C}:=w.

Graph topology.

In an undirected graph GG, two nodes viv_{i} and vjv_{j} are called connected if the graph GG contains a path from viv_{i} to vjv_{j}. We need the following notion.

Definition 3.2 (Bridge).

Let G=(V,E)G=(V,E) be an undirected graph. Let 𝒮G\mathcal{S}_{G} be the set of pairs of nodes that are connected in GG. Let G~e\tilde{G}_{e} be the undirected graph obtained by removing the edge ee from GG, that is, G~e=(V,E{e})\tilde{G}_{e}=(V,E\setminus\{e\}). If |𝒮G||\mathcal{S}_{G}| is strictly greater than |𝒮G~e||\mathcal{S}_{\tilde{G}_{e}}|, the edge ee is called a bridge (cut-edge) of GG.

A graph GG without a bridge is called bridgeless. We record the following result characterizing cut-edges.

Proposition 1.

An edge ee in a connected graph GG is a bridge if and only if no cycles of GG contain both vertices adjacent to ee.

See [33, Theorem 3.3] for a proof of Proposition  1. Paraphrasing, the statement says that every node in a connected, simple, bridgeless graph GG is covered by at least one cycle.

Derived graphs.

We now focus on describing the allowed sequences of updates which yields the consensus at the limit for the gossip process. These will be defined via paths in what we call the derived graph of GG by ww, denoted by DG(w)D_{G}(w). We present this graph as a geometric graph, with nodes embedded in nm\mathbb{R}^{nm}.

Definition 3.3 (Derived Graph DG()D_{G}(\cdot)).

Let G=(V,E)G=(V,E) be a matrix-weighted graph on nn nodes, with weights Aenm×nmA_{e}\in\mathbb{R}^{nm\times nm}. For a weight vector wintΔnm1w^{\top}\in\operatorname{int}\Delta^{nm-1}, the derived graph of GG generated by ww, denoted by DG(w)=(Nw,\vvEw)D_{G}(w)=(N_{w},\vv{E}_{w}), is a directed matrix-weighted graph, possibly with multi-edges and self-loops, with Nw=C\vv𝒞𝒪wCN_{w}=\bigcup_{C\in\vv{\mathcal{C}}}\mathcal{O}_{w}^{C}. For wi,wj𝒪wCw_{i},w_{j}\in\mathcal{O}_{w}^{C}, there exists an edge wiwj\vvEww_{i}w_{j}\in\vv{E}_{w} if wi=wjPCw_{i}=w_{j}P_{C} for a (pointed) cycle CC; in this case, the edge weight is PCP_{C}.

Remark 1.

Note that the derived graph being a geometric graph, ensures that elements of distinct orbit sets with the same coordinates correspond to a unique vertex in the derived graph.

Let e\vvEwe\in\vv{E}_{w} have weight PCP_{C} with C=vivi+1vkviC=v_{i}v_{i+1}\cdots v_{k}v_{i}. We set ψ(e)=vivi+1vkvi\psi(e)=v_{i}v_{i+1}\cdots v_{k}v_{i}. We extend the domain of ψ\psi to the set of paths in DG(w){D}_{G}(w) according to

ψ(γe)=ψ(e)ψ(γ)\psi(\gamma\lor e)=\psi(e)\lor\psi(\gamma)

for any walk γ\gamma in DG(w){D}_{G}(w). Note the order reversal in the above equation, a change that is essential for maintaining coherence. The gossip process evolves as the left multiplication of local stochastic matrices while the paths in the derived graph DG(w)D_{G}(w) correspond to the right multiplication of matrices with row vectors.

We provide an example for the derivation of DG(w)D_{G}(w).

Example 1.

Consider a simple, connected, bridgeless graph with matrix weights Aenm×nmA_{e}\in\mathbb{R}^{nm\times nm} as depicted in Figure 1(a). Consider the pointed cycles C1=v1v2v3v1C_{1}=v_{1}v_{2}v_{3}v_{1}, C2=v4v1v5v4C_{2}=v_{4}v_{1}v_{5}v_{4} and C3=v5v7v6v5C_{3}=v_{5}v_{7}v_{6}v_{5} in GG. Assume that the set of local stochastic matrices Ae,eEA_{e},e\in E is ww-holonomic for GG and the corresponding orbit sets of the weight vector ww around each of these cycles are

𝒪wC1\displaystyle\mathcal{O}_{w}^{C_{1}} ={w,wC1(1),,wC1(k12),wC1(k11)}\displaystyle=\{w,w^{(1)}_{C_{1}},\cdots,w^{(k_{1}-2)}_{C_{1}},w^{(k_{1}-1)}_{C_{1}}\}
𝒪wC2\displaystyle\mathcal{O}_{w}^{C_{2}} ={w,wC2(1),,wC2(k22),wC2(k21)}\displaystyle=\{w,w^{(1)}_{C_{2}},\cdots,w^{(k_{2}-2)}_{C_{2}},w^{(k_{2}-1)}_{C_{2}}\}
𝒪wC3\displaystyle\mathcal{O}_{w}^{C_{3}} ={w}\displaystyle=\{w\}

where the ww-order of the cycles C1C_{1}, C2C_{2} and C3C_{3} are k1,k2k_{1},k_{2} and 11, respectively.

v1v_{1}v2v_{2}v3v_{3}v4v_{4}v5v_{5}v6v_{6}v7v_{7}
(a) The graph \vvG\vv{G}
wC2(1){w}_{C_{2}}^{(1)} wC2(k21){w}_{C_{2}}^{(k_{2}-1)} wC2(k22){w}_{C_{2}}^{(k_{2}-2)} w{w}wC1(1){w}_{C_{1}}^{(1)} wC1(k11){w}_{C_{1}}^{(k_{1}-1)} wC1(k12){w}_{C_{1}}^{(k_{1}-2)} eC20e_{C_{2}}^{0}eC2k22e_{C_{2}}^{k_{2}-2}eC2k21e_{C_{2}}^{k_{2}-1}eC10e_{C_{1}}^{0}eC3e_{C_{3}}eC1k12e_{C_{1}}^{k_{1}-2}eC1k11e_{C_{1}}^{k_{1}-1}
(b) The graph DG(w)D_{G}(w)
Figure 1: The graph G\vec{G} and DG(w)D_{G}(w)

We denote an edge ee in the graph DG(w)D_{G}(w) as eCike_{C_{i}}^{k} if its weight is the matrix PCiP_{C_{i}} and its starting node is wCi(k)w_{C_{i}}^{(k)}. By abuse of notation, we denote a self-loop ee in the graph DG(w)D_{G}(w) as eCie_{C_{i}} if its weight is the matrix PCiP_{C_{i}}. Consider the path γ:=eC1k11eC3eC20\gamma:=e_{C_{1}}^{k_{1}-1}\lor e_{C_{3}}\lor e_{C_{2}}^{0} in the graph DG(w)D_{G}(w). “Travelling” over the path γ\gamma in DG(w)D_{G}(w) translates into the matrix product

wC1(k11)PC1PC3PC2(=wC2(1)).{w}_{C_{1}}^{(k_{1}-1)}P_{C_{1}}P_{C_{3}}P_{C_{2}}(={w}_{C_{2}}^{(1)}).

Now, we can state our main theorem.

Theorem 1.

Let G=(V,E)G=(V,E) be a simple, connected, bridgeless graph on nn nodes with matrix-valued edge weights AeA_{e}, eEe\in E. Let wintΔnm1w\in\mathrm{int}\Delta^{nm-1} be such that the set of local stochastic matrices {Aenm×nm,eE}\{A_{e}\in\mathbb{R}^{nm\times nm},e\in E\} is ww-holonomic for GG. Then, for any infinite exhaustive closed walk γ\gamma in DG(w)D_{G}(w), we have that:

  1. (i)

    The limit set of Pψ(γ){P}_{\psi(\gamma)} is a finite set \mathcal{L}.

  2. (ii)

    There exists a relabeling of the nodes such that each element of \mathcal{L} can be expressed as

    Pψ(γ)=[P~ψ(γ)00Mψ(γ)]P_{\psi(\gamma)}=\left[\begin{array}[]{cc}\tilde{P}_{\psi(\gamma)}&0\\ 0&M_{\psi(\gamma)}\end{array}\right]

    where P~ψ(γ)\tilde{P}_{\psi(\gamma)} is a permutation matrix with rows/columns indexed by the set C\vv𝒞πPC\cap_{C\in\vv{\mathcal{C}}}\pi_{P_{C}} and Mψ(γ)M_{\psi(\gamma)} is a block diagonal matrix.

  3. (iii)

    The blocks Mψ(γ)iiM_{\psi(\gamma)}^{ii} of Mψ(γ)M_{\psi(\gamma)} are rank-one matrices.

We will provide an explicit description of the limit set \mathcal{L} and the block Mψ(γ)iiM_{\psi(\gamma)}^{ii} as a function of ww in the proof of Theorem 1. The descriptions of the blocks Mψ(γ)iiM_{\psi(\gamma)}^{ii} guarantee that they have no zero entries. It follows that if the transition matrix PCP_{C} for a cycle CC has a 11 in a row (e.g. having transpose of a standard unit vector in |nm|\mathbb{R}^{|nm|} as a row), then the 11 in the row either shows up in the permutation block P~ψ(γ)\tilde{P}_{\psi(\gamma)} for a matrix in the limit set or disappears by converging a value as a function of ww in the block Mψ(γ)M_{\psi(\gamma)}.

Note that Theorem 1 holds for a broader class of allowable sequences but we only consider what the derived graph has generated.

4 Analysis and Proofs of Theorems

In this section, we analyze the cycles that have nonzero order and then prove Theorem 1. The major contribution of this section is to develop a necessary condition for a cycle to have nonzero order and to provide a proof of Theorem 1. We start with an analysis of the spectrum of the product of local stochastic matrices and its relationship with holonomy.

4.1 Cycles with Nonzero Order

Let S1:={z:|z|=1}S^{1}:=\{z\in\mathbb{C}:|z|=1\} and let S1:=S1\{1}S^{1^{*}}:=S^{1}\backslash\{1\}. The following lemma provides a necessary condition for a cycle to have non-trivial holonomy in GG.

Lemma 2.

If a cycle CC has non-trivial holonomy, then the matrix P¯C\bar{P}_{C} has at least one eigenvalue in the set S1S^{1^{*}}.

Proof.

Let σ(P¯C)\sigma(\bar{P}_{C}) be the spectrum of the matrix P¯C\bar{P}_{C}. If CC has non-trivial holonomy, then there exists w¯\bar{w} such that w¯=w¯(P¯C)k\bar{w}=\bar{w}(\bar{P}_{C})^{k}, for some k>1k>1 and w¯w¯(P¯C)\bar{w}\neq\bar{w}(\bar{P}_{C}). The former condition implies that λσ(P¯C)\exists\lambda\in\sigma(\bar{P}_{C}) such that λk=1\lambda^{k}=1 or, equivalently, λS1\lambda\in S^{1} and the latter implies that λ1\lambda\neq 1, from which the result follows. ∎

Motivated by the previous Lemma, we now seek to describe stochastic matrices whose spectra have non-empty intersections with S1S^{1^{*}}. Note that the spectral radius of a stochastic matrix is 11 and its spectral circle is S1S^{1}.

Let AA be a reducible matrix. We know that APBA\sim_{P}B where BB is of the form:

B=[B11B1rB1,r+1B1m0BrrBr,r+1BrmBr+1,r+1000Bmm]B=\left[\begin{array}[]{ccccccc}B_{11}&\cdots&B_{1r}&B_{1,r+1}&\cdots&B_{1m}\\ &\ddots&\vdots&\vdots&\vdots&\vdots\\ \textbf{0}&&B_{rr}&B_{r,r+1}&\cdots&B_{rm}\\ &&&B_{r+1,r+1}&&\textbf{0}\\ &\textbf{0}&&&\ddots&\\ &&&\textbf{0}&&B_{mm}\\ \end{array}\right] (13)

where each principal block BiiB_{ii} is either irreducible or the zero matrix [34]; we say that the form of BB given in (13) is a canonical form for reducible matrices. In the literature, the subset of states corresponding to BkkB_{kk} for 1kr1\leq k\leq r is called the kthk^{th} transient class of matrix BB and the subset of states corresponding to Br+j,r+jB_{r+j,r+j} for j1j\geq 1 is called the jthj^{th} ergodic class of matrix BB.

Lemma 3.

Let CC be a cycle with nonzero order. Then, there exists a permutation matrix PP such that P¯CPB\bar{P}_{C}\sim_{P}B where BB is as in (13) with r=0r=0 and m1m\geq 1.

Proof.

If the matrix P¯C\bar{P}_{C} is irreducible, then the result trivially holds for P=IP=I. If the matrix P¯C\bar{P}_{C} is reducible, by definition, there exists a permutation matrix PP such that P¯CPB\bar{P}_{C}\sim_{P}B where BB as in (13). The principal submatrices BiiB_{ii} for iri\leq r act solely on the transient states of P¯C\bar{P}_{C}. Therefore, ρ(Bii)<1\rho(B_{ii})<1 for iri\leq r. This then implies that the matrices BiiB_{ii}, iri\leq r are convergent. Since ordC>0\operatorname{ord}C>0, the matrix BB cannot have convergent submatrices on the diagonal, from which the result follows. ∎

Paraphrasing, the statement says that the matrix P¯C\bar{P}_{C} is permutationally similar to a matrix BB such that it is block-diagonal with principal blocks being irreducible matrices. Note that a permutation matrix PP is an irreducible stochatic matrix. Isolating the permutation part of P¯C\bar{P}_{C}, we can further write, up to relabeling, that

P¯CP[BC0000MC]={blockarray}c@rr@clπ0C&π1CπmC
{block}
[c@rr@c]lBC000π0C
BC11π1C

0
BCmmπmC
.
\bar{P}_{C}\sim_{P}\left[\begin{array}[]{cc}B^{00}_{C}&0\\ 0&M_{C}\end{array}\right]=\blockarray{c@{\hskip 10.0pt}rr@{\hskip 10.0pt}cl}\mbox{\scriptsize$\pi^{C}_{0}$}&\mbox{\scriptsize$\pi^{C}_{1}$}\cdots\mbox{\scriptsize$\pi^{C}_{m}$}\\ \block{[c@{\hskip 10.0pt}rr@{\hskip 10.0pt}c]l}B^{00}_{C}\textbf{0}\mbox{\scriptsize$\pi^{C}_{0}$}\\ B^{11}_{C}\mbox{\scriptsize$\pi^{C}_{1}$}\\ \ddots\mbox{\scriptsize\vdots}\\ \textbf{0}B^{mm}_{C}\mbox{\scriptsize$\pi^{C}_{m}$}\\ .
(14)

where the principal submatrix BC00B^{00}_{C} is a permutation matrix. For convenience, we denote the irreducible block Br+j,r+jB_{r+j,r+j} in (13) by BCjjB^{jj}_{C}.

Let πjC\pi^{C}_{j} be the set of indices labeling the rows/columns of the corresponding block matrix BCjj{B^{jj}_{C}} up to relabeling through the matrix PP. It then follows that we have a partition of the index set {1,,nm}\{1,\cdots,nm\} induced by CC, denoted by πC\pi^{C}, precisely πC:={{πiC}i=0m}\pi^{C}:=\{\{\pi^{C}_{i}\}_{i=0}^{m}\}. As a matter of convention, we use the notation π0C\pi^{C}_{0} interchangeably with πPC\pi_{P_{C}} to refer to the largest index set among the sets that index the permutation submatrices in PCP_{C}. We will revisit the concept of the partition of the index set induced by a cycle in the next sub-section. We now aim to better understand irreducible principal blocks of the matrix MCM_{C}. Hence, without loss of generality, we can assume that P¯C\bar{P}_{C} is irreducible (e.g., only the B11B_{11} block is nontrivial).

More is known about irreducible stochastic matrices. Let AA be an irreducible stochastic matrix; then there exists a unique vector pp satisfying

Ap=pp>0 and p1=1Ap=p\mbox{, }p>0\mbox{ and }\|p\|_{1}=1 (15)

which is called the Perron vector. The Perron-Frobenius Theorem says that an irreducible primitive stochastic matrix AA converges to a scrambling matrix whose rows are equal to the Perron vector of AA^{\top} [34], which is called the row Perron vector of the matrix AA. To be more precise, limkAk=𝟙q\lim_{k\to\infty}A^{k}=\mathds{1}q^{\top} where Aq=qA^{\top}q^{\top}=q^{\top}.

We now recall an extension of the Perron-Frobenious Theorem.

Lemma 4 (Frobenius Form).

For each imprimitive matrix AA with index of imprimivity h>1h>1, there exists a permutation matrix PP such that APFA\sim_{P}F where FF is of the form:

F=[0A120000A230000Ah1,hAh1000],F=\left[\begin{array}[]{ccccc}\textbf{0}&A_{12}&0&\cdots&0\\ 0&\textbf{0}&A_{23}&\cdots&0\\ \vdots&\vdots&\ddots&\ddots&\vdots\\ 0&0&\cdots&\textbf{0}&A_{h-1,h}\\ A_{h1}&0&\cdots&0&\textbf{0}\end{array}\right], (16)

where the zero blocks, denoted by 0, on the main diagonal are square.

See [34] for a proof of Lemma 4. This is known as Frobenius Form for an irreducible imprimitive matrix. It is easy to see that the matrix FhF^{h} is a block diagonal matrix with blocks that are primitive.

4.2 Permutation Blocks

We now aim to gain a better understanding of permutation matrices. We demonstrate that the block matrix BC00B^{00}_{C} is similar to a matrix that exhibits a permutation block structure after relabeling in (14). The following proposition establishes that the matrix BC00B^{00}_{C} itself possesses a permutation block structure.

Proposition 2.

If AA is a stochastic matrix which is conjugate to a permutation matrix, then AA is a permutation matrix.

To prove Proposition 2, we use the following lemma.

Lemma 5.

A matrix AA is a stochastic matrix with A1A^{-1} also a stochastic matrix if and only if AA is a permutation matrix.

See [35] for a proof of Lemma 5. As an easy corollary, we have the following lemma:

Lemma 6.

A matrix AA is a stochastic matrix with all its eigenvalues on the unit circle if and only if AA is a permutation matrix.

Using Lemma 5, we now prove Proposition 2:

Proof of Proposition 2..

Let A=PSP1A=PSP^{-1} where SSnS\in S_{n}, and P𝔾𝕃(n)P\in\mathbb{GL}(n). We have that Ak=PSkP1.A^{k}=PS^{k}P^{-1}. Since SS is a permutation matrix, there exists a k>1k>1 so that Sk=IS^{k}=I. Putting the previous two equalities together, we get

Ak=PIP1=PP1=I.A^{k}=PIP^{-1}=PP^{-1}=I.

Hence, we have A1=Ak1A^{-1}=A^{k-1}. Furthermore, by Lemma 5, the matrices A1A^{-1} and AA are permutation matrices. ∎

4.3 Non-Permutation Blocks

Proposition 3.

For an exhaustive closed walk γ\gamma in the derived graph DG(w){D}_{G}(w), there exists a permutation matrix PP such that

Pψ(γ)P{blockarray}c@rr@clπ0ψ(γ)&π1ψ(γ)πlψ(γ)
{block}
[c@rr@c]lBψ(γ)000π0ψ(γ)
Bψ(γ)11π1ψ(γ)

0
Bψ(γ)llπlψ(γ)
P_{\psi(\gamma)}\sim_{P}\blockarray{c@{\hskip 10.0pt}rr@{\hskip 10.0pt}cl}\mbox{\scriptsize$\pi^{\psi(\gamma)}_{0}$}&\mbox{\scriptsize$\pi^{\psi(\gamma)}_{1}$}\cdots\mbox{\scriptsize$\pi^{\psi(\gamma)}_{l}$}\\ \block{[c@{\hskip 10.0pt}rr@{\hskip 10.0pt}c]l}B_{\psi(\gamma)}^{00}\textbf{0}\mbox{\scriptsize$\pi^{\psi(\gamma)}_{0}$}\\ B_{\psi(\gamma)}^{11}\mbox{\scriptsize$\pi^{\psi(\gamma)}_{1}$}\\ \ddots\mbox{\scriptsize\vdots}\\ \textbf{0}B_{\psi(\gamma)}^{ll}\mbox{\scriptsize$\pi^{\psi(\gamma)}_{l}$}\\
(17)

where Bψ(γ)iiB_{\psi(\gamma)}^{ii} are irreducible matrices for i1i\geq 1, the rows/columns of which are indexed by the set πiψ(γ)\pi^{\psi(\gamma)}_{i} for i1i\geq 1 and the rows/columns of the permutation matrix Bψ(γ)00B_{\psi(\gamma)}^{00} are indexed by the set Cψ(γ)π0C\cap_{C\in\psi(\gamma)}\pi^{C}_{0}.

To prove Proposition 3, we need to follow lemmas and definitions. First, we know that BC00B^{00}_{C} is a permutation matrix for a cycle CC with nonzero order due to Lemma 2. We need the following lemma to show that Bγ(ψ)00B^{00}_{\gamma(\psi)} is also a permutation matrix:

Lemma 7.

If C=ABC=AB where CSnC\in S_{n} and AA,BB are stochastic matrices order nn, then BSnB\in S_{n} and ASnA\in S_{n}

Proof.

We have |det(C)|=1|\det(C)|=1 since it is a permutation matrix. Thanks to the homomorphism of the determinant, we have the following:

det(C)\displaystyle\det(C) =det(A)det(B)\displaystyle=\det(A)\det(B)
1\displaystyle 1 =|det(A)||det(B)|\displaystyle=|\det(A)||\det(B)|

This shows that |det(A)||\det(A)| and |det(B)||\det(B)| are 11. It implies that σ(A)\sigma(A) and σ(B)\sigma(B) lie on the unit circle. Lemma 6 then implies that ASnA\in S_{n} and BSnB\in S_{n}. ∎

We need the following lemma to study properties of irreducible block matrices in the submatrix Mψ(γ)M_{\psi(\gamma)}.

Lemma 8.

For a nonzero ordwC\operatorname{ord}_{w}C, there exists a permutation matrix PP such that the matrix (MC)ordwCPB(M_{C})^{\operatorname{ord}_{w}C}\sim_{P}B where BB is a block diagonal matrix with blocks that are primitive.

Proof.

From Lemma 3 and Definition 3.1, we know that the following holds:

[wp,wM]w¯(P¯C)ordwC=[wp,wM][BC0000MC]ordwC=[wp(BC00)ordwC,wM(MC)ordwC]=[wp,wM]w¯\underbrace{[w_{p},w_{M}]}_{\bar{w}}(\bar{P}_{C})^{\operatorname{ord}_{w}C}=[w_{p},w_{M}]\left[\begin{array}[]{cc}B^{00}_{C}&0\\ 0&M_{C}\end{array}\right]^{\operatorname{ord}_{w}C}=[w_{p}(B^{00}_{C})^{\operatorname{ord}_{w}C},\underbrace{w_{M}(M_{C})^{\operatorname{ord}_{w}C}}_{*}]=\underbrace{[w_{p},w_{M}]}_{\bar{w}} (18)

up to relabeling. From the definition of Perron vector and ()(*), we know that ordwC\operatorname{ord}_{w}C is a multiple of the index of imprimivity of all block matrices BCjjB^{jj}_{C} for j1j\geq 1, from which the result follows.∎

For convenience, we denote the matrix (MC)ordwC(M_{C})^{\operatorname{ord}_{w}C} by (MC)w(M_{C})^{w}. To prove Proposition 3, we need to consider the graph of a matrix. Let AA be a matrix. Let 𝔾A\mathbb{G}_{A} be a directed graph such that the transpose of its adjacency matrix is equal to the matrix obtained by replacing non-zero entries of the matrix AA by one. The directed graph 𝔾A\mathbb{G}_{A} is called the graph of AA.

For an irreducible matrix, the period of the matrix is equal to the index of imprimivity of the matrix [36]. Then, one can easily prove the following corollary to Lemma 8:

Lemma 9.

The graph of the matrix (MC)w(M_{C})^{w} is the union of the graphs of the submatrices (BCjj)w(B^{jj}_{C})^{w}, each of which is strongly connected with self-arc at every node for j1j\geq 1.

We need to introduce the composition of the graph of matrices. Let 𝔾A\mathbb{G}_{A} and 𝔾B\mathbb{G}_{B} be two directed graphs with the same node set VV. The composition of 𝔾A\mathbb{G}_{A} with 𝔾B\mathbb{G}_{B}, denoted by 𝔾B𝔾A\mathbb{G}_{B}\circ\mathbb{G}_{A}, is a digraph with the node set VV and the edge set defined as follows: vivjv_{i}v_{j} is an edge of 𝔾B𝔾A\mathbb{G}_{B}\circ\mathbb{G}_{A} whenever there is a node vkv_{k} such that vivkv_{i}v_{k} is an edge of 𝔾A\mathbb{G}_{A} and vkvjv_{k}v_{j} is an edge of 𝔾B\mathbb{G}_{B} [5]. We have the following Lemma based on the composition definition:

Lemma 10.

For any sequence of stochastic matrices A1,A2,,AkA_{1},A_{2},\cdots,A_{k} which are all of the same size, we have that 𝔾AkA2A1=𝔾Ak𝔾A2𝔾A1\mathbb{G}_{A_{k}\cdots A_{2}A_{1}}=\mathbb{G}_{A_{k}}\circ\cdots\circ\mathbb{G}_{A_{2}}\circ\mathbb{G}_{A_{1}}

See [22, Lem. 5] for a proof of Lemma 10. One can easily prove the following lemma:

Lemma 11.

If the graphs 𝔾A\mathbb{G}_{A} and 𝔾B\mathbb{G}_{B} have self-arcs at every node, then the union of the arc sets of 𝔾A\mathbb{G}_{A} and 𝔾B\mathbb{G}_{B} is a subset of the arc set of the graph 𝔾AB\mathbb{G}_{AB}

The condition of the Lemma says that AA and BB have all non-zero diagonal entries. Using the preceding lemmas, we now prove Proposition 3:

Proof of Proposition 3..

By construction, a cycle in DG(w)D_{G}(w) has no chord. Hence, it is guaranteed that any exhaustive walk γ\gamma in DG(w)D_{G}(w) is a concatenation of all the cycles and the self-loops in DG(w)D_{G}(w). Let CaC_{a} and CbC_{b} be cycles in GG with ww-orders kak_{a} and kbk_{b}, respectively. It is sufficient to check the particular case for γ\gamma. Assume that γ:=eCa1eCa2eCaka1eCb1eCbkbeCaka1\gamma:=e_{C_{a}}^{1}e_{C_{a}}^{2}\cdots e_{C_{a}}^{k_{a}-1}e_{C_{b}}^{1}\cdots e_{C_{b}}^{k_{b}}\cdots e_{C_{a}}^{k_{a}-1}. We map the walk γ\gamma to a sequence of edges in GG via ψ(.)\psi(.):

ψ(γ)=CaCa|𝒪wCa|CbCb|𝒪wCb|CaCa|𝒪wCa|.\psi(\gamma)=\underbrace{C_{a}\cdots C_{a}}_{|\mathcal{O}_{w}^{C_{a}}|}\cdots\underbrace{C_{b}\cdots C_{b}}_{|\mathcal{O}_{w}^{C_{b}}|}\underbrace{C_{a}\cdots C_{a}}_{|\mathcal{O}_{w}^{C_{a}}|}.

The matrix PCP_{C} is the product of the local stochastic matrices associated with eCe\in C. For convenience, we denote the matrix PC|𝒪wC|P_{C}^{|\mathcal{O}_{w}^{C}|} by PCwP_{C}^{w}. Then, we have the following:

Pψ(γ)\displaystyle P_{\psi(\gamma)} =PCawPCbwPCaw.\displaystyle=P^{w}_{C_{a}}\cdots P^{w}_{C_{b}}P^{w}_{C_{a}}. (19)

Let πCa\pi^{C_{a}} and πCb\pi^{C_{b}} be the partition of the index sets induced by the cycles CaC_{a} and CbC_{b}, respectively. Due to Lemma 8 and (19), if the partitions πCa\pi^{C_{a}} and πCb\pi^{C_{b}} are the same, the result trivially holds and πψ(γ)=πCa\pi^{\psi(\gamma)}=\pi^{C_{a}}. It remains to prove the result if the partitions πCa\pi^{C_{a}} and πCb\pi^{C_{b}} are different.

First consider the permutation block in each matrix PCawP^{w}_{C_{a}} and PCbwP^{w}_{C_{b}}. Assume that for some index ii, iπ0Cai\in\pi^{C_{a}}_{0}, but iπ0Cbi\notin\pi^{C_{b}}_{0}, the permutation index set for the matrix PCawPCbwP^{w}_{C_{a}}P^{w}_{C_{b}} does not contain the index ii due to Lemma 7. Owing to the above, the corresponding matrix Pψ(γ)P_{\psi(\gamma)} only has permutation matrices corresponding to columns/rows associated with the states indexed by the intersection of the permutation index sets of the visited cycles by ψ(γ)\psi(\gamma), which is π0ψ(γ)=Cψ(γ)π0C\pi^{\psi(\gamma)}_{0}=\cap_{C\in\psi(\gamma)}\pi^{C}_{0}. It proves the first part of the proposition. It remains to show the product of a set of irreducible blocks indexed by some elements in πCa\pi^{C_{a}} and πCb\pi^{C_{b}} produces a new irreducible block indexed by an element in πψ(γ)\pi^{\psi(\gamma)} under some conditions.

We first assume that the entries of the weight vector ww are distinct. Then, it is easy to see that ordwC\operatorname{ord}_{w}C is a multiple of the order of BC00B^{00}_{C}. This implies that (BC00)w(B^{00}_{C})^{w} is the identity, the graph of which consists of self-arcs at every node. Consider the graphs 𝔾PCbw\mathbb{G}_{P_{C_{b}}^{w}} and 𝔾PCaw\mathbb{G}_{P_{C_{a}}^{w}}, Lemma 11 implies that they have self-arcs at every node. Due to Lemma 9 and Lemma 10, the graph 𝔾PCbwPCaw\mathbb{G}_{P_{C_{b}}^{w}P_{C_{a}}^{w}} has a strongly connected component for an index set πkCaCb:=πiCaπjCb\pi^{C_{a}C_{b}}_{k}:=\pi^{C_{a}}_{i}\cup\pi^{C_{b}}_{j} if πiCaπjCb\pi^{C_{a}}_{i}\cap\pi^{C_{b}}_{j}\neq\emptyset for some k,i,jk,i,j. Owing to the above, one can easily find the elements of partition πψ(γ)\pi^{\psi(\gamma)} of the index set induced by ψ(γ)\psi(\gamma) by induction and each element πkψ(γ)\pi^{\psi(\gamma)}_{k} labels a strongly connected component in 𝔾Pψ(γ)\mathbb{G}_{P_{\psi(\gamma)}}. It then follows that each block Bψ(γ)kkB^{kk}_{\psi(\gamma)}, which is indexed by the set πkψ(γ)\pi^{\psi(\gamma)}_{k}, is an irreducible block [22, Thm. 6.2.44] for k>1k>1.

We denote by wπ0Cw_{\pi^{C}_{0}} the entries of the weight vector ww which are indexed by the set π0C\pi^{C}_{0} for a cycle CC. Now, consider the case when some of the entries of the weight vector ww are equal. On the one hand, if the corresponding indexes to these entries are not in the set π0C\pi^{C}_{0}, then one can easily show that ordwC\operatorname{ord}_{w}C is a multiple of the order of permutation matrix BC00B^{00}_{C}. Therefore, (BC00)w(B^{00}_{C})^{w} is identity. The remaining part of the proof will then be similar to the above. On the other hand, consider the case when the corresponding indices to these entries are in the set π0C\pi^{C}_{0}. By definition of the orbit set (12), the matrix (PC)w(P_{C})^{w} is in the stabilizer of weight vector ww. But, it does not imply that ordwC\operatorname{ord}_{w}C is a multiple of the order of the permutation BC00B^{00}_{C}. For example, if all entries of the vector wπ0Cw_{\pi^{C}_{0}} are equal, then any permutation matrix with the proper dimension will be in the stabilizer of the vector wπ0Cw_{\pi^{C}_{0}}. This implies that the matrix (BC00)w(B^{00}_{C})^{w} is not necessarily the identity matrix, but it includes an identity block labeled by the indexes that correspond to the distinct weights in wπ0Cw_{\pi^{C}_{0}}. This observation leads us to the conclusion that the remaining part of the proof follows a similar pattern to the one discussed earlier. With this, we conclude the proof.∎

To elaborate on Proposition 3, consider two different exhaustive closed walks γ1\gamma_{1} and γ2\gamma_{2} in Dw(G)D_{w}(G). Note that the partitions induced by ψ(γ1)\psi(\gamma_{1}) and ψ(γ2)\psi(\gamma_{2}) are the same, but the corresponding block matrices Bψ(γ1)iiB_{\psi(\gamma_{1})}^{ii} and Bψ(γ2)iiB_{\psi(\gamma_{2})}^{ii} are not necessarily the same. Indeed, the partition of the index set induced by an exhaustive walk can be found by intersection and union operation on the index sets, which are associative. This implies that the appearance order of the cycles in the exhaustive walk does not affect the corresponding index sets. We then conclude that the block structures of the matrices Pψ(γ1)P_{\psi(\gamma_{1})} and Pψ(γ2)P_{\psi(\gamma_{2})} are the same up to relabeling. We can thus denote the elements of the corresponding partition by πG\pi^{G} (e.g. πG=πψ(γ)\pi^{G}=\pi^{\psi(\gamma)}). On the other hand, the order in which cycles are traversed affects the order in which local stochastic matrices (which are not necessarily commutative in our work) are multiplied, and thus the block matrices are not necessarily equal.

For this section, we continue with Example 1.

Example 1 (Cont.).

Assume that the number of states for each node mm is 33 in the graph shown in Figure 1(a); then x(t)21x(t)\in\mathbb{R}^{21}. Consider the cycles C1C_{1} and C2C_{2}. Assume that we have the following partition of the index set:

πC1={{2,4,7,10,11,,20,21}π0C1,{1,3,5}π1C1,{6,8,9}π2C1}\pi^{C_{1}}=\{\underbrace{\{2,4,7,10,11,\cdots,20,21\}}_{\pi^{C_{1}}_{0}},\underbrace{\{1,3,5\}}_{\pi^{C_{1}}_{1}},\underbrace{\{6,8,9\}}_{\pi^{C_{1}}_{2}}\} (20)
πC2={{4,5,6,7,8,10,11,16,17,,20,21}π0C2,{12,13}π1C2,{1,2}π2C2,{14,15}π3C2}.\pi^{C_{2}}=\{\underbrace{\{4,5,6,7,8,10,11,16,17,\cdots,20,21\}}_{\pi^{C_{2}}_{0}},\underbrace{\{12,13\}}_{\pi^{C_{2}}_{1}},\underbrace{\{1,2\}}_{\pi^{C_{2}}_{2}},\underbrace{\{14,15\}}_{\pi^{C_{2}}_{3}}\}. (21)

Consider the partition of the index set induced by C2C1C_{2}C_{1}. We observe that the sets π1C1\pi^{C_{1}}_{1} and π2C2\pi^{C_{2}}_{2} have a nonempty intersection. Hence, the set π1C2C1:=π1C1π2C2\pi^{C_{2}C_{1}}_{1}:=\pi^{C_{1}}_{1}\cup\pi^{C_{2}}_{2} is an element of the partition πC2C1\pi^{C_{2}C_{1}}. On the other hand, the index sets π0C1\pi^{C_{1}}_{0} and π0C2\pi^{C_{2}}_{0} labels the maximal permutation block in the corresponding matrix. From Proposition 3, the set π0C2C1:=π0C1π0C2\pi^{C_{2}C_{1}}_{0}:=\pi^{C_{1}}_{0}\cap\pi^{C_{2}}_{0} is an element of the partition of the index set πC2C1\pi^{C_{2}C_{1}}. Then, we obtain the following:

πC2C1:={{4,7,10,11,16,17,,20,21}π0C2C1,{1,2,3,5}π1C2C1,{6,8,9}π2C2C1,{12,13}π3C2C1,{14,15}π4C2C1}\pi^{C_{2}C_{1}}:=\{\underbrace{\{4,7,10,11,16,17,\cdots,20,21\}}_{\pi^{C_{2}C_{1}}_{0}},\underbrace{\{1,2,3,5\}}_{\pi^{C_{2}C_{1}}_{1}},\underbrace{\{6,8,9\}}_{\pi^{C_{2}C_{1}}_{2}},\underbrace{\{12,13\}}_{\pi^{C_{2}C_{1}}_{3}},\underbrace{\{14,15\}}_{\pi^{C_{2}C_{1}}_{4}}\} (22)

4.4 Proof of Theorem 1

For a cycle CC in GG with non-zero order, we define,

ϵC:=min1jm(minBCjj)\epsilon_{C}:=\min_{1\leq j\leq m}(\min B^{jj}_{C}) (23)

where BCjjB^{jj}_{C} is the irreducible block in the matrix MCM_{C} (14). We then set:

ϵ:=minCGϵC.\epsilon:=\min_{C\in G}\epsilon_{C}. (24)

The coefficient of ergodicity of a stochastic matrix An×nA\in\mathbb{R}^{n\times n} is [37]

μ(A):=12maxi,jn|aikajk|.\mu(A):=\frac{1}{2}\max_{i,j}\sum^{n}{|a_{ik}-a_{jk}|}. (25)

It is clear that μ(A)1\mu(A)\leq 1 for any stochastic matrix AA. The following inequality holds for any two stochastic matrices BB and CC [38],

BCSμ(B)CS.\left\|BC\right\|_{S}\leq\mu(B)\left\|C\right\|_{S}. (26)

For any scrambling matrix AA, we have the following inequality:

μ(A)1min(A).\mu(A)\leq 1-\min(A). (27)

We now need the following lemma:

Lemma 12.

The product of any set of ln2l\geq\lfloor\frac{n}{2}\rfloor irreducible n×nn\times n stochastic matrices with positive diagonal entries is a scrambling matrix.

See [5, Lem. 5] for a proof of Lemma 12.

Definition 4.1 (Spanning Sequence).

Let G=(V,E)G=(V,E) be a simple, undirected graph. A finite sequence of edges of GG is spanning if it covers a spanning tree of GG. An infinite sequence of edges is spanning if it has infinitely many disjoint finite strings that are spanning.

Lemma 13.

Let GG be a bridgeless,simple, connected graph. For an exhaustive walk γ\gamma in DG(w)D_{G}(w) where the set of local stochastic matrices {Aenm×nm,eE}\{A_{e}\in\mathbb{R}^{nm\times nm},e\in E\} is ww-holonomic for GG, the sequence of edges ψ(γ)\psi(\gamma) is a spanning sequence of edges in GG.

Proof.

By Definition 3.1, the set 𝒪wC\mathcal{O}_{w}^{C} is non-empty for any cycle CC. Hence, there exists an edge, say ee, in DG(w)D_{G}(w) which has the matrix-weight PCP_{C}. The exhaustive walk γ\gamma visits all edges in DG(w)D_{G}(w). Without loss of generality, assume that γ=γaeγb\gamma=\gamma_{a}\lor e\lor\gamma_{b} where γa\gamma_{a} and γb\gamma_{b} are walks in DG(w)D_{G}(w); we have that ψ(γ)=ψ(γb)Cψ(γa)\psi(\gamma)=\psi(\gamma_{b})\lor C\lor\psi(\gamma_{a}). This shows that the edges in any cycle CC in GG are visited. Because GG is bridgeless, every edge in GG is covered by at least one cycle. This implies that every edge in GG is visited by the sequence of edges ψ(γ)\psi(\gamma), which concludes the proof. ∎

With the auxiliary results above, we are now in a position to prove Theorem 1.

Proof of Theorem 1..

Let wintΔnm1w\in\operatorname{int}\Delta^{nm-1} be a weight vector. Recall that the set of local stochastic matrices {Aenm×nm,eE}\{A_{e}\in\mathbb{R}^{nm\times nm},e\in E\} is assumed to be ww-holonomic for GG. Let γ\gamma be an infinite exhaustive closed walk in DG(w)D_{G}(w).

Proposition 3 shows that the block of Pψ(γ)P_{\psi(\gamma)} indexed by π0ψ(γ):=Cψ(γ)π0C\pi^{\psi(\gamma)}_{0}:=\cap_{C\in\psi(\gamma)}\pi_{0}^{C} is a permutation matrix. Due to Lemma 13, all cycles in GG are covered by ψ(γ)\psi(\gamma), which implies that π0ψ(γ)=C\vv𝒞π0C=π0G\pi^{\psi(\gamma)}_{0}=\cap_{C\in\vv{\mathcal{C}}}\pi_{0}^{C}=\pi^{G}_{0}. Consequently, by relabeling the states so that the submatrix indexed by π0G\pi^{G}_{0} is in the upper-left corner of Pψ(γ)P_{\psi(\gamma)}, we have proven the first part of assertion (ii)(ii) in Theorem 1.

It remains to characterize the limit set \mathcal{L}. Let Γ\Gamma be the set of all finite exhaustive walks in DG(w)D_{G}(w). Let 𝒮Γ\mathcal{S}_{\Gamma} be the set of permutation matrices Bψ(γi)00B^{00}_{\psi(\gamma_{i})}, for all γiΓ\gamma_{i}\in\Gamma. Owing to the above paragraph, each Bψ(γi)00B^{00}_{\psi(\gamma_{i})} is of the same dimension |π0G||\pi^{G}_{0}|. The set 𝒮ΓS|π0G|\mathcal{S}_{\Gamma}\subseteq S_{|\pi^{G}_{0}|} is obviously finite; let 𝒦\mathcal{K} be the subgroup generated by the elements of 𝒮Γ\mathcal{S}_{\Gamma}. We can write an infinite exhaustive closed walk γ\gamma as the concatenation of finite exhaustive closed walks γi\gamma_{i}, i1i\geq 1, in DG(w)D_{G}(w). We then have P~ψ(γ)=Bψ(γi+1)00Bψ(γi)00Bψ(γi1)00\tilde{P}_{\psi(\gamma)}=\cdots B^{00}_{\psi(\gamma_{i+1})}B^{00}_{\psi(\gamma_{i})}B^{00}_{\psi(\gamma_{i-1})}\cdots which shows that P~ψ(γ)𝒦\tilde{P}_{\psi(\gamma)}\in\mathcal{K}.

We now show that, given a weight vector ww, the block Mψ(γ)M_{\psi(\gamma)} in Theorem 1 is uniquely given. This implies that there exists an injection between the limit set Pψ(γ)\mathcal{L}\ni P_{\psi(\gamma)} of the process and the set 𝒦\mathcal{K}, which proves that the limit set is finite. To proceed, recall that Proposition 3 states that the matrix Mψ(γ)M_{\psi(\gamma)} consists of principal block matrices that are irreducible with dimensions |πiG||\pi^{G}_{i}| for i1i\geq 1 and are denoted by Mψ(γ)iiM_{\psi(\gamma)}^{ii}. Let lG:=maxi(|πiG|)l_{G}:=\max_{i}({|\pi^{G}_{i}|}). Let 0:=t0<t1<t20:=t_{0}<t_{1}<t_{2}\cdots be a monotonically increasing sequence such that every string ψ(γ(tk:tk+1))\psi(\gamma(t_{k}:t_{k+1})) for k0k\geq 0, has lG2\lfloor\frac{l_{G}}{2}\rfloor exhaustive walks in DG(w)D_{G}(w). Since Mψ(γ)S\|M_{\psi(\gamma)}\|_{S} is non-increasing by (26), it has a limit for |γ||\gamma|\to\infty. We now show that Mψ(γ)iiS\|M_{\psi(\gamma)}^{ii}\|_{S} is 0 for i1i\geq 1.

Lemma 12 implies that Mψ(γ(tk:tk+1))iiM_{\psi(\gamma(t_{k}:t_{k+1}))}^{ii} is a scrambling matrix. Plugging the lower bound (24) into (27), we have the following inequality μ(Mψ(γ(tk:tk+1))ii)(1ϵ)\mu(M_{\psi(\gamma(t_{k}:t_{k+1}))}^{ii})\leq(1-\epsilon). Then, we can use this inequality in (26) for each block matrix Mψ(γ(tk:tk+1))iiM_{\psi(\gamma(t_{k}:t_{k+1}))}^{ii} to obtain:

limkMψ(γ(tk:0))iiS\displaystyle\lim_{k\to{\infty}}\left\|M_{\psi(\gamma(t_{k}:0))}^{ii}\right\|_{S} limk(1ϵ)Mψ(γ(tk1:0))iiS\displaystyle\leq\lim_{k\to\infty}(1-\epsilon)\left\|M_{\psi(\gamma(t_{k-1}:0))}^{ii}\right\|_{S}
(1ϵ)k=0\displaystyle\leq(1-\epsilon)^{k}=0

which implies that limkMψ(γ(tk:0))iiS=0\lim_{k\to{\infty}}\left\|M_{\psi(\gamma(t_{k}:0))}^{ii}\right\|_{S}=0. We conclude using [37] that Mψ(γ(tk:0))iiM_{\psi(\gamma(t_{k}:0))}^{ii} converges to a rank-one matrix, say Mψ(γ)ii=𝟙piM_{\psi(\gamma)}^{ii}=\mathds{1}p_{i}^{\top} for some vector pi|πiG|p_{i}\in\mathbb{R}^{|\pi^{G}_{i}|}. This establishes asymptotic convergence in the assertion (iii)(iii) in Theorem 1.

It remains to provide an explicit characterization of the vector pip_{i}^{\top} such that Mψ(γ)ii=𝟙piM_{\psi(\gamma)}^{ii}=\mathds{1}p_{i}^{\top} as a function of the weight vector ww. We denote by w~\tilde{w} the entries of the weight vector ww which are indexed by the set i1πiG\cup_{i\geq 1}{\pi^{G}_{i}}. By construction of the derived graph DG(w)D_{G}(w), the closed walk γ\gamma induces the mapping:

w~w~Mψ(γ)(=w~).\tilde{w}\mapsto\tilde{w}M_{\psi(\gamma)}(=\tilde{w}).

Then, we have the following up to relabeling,

w~=w~[Mψ(γ)11Mψ(γ)22]=w~[𝟙p1𝟙p2].\tilde{w}=\tilde{w}\left[\begin{array}[]{ccccc}M_{\psi(\gamma)}^{11}&&\\ &M_{\psi(\gamma)}^{22}&\\ &&\ddots&\end{array}\right]=\tilde{w}\left[\begin{array}[]{ccccc}\mathds{1}p_{1}^{\top}&&\\ &\mathds{1}p_{2}^{\top}&\\ &&\ddots&\end{array}\right]. (28)

From the block structure (28), we have that,

pi:=[w~]jπiGαi where αi:=jπiGw~j.p_{i}:=\frac{[\tilde{w}]_{j\in\pi^{G}_{i}}}{\alpha_{i}}\mbox{ where }\alpha_{i}:=\sum_{j\in\pi^{G}_{i}}\tilde{w}_{j}.

Since ww is a weight vector by definition, we know that αi(0,1]\alpha_{i}\in(0,1]. This then shows that pip_{i} has no zero entry. ∎

5 Summary and outlook

In this paper, we have investigated the weighted average consensus problem for a gossiping network of agents with vector states. We have introduced the concept of ww-holonomy for a set of stochastic matrices, which helped us to investigate the existence of non-trivial, finite holonomy groups in the gossip process. The allowable sequences of updates in the gossip process were obtained as closed walks in the so-called derived graph DG(w)D_{G}(w), in that any infinite exhaustive closed walks in DG(w)D_{G}(w) could be mapped to an allowable sequence of updates for the gossip process. Such sequences could be implemented in a decentralized manner, and we have shown that the corresponding infinite product of stochastic matrices converges to a finite limit set, whose elements we have explicitly characterized.

Our results have established a unified framework that connects the methodologies presented in [5] and [4]. Indeed, on the one hand, we have extended the framework of [5] by allowing gossip processes that display non-trivial holonomy groups, which results in a finite limit set for the process (as stated in Theorem 1). This is in contrast to [5, Thm. 1], where the limit set is a singleton. As a drawback of the existence of non-trivial holonomy groups, our results require following an allowable sequence of updates in the gossip process, whereas the order of the gossiping pairs does not matter in [5, Thm. 1].

On the other hand, in [4], allowable sequences have been found through a derived graph, (see [4, Definition 2.3]) nodes of which correspond to three nodes in the gossip graph that communicate simultaneously. This approach, motivated by the design of secure protocols, results in allowable sequences consisting of an infinite concatenation of triangles within the communication graph. In our work, we have developed a different perspective. Each node in our derived graph corresponds to an element of the orbit sets of the weight vector around a cycle. Consequently, our allowable sequences consist of the concatenation of cycles in the communication graph. It is worth noting that our approach requires a bridgeless communication graph, while the methodology presented by [4] requires a triangulated Laman graph.

The present work can be extended in several directions. Among others, we focus here on algorithm design, which comprises two parts. The first is to characterize the set of local stochastic matrices that yield a predefined consensus weight for agents with vector-valued states. It is relatively straightforward to use the results developed in this paper to develop a method that yields the desired local stochastic matrices given that the communication graph is bridgeless. Removing this topological constraint requires further research. This leads us to the second aspect, which is to remove the requirement of bridgeless graphs. To understand what it entails, we first note that the requirement can be traced back to the definition of ww-holonomy involving cycles in GG. Thus, one approach to remove the requirement is to modify the definition of ww-holonomy to consider paths rather than cycles in the communication graph.

Namely, for a path ζ\zeta, we would redefine the orbit set in (12) as 𝒪wζ:={wζ(a)nm|wζ(a)=w(Pζ)a for a}\mathcal{O}_{w}^{\zeta}:=\{w_{\zeta}^{(a)}\in\mathbb{R}^{nm}|w_{\zeta}^{(a)}=w({P}_{\zeta})^{a}\mbox{ for }a\in\mathbb{N}\} and state that if the set 𝒪wζ\mathcal{O}_{w}^{\zeta} has finite and non-zero cardinality, then set of local stochastic matrices {Aenm×nm,eζ}\{A_{e}\in\mathbb{R}^{nm\times nm},\forall e\in\zeta\} will be ww-holonomic for ζ\zeta. With this modification, we can still employ the derived graph approach to characterize the allowable sequences. However, this introduces a significant challenge: the allowable sequences of updates cannot be followed in a decentralized manner (at least in an obvious manner). The ability of gossip processes to operate without a central authority is however crucial. This highlights the need to further understand the connection between topological constraints on GG, and the development of decentralized update rules.

References

  • [1] M. H. DeGroot, “Reaching a consensus,” Journal of the American Statistical association, vol. 69, no. 345, pp. 118–121, 1974.
  • [2] S. Boyd, A. Ghosh, B. Prabhakar, and D. Shah, “Randomized gossip algorithms,” IEEE transactions on information theory, vol. 52, no. 6, pp. 2508–2530, 2006.
  • [3] F. Bénézit, V. Blondel, P. Thiran, J. Tsitsiklis, and M. Vetterli, “Weighted gossip: Distributed averaging using non-doubly stochastic matrices,” in 2010 ieee international symposium on information theory, pp. 1753–1757, IEEE, 2010.
  • [4] M.-A. Belabbas and X. Chen, “Triangulated laman graphs, local stochastic matrices, and limits of their products,” Linear Algebra and its Applications, vol. 619, pp. 176–209, 2021.
  • [5] X. Chen, M.-A. Belabbas, and J. Liu, “Gossip over holonomic graphs,” Automatica, vol. 136, p. 110088, 2022.
  • [6] W. Kuhnel, Differential geometry : curves - surfaces - manifolds. Providence, R.I: American Mathematical Society, 2002.
  • [7] Z. Zhu, J. Zhu, J. Liu, and Y. Liu, “Federated bandit: A gossiping approach,” Proc. ACM Meas. Anal. Comput. Syst., vol. 5, no. 1, 2021.
  • [8] R. Olfati-Saber, “Distributed kalman filtering for sensor networks,” in 2007 46th IEEE Conference on Decision and Control, pp. 5492–5498, IEEE, 2007.
  • [9] M. Zhu and S. Martinez, “Discrete-time dynamic average consensus,” Automatica, vol. 46, no. 2, pp. 322–329, 2010.
  • [10] X. Chen, M.-A. Belabbas, and T. Başar, “Cluster consensus with point group symmetries,” SIAM Journal on Control and Optimization, vol. 55, no. 6, pp. 3869–3889, 2017.
  • [11] S. Sundaram and B. Gharesifard, “Distributed optimization under adversarial nodes,” IEEE Transactions on Automatic Control, vol. 64, no. 3, pp. 1063–1076, 2018.
  • [12] B. Touri and B. Gharesifard, “Continuous-time distributed convex optimization on time-varying directed networks,” in 2015 54th IEEE Conference on Decision and Control (CDC), pp. 724–729, IEEE, 2015.
  • [13] V. D. Blondel and A. Olshevsky, “How to decide consensus? a combinatorial necessary and sufficient condition and a proof that consensus is decidable but np-hard,” SIAM Journal on Control and Optimization, vol. 52, no. 5, pp. 2707–2726, 2014.
  • [14] A. Nedić and J. Liu, “On convergence rate of weighted-averaging dynamics for consensus problems,” IEEE Transactions on Automatic Control, vol. 62, no. 2, pp. 766–781, 2016.
  • [15] J. N. Tsitsiklis, Problems in decentralized decision making and computation. PhD thesis, Massachusetts Institute of Technology, 1984.
  • [16] A. Kashyap, T. Başar, and R. Srikant, “Quantized consensus,” Automatica, vol. 43, no. 7, pp. 1192–1203, 2007.
  • [17] S. R. Etesami and T. Başar, “Convergence time for unbiased quantized consensus over static and dynamic networks,” IEEE Transactions on Automatic Control, vol. 61, no. 2, pp. 443–455, 2015.
  • [18] X. Chen, M.-A. Belabbas, and T. Başar, “Distributed averaging with linear objective maps,” Automatica, vol. 70, pp. 179–188, 2016.
  • [19] X. Li and J. Zhu, “Event-triggered weighted average consensus in networks of dynamic agents with time-varying delay,” IFAC-PapersOnLine, vol. 54, no. 18, pp. 127–132, 2021.
  • [20] M. Cao, A. S. Morse, and B. Anderson, “Reaching a consensus in a dynamically changing environment: Convergence rates, measurement delays, and asynchronous events,” SIAM Journal on Control and Optimization, vol. 47, no. 2, pp. 601–623, 2008.
  • [21] L. Fang, P. J. Antsaklis, and A. Tzimas, “Asynchronous consensus protocols: Preliminary results, simulations and open questions,” in Proceedings of the 44th IEEE Conference on Decision and Control, pp. 2194–2199, IEEE, 2005.
  • [22] M. Cao, A. S. Morse, and B. Anderson, “Reaching a consensus in a dynamically changing environment: A graphical approach,” SIAM Journal on Control and Optimization, vol. 47, no. 2, pp. 575–600, 2008.
  • [23] W. Ren and R. W. Beard, “Consensus seeking in multiagent systems under dynamically changing interaction topologies,” IEEE Transactions on automatic control, vol. 50, no. 5, pp. 655–661, 2005.
  • [24] J. M. Hendrickx and J. Tsitsiklis, “A new condition for convergence in continuous-time consensus seeking systems,” in 2011 50th IEEE Conference on Decision and Control and European Control Conference, pp. 5070–5075, IEEE, 2011.
  • [25] J. M. Hendrickx and J. Tsitsiklis, “Convergence of type-symmetric and cut-balanced consensus seeking systems,” IEEE Transactions on Automatic Control, vol. 58, no. 1, pp. 214–218, 2012.
  • [26] F. Fagnani and S. Zampieri, “Randomized consensus algorithms over large scale networks,” IEEE Journal on Selected Areas in Communications, vol. 26, no. 4, pp. 634–649, 2008.
  • [27] B. Touri and A. Nedić, “On backward product of stochastic matrices,” Automatica, vol. 48, no. 8, pp. 1477–1488, 2012.
  • [28] B. Touri and A. Nedic, “On ergodicity, infinite flow, and consensus in random models,” IEEE Transactions on Automatic Control, vol. 56, no. 7, pp. 1593–1605, 2010.
  • [29] A. Aghajan and B. Touri, “Ergodicity of continuous-time distributed averaging dynamics: A spanning directed rooted tree approach,” IEEE Transactions on Automatic Control, vol. 67, no. 2, pp. 918–925, 2021.
  • [30] J. Liu, S. Mou, A. S. Morse, B. Anderson, and C. Yu, “Deterministic gossiping,” Proceedings of the IEEE, vol. 99, no. 9, pp. 1505–1524, 2011.
  • [31] F. He, A. Morse, J. Liu, and S. Mou, “Periodic gossiping,” IFAC Proceedings Volumes, vol. 44, no. 1, pp. 8718–8723, 2011.
  • [32] B. P. Kitchens, Symbolic dynamics: one-sided, two-sided and countable state Markov shifts. Springer, 1997.
  • [33] R. Merris, Graph theory. John Wiley & Sons, 2011.
  • [34] C. D. Meyer, Matrix analysis and applied linear algebra. OT ; 71, Philadelphia: Society for Industrial and Applied Mathematics, 2000.
  • [35] J. Ding and N. H. Rhee, “Teaching tip: When a matrix and its inverse are stochastic,” The College Mathematics Journal, vol. 44, no. 2, pp. 108–109, 2013.
  • [36] R. A. Horn and C. R. Johnson, Matrix analysis. Cambridge university press, 2012.
  • [37] E. Seneta, Non-negative matrices and Markov chains. Springer, 2006.
  • [38] J. Hajnal and M. S. Bartlett, “Weak ergodicity in non-homogeneous markov chains,” in Mathematical Proceedings of the Cambridge Philosophical Society, vol. 54, pp. 233–246, Cambridge University Press, 1958.