A Hybrid Observer for Estimating the State of a Distributed Linear System

Lili Wang lili.wang.zj@gmail.com Ji Liu ji.liu@stonybrook.edu A. Stephen Morse as.morse@yale.edu Samueli School of Engineering, University of California, Irvine Department of Electrical and Computer Engineering, Stony Brook University Department of Electrical Engineering, Yale University

Abstract

A hybrid observer is described for estimating the state of an $m>0$ channel, $n$ -dimensional, continuous-time, linear system of the form $\dot{x}=Ax,\;y_{i}=C_{i}x,\;i\in\{1,2,\ldots,m\}$ . The system’s state $x$ is simultaneously estimated by $m$ agents assuming each agent $i$ senses $y_{i}$ and receives appropriately defined data from each of its current neighbors. Neighbor relations are characterized by a time-varying directed graph $\mathbb{N}(t)$ whose vertices correspond to agents and whose arcs depict neighbor relations. Agent $i$ updates its estimate $x_{i}$ of $x$ at “event times” $t_{i1},t_{i2},t_{i3},\ldots$ using a local continuous-time linear observer and a local parameter estimator which iterates $q$ times during each event time interval $[t_{i(s-1)},t_{is}),\;s\geq 1$ to obtain an estimate of $x(t_{is})$ . Subject to the assumptions that none of the $C_{i}$ ’s are zero, the neighbor graph $\mathbb{N}(t)$ is strongly connected for all time, and the system whose state is to be estimated is jointly observable, it is shown that for any number $\lambda>0$ , it is possible to choose $q$ and the local observer gains so that each estimate $x_{i}$ converges to $x$ at least as fast as $e^{-\lambda t}$ does. This result holds whether or not agents communicate synchronously, although in the asynchronous case it is necessary to assume that $\mathbb{N}(t)$ changes in a suitably defined sense. Exponential convergence is also assured if the event time sequences of the $m$ agents are slightly different than each other, although in this case only if the system being observed is exponentially stable; this limitation however, is primarily a robustness issue shared by all state estimators, centralized or not, which are operating in “open loop” in the face of small modeling errors. The result also holds facing abrupt changes in the number of vertices and arcs in the inter-agent communication graph upon which the algorithm depends.

keywords:

Hybrid Systems; Distributed Observer; Robustness; Resilience.

^†^†thanks: This paper was not presented at any IFAC meeting. Portions of this paper were presented, in abbreviated form and without proofs, at the 2017 IEEE Conference on Decision and Control [1]. Corresponding author: L. Wang.

, ,

1 Introduction

In [2] a distributed observer is described for estimating the state of an $m>0$ channel, $n$ -dimensional, continuous-time, jointly observable linear system of the form $\dot{x}=Ax,\;y_{i}=C_{i}x,\;i\in\{1,2,\ldots,m\}$ . The state $x\in{\rm I\!R}^{n}$ is simultaneously estimated by $m$ agents assuming that each agent $i$ senses $y_{i}$ and receives the state of each of its neighbors’ estimates. An attractive feature of the observer described in [2] is that it is able to generate an asymptotically correct estimate of $x$ exponentially fast at a pre-assigned rate, if each agent’s set of neighbors do not change with time and the neighbor graph characterizing neighbor relations is strongly connected. However, a shortcoming of the observer in [2] is that it is unable to function correctly if the network changes with time. Changing neighbor graphs will typically occur if the agents are mobile. A second shortcoming of the observer described in [2] is that it is “fragile” by which we mean that the observer is not able to cope with the situation when there is an arbitrary abrupt change in the topology of the neighbor graph such as the loss or addition of a vertex or an arc. For example, if because of a component failure, a loss of battery power, or some other reasons, an agent drops out of the network, what remains of the overall observer will typically not be able to perform correctly and may become unstable, even if joint observability is not lost and what remains of the neighbor graph is still strongly connected.

This paper breaks new ground by introducing a hybrid distributed observer which overcomes the aforementioned difficulties without making restrictive assumptions. To the best of our knowledge, this observer is the first provably correct distributed algorithm capable of generating an asymptotically correct estimate of a jointly observable linear system’s state in the presence of a neighbor graph which changes with time under reasonably general assumptions. Although the observer is developed for continuous-time systems, it can very easily be modified in the obvious way to deal with discrete-time systems.

Notation: Given a collection of $n\times n$ matrices, $A_{1},\;A_{2},\;\ldots,\;A_{m}$ , let $\text{diagonal}\{A_{1},A_{2},\ldots,A_{m}\}$ be the block diagonal matrix with $A_{k}$ as its $k$ th diagonal block. Given a collection of $n\times 1$ vectors, $v_{1},\;v_{2};\;\ldots,\;v_{m}$ , let $\text{column}\{v_{1},v_{2},\ldots,v_{\}}$ be the stacked vector with $v_{k}$ as its $v$ th sub-vector. For an $n\times n$ matrix $A$ , we let $\text{image}A$ denote the linear subspace spanned by matrix $A$ . For two $n\times n$ matrix $A_{1}$ , and $A_{2}$ , we let $\text{image}A_{2}\cap\text{image}A_{2}$ denote the intersection of the two images.

1.1 The Problem

We are interested in a network of $m>0$ autonomous agents labeled $1,2,\ldots,m$ which are able to receive information from their “neighbors” where by the neighbor of agent $i$ is meant any agent who is in agent $i$ ’s reception range. We write $\mathcal{N}_{i}(t)$ for the set of labels of agent $i$ ’s neighbors at real {continuous} time $t$ and always take agent $i$ to be a neighbor of itself. Neighbor relations at time $t$ are characterized by a directed graph $\mathbb{N}(t)$ with $m$ vertices and a set of arcs defined so that there is an arc from vertex $j$ to vertex $i$ whenever agent $j$ is a neighbor of agent $i$ . Since each agent $i$ is always a neighbor of itself, $\mathbb{N}(t)$ has a self-arc at each of its vertices. Each agent $i$ can sense a continuous-time signal $y_{i}\in{\rm I\!R}^{s_{i}},\;i\in\mathbf{m}\stackrel{{\scriptstyle\Delta}}{{=}}\{1,2,\ldots,m\}$ , where

	$\displaystyle y_{i}$	$\displaystyle=$	$\displaystyle C_{i}x,\;\;\;i\in\mathbf{m}$		(1)
	$\displaystyle\dot{x}$	$\displaystyle=$	$\displaystyle Ax$		(2)

and $x\in{\rm I\!R}^{n}$ . It is assumed throughout that the system defined by (1) and (2) is jointly observable; i.e., with $C=[C_{1}^{\prime}\;C_{2}^{\prime}\;\cdots\;C_{m}^{\prime}]^{\prime}$ , the matrix pair $(C,A)$ is observable. For simplicity, it is further assumed that $C_{i}\neq 0,\;i\in\mathbf{m}$ ; generalization to deal with the case when this assumption does not hold is straight forward. The problem of interest is to develop “private estimators”, one for each agent, which, under ideal conditions without modeling or synchronization errors, enable each agent to obtain an estimate of $x$ which converges to $x$ exponentially fast at a pre-assigned rate .

1.2 Background

The distributed state estimation problem has been under study in one form or another for years. The problem has been widely studied as a distributed Kalman filter problem [3, 4, 5, 6, 7, 8, 9, 10]. A form of distributed Kalman filtering is introduced in [3] for discrete-time linear systems; the underlying idea is to switch back and forth between conventional state estimation and a data fusion computation. This approach is extended to continuous-time systems in [4]. There are two key limitations of the ideas presented in [3, 4]. First, it is implicitly assumed in each paper that data fusion {i.e., consensus} can be attained in finite time. Second, it is also implicitly assumed that each pair $(C_{i},A)$ is observable; this restrictive assumption is needed in order to guarantee that each local error covariance matrix Riccati equation has a solution. Both papers also include assumptions about graph connectivity and information exchange which are more restrictive than they need be.

Discrete-time distributed observers have recently appeared in [11, 12, 13, 14, 15, 16, 17, 18, 19, 20]. None of these estimators admit continuous-time extensions. The algorithm in [11] works for fixed graphs with a relatively complicated topology design by studying the roles of each agent in the network. The distributed observer proposed in [12] can track the system only if the so-called Scalar Tracking Capacity condition is satisfied. Noteworthy among these is the paper [15] which described a discrete-time linear system which solves the estimation problem for jointly observable, discrete-time systems with fixed neighbor graphs assuming only that the neighbor graph is directed and strongly connected. This is done by recasting the estimation problem as a classical decentralized control problem [21, 22]. Although these observers are limited to to discrete-time systems, it has proved possible to make use of the ideas in [15] to obtain a distributed observer for continuous-time systems [2]. In particular, [2] explains how to construct a distributed observer for a continuous-time system with a strongly connected neighbor graph, which is capable of estimating state exponentially fast at a pre-assigned rate. It is straightforward to modify this observer to deal with discrete-time systems.

An interesting idea, suggested in [23], seeks to simplify the structure of a distributed estimator for a continuous-time system at the expense of some design flexibility. This is done, in essence, by exploiting the $A$ -invariance of the unobservable spaces of the pairs $(C_{i},A)$ ; this in turn enables one to “split” the estimators into two parts, one based on conventional spectrum assignment techniques and the other based on consensus [23, 24, 25, 26, 27]. Reference [23] addresses the problem in continuous time for undirected, connected neighbor graphs. The work of [24, 25] extends the result of [23] to the case when the neighbor graph is directed and strongly connected. Establishing correctness requires one to choose gains to ensure that certain LMIs hold. In [27], motivated by the distributed least squares solver problem, a modified algorithm which can deal with measurement noise is proposed . In [26] a simplified version of the ideas in [24] is presented. Because the “high gain” constructions used in [24] and [26] don’t apply in discrete-time, significant modifications are required to exploit these ideas in a discrete-time context [28].

Despite the preceding advances, until the appearance of [1], which first outlines the idea presented in this paper, there were almost no results for doing state estimation with time varying neighbor graphs for either discrete-time or continuous-time linear systems. For sure, there were a few partial results. For example, [17] suggests a distributed observer using a consensus filter for the state estimation of discrete-time linear distributed system for specially structured, undirected neighbor graphs. Another example, in [18], an $H_{\infty}$ based observer is described which is intended to function in the face of a time-varying graph with a Markovian randomly varying topology. It is also worth mentioning [29] which tackles the challenging problem of trying to define a distributed observer which can function correctly in the face of intermittent disruptions in available information. Although the problem addressed in [29] is different than the problem to which this paper is addressed, resilience in the face of intermittent disruptions is to some extent similar to the notion of resilience addressed in this paper.

The first paper to provide a definitive solution to the distributed state estimation problem for time varying neighbor graphs under reasonably relaxed assumptions was presented, in abbreviated form at the 2017 IEEE Conference on Decision and Control [1]. The central contribution of [1] and this paper is to describe a distributed observer for a jointly observable, continuous-time linear system with a time-varying neighbor graph $\mathbb{N}$ which is capable of estimating the system’s state exponentially fast at any prescribed rate. Assuming “synchronous operation”, the only requirement on the graph is that it be strongly connected for all time.

Since the appearance of [1], several other distributed observers have been suggested which are capable of doing state estimation in the face of changing neighbor graphs. For example, expanding on earlier work in [16], [30] provides a procedure for constructing such an observer which exploits in some detail the structure of $\mathbb{N}$ and its relation to the structure of the data matrices defining the system. The resulting algorithm, which is tailored exclusively to discrete-time systems, deals with state estimation under assumptions which are weaker than strong connectivity. Recently we have learned that the split spectrum observer idea first proposed in [23] and later simplified in [24] and [26] can be modified to deal with strongly-connected time-varying neighbor graphs, although only for continuous time systems. See [31] for an unpublished report on the subject.

1.3 Organization

The remainder of the paper is organized as follows. The hybrid observer itself is described in §2 subject to the assumption that all $m$ agents share the same event time sequence. Two cases are considered, one in which the interchanges of information between agents are performed synchronously and the other case being when it is not. The synchronous case is the one most comparable to the versions of the distributed observer problem treated in [3] - [18]. The main result for this case is Theorem 22 which asserts that so long as the neighbor graph is strongly connected for all time, exponential convergence to zero at a prescribed convergence rate of all $m$ state estimation errors is achieved. This is a new result which has no counterpart in any of the previously cited references. The same result is achieved in the asynchronous case {cf. Theorem 2}, but to reach this conclusion it is necessary to assume that the neighbor graph changes in a suitably defined sense¹¹1It is worth noting at this point that many of the subtleties of asynchronous operation are obscured or at least difficult to recognize in a discrete-time setting where there is invariably a single underlying discrete-time clock shared by all $m$ agents. These two theorems are the main contributions of this paper. Their proofs can be found in §3.

The aim of §4 is to explain what happens if the assumption that all $m$ agents share the same event time sequence is not made. For simplicity, this is only done for the case when differing event time sequences are the only cause of asynchronism. As will be seen, the consequence of event-time sequence mismatches turns out to be more of a robustness issue than an issue due to unsynchronized operation. In particular, it will become apparent that if different agents use slightly different event time sequences then asymptotically correct state estimates will not be possible unless $A$ is a stability matrix, i.e., all the eigenvalues of matrix $A$ have strictly negative parts. While at first glance this may appear to be a limitation of the distributed observer under consideration, it is in fact a limitation of virtually all state estimators, distributed or not, which are not used in feedback-loops. Since this easily established observation is apparently not widely appreciated, an explanation is given at the end of the section.

By a (passively) resilient algorithm for a distributed process is meant an algorithm which, by exploiting built-in network and data redundancies, is able to continue to function correctly in the face of abrupt changes in the number of vertices and arcs in the inter-agent communication graph upon which the algorithm depends. In §5, it is briefly explained how to configure things so that that the proposed estimator can cope with the situation when there is an arbitrary abrupt change in the topology of the neighbor graph such as the loss or addition of an arc or a vertex provided connectivity is not lost in an appropriately defined sense. Dealing with a loss or addition of an arc proves to to be easy to accomplish because of the ability of the estimator to deal with time-varying graphs. Dealing with the loss or addition of a vertex is much more challenging and for this reason only preliminary results are presented. Finally in §6 simulation results are provided to illustrate the observer’s performance.

2 Hybrid Observer

The overall hybrid observer to be considered consists of $m$ private estimators, one for each agent. Agent $i$ ’s private estimator, whose function is to generates an estimate $x_{i}$ of $x$ , is a hybrid dynamical system consisting of a “local observer” and a “local parameter estimator.” The purpose of local observer $i$ is to generate an asymptotically correct estimate of $L_{i}x$ where $L_{i}$ is any pre-specified, full-rank matrix whose kernel equals the kernel of the observability matrix of the pair $(C_{i},A)$ ; roughly speaking, $L_{i}x$ can be thought of as that “part of $x$ ” which is observable to agent $i$ . Agent $i$ ’s local observer is then an $n_{i}$ -dimensional continuous-time, linear system of the form

\dot{w}_{i}=(\bar{A}_{i}+K_{i}\bar{C}_{i})w_{i}-K_{i}y_{i}

(3)

where $n_{i}={\rm rank\;}L_{i}$ , $K_{i}$ is a gain matrix to be specified, and $\bar{C}_{i}$ and $\bar{A}_{i}$ are unique solutions to the equations $C_{i}=\bar{C}_{i}L_{i}$ and $L_{i}A=\bar{A}_{i}L_{i}$ , respectively. As is well known, the pair $(\bar{C}_{i},\bar{A}_{i})$ is observable and the local observer estimation error $\bar{e}_{i}\stackrel{{\scriptstyle\Delta}}{{=}}w_{i}-L_{i}x$ satisfies

\bar{e}_{i}(t)=e^{(\bar{A}_{i}+K_{i}\bar{C}_{i})t}\bar{e}_{i}(0),\;\;\;t\in[0,\infty)

Since $(\bar{C}_{i},\bar{A}_{i})$ is an observable pair, $K_{i}$ can be selected so that $\bar{e}_{i}(t)$ converges to $0$ exponentially fast at any pre-assigned rate. We assume that each $K_{i}$ is so chosen. Since

w_{i}(t)=L_{i}x(t)+\bar{e}_{i}(t),\;\;\;i\in\mathbf{m},\;\;\;t\in[0,\infty)

(4)

$w_{i}$ can be viewed as a signal which approximates $L_{i}x$ in the face of exponentially decaying additive noise, namely $\bar{e}_{i}$ .

The other sub-system comprising agent $i$ ’s private estimator, is a “local parameter estimator” whose function is to generate estimates of $x$ at each of agent $i$ ’s preselected event times $t_{i1},t_{i2},\ldots$ . Here $t_{i0},t_{i1},t_{i2},\ldots$ is an ascending sequence of event times with a fixed spacing of $T>0$ time units between any two successive event times. In this section it is assume that $t_{i0}=0,\;i\in\mathbf{m}$ , and consequently that all event time sequences are the same²²2It is easy to generalize the results in this section to the case when event times are not evenly spaced provided that the spacings between successive pairs of event times remains positive and bounded.. Thus $t_{is}=sT,\;s\geq 0,\;\;\;i\in\mathbf{m}.$ Between event times, each $x_{i}$ is generated using the equation

\dot{x}_{i}=Ax_{i}

(5)

Motivation for the development of the local parameter estimator whose purpose is to enable agent $i$ to estimate $x(t_{i(s-1)})$ over the event time interval $[t_{i(s-1)},\;t_{is})$ , stems from the fact that the equations

w_{j}(t_{i(s-1)})=L_{j}p+\bar{e}_{j}(t_{i(s-1)}),\;\;\;\;j\in\mathbf{m}

admit a unique solution, namely $p=x(t_{i(s-1)})$ . Existence follows from (4) whereas uniqueness is a consequence of the assumption of joint observability.

The existence and uniqueness of $p$ suggest that an approximate value of $x(t_{i(s-1)})$ can be obtained after a finite number of iterations - say $q$ - using the linear equation solver discussed in[32]. Having obtained such an approximate value of $x(t_{i(s-1)})$ , denoted below by $z_{is}(q)$ , the desired estimate of $x(t_{is})$ can be taken as

x_{i}(t_{is})\stackrel{{\scriptstyle\Delta}}{{=}}e^{AT}z_{is}(q)

(6)

This is the architecture which will be considered.
The computations needed to update agent $i$ ’s estimate of $x(t_{i(s-1)})$ are carried out by agent $i$ during the event time interval $[t_{i(s-1)},t_{is})$ . This is done using a local parameter estimator which generates a sequence of $q$ auxiliary states $z_{is}(1),z_{is}(2),\ldots,z_{is}(q)$ where $q>0$ is a positive integer to be specified below. The sequence is initialized by setting

z_{is}(0)=x_{i}(t_{i(s-1)}),

(7)

and is recursively updated by agent $i$ at local iteration times $\tau_{is}(k),k\in\mathbf{q}\stackrel{{\scriptstyle\Delta}}{{=}}\{1,2,\ldots,q\},$ known only to agent $i$ . It is assumed that the $\tau_{is}(k)$ together with the initialization $\tau_{is}(0)$ are of the form

\tau_{is}(k)=t_{i(s-1)}+k\Delta+\delta_{is}(k),\;\;\;k\in\{0,1,\ldots,q\}

(8)

where $\delta_{is}(0),\delta_{is}(1),\delta_{is}(2),\ldots,\delta_{is}(q)$ is a sequence of small deviations which satisfy

\delta_{is}(k)\in[-\epsilon_{i},\epsilon_{i}],\;\;\;k\in\{0,1,\ldots,q\},

(9)

Here $\epsilon_{i}$ is a small nonnegative number whose constraints will be described below and $\Delta$ is a positive number satisfying

\Delta q+\max_{i}\{\epsilon_{i},\;i\in\mathbf{m}\}\leq T

The signal $z_{is}(q)$ is agent $i$ ’s updated estimate of $x(t_{i(s-1)})$ and is used to define $x_{i}(t_{is})$ as in (6).
The transfer of information between agents which is needed to generate the $z_{is}(k)$ , is carried out asynchronously as follows. For $k\in\mathbf{q}$ and $j\in\mathbf{m}$ , agent $j$ broadcasts $z_{js}(k-1)$ at time $\tau_{js}(k-1)+\beta$ where $\beta$ is any prescribed nonnegative number chosen smaller than $\Delta$ . It is assumed that the bounds $\epsilon_{i},\;i\in\mathbf{m}$ , appearing in (9) are small enough so that there exist $\beta$ and $\Delta$ satisfying

\epsilon_{i}+\epsilon_{j}\leq\beta,\;\;\;\;\;\text{and}\;\;\;\;\;\epsilon_{i}+\epsilon_{j}+\beta<\Delta,\;\;i,j\in\mathbf{m}

(10)

These inequalities ensure that for $k\in\mathbf{q}$ , a broadcast by any agent $j$ at time $\tau_{js}(k-1)+\beta$ will occur within the reception interval $[\tau_{is}(k-1),\;\;\tau_{is}(k))$ of agent $i$ . Fig. 1 provides an example of the update and communication times of two different different agents $i$ and $j$ .

Refer to caption — Figure 1: A broadcast by any agent $j$ at time $\tau_{js}(k-1)+\beta$ will occur within the reception interval $[\tau_{is}(k-1),\;\;\tau_{is}(k))$ of agent $i$ .

Accordingly, agent $j$ is a data source or just source for agent $i$ on $[\tau_{is}(k-1),\;\;\tau_{is}(k))$ if agent $j$ is in the reception range of agent $i$ at time $\tau_{js}(k-1)+\beta$ . Let $\mathcal{S}_{is}(k)$ denote the set of labels of such agents; that is

\mathcal{S}_{is}(k)=\{j:j\in\mathcal{N}_{i}(\tau_{js}(k-1)+\beta)\}

(11)

Note that $i\in\mathcal{S}_{is}(k)$ , for all $i\in\mathbf{m}$ so $\mathcal{S}_{is}(k)$ is never empty. Clearly agent $i$ can use the signals $z_{js}(k-1),\;j\in\mathcal{S}_{is}(k)$ , to compute $z_{is}(k)$ .

Prompted by [32], the update equation used to recursively generate the $z_{is}(k)$ during agent $i$ ’s $s$ th event time interval $[t_{i(s-1)},\;t_{is})$ is given by

z_{is}(k)=\bar{z}_{is}(k-1)\\ -Q_{i}(L_{i}\bar{z}_{is}(k-1)-w_{i}(t_{i(s-1)})),\;\;\;\;k\in\mathbf{q}\;\;

(12)

where $Q_{i}=L_{i}^{\prime}(L_{i}L_{i}^{\prime})^{-1}$ , $\bar{z}_{is}(k-1)$ is an averaged state

\bar{z}_{is}(k-1)=\frac{1}{m_{is}(k)}\sum_{j\in\mathcal{S}_{is}(k)}z_{js}(k-1),

(13)

and $m_{is}(k)$ is the number of labels in $\mathcal{S}_{is}(k)$ . The overall private estimator for agent $i$ is thus described by the equations (3), (5) - (8) and (11) - (13). In summary, initialize $x_{i}(t_{i0})$ , $w_{i}(0)$ randomly. For $t\in[t_{i0},t_{i1})$ , $\dot{x}_{i}=Ax_{i}$ . Then for $s=1,2,\ldots$ , the algorithm of the hybrid estimator for anget $i$ is shown in Algorithm 1.

Algorithm 1 The hybrid estimator of agent

i

1:Initialize

x_{i}(t_{i0}),\;w_{i}(0),\;K_{i},\;(\bar{C}_{i},A_{i}),\;L_{i},\;q

\dot{w}_{i}=(\bar{A}_{i}+K_{i}\bar{C}_{i})w_{i}-K_{i}y_{i}

3:for

t\in[t_{i0},t_{i1})

\dot{x}_{i}=Ax_{i}

with

x_{i}(t_{i0})

5:end for

6:for

s=1,2,\ldots

z_{is}(0)=x_{i}(t_{i(s-1)})

8: for

k=1:q

9: Agent

i

gets the sampled value

w_{i}(t_{i(s-1)})

from its own estimator, and receives

z_{js}(k-1)

from its neighbor

j

z_{is}(k)=\bar{z}_{is}(k-1)-Q_{i}(L_{i}\bar{z}_{is}(k-1)-w_{i}(t_{i(s-1)}))

where

\bar{z}_{is}

is as defined in Eq. (13)

10: end for

11: for

t\in[t_{is},t_{i(s+1)})

12:

\dot{x}_{i}=Ax_{i}

with

x_{i}(t_{is})=e^{AT}z_{is}(q)

13: end for

14:end for

15:Output:

x_{i}

In order to complete the definition of the hybrid observer, it is necessary to specify values of the $K_{i}$ and $q$ . Towards this end, suppose that as a design goal it is desired to pick the $K_{i}$ and $q$ so that all $m$ state estimation errors

e_{i}\stackrel{{\scriptstyle\Delta}}{{=}}x_{i}-x,\;i\in\mathbf{m}

(14)

converge to zero as fast as $e^{-\lambda t}$ does where $\lambda>0$ is some desired convergence rate. The $K_{i}$ would then have to be chosen using spectrum assignment or some other technique so that the matrix exponentials $e^{(\bar{A}_{i}+K_{i}\bar{C}_{i})t}$ all converge to zero at least as fast as $e^{-\lambda t}$ does. This of course can be accomplished because each matrix pair $(\bar{C}_{i},\bar{A}_{i})$ is observable. In the sequel it will be assumed that for some preselected positive number $\bar{\lambda}>\lambda$ , the $K_{i}$ have been chosen so that for $i\in\mathbf{m}$ the local observer estimation errors satisfy

||\bar{e}_{i}(t)||\leq c_{i}e^{-\bar{\lambda}(t-\mu)}||\bar{e}_{i}(\mu)||,\;\;\;t\geq\mu\geq 0

(15)

where the $c_{i},\;i\in\mathbf{m}$ are nonnegative constants and $||\cdot||$ denotes the two-norm.

To describe how to define an appropriate value of $q$ to attain the desired convergence rate for the state estimation errors $e_{i},\;i\in\mathbf{m}$ , it is necessary to take some preliminary steps. First, for each $i\in\mathbf{m}$ , let $P_{i}$ denote the orthogonal projection on the unobservable space of $(C_{i},A_{i})$ . It is easy to see that $P_{i}=I-L_{i}^{\prime}(L_{i}L_{i}^{\prime})^{-1}L_{i},\;\;i\in\mathbf{m}$ . Moreover, because of the assumption of joint observability,

\bigcap_{i\in\mathbf{m}}\text{image}\;P_{i}=\{0\}

(16)

Next, let $\mathcal{C}$ denote the set of all products of the form $P_{i_{1}}P_{i_{2}}\cdots P_{i_{(m-1)^{2}+1}}$ where each projection matrix in $\{P_{i}:i\in\mathbf{m}\}$ occurs in each of such product at least once. Note that $\mathcal{C}$ is a closed subset of ${\rm I\!R}^{n\times n}$ . Since each projection matrix $P_{i},\;i\in\mathbf{m}$ has a two-norm which is no greater than $1$ , each matrix $M\in\mathcal{C}$ has a two-norm less than or equal to $1$ . Thus $\mathcal{C}$ is also a bounded and thus compact subset. In fact, each product in $\mathcal{C}$ actually has two-norm strictly less than $1$ . This is a consequence of (16) and the requirement that each matrix in $\{P_{i}:i\in\mathbf{m}\}$ must occur in each product in $\mathcal{C}$ at least once {Lemma 2, [33]}. These observations imply that maximum of the two-norms of the matrices in $\mathcal{C}$ , namely

\rho\stackrel{{\scriptstyle\Delta}}{{=}}\max\{||M||:M\in\mathcal{C}\},

(17)

exists and is a real non-negative number strictly less than $1$ ³³3It is worth noting that although the matrices $L_{i}$ used in defining the $P_{i}$ are not uniquely determined by the unobservable spaces of the pairs $(C_{i},A)$ , the orthogonal projection matrices $P_{i}$ nonetheless are. Thus the set $\mathcal{C}$ used in the definition of $\rho$ in (17) ultimately depends only on the family of unobservable spaces of the pairs $(C_{i},A),\;i\in\mathbf{m}$ and not on the particular manner in which the $L_{i}$ are chosen. Just how to explicitly characterize this dependence is a topic for future research.. This in turn implies that the attenuation constant

\alpha\stackrel{{\scriptstyle\Delta}}{{=}}1-\frac{(m-1)(1-\rho)}{m^{(m-1)^{2}}}

(18)

is also a real non-negative number strictly less than $1$ . As will become evident below {cf. (40) and Lemma 5}, in the idealized case when all $\epsilon_{i}$ and $\bar{e}_{i}$ are zero, for any integer $p>0$ and any given value of $q$ satisfying

q\geq p((m-1)^{2}+1),

(19)

the value of the signal

\max\{||z_{is}(k)-x(t_{i(s-1)})||:i\in\mathbf{m}\}

is attenuated by at least a factor $\alpha^{p}$ after $q$ iterations during each event - time interval $[t_{i(s-1)},\;t_{is})$ ; i.e., for $s\geq 1$ ,

	$\displaystyle\max\{\|\|z_{is}(q)-x(t_{i(s-1)})\|\|:i\in\mathbf{m}\}\leq$
			$\displaystyle\alpha^{p}\max\{\|\|z_{is}(0)-x(t_{i(s-1)})\|\|:i\in\mathbf{m}\}$

It will soon be apparent, if it is not already from (6), (7) and (14), that over each event- time interval $[t_{i(s-1)},\;t_{is})$ ,

	$\displaystyle\max\{\|\|e_{i}(t_{is})\|\|:i\in\mathbf{m}\}\leq$				(20)
			$\displaystyle e^{\|\|A\|\|T}\alpha^{p}\max\{\|\|e_{i}(t_{i(s-1)})\|\|:i\in\mathbf{m}\}$		(20)

Since each event - time interval is of length $T$ , to achieve an exponential convergence rate of $\lambda$ in the idealized case, it is necessary to pick $q$ so that (19) holds where $p$ is any integer satisfying $e^{||A||T}\alpha^{p}<e^{-\lambda T}$ . In other words, the requirement on $q$ is that (19) hold where

p>\left\lceil\frac{(\lambda+||A||)T}{\ln(\frac{1}{\alpha})}\right\rceil,

(21)

with $\lceil r\rceil$ here denoting, for any nonnegative number $r$ , the smallest integer $k\geq r$ . The following theorem, which applies to the synchronous case when all of the $\epsilon_{i}$ are zero, {but not necessarily the $\bar{e}_{i}$ } summarizes these observations.

Theorem 1.

Synchronous case: Suppose $\epsilon_{i}=0,\;\;i\in\mathbf{m}$ , and that the neighbor graph $\mathbb{N}(t)$ is strongly connected for all $t$ . Let $\rho$ and $\alpha$ be defined by (17) and (18) respectively. Then each state estimation $e_{i}=x_{i}-x,\;i\in\mathbf{m}$ , of the hybrid observer defined by (3), (5) - (8) and (11) - (13), tends to zero as fast as $e^{-\lambda t}$ does provided $q$ satisfies

q>((m-1)^{2}+1)\left\lceil\frac{(\lambda+||A||)T}{\ln(\frac{1}{\alpha})}\right\rceil

(22)

This theorem will be proved in the next section. Several comments are in order. First, the attenuation of $\max\{||e_{i}||:i\in\mathbf{m}\}$ by $\alpha^{p}$ over an event time interval is not likely to be tight and a larger attenuation constant can almost certainly be expected. This is important because the larger the attenuation constant the smaller the required value of $q$ needed to achieve a given convergence rate. Second, the hypothesis that $\mathbb{N}(t)$ strongly connected is almost certainly stronger than is necessary, the notion of a repeatedly jointly strongly connected sequence of graphs [33] being a likely less stringent alternative.

To deal with the asynchronous case when at least some of the $\epsilon_{i}$ are nonzero, it is necessary to assume that $\mathbb{N}(t)$ is constant on each of the time intervals

\mathcal{I}_{s}(k)=[-\epsilon+sT+(k-1)\Delta+\beta,\epsilon+sT\\ +(k-1)\Delta+\beta],\;k\in\mathbf{q},\;s\geq 1

(23)

where

\epsilon=\max\{\epsilon_{i}:i\in\mathbf{m}\}

(24)

For this assumption to make sense, these intervals cannot overlap. The following lemma establishes that this is in fact the case.

Lemma 1.

Suppose that $q\geq 2$ and that the $\epsilon_{i}$ are fixed nonnegative numbers satisfying the constraints in (10). Then for each $s\geq 0$ , the $q$ time intervals defined by (23) are non-overlapping and each is a subinterval of $[sT,(s+1)T)$ .

Proof of Lemma 1: Fix $2\leq k\leq q$ . Note that $2\epsilon<\beta$ because of (10). This implies that $\epsilon+sT+(k-2)\Delta+\beta<-\epsilon+sT+(k-1)\Delta+\beta$ and thus that $\mathcal{I}_{s}(k)$ and $\mathcal{I}_{s}(k-1)$ are disjoint. Since this holds for all $k$ satisfying $2\leq k\leq q$ , all $\mathcal{I}_{s}(k),\;k\in\mathbf{q}$ are disjoint.

From (10), $\epsilon\leq\beta$ and $\epsilon+\beta<\Delta$ . These inequalities imply that $-\epsilon+sT+\beta\geq sT$ and $\epsilon+sT+(q-1)\Delta+\beta<(s+1)T$ respectively. From this it follows that $\mathcal{I}_{s}(1)\subset[sT,(s+1)T)$ , that $\mathcal{I}_{s}(q)\subset[sT,(s+1)T)$ and thus that $\mathcal{I}_{s}(k)\subset[sT,(s+1)T),\;\;k\in\mathbf{q}$ .

We are led to the asynchronous version of Theorem 22.

Theorem 2.

Asynchronous case: Suppose the $\epsilon_{i},\;i\in\mathbf{m}$ , satisfy (10) and that the neighbor graph $\mathbb{N}(t)$ is constant on each interval $\mathcal{I}_{s}(k),\;k\in\mathbf{q},\;s\geq 1$ and strongly connected for all $t$ . Let $\rho$ and $\alpha$ be defined by (17) and (18) respectively and suppose that $q$ satisfies (22). Then as in Theorem 22, each state estimation $e_{i}=x_{i}-x,\;i\in\mathbf{m}$ , of the hybrid observer defined by(3), (5) - (8) and (11) - (13), tends to zero as fast as $e^{-\lambda t}$ does.

The proof of this theorem will be given in the next section. Notice that the asynchronous case here can not be recognized in a discrete-time setting with a discrete-time clock shared by all $m$ agents considering delays[34].

2.1 A special case

It is possible to relax somewhat the lower bound (22) for $q$ to achieve exponential convergence in the special case when the neighbor graph $\mathbb{N}(t)$ is symmetric and strongly connected for all $t$ . This can be accomplished by replacing the straight averaging rule defined by (13), with the convex combination rule

\bar{z}_{is}(k-1)=(1-\frac{m_{is}(k)}{m+1})z_{is}(k-1)+\frac{1}{m+1}\!\!\!\!\sum_{j\in\mathcal{S}_{is}(k)}\!\!\!\!z_{js}(k-1)

(25)

where $m_{is}(k)$ is the number of labels in $\mathcal{S}_{is}(k)$ .

To proceed, let $\mathcal{G}$ denote the set of all symmetric and strongly connected, graphs on $m$ vertices. Each graph $\mathbb{G}\in\mathcal{G}$ uniquely determines a matrix $M_{\mathbb{G}}=I-\frac{1}{m+1}L_{\mathbb{G}}$ where $L_{\mathbb{G}}$ is the Laplacian of the simple, weakly connected {undirected} graph determined by $\mathbb{G}$ . It is easy to see that $M_{\mathbb{G}}$ is a symmetric, doubly stochastic matrix with positive diagonals and that $\mathbb{G}$ is its graph. The connection between these matrices and the update rule defined by (25) will become apparent later when assumptions are made which enable us to identify the subsets $\mathcal{S}_{is}(k)$ appearing in (25) with the neighbor sets of the neighbor graph $\mathbb{N}(((s-1)T+(k-1)\delta+\beta)$ {c.f. Lemmas 36 and 3}. Later in this paper it will also be shown that $P(M_{\mathbb{G}}\otimes I)$ is a contraction in the two norm {Lemma 6}. This means that

\sigma=\max_{\mathbb{G}\in\mathcal{G}}||P(M_{\mathbb{G}}\otimes I)||

(26)

is a nonnegative number less than one.

As will become clear, to achieve a convergence rate of $\lambda$ , it is sufficient to pick $q$ large enough to that $e^{||A||T}\sigma^{q}<e^{-\lambda T}$ . In other words, in the special case when $\mathbb{N}(t)$ is symmetric and strongly connected for all time, instead of choosing $q$ to satisfy (22), to achieve an exponential convergence rate of $\lambda$ it is enough to choose $q$ to satisfy the less demanding constraint

q>\left\lceil\left(\frac{(\lambda+||A||)T}{\ln(\frac{1}{\sigma})}\right)\right\rceil

(27)

Justification for this claim is given in §3. Choosing $q$ in this way is easier that choosing $q$ according to (22) because the computation of $\sigma$ is less demanding than the computation of $\rho$ and consequently $\alpha$ . On the other hand, this special approach only applies when the neighbor graph is symmetric.

3 Analysis

The aim of this section is to analyze the behavior of the hybrid observer defined in the last section. To do this, use will be made of the notion of a “mixed matrix norm” which will now be defined. For any positive integers $k$ , $m$ , $n$ and $p$ , let $\mathcal{M}$ denote the real $kmnp$ - dimensional vector space of block partitioned matrices $\mathbf{M}=[M_{ij}]_{k\times m}$ where each block $M_{ij}$ is a $n\times p$ matrix. By the mixed matrix norm of $\mathbf{M}\in\mathcal{M},$ written $|\mathbf{M}|$ , is meant the infinity norm of the matrix $[||M_{ij}||]_{k\times m}$ where $||M_{ij}||$ is the two-norm of $M_{ij}$ . For example, with $e$ denoting the “stacked” state estimation error $e\stackrel{{\scriptstyle\Delta}}{{=}}\text{column}\{e_{1},e_{2},\ldots,e_{m}\}$ the quantity $\max\{||e_{i}||:i\in\mathbf{m}\}$ mentioned in the last section, is $|e|$ , the mixed matrix norm of $e$ . It is straight forward to verify that $|\cdot|$ is in fact a norm and that this norm is sub-multiplicative [33].

Recall that the purpose of agent $i$ ’s local parameter estimator defined by (7), (12), and (13) is to estimate $x(t_{i(s-1)})$ after executing $q$ iterations during the $s$ th event time interval of agent $i$ . In view of this, we define the parameter error vectors for $i\in\mathbf{m}$ ,

\pi_{is}(k)=z_{is}(k)-x(t_{i(s-1)})\;\;\;k=0,1,\ldots,q

(28)

for all $s\geq 1$ . This, (7), and the definition of $e_{i}$ in (14) imply that

\pi_{is}(0)=e_{i}(t_{i(s-1)}),\;\;s\geq 1,\;\;i\in\mathbf{m}

(29)

In addition, from (5), (6) and (14) it is clear that that

e_{i}(t_{is})=e^{AT}\pi_{is}(q),\;s\geq 1,\;\;i\in\mathbf{m}

(30)

To derive the update equation for $\pi_{is}(k)$ as $k$ ranges from $1$ to $q$ , we first note from (13) that

\bar{z}_{is}(k-1)-x(t_{i(s-1)})=\frac{1}{m_{is}(k)}\!\!\!\!\sum_{j\in\mathcal{S}_{is}(k)}\!\!\!\!\pi_{js}(k-1)\;

(31)

Next note that because of (4) and (12)

\pi_{is}(k)=\bar{z}_{is}(k-1)-x(t_{i(s-1)})\\ -Q_{i}(L_{i}\bar{z}_{is}(k-1)-x(t_{i(s-1)})-\bar{e}_{i}(t_{i(s-1)}))

From this and (31) it follows that for $k\in\mathbf{q},\;i\in\mathbf{m},\;s\geq 1$ ,

\pi_{is}(k)\!\!=\!\!\frac{1}{m_{i(s)}(k)}P_{i}\!\!\!\!\!\sum_{j\in\mathcal{S}_{is}(k)}\!\!\!\!\!\pi_{js}(k-1)+Q_{i}\bar{e}_{i}(t_{i(s-1)})

(32)

where as before, $P_{i}=I-Q_{i}L_{i}$ . These are the local parameter error equations for the hybrid observer.

The next step in the analysis of the system is studied the evolution of the all-agent parameter error vector

\pi_{s}(k)=\text{column}\{\pi_{1s}(k),\pi_{2s}(k),\ldots,\pi_{ms}(k)\}

Note first that because of (29) and (30)

\pi_{s}(0)=e(s-1),\;\;s\geq 1

(33)

and

e(s)=e^{\tilde{A}T}\pi_{s}(q),\;s\geq 1

(34)

where $e(s)$ is the all-agent state estimation error vector

e(s)=\text{column}\{e_{1}(t_{is}),\ldots,e_{m}(t_{ms})\},\;\;s\geq 0

(35)

and $\tilde{A}=\text{diagonal}\{A,A,\ldots,A\}$ .

In order to develop an update equation for $\pi_{s}(k)$ as $k$ ranges from $1$ to $q$ , it is necessary to combine the $m$ update equations in (32) into a single equation and to do this requires a succinct description of the graph determined by the sets $\mathcal{S}_{is},\;i\in\mathbf{m}$ defined in (11). There are two cases to consider: the synchronous case which is when all of the $\epsilon_{i}=0$ and the asynchronous case when some or all of the $\epsilon_{i}$ may be non-zero. The following lemmas cover both cases.

Lemma 2.

Synchronous Case: Suppose $\epsilon_{i}=0,\;i\in\mathbf{m}$ . Then for any fixed value of $\beta$ satisfying (10), including $\beta=0$ ,

\mathcal{S}_{is}(k)\!=\!\mathcal{N}_{i}((\!s\!-\!1\!)T\!+\!(k-1)\Delta\!+\!\beta),i\in\mathbf{m},k\in\mathbf{q}

(36)

Proof of Lemma 36: By hypothesis all $\epsilon_{i}=0$ . Clearly (10) can be satisfied with $\beta=0$ . Moreover from (8) and (9) and the assumption that $t_{i(s-1)}=(s-1)T$ , it follows that $\tau_{js}(k-1)=(s-1)T+(k-1)\Delta,\;j\in\mathbf{m}$ , so $\mathcal{N}_{i}(\tau_{js}(k-1)+\beta)=\mathcal{N}_{i}((s-1)T+(k-1)\Delta+\beta)$ . From this and (11) it follows that (36) is true.

The following lemma asserts that (36) still holds in the asynchronous case when some of the $\epsilon_{i}$ are nonzero, provided $\mathbb{N}(t)$ is constant on each interval $\mathcal{I}_{s}(k),\;k\in\mathbf{q},\;s\geq 1$ .

Lemma 3.

If the $\epsilon_{i}$ satisfy the constraints in (10) and $\mathbb{N}(t)$ is constant on each interval $\mathcal{I}_{s}(k),\;k\in\mathbf{q},\;s\geq 1$ then (36) is true.

Proof of Lemma 3: Fix $i\in\mathbf{m}$ , $s\geq 1$ and $k\in\mathbf{q}$ . In light of (8), (9) and the assumption that $t_{i(s-1)}=(s-1)T$ , it is clear that for any $j\in\mathbf{m}$ ,

\tau_{js}(k-1)+\beta\in[-\epsilon_{j}+(s-1)T+(k-1)\Delta+\beta,\\ \epsilon_{j}+(s-1)T+(k-1)\Delta+\beta]\subset\mathcal{I}_{(s-1)}(k)

Moreover, $(s-1)T+(k-1)\Delta+\beta\in\mathcal{I}_{(s-1)}(k)$ . But by assumption, $\mathbb{N}(t)$ is constant on $\mathcal{I}_{(s-1)}(k)$ which means that $\mathcal{N}_{i}(t)$ is constant on $\mathcal{I}_{(s-1)}(k)$ .Therefore $\mathcal{N}_{i}(\tau_{js}(k-1)+\beta)=\mathcal{N}_{i}((s-1)T+(k-1)\Delta+\beta)$ . From this and the definition of $\mathcal{S}_{is}(k)$ in (11), it follows that (36) is true.

In summary, Lemmas 36 and 3 assert that (36) holds in the synchronous case when all $\epsilon_{i}=0$ , or alternatively in the asynchronous case when the neighbor graph $\mathbb{N}(t)$ is constant on each interval $\mathcal{I}_{s}(k),\;k\in\mathbf{q},\;s\geq 1.$ Because of this, the following steps to obtain an update equation for $e(s)$ apply to both cases.

Equation (36) implies that the graphs determined by the $\mathcal{S}_{is}(k),\;\;k\in\mathbf{q},\;i\in\mathbf{m},\;s\geq 1$ are the neighbor graphs $\mathbb{N}((s-1)T+(k-1)\Delta+\beta),\;k\in\mathbf{q},\;s\geq 1$ . Since $\mathbb{N}(t)$ is assumed to be strongly connected for all $t\geq 0$ , each of these neighbor graphs is strongly connected. These graphs are used as follows.

Let $\mathcal{G}$ denote the set of all directed graphs on $m$ vertices which have self-arcs at all vertices. Note that $\mathcal{G}$ is a finite set and that $\mathbb{N}(t)\in\mathcal{G},\;t\geq 0$ . Each graph $\mathbb{G}\in\mathcal{G}$ uniquely determines a so-called “flocking-matrix” which is an $m\times m$ stochastic matrix of the form $D^{-1}_{\mathbb{G}}A^{\prime}_{\mathbb{G}}$ , where $A_{\mathbb{G}}$ and $D_{\mathbb{G}}$ are respectively the adjacency matrix and diagonal in-degree matrix of of $\mathbb{G}$ ; $D_{\mathbb{G}}$ is nonsingular because each graph in $\mathcal{G}$ has self-arcs at all vertices.

For $k\in\mathbf{q}$ and $s\geq 1$ , let $F_{s}(k)$ denote the flocking matrix determined by $\mathbb{N}((s-1)T+(k-1)\Delta+\beta)$ . Then (32) implies for $s\geq 1$ that

\pi_{s}(k)=P(F_{s}(k)\otimes I)\pi_{s}(k-1)+Q\bar{e}(s-1),\;\;k\in\mathbf{q}

(37)

where

\bar{e}(s)=\text{column}\{\bar{e}_{1}(t_{1s}),\ldots,\bar{e}_{m}(t_{ms})\},\;\;s\geq 0

(38)

$P=\text{diagonal}\{P_{1},\ldots,P_{m}\},Q=\text{diagonal}\{Q_{1},\ldots,Q_{m}\}$ , and $I$ is the $n\times n$ identity. Thus for $s\geq 1$ ,

\pi_{s}(q)=\Phi_{s}(0)\pi_{s}(0)+\left(\sum_{k=1}^{q}\Phi_{s}(k)\right)Q\bar{e}(s-1)

(39)

where $\Phi_{s}(k)$ is the state transition matrix defined by

\Phi_{s}(k)=P(F_{s}(q)\otimes I)\cdots P(F_{s}(k+1)\otimes I)

(40)

for $0\leq k<q$ and by $\Phi_{s}(q)=I$ for $k=q$ . From this, (33) and (34) it follows that for $s\geq 1$ , the all-agent state estimation error $e(s)$ satisfies

e(s)=A(s)e(s-1)+B(s)\bar{e}(s-1),\;\;s\geq 1

(41)

where

	$\displaystyle A(s)$	$\displaystyle=$	$\displaystyle e^{\tilde{A}T}\Phi_{s}(0),\;\;s\geq 1$		(42)
	$\displaystyle B(s)$	$\displaystyle=$	$\displaystyle e^{\tilde{A}T}\sum_{k=1}^{q}\Phi_{s}(k)Q,\;\;s\geq 1$		(43)

To determine the convergence properties of $e(s)$ as $s\rightarrow\infty$ use will be made of the following lemma which gives bounds on the norms of the coefficient matrices $A(s)$ and $B(s)$ appearing in (41).

Lemma 4.

Suppose that $q$ satisfies the inequality given in Theorem 22. Then

	$\displaystyle\|A(s)\|$	$\displaystyle\leq$	$\displaystyle e^{-\lambda T},\;\;s\geq 1$		(44)
	$\displaystyle\|B(s)\|$	$\displaystyle\leq$	$\displaystyle qe^{\|\|A\|\|T}\|Q\|,\;\;s\geq 1$		(45)

In order to justify the bound on the norm of $A(s)$ given in (44), use will be made of the following lemma. which is a simple variation on a result in [33].

Lemma 5.

Let $\mathcal{F}$ denote the set of all flocking matrices determined by those graphs in $\mathcal{G}$ which are strongly connected. For any set of $\mu\geq(m-1)^{2}+1$ flocking matrices $S_{1},S_{2},\ldots,S_{\mu}$ in $\mathcal{F}$

|P(S_{\mu}\otimes I)P(S_{\mu-1}\otimes I)\cdots P(S_{1}\otimes I)|\leq\alpha

(46)

where $\alpha$ is the attenuation constant

\alpha=1-\frac{(m-1)(1-\rho)}{m^{(m-1)^{2}}}

Proof of Lemma 5: Fix $\mu\geq(m-1)^{2}+1$ , set $k=(m-1)^{2}$ and let and $S_{1},S_{2},\ldots,S_{\mu}$ be flocking matrices in $\mathcal{F}$ . Then

P(S_{\mu}\otimes I)P(S_{\mu-1}\otimes I)\cdots P(S_{2}\otimes I)P(S_{1}\otimes I)\\ =\{P(S_{\mu}\otimes I)\cdots P(S_{k+2}\otimes I)\}\\ \{P(S_{k+1}\otimes I)\cdots P(S_{2}\otimes I)P\}\{S_{1}\otimes I\}

But for any flocking matrix $S\in\mathcal{F}$ , $|S\otimes I|=||S||_{\infty}=1$ where $||\cdot||_{\infty}$ is the infinity norm. From this, the sub-multiplicative property of the mixed matrix norm, and the fact that $|P|\leq 1$ , it follows that

|P(S_{\mu}\otimes I)P(S_{\mu-1}\otimes I)\cdots P(S_{2}\otimes I)P(S_{1}\otimes I)|\\ \leq|P(S_{k+1}\otimes I)\cdots P(S_{2}\otimes I)P|

In view of equation (26) of [33],

|P(S_{k+1}\otimes I)\cdots P(S_{2}\otimes I)P|\leq 1-\frac{(m-1)(1-\rho)}{m^{(m-1)^{2}}}

Therefore (46) is true.

Proof of Lemma 4: Lemma 5 implies that if for a given integer $p>0$ , if $q\geq p((m-1)^{2}+1)$ then for any $s\geq 1$ ,

|(P(F_{s}(q)\otimes I)\cdots P(F_{s}(1)\otimes I)|\leq\alpha^{p}

(47)

Therefore by (40) and (42), if $q$ is so chosen, then $|A(s)|\leq e^{||A||T}\alpha^{p}$ . Thus by picking $p$ so large that

e^{||A||T}\alpha^{p}<e^{-\lambda T}

(48)

and then setting $q=p((m-1)^{2}+1)$ one gets (44). The requirement on $p$ determined by (48) is equivalent to the requirement on $p$ determined by (21). It follows that (44) will hold provided $q$ satisfies the inequality given in Theorem 22.

Recall that $|P|\leq 1$ and that $|S\otimes I|=1$ for any $m\times m$ stochastic matrix $S$ . From this and the sub-multiplicative property of the mixed matrix norm it follows that the matrix $\Phi_{s}(k)$ defined by (40) satisfies

|\Phi_{s}(k)|\leq 1,\;\;\;k\in\mathbf{q},\;\;s\geq 1

(49)

This and the definition of $B(s)$ in (43) imply that for all $s\geq 1$ , $|B(s)|\leq qe^{||A||T}|Q|$ . Thus (45) is true.

It is obvious at this point that because (36) holds in both the synchronous and asynchronous cases, the same arguments can be used to prove both Theorem 22 and Theorem 2.

Proof of Theorems 22 and 2: In view of (41) and Lemma 4 it is possible to write

|e(s)|<e^{-\lambda T}|e(s-1)|+b|\bar{e}(s-1)|,\;s\geq 1

where $b=qe^{||A||T}|Q|$ . Therefore

|e(s)|<e^{-\lambda sT}|e(0)|+b\sum_{k=1}^{s}e^{-\lambda(s-k)T}|\bar{e}(k-1)|

(50)

To deal with the term involving $\bar{e}$ in (50), we proceed as follows. Note first from (15) that

||\bar{e}_{i}(t_{is})||\leq c_{i}e^{-\bar{\lambda}T}||\bar{e}_{i}(t_{i(s-1)})||,\;\;i\in\mathbf{m},\;\;\;s\geq 1

Thus $||\bar{e}_{i}(t_{is})||\leq c_{i}e^{-\bar{\lambda}sT}||\bar{e}_{i}(t_{i0})||$ for $i\in\mathbf{m}\;s\geq 1$ . It follows from this and the definition of $\bar{e}(s)$ in (38) that

|\bar{e}(s)|\leq ce^{-\bar{\lambda}sT}|\bar{e}(0)|,\;\;s\geq 1

where $c=\max\{c_{i},\;i\in\mathbf{m}\}$ . Thus for $s\geq 1$

b\sum_{k=1}^{s}e^{-\lambda(s-k)T}|\bar{e}(k-1)|\\ \leq cb\sum_{k=1}^{s}e^{-\lambda(s-k)T}e^{-\bar{\lambda}(k-1)T}|\bar{e}(0)|\\ =cbe^{-(\lambda s-\bar{\lambda})T}\sum_{k=1}^{s}e^{-(\bar{\lambda}-\lambda)kT)}|\bar{e}(0)|\\ \leq cbe^{-(\lambda s-\bar{\lambda})T}\sum_{k=1}^{\infty}e^{-(\bar{\lambda}-\lambda)kT)}|\bar{e}(0)|\\ =cbe^{-(\lambda s-\bar{\lambda})T}\frac{e^{-(\bar{\lambda}-\lambda)T}}{1-e^{-(\bar{\lambda}-\lambda)T}}|\bar{e}(0)|\\ =cbe^{-\lambda sT}\frac{e^{\lambda T}}{1-e^{-(\bar{\lambda}-\lambda)T}}|\bar{e}(0)|\hskip 59.98422pt

Using (50) there follows

|e(s)|\leq e^{-\lambda sT}(|e(0)|+d|\bar{e}(0)|),\;s\geq 1

(51)

where

d=cb\frac{e^{\lambda T}}{1-e^{-(\bar{\lambda}-\lambda)T}}

Fix $i\in\mathbf{m}$ . In view of (51) and the definition of $e(s)$ in (35),

||e_{i}(t_{i(s-1)})||\leq e^{-\lambda(s-1)T}(|e(0)|+d|\bar{e}(0)|)\;\;i\in\mathbf{m},\;\;s\geq 1

But for $t\in(t_{i(s-1)},\;t_{is})$ , $\dot{x}_{i}=Ax_{i}$ ; consequently $\dot{e}_{i}=Ae_{i}$ for the same values of $t$ . Therefore

e_{i}(t)=e^{A(t-(s-1)T)}e_{i}(t_{i(s-1)}),\;\;t\in[t_{i(s-1)},\;t_{is}),\;\;s\geq 1

||e_{i}(t)||\leq e^{||A||T}||e_{i}(t_{i(s-1)}||,\;\;t\in[t_{i(s-1)},\;t_{is}),\;\;s\geq 1

Therefore for $t\in[t_{i(s-1)},t_{is})$ and $s\geq 1$

||e_{i}(t)||\leq e^{(||A||T-\lambda(s-1)T)}(|e(0)|+d|\bar{e}(0)|)

Now for $i\in\mathbf{m}$ ,

e^{-\lambda sT}\leq e^{-\lambda t},\;\;t\in[t_{i(s-1)},\;t_{is})

||e_{i}(t)||\leq e^{(||A||T-\lambda t)}(|e(0)|+d|\bar{e}(0)|),\;\;t\in[(t_{i(s-1)},\;t_{is})

Since this holds for all $s\geq 1$

||e_{i}(t)||\leq e^{(||A||T-\lambda t)}(|e(0)|+d|\bar{e}(0)|),\;\;t\geq 0

which proves that the state estimation errors $e_{i},\;i\in\mathbf{m}$ , all converge to zero as fast as $e^{-\lambda t}$ does.

3.1 Special case

We now turn to the special case mentioned in §2.1. In this case the definition of the state-transition matrix $\Phi$ appearing in (39) changes from (40) to

\Phi_{s}(k)=P(W_{s}(q)\otimes I)\cdots P(W_{s}(k+1)\otimes I)

(52)

for $0\leq k<q$ , and $W_{s}(k)\;\stackrel{{\scriptstyle\triangle}}{{=}}\;M_{\mathbb{N}((s-1)T+(k-1)\Delta+\beta)}$ with graph $\mathbb{N}((s-1)T+(k-1)\Delta+\beta)$ .

Although the formula for $e(s)$ , namely (41), and the definitions of $A(s)$ and $B(s)$ in (42) and (43) are as before, the bounds for $A(s)$ and and $B(s)$ given by (44) and (45) no longer apply. To proceed, use will be made of the following lemma.

Lemma 6.

Let $F$ be an $m\times m$ doubly stochastic matrix with positive diagonals and a strongly connected graph. Suppose that $P_{i},\;\;i\in\mathbf{m}$ , is a set of $n\times n$ orthogonal projection matrices such that

\bigcap_{i=1}^{m}\text{image}\;P_{i}=0

(53)

Then the matrix $P(F\otimes I)$ is a contraction in the $2$ -norm where $P=\text{\rm diag\;}\{P_{2},P_{2},\ldots,P_{m}\}$ .

Proof: Write $S$ for $F\otimes I$ and note that $S$ is doubly stochastic with positive diagonals and a strongly connected graph. Since $||P||\leq 1$ , it must be true that that $||PS||\leq||S||$ . Moreover $||S||\leq 1$ because $S^{\prime}S$ is stochastic; thus $||PS||\leq 1$ . Hence it is enough to prove that $||PS||\neq 1$ or equivalently that $||S^{\prime}P||\neq 1$
Suppose that $||S^{\prime}P||=1$ or equivalently that $PSS^{\prime}Px=x$ for some nonzero vector $x$ . Clearly PSS’Px = Px which implies that $Px=x$ and thus that $x^{\prime}SS^{\prime}x=x^{\prime}x.$ Therefore $||S^{\prime}x||=||x||$ . From this and Lemma 1 of [35] it follows that $SS^{\prime}x=x$ . Now $SS^{\prime}$ is stochastic. Moreover its graph is strongly connected because $S$ has a strongly connected graph and positive diagonals, as does $S^{\prime}$ . Thus by the Perron Frobenius theorem, $SS^{\prime}$ has exactly one eigenvalue at $1$ and all the rest must be inside the unit circle; in addition the eigenspace for the eigenvalue $1$ must be spanned by the one-vector $\mathbf{1}_{nm}$ . Therefore $x=\mu\mathbf{1}_{nm}$ for some nonzero scalar $\mu$ . Therefore $P\mathbf{1}_{nm}=\mathbf{1}_{nm}$ which implies that $\mathbf{1}_{n}=P_{i}\mathbf{1}_{n},\;\;i\in\mathbf{m}$ , But this is impossible because of (3).

The following lemma gives the bounds on $A(s)$ and $B(s)$ for the special case under consideration.

Lemma 7.

Suppose that $q$ satisfies (27). Then

			$\displaystyle\\|A(s)\\|\leq e^{-\lambda T},\;\;s\geq 1$		(54)
			$\displaystyle\\|B(s)\\|\leq qe^{\\|A\\|T}\\|Q\\|,\;\;s\geq 1$		(55)

Proof: Lemma 6 implies that for each $0\leq k<q$ , $\|P(W_{s}(k)\otimes I)\|<1$ . Moreover, $\|P(W_{s}(k)\otimes I)\|\leq\sigma$ where $\sigma$ is chosen according to (26). From this and the sub-multiplicative property of the two norm it follows that

\|P(W_{s}(q)\otimes I)\cdots P(W_{s}(1)\otimes I)\|\leq\sigma^{q}

Therefore by (42) and (52), if $q$ is so chosen to satisfy (27), then

\|A(s)\|\leq e^{\|A\|T}\sigma^{q}<e^{-\lambda T}

Thus (54) is true. Recall that $\|P(W_{s}(k)\otimes I)\|<1$ and $\|W_{s}(k)\otimes I\|\leq 1$ for $0\leq k<q$ . From this and the sub-multiplicative property of the two norm, the matrix $\Phi_{s}(k)$ defined by (52) satisfies

\|\Phi_{s}(k)\|\leq 1,\;\;k\in\mathbf{q},\;\;s\geq 1

This and (43) imply (55).

Other than the modifications in the bounds on $A(s)$ and $B(s)$ given in the above lemma, everything else is the same for both the synchronous and asynchronous versions of the problem. So what one gains in this special case is exponential convergence at a prescribed rate with a smaller value of $q$ .

4 Event-time Mismatch - A Robustness Issue

In the preceding section it was shown that the hybrid observer under discussion will function correctly if local iterations are performed synchronously across the network no matter how fast the associated neighbor graph changes, just so long as it is always strongly connected. Correct performance is also assured in the face of asynchronously executed local iterations across the network during each event time interval, provided the neighbor graph changes in a suitably defined sense. Implicitly assumed in these two cases is that the event time sequences of all $m$ agents are the same. The aim of this section is to explain what happens if this assumption is not made. For simplicity, this will only be done for the case when differing event time sequences are the only cause of asynchronism. As will be seen, the consequence of event-time sequence mismatches turns out to be more of a robustness issue than an issue due to unsynchronized operation. In particular, it will become apparent that if different agents use slightly different event time sequences then asymptotically correct state estimates will not be possible unless $A$ is a stability matrix. While at first glance this may appear to be a limitation of the distributed observer under consideration, it is in fact a limitation of virtually all state estimators, distributed or not, which are not used in feedback-loops. Since this easily explained observation is apparently not widely appreciated, an explanation of this simple fact will be given at the end of this section.

There are two differences between the setup to be considered here and the setup considered in the last section. First it will now be assumed that the local deviation times $\delta_{is}(k)$ appearing in (8) are all zero. Thus in place of (8) the local iteration times for agent $i$ on $[t_{i(s-1)},t_{is})$

\tau_{is}(k)=t_{i(s-1)}+k\Delta,\;k\in\{0,1,\ldots,m\}

(56)

Second instead of assuming that the initializations $t_{i0}$ of the $m$ agents’ event time sequences are all zero, it will be assumed instead that each $t_{i0}$ is a small number known only to agent $i$ which lies in the interval $[-\epsilon_{i},\epsilon_{i}]$ where, as before, $\epsilon_{i}$ is a small nonnegative number. This means that even though the event time sequences of all $m$ agents are still periodic with period $T$ , the sequences are not synchronized with each other. As before it is assumed that within event time interval $[t_{j(s-1)},t_{js})$ , agent $j$ broadcasts iterate $z_{js}(k-1)$ at time $\tau_{js}(k-1)+\beta$ . To ensure that this time falls within the reception interval $[t_{i(s-1)},t_{is})$ of each agent $i$ , it will continue to be assumed that (10) holds. Apart from these modifications the setup to be considered here is the same as the one considered previously. As a consequence, many of the steps in the analysis of the hybrid observers performance are the same as they were for the previously considered case.

Our first objective is to develop the relevant equations for the local parameter error vector $\pi_{is}(k)$ defined by (28). Although (29) and (30) continue to hold without change, (32) requires modification. To understand what needs to be changed, it is necessary to first derive a relationship between $x(t_{i(s-1)})$ and $x(t_{j(s-1)})$ . Towards this end, note that

x(t_{i(s-1)})\!=\!x(t_{j(s-1)})+x((s-1)T+t_{i0})-x((s-1)T+t_{j0})

because $t_{k(s-1)}=t_{k0}+(s-1)T$ for all $k\in\mathbf{m}$ . From this and (5) it follows that

x(t_{i(s-1)})=x(t_{j(s-1)})+\left(e^{At_{i0}}-e^{At_{j0}}\right)x((s-1)T)

Hence (13) can now be used to obtain

\bar{z}_{is}(k-1)-x(t_{i(s-1)})=\frac{1}{m_{is}(k)}\sum_{j\in\mathcal{S}_{is}(k)}\pi_{j(s-1)}(k-1)\\ +\Gamma_{is}(k)x((s-1)T)\;\;\;\;\;\;\;

(57)

where

\Gamma_{is}(k)=\frac{1}{m_{is}(k)}\sum_{j\in\mathcal{S}_{is}(k)}\left(e^{At_{i0}}-e^{At_{j0}}\right)

(58)

Next note that because of (4) and (12)

\pi_{is}(k)=\bar{z}_{is}(k-1)-x(t_{i(s-1)})\\ -Q_{i}(L_{i}\bar{z}_{is}(k-1)-x(t_{i(s-1)})-\bar{e}_{i}(t_{i(s-1)}))

From this and (57) it follows that

\pi_{is}(k)=\frac{1}{m_{i(s)}(k)}P_{i}\sum_{j\in\mathcal{S}_{is}(k)}\pi_{js}(k-1)+Q_{i}\bar{e}_{i}(t_{i(s-1)})\\ +P_{i}\Gamma_{is}(k)x((s-1)T),\;\;k\in\mathbf{q},\;i\in\mathbf{m},\;s\geq 1

(59)

which is the modified version of (32) needed to proceed. The difference between (32) and (59) is thus the inclusion in (59) of the term $P_{i}\Gamma_{is}(k)x((s-1)T)$ .

The assumption that the event time sequences of the agents may start at a different time requires us to make the same assumption as before about the neighbor graph $\mathbb{N}(t)$ , namely that it is constant on each interval $\mathcal{I}_{s}(k),\;k\in\mathbf{q},\;s\geq 1$ . The assumption makes sense in the present context for the same reason as before, specifically because the interval $\mathcal{I}(k)$ defined by (23) do not overlap. This, in turn, is because the bounds $\epsilon_{i}$ have been assumed to satisfy (10) which guarantees that Lemma 1 continues to hold.

The next step in the analysis of the hybrid observer is to study the evolution of the all-agent parameter error vector

\pi_{s}(k)=\text{column}\{\pi_{1s}(k),\pi_{2s}(k),\ldots,\pi_{ms}(k)\}

As before, (33) and (34) continue to hold where $e(s)$ is the all - agent state estimation error defined by (35). A simple modification of the proof of Lemma 3 can be used to establish the lemma’s validity in the present context. Consequently a proof will not be given. The lemma enables us to combine the individual update equations in (59), thereby obtaining the update equation

\pi_{s}(k)=P(F_{s}(k)\otimes I)\pi_{s}(k-1)+Q\bar{e}(s-1)\\ +P\Gamma_{s}(k)x((s-1)T),\;\;k\in\mathbf{q}

where

\Gamma_{s}(k)=\text{column}\{\Gamma_{1s}(k),\ldots,\Gamma_{ms}(k)\}

(60)

The steps involved in doing this are essentially the same as the steps involved in deriving (37). Not surprisingly, the only difference between (37) and (60) is the inclusion in the latter of the term $P\Gamma_{s}(k)x((s-1)T)$ .

From (33), (34), and (60) it follows at once that the all-agent state estimation error vector satisfies

e(s)=A(s)e(s-1)+B(s)\bar{e}(s-1)\\ +G(s)x((s-1)T),\;\;s\geq 1

(61)

where $A(s)$ and $B(s)$ are as defined in (42) and (43) respectively, and

G(s)=e^{\tilde{A}T}\sum_{k=1}^{q}\Phi_{s}(k)P\Gamma_{s}(k)

The following lemma gives a bound on the mixed matrix norm of $G(s)$ .

Lemma 8.

Suppose that $q$ satisfies the inequality given in Theorem 22. Then

|G(s)|\leq 2mq\epsilon||A||e^{||A||(T+\beta)}

(62)

Note that this bound is small when $\epsilon$ is small. This means that small deviations of the agent’s event time sequences from the nominal event time sequence $0,T,2T,\ldots$ produce small effects on the error dynamics in (61), provided of course $x$ is well behaved; i.e., $A$ is a stability matrix! More will be said about this point below.

Proof of Lemma 62: From (58),

||\Gamma_{is}(k)||\leq\sum_{j\in\mathbf{m}}||e^{At_{i0}}-e^{At_{j0}}||

In general, for any real square matrix $M$ , and real numbers $t$ and $\tau$

||e^{Mt}-e^{M\tau}||\leq||M(t-\tau)||e^{||Mt||}

||\Gamma_{is}(k)\|\leq\sum_{j\in\mathbf{m}}||A(t_{i0}-t_{j0})||e^{||At_{i0}||}

By assumption $|t_{i0}|\leq\epsilon_{i}$ and $|t_{i0}-t_{j0}|\leq\epsilon_{i}+\epsilon_{j}$ . But $\epsilon_{i}\leq\beta$ and $\epsilon=\max\{\epsilon_{i},\;i\in\mathbf{m}\}$ . Thus $|t_{i0}|\leq\beta$ and $|t_{i0}-t_{j0}|\leq 2\epsilon$ . Therefore

||\Gamma_{is}(k)||\leq 2m\epsilon||A||e^{||A||\beta}

|\Gamma_{s}(k)|\leq 2m\epsilon||A||e^{||A||\beta}

In view of (49) and the definition of $G(s)$ , $|G(s)|\leq qe^{||A||T}||\Gamma_{s}(k)||$ . It follows that (62) holds.

Taking the construction leading to (50) as a guide, it is not difficult to derive from (61) the inequality

|e(s)|\leq e^{-\lambda sT}(|e(0)|+d|\bar{e}(0)|)\\ +\epsilon g\sum_{k=1}^{s}e^{-\lambda(s-k)T}||x((k-1)T)||,\;s\geq 1

(63)

where $d$ is as defined just below (51) and $g=2mq||A||e^{||A||(T+\beta)}$ . Comparing (51) to (63), we see that the effect of the change in assumptions leads to the inclusion in (63) of the term involving $x$ .

At this point there are two distinct cases to consider - either $e^{At}$ converges to zero or it does not. Consider first the case when $e^{At}$ converges to zero. Then there must be positive constants $c_{a}$ and $\lambda_{a}$ such that $||e^{At}||\leq c_{a}e^{-\lambda_{a}t}$ . By treating the term involving $x$ in (63) in the same manner as the term involving $\bar{e}$ in (51) was treated, one can easily conclude that for a suitably defined constant $h$

|e(s)|\leq e^{-\lambda sT}(|e(0)|+d|\bar{e}(0)|+\epsilon h||x(0)||),\;\;s\geq 1

if $\lambda_{a}>\lambda$ , or

|e(s)|\leq e^{-\lambda sT}(|e(0)|+d|\bar{e}(0)|)+\epsilon he^{-\lambda_{a}sT}||x(0)||,\;\;s\geq 1

if $\lambda_{a}\leq\lambda$ . If the former is true, then the same arguments as were used in the last section can be used to show that the state estimations errors $e_{i}(t)$ converge to zero as fast as $e^{-\lambda t}$ does. On the other hand, if the latter is true, by similar reasoning $e_{i}(t)$ can easily be shown to converge to zero as fast as $e^{-\lambda_{a}t}$ does. Note that in this case, if $\lambda_{a}$ is small, the effect of the resulting slow convergence of $x$ will to some extent be mitigated by the smallness of $\epsilon$ , so even with small $\lambda_{a}$ , the performance of the hybrid observer may be acceptable for sufficiently small perturbations of the start times of the event time sequences from $0$ .

In the other situation, which is when $A$ is not a stability matrix, the hybrid observer cannot perform acceptably except possibly if finite time state estimation is all that is desired and $\epsilon$ is sufficiently small.

Key Point: This limitation applies not only to the hybrid observer discussed in this paper, but to all state estimators, centralized or not, including Kalman filters which are not being used in feedback loops.⁴⁴4Some of the adaptive observers developed in the past may be an exception to this, but such observers invariably require persistent excitation to achieve exponential convergence.

Experience has shown that this limitation is not widely recognized, despite its simple justification. Here is the justification.
Suppose one is trying to obtain an estimate $\widehat{x}$ of the state $x$ of a single-channel, observable linear system $y=Cx$ , $\dot{x}=Ax$ using an observer but approximately correct values of $A$ and $C$ - say $\widehat{A}$ and $\widehat{C}$ - upon which to base the observer design are known. The observer would then be a linear system of the form

\dot{\widehat{x}}=\widehat{A}\widehat{x}+K(\widehat{C}\widehat{x}-y)

(64)

with $K$ chosen to exponentially stabilize $\widehat{A}+K\widehat{C}$ . Then it is easy to see that the state estimation error $e=\widehat{x}-x$ must satisfy

\dot{e}=(\widehat{A}+K\widehat{C})e+(\widehat{A}-A+K(\widehat{C}-C))x

Therefore if $A$ is not a stability matrix and either $\widehat{A}$ is not exactly equal to $A$ or $\widehat{C}$ is not exactly equal to $C$ , then instead of converging to zero, the state estimation error $e$ will grow without bound for almost any initialization. In other words, with robustness in mind, the problem of trying to obtain an estimate of the state of a linear system with an “open-loop” state estimator, does not make sense unless $A$ is a stability matrix. Of course, if one is trying to use a state estimator generate an estimate $\widehat{x}$ of the state $x$ of the forced linear system

\dot{x}=Ax+BF\widehat{x}

where $A+BF$ is a stability matrix, this problem does not arise, but to accomplish this one has to change the estimator dynamics defined in (64) to

\dot{\widehat{x}}=\widehat{A}\widehat{x}+K(\widehat{C}\widehat{x}-y)+BF\widehat{x}

While this modification works in the centralized case, it cannot be used in the decentralized case as explained in [36]. In fact, until recently there appeared to be only one of distributed observer which could be used in a feedback configuration thereby avoiding the robustness issue just mentioned [36]. However, recent research suggests other approach may emerge [37].

5 Resilience

Consider first the situation when there is a potential loss or addition of $a$ arcs in the neighbor graph. Assume the neighbor graph is $\bar{a}$ -arc redundantly strongly connected in that the graph is strongly connected and remains strongly connected after any $a\leq\bar{a}$ arcs are removed. With this assumption, strong connectivity of the neighbor graph and jointly observability of the system are ensured when any $a\leq\bar{a}$ arcs are lost. Alternatively, if any number of new arcs are added, strong connectivity and joint observability are clearly still ensured. Thus, in the light of Theorem 22, whenever $a\leq\bar{a}$ arcs are lost from or added to the neighbor graph, the hybrid estimator under consideration will still function correctly without the need for any “active” intervention such as redesign of any of the $K_{i}$ or readjustment of $q$ . In fact, Theorem 22 guarantees that correct performance will prevail, even if arcs change over and over, no matter how fast, just so long as strong connectivity is maintained for all time.

Consider next the far more challenging situation when at some time $t^{*}$ there is a loss of $v<m$ vertices from the neighbor graph $\mathbb{N}(t).$ For this situation, only preliminary results currently exist. One possible way to deal with this situation is as follows.

As a first step, pick the $K_{i}$ as before, so that all $m$ local observer state estimator errors converge to zero as fast as $e^{-\lambda t}$ does. Next, assume that the neighbor graph is $\bar{v}<m$ -vertex redundantly strongly connected in that it is strongly connected and remains strongly connected after any $v\leq\bar{v}$ vertices are removed. Assume in addition that the system described by (1), (2) is $\bar{v}$ redundantly jointly observable in that the system which results after any $v\leq\bar{v}$ output measurements $y_{i}$ have been deleted, is still jointly observable. Let $\mathcal{D}$ denote the family of all nonempty subsets $\mathbf{d}\subset\mathbf{m}$ such that each subset $\mathbf{d}\in\mathcal{D}$ contains at least $m-\bar{v}$ vertices. Thus each loss of at most $\bar{v}$ vertices results in a strongly connected subgraph of $\mathbb{N}(t)$ for some subset $\mathbf{d}\in\mathcal{D}$ ; call this subgraph $\mathbb{N}_{\mathbf{d}}(t)$ . Correspondingly, let $\Sigma_{\mathbf{d}}$ denote the multi-channel linear system which results when those outputs $y_{i}$ $i\not\in\mathbf{d}$ are deleted from (1), (2). Thus $\Sigma_{\mathbf{d}}$ is a jointly observable multi-channel linear system whose channel outputs are the $y_{i},\;i\in\mathbf{d}$ . Fix $\lambda>0$ .

Fix $\mathbf{d}\in\mathcal{D}$ and let $m_{\mathbf{d}}$ denote the number of vertices in $\mathbb{N}_{\mathbf{d}}$ . Since $\Sigma_{\mathbf{d}}$ is jointly observable it is possible to compute a number $\rho_{\mathbf{d}}$ which satisfies (17). Using the pair $(\rho_{\mathbf{d}},m_{\mathbf{d}})$ in place of the pair $(\rho,m)$ in (18) and (22), it is possible to calculate a value of $q$ , for which (22) holds. In other words, for this value of $q$ , henceforth labelled $q_{\mathbf{d}}$ Theorem 22 holds for the multichannel system $\Sigma_{\mathbf{d}}$ and neighbor graph $\mathbb{N}_{\mathbf{d}}(t)$ . By then picking

q^{*}=\max_{\mathbf{d}\in\mathcal{D}}q_{\mathbf{d}}

one obtains a value of $q$ for which Theorem 22 holds for all pairs $(\Sigma_{\mathbf{d}},\mathbb{N}_{\mathbf{d}}(t))$ as $\mathbf{d}$ ranges over $\mathcal{D}$ . Suppose a hybrid observer using $q=q^{*}$ is implemented. Suppose in addition that at some time $t^{*}$ , for some specific $\mathbf{d}\in\mathcal{D}$ , agents with labels in $\mathbf{m}-\mathbf{d}$ stop functioning. Clearly the remaining agents with labels in $\mathbf{d}$ will be able to deliver the desired state estimates with the prescribed convergence rate bounds. In this sense, the observer under consideration is resilient to vertex losses. However, unlike the loss or addition of edges mentioned above, no claim is being made at this point about what might happen if some or all of the lost vertices rejoin the network, especially if this loss-gain process is rapidly reoccurring over and over as time evolves.

A similar approach can be used to deal with the situation when at some time $t^{*}$ , the network gains some additional agents. In this case one would have to specify all possibilities and make sure that for each one, one has a strongly connected graph and a jointly observable system.

A little thought reveals that what makes it possible to deal with a change in the number of vertices in this way, is the fact that there is a single scalar quantity, namely $q$ , with the property that for each possible graphical configuration resulting from an anticipated gain or loss of vertices, there is a value of $q$ large enough for the distributed observer to perform correctly and moreover if $q$ is assigned the maximum of these values then the distribute observer will perform correctly no matter which of the anticipated vertex changes is actually encountered. Since the distributed observers described in [23, 24, 26] also require the adjustment of only a single scalar-valued quantity for a given neighbor graphs, the same basic idea just described can be used to make the observers in [23, 24, 26] resilient to a one-time gain or loss of the number of vertices on their associated neighbor graph. On the other hand, some distributed observers such as the ones described in [15, 2, 38] are not really amenable to this kind of generalization because for such observers changes in network topology require completely new designs involving the change of many of the observer’s parameters. There are also papers [39, 40] deal with sensor attacks, where a malicious attacker can manipulate their observations arbitrarily when each sensor only has one dimensional measurement.

6 Simulation

The following simulations are intended to illustrate (i) the performance of the hybrid observer in the face of system noise, (ii) the robustness of the hybrid observer with respect to variations of event time sequences, and (iii) resilience of the hybrid observer to the loss or gain of an agent. Consider the four channel, four-dimensional, continuous-time system described by the equations $\dot{x}=Ax,\;y_{i}=C_{i}x,\;i\in\{1,\;2,\;3,\;4\}$ , where

A=\begin{bmatrix}-0.1&0.4&0&0\\ -0.1&-0.1&0&0\\ 0&0&-0.2&0.2\\ 0&0&-2&0.1\end{bmatrix}

and $C_{i}$ is the $i$ th unit row vector in ${\rm I\!R}^{1\times 4}$ . Note that $A$ is a stable matrix with two eigenvalues at $-0.1\pm j0.2$ and a pair of complex eigenvalues at $-0.05\pm j0.6144$ . While the system is jointly observable, no single pair $(C_{i},A)$ is observable. However the system is “redundantly jointly observable” in that what remains after the removal of any one output $y_{i}$ , is still jointly observable. For the first two simulations $\mathbb{N}(t)$ is switching back and forth between Figure 2(a) and Figure 2(b), and for the third simulation the neighbor graph is as shown in Figure 2(a). Both graphs are strongly connected, and the graph in Figure 2(a) is redundantly strongly connected in that it is strongly connected and remains strongly connected after any one vertex is removed.

Suppose $T=1$ for this system. To achieve a convergence rate of $\lambda=2$ , $\bar{\lambda}$ and $q$ are chosen to be $q=50$ and $\bar{\lambda}=3$ respectively.
For agent 1: $\bar{C}_{1}=[0\;1]$ ,

\bar{A}_{1}\!\!=\!\!\begin{bmatrix}-0.1&-0.1\\ 0.4&-0.1\end{bmatrix},\;L_{1}\!\!=\!\!\begin{bmatrix}0&1&0&0\\ 1&0&0&0\end{bmatrix},\;K_{1}\!\!=\!\!-\begin{bmatrix}13.7\cr 4.8\end{bmatrix}

For agent 2: $\bar{C}_{2}=[0\;1]$ ,

\bar{A}_{2}=\begin{bmatrix}-0.1&-0.4\\ 0.1&-0.1\end{bmatrix},L_{2}=\begin{bmatrix}-1&0&0&0\\ 0&1&0&0\end{bmatrix},K_{2}=-\begin{bmatrix}54.7\\ 4.8\end{bmatrix}

For agent 3: $\bar{C}_{3}=[0\;1]$ ,

\bar{A}_{3}=\begin{bmatrix}0.1&-2\\ 0.2&-0.2\end{bmatrix},L_{3}=\begin{bmatrix}0&0&0&1\\ 0&0&1&0\end{bmatrix},K_{3}=-\begin{bmatrix}30.6\\ 4.9\end{bmatrix}

For agent 4: $\bar{C}_{4}=[0\;1]$ ,

\bar{A}_{4}=\begin{bmatrix}-0.2&-0.2\\ 2&0.1\end{bmatrix},L_{4}=\begin{bmatrix}0&0&-1&0\\ 0&0&0&1\end{bmatrix},K_{4}=-\begin{bmatrix}2.32\cr 4.9\end{bmatrix}

In all four cases the local observer convergence rates are all $2$ .

This system was simulated with $x(0)=[3\;2\;4\;1]^{\prime}$ as the initial state of the process, $w_{1}(0)=[5\;5]^{\prime}$ , $w_{2}(0)=[5\;5]^{\prime}$ , $w_{3}(0)=[5\;5]^{\prime}$ , and $w_{4}(0)=[5\;5]^{\prime}$ as the initial states of the four local observers, $x_{1}(0)=x_{2}(0)=[5\;5\;5\;5]^{\prime}$ , and $x_{3}(0)=x_{4}(0)=[4\;4\;4\;4]^{\prime}$ as the initial estimates of the four local estimators.

Three simulations were performed. The first is intended to demonstrate performance in the face of system noise. For this a modified process dynamic of the form $\dot{x}=Ax+b\nu$ is assumed where $b=[1\;1\;1\;1]^{\prime}$ and $\nu=\cos 10t$ is system noise. Traces of this simulation are shown in Figure 3 where $x_{1}^{(3)}$ and $x^{(3)}$ denote the third components of $x_{1}$ and $x$ respectively. Only the trajectory of $x_{1}^{(3)}$ is plotted because for agent $1$ only the third component is unobservable, and all the other components are observable.

The second simulation, which is without system noise, is intended to demonstrate the hybrid observer’s robustness against a small change in the event time sequence of one of the agents. The change considered presumes that the event times of agent 4 occur $.2T$ time units before the the event times of the other three agents. Traces of this simulation are shown in Figure 4.

The third simulation, also without system noise, is intended to demonstrate the hybrid observer’s resilience against the disappearance of agent 4 at time $t=5$ and also against agent $4$ ’s re-emergence at time $t=7$ . Traces of this simulation are shown in Figure 5.

Disruption appearing at the beginning of the traces for all three simulations are due to initial conditions and are not important. In the third simulation, the loss of agent 4 at $t=5$ does not appear to have any impact whereas the trace shows that the re-emergence of agent 4 at $t=7$ briefly effects performance. While the claims in this paper do not consider the possibility of of agent re-emergence, it is not surprising that this event does not cause misbehavior because the time between the loss and gain of the agent, namely $2$ time units, is large compared to the time constants of the observer. Clearly much more work needs to be done here to better understand rapidly occurring and re-occurring losses and gains of agents.

7 Concluding Remarks

One of the nice properties of the hybrid observer discussed in this paper is that it is resilient. By this we mean that under appropriate conditions it is able to continue to provide asymptotically correct estimates of $x$ , even if communications between some agents break down or if one or several of the agents joins or leaves the network. The third simulation provides an example of this capability. As pointed out earlier, further research is needed to more fully understand observer resilience, especially the situation when agents join or leave the network.

Generally one would like to choose $T$ “small” to avoid unnecessarily large error overshooting between event times. Meanwhile it is obvious from (20) that the larger the number $p$ and consequently the number of iterations $q$ on each event-time interval, the faster the convergence. Two considerations limit the value of $q$ - how fast the parameter estimators can compute and how quickly information can be transmitted across the network. We doubt the former consideration will prove very important in most applications, since digital processors can be quite fast and the computations required are not so taxing. On the other hand, transmission delays will almost certainly limit the choice of $q$ . A model which explicitly takes such delays into account will be presented in another paper.

A practical issue is that the development in this paper does not take into account measurement noise. On the other hand, the observer provides exponential convergence and this suggests that if noisy measurements are considered, the observer’s performance will degrade gracefully with increasing noise levels. Of course one would like an “optimal” estimator for such situations in the spirit of a Kalman filter. Just how to formulate and solve such a problem is a significant issue for further research.

{ack}

This work was supported by NSF grant 1607101.00, AFOSR grant FA9550-16-1-0290, and ARO grant W911NF-17-1-0499.

References

[1] L. Wang, A. S. Morse, D. Fullmer, and J. Liu. A hybrid observer for a distributed linear system with a changing neighbor graph. In Proceedings of the 2017 IEEE Conference on Decision and Control, pages 1024–1029, 2017.
[2] L. Wang and A. S. Morse. A distributed observer for an time-invariant linear system. IEEE Transactions on Automatic Control, 63(7):2123–2130, 2018.
[3] Reza Olfati-Saber. Distributed Kalman filter with embedded consensus filters. Proceedings of the 44th IEEE Conference on Decision and Control, and the European Control Conference, CDC-ECC ’05, pages 8179–8184, 2005.
[4] Reza Olfati-Saber. Distributed Kalman filtering for sensor networks. In Proceedings of the 46th IEEE Conference on Decision and Control, pages 5492–5498, 2007.
[5] Reza Olfati-Saber. Kalman-Consensus Filter : Optimality, stability, and performance. Proceedings of the 48h IEEE Conference on Decision and Control, and the 28th Chinese Control Conference, pages 7036–7042, dec 2009.
[6] Usman A Khan, Soummya Kar, Ali Jadbabaie, and José M.F. Moura. On connectivity, observability, and stability in distributed estimation. In Proceedings of the IEEE Conference on Decision and Control, pages 6639–6644, 2010.
[7] U. A. Khan and Ali Jadbabaie. On the stability and optimality of distributed Kalman filters with finite-time data fusion. In Proceedings of the 2011 American Control Conference, pages 3405–3410, 2011.
[8] Jaeyong Kim, Hyungbo Shim, and Jingbo Wu. On distributed optimal Kalman-Bucy filtering by averaging dynamics of heterogeneous agents. In Proceedings of the 55th IEEE Conference on Decision and Control, pages 6309–6314, 2016.
[9] Jingbo Wu, Anja Elser, Shen Zeng, and Frank Allgöwer. Consensus-based Distributed Kalman-Bucy Filter for Continuous-time Systems. IFAC-PapersOnLine, 49(22):321–326, 2016.
[10] Reza Olfati-Saber and Parisa Jalalkamali. Coupled distributed estimation and control for mobile sensor networks. IEEE Transactions on Automatic Control, 57(10):2609–2614, 2012.
[11] Mohammadreza Doostmohammadian and Usman A. Khan. On the genericity properties in distributed estimation: Topology design and sensor placement. IEEE Journal of Selected Topics in Signal Processing, 7(2):195–204, 2013.
[12] Usman A. Khan and Ali Jadbabaie. Collaborative scalar-gain estimators for potentially unstable social dynamics with limited communication. Autom., 50:1909–1914, 2014.
[13] Shinkyu Park and NC Martins. An augmented observer for the distributed estimation problem for LTI systems. In Proceedings the 2012 American Control Conference, pages 6775–6780, 2012.
[14] Shinkyu Park and Nuno C. Martins. Necessary and sufficient conditions for the stabilizability of a class of LTI distributed observers. In Proceedings of the 51st IEEE Conference on Decision and Control, pages 7431–7436, 2012.
[15] Shinkyu Park and Nuno C Martins. Design of Distributed LTI Observers for State Omniscience. IEEE Transactions on Automatic Control, 62(2):561–576, 2017.
[16] Aritra Mitra and Shreyas Sundaram. Distributed Observers for LTI Systems. IEEE Transactions on Automatic Control, 63(11):3689–3704, 2018.
[17] Behçet Açikmeşe, Milan Mandić, and Jason L Speyer. Decentralized observers with consensus filters for distributed discrete-time linear systems. Automatica, 50(4):1037–1052, 2014.
[18] Valery A. Ugrinovskii. Distributed robust estimation over randomly switching networks using $h_{\infty}$ consensus. Automatica, 49:160–168, 2013.
[19] Álvaro Rodríguez del Nozal, Pablo Millán, Luis Orihuela, Alexandre Seuret, and Luca Zaccarian. Distributed estimation based on multi-hop subspace decomposition. Automatica, 99:213–220, 2019.
[20] Francisco Castro Rego, Ye Pu, Andrea Alessandretti, A. Pedro Aguiar, António M. Pascoal, and Colin N. Jones. A distributed luenberger observer for linear state feedback systems with quantized and rate-limited communications. IEEE Transactions on Automatic Control, 66(9):3922–3937, 2021.
[21] Shih-Ho Wang Shih-Ho Wang and E Davison. On the stabilization of decentralized control systems. IEEE Transactions on Automatic Control, 18(5):473–478, 1973.
[22] J. P. Corfmat and A. S. Morse. Decentralized control of linear multivariable systems. Automatica, 12(5):479–497, September 1976.
[23] Taekyoo Kim, Hyungbo Shim, and Dongil Dan Cho. Distributed Luenberger Observer Design. In Proceedings of the 55th IEEE Conference on Decision and Control, pages 6928–6933, Las Vegas, USA, 2016.
[24] Weixin Han, Harry L. Trentelman, Zhenhua Wang, and Yi Shen. A simple approach to distributed observer design for linear systems. IEEE Transactions on Automatic Control, 64(1):329–336, 2019.
[25] Weixin Han, Harry L. Trentelman, Zhenhua Wang, and Yi Shen. Towards a minimal order distributed observer for linear systems. Systems and Control Letters, 114:59–65, 2018.
[26] L. Wang, J. Liu, and A. S. Morse. A distributed observer for a continuous-time linear system. In Proceedings of 2019 American Control Conference, pages 86–89, Philadelphia, PA, USA, July 2019.
[27] Jin Gyu Lee and Hyungbo Shim. A distributed algorithm that finds almost best possible estimate under non-vanishing and time-varying measurement noise. IEEE Control Systems Letters, 4(1):229–234, 2020.
[28] L. Wang, J. Liu, A. S. Morse, and B. D. O. Anderson. A distributed observer for a discrete-time linear system. In Proceedings of the 2019 IEEE Conference on Decision and Control, pages 367–372, 2019.
[29] Y. Li, S. Phillips, and R.G. Sanfelice. Robust Distributed Estimation for Linear Systems under Intermittent Information. IEEE Transactions on Automatic Control, 63(4):1–16, 2018.
[30] Aritra Mitra, John A Richards, Saurabh Bagchi, and Shreyas Sundaram. Finite-Time Distributed State Estimation over Time-Varying Graphs : Exploiting the Age-of-Information. 2018. arXiv:1810.06151 [cs.SY].
[31] L. Wang and A. S. Morse. A distributed observer for a continuous-time, linear system with a time-varying graph. arXiv:2003.02134v1, mar 2020.
[32] L. Wang, D. Fullmer, and A. S. Morse. A distributed algorithm with an arbitrary initialization for solving a linear algebraic equation. In Proceedings of the 2016 American Control Conference (ACC), pages 1078–1081, July 2016.
[33] S. Mou, J. Liu, and A. S. Morse. An distributed algorithm for solving a linear algebraic equation. IEEE Transactiona on Automatic Control, pages 2863–2878, 2015.
[34] Mohammadreza Doostmohammadian, Usman A. Khan, Mohammad Pirani, and Themistoklis Charalambous. Consensus-based distributed estimation in the presence of heterogeneous, time-invariant delays. IEEE Control Systems Letters, 6:1598–1603, 2022.
[35] L. Wang, D. Fullmer, and A. S. Morse. A distributed algorithm with an arbitrary initialization for solving a linear algebraic equation. In Proceedings of the 2016 American Control Conference, pages 1078–1081, 2016.
[36] F. Liu, L. Wang, D. Fullmer, and A. S. Morse. Distributed feedback control of multi-channel linear systems. arXiv:1912.03890, 2020. under revision.
[37] T. Kim, D. Lee, and H. Shim. Decentralized design and plug-and-play distributed control for linear multi-channel systems. arXiv:2011.09735, 2020.
[38] Mohammadreza Doostmohammadian, Hamid R. Rabiee, Houman Zarrabi, and Usman A. Khan. Distributed estimation recovery under sensor failure. IEEE Signal Processing Letters, 24(10):1532–1536, 2017.
[39] Chanhwa Lee, Hyungbo Shim, and Yongsoon Eun. On redundant observability: From security index to attack detection and resilient state estimation. IEEE Transactions on Automatic Control, 64(2):775–782, 2019.
[40] Xingkang He, Xiaoqiang Ren, Henrik Sandberg, and Karl Henrik Johansson. How to secure distributed filters under sensor attacks. IEEE Transactions on Automatic Control, 67(6):2843–2856, 2022.