This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Group Steering: Approaches Based on Power Moments

Guangyu Wu chinarustin@sjtu.edu.cn    Anders Lindquist alq@kth.se Department of Automation, Shanghai Jiao Tong University, Shanghai, China Department of Automation and School of Mathematical Sciences, Shanghai Jiao Tong University, Shanghai, China
Abstract

This paper considers the problem of steering a vast group of agents of which the dynamics are governed by a discrete-time first-order linear system. The group of agents are characterized as a probability density function and an occupation measure respectively in the paper and two corresponding treatments are given. We propose to use the power moments to characterize the density function/occupation measure of the agents. A moment system representation of the original system is put forward for control and an empirical control scheme corresponding to it is proposed. By the designed control law, the moment sequence of the control at each time step is positive, which ensures the existence of the control for the moment system. We then realize the control as an analytic form of function by a convex optimization scheme of which the existence and uniqueness of the solution have been proved in our previous paper. The terminal density is proved to converge to the desired terminal one, which distinguishes the proposed distribution steering scheme from other existing ones. An error analysis of the terminal density from the specified one is also provided. For the problem where the group of agents is characterized as an occupation measure, the control for each agent is determined by drawing independent and identically-distributed(i.i.d) samples from the realized analytic function. Finally we simulate both unconstrained and constrained controls of a vast group of agents, which validate our proposed algorithms.

keywords:
Distribution steering; power moments; multiple agents.

plain definition remark

thanks: Portions of this work have been presented at the IFAC World Congress 2023, Yokohama, Japan [28].

,

1 Introduction

We are interested in the problem of steering the states of a vast group of agents, of which the dynamics are governed by a discrete-time stable first-order linear system, between an initial and a final probability densities or occupation measures without stochastic disturbance. We consider the linear dynamics of the ithi_{\text{th}} agent

xi(k+1)=a(k)xi(k)+ui(k),i=1,,Nx_{i}(k+1)=a(k)x_{i}(k)+u_{i}(k),\quad i=1,\cdots,N (1)

Since the system is stable and we assume a(k)a(k) to be positive, we have a(k)(0,1)a(k)\in(0,1). The control input on the ithi_{\text{th}} agent is denoted as uiu_{i}, and xix_{i} is its state. We assume that the agents are non-interactive and the volume of the agents is ignored. It means that the agents are allowed to occupy the same state and the collisions are ignored.

This poses a significant challenge both in theory and practice. From a theoretical standpoint, the challenge involves controlling numerous agents, distinguishing it from conventional control problems where only one object is typically controlled, and feedback control laws are developed based on the state of that single object. However, applying such principles to a vast group of agents becomes highly complex. In this scenario, the goal isn’t to manipulate the state of each individual agent to a predefined state, which is not only unnecessary but also computationally expensive. Instead, the objective is to control the entire group, ensuring it possesses specific global properties. As the number of agents approaches infinity, the problem becomes infinite-dimensional, making it intractable without dimension reduction or approximation. This intrinsic infinite-dimensionality adds depth and interest to the problem, presenting it as an open challenge. As Brockett asserted in [6], important limitations standing in the way of the wider use of optimal control can be circumvented by explicitly acknowledging that in most situations the apparatus implementing the control policy will be judged on its ability to cope with a distribution of initial states, rather than a single state. In practical terms, there are numerous scenarios demanding the control of a group of agents. These applications extend to steering swarms (such as UAVs or large collections of microsatellites), modeling the flow and collective motion of agents, and other ensemble control situations.

Numerous studies have addressed the challenge of steering groups of agents, leading to diverse findings. One line of research formulates the issue as stabilizing a discrete-time Markov process evolving within a compact subset of d\mathbb{R}^{d} towards a target density [3, 4, 13]. These contributions are profound and thought-provoking. However, there still exist several disadvantages. The results primarily focus on stabilizing the system and lack the capability to steer the initial density to an arbitrary distribution in a specified limited number of steps. Additionally, due to the discretization of the domain, it is not quite feasible to propose a quantitative analysis of the difference between the terminal distribution obtained by the proposed algorithms and the desired one.

In prior research, a notable exploration revolves around the assumption that the distribution follows a Gaussian pattern [18, 19, 2, 20, 31, 17, 1, 21]. Specifically, the goal here is to manipulate the first two moments of Gaussian distributions to converge towards predetermined targets. However, these Gaussian methodologies primarily originate from the context of quantifying control uncertainty, where the distributions serve to characterize the uncertainty inherent in the system state. Nevertheless, in our treatment of the group steering problem, assuming the agents to be distributed according to a Gaussian pattern is infeasible. Such an assumption would fail to adequately represent the complexity of non-Gaussian scenarios, significantly constraining the efficacy of the proposed group steering algorithm in practical applications.

Considerable advancements in steering continuous-time systems have been made, notably by Chen, Georgiou, and Pavon, who introduced fundamental results using the Schrödinger Bridge strategy for Gaussian distributions [9, 10] and extended their findings to other types of distributions [11]. These results have further been expanded to nonlinear continuous-time systems with hard state constraints [8]. Additionally, a robust optimal density control strategy for robotic swarms is proposed in [24]. These contributions collectively enrich our understanding of steering problems across different system dynamics and constraints.

When the density of the agents is assumed to be Gaussian, we use the first and second order moments to characterize the density, which turns the problem into a finite-dimensional one. However, by generalizing the mean and covariance to all the power moments, we will have a more conceptual view of this problem. Controlling the system state as a distribution function, if only assumed to be Lebesgue integrable, is an uncountably infinite-dimensional problem. By probability theory, we note that a distribution function can be uniquely determined by its full power moment sequence [32, 12]. By controlling the full power moment sequence instead of the distribution of system state, the problem is reduced to a countably infinite-dimensional one. By properly truncating the first several terms of the power moment sequence for characterizing the density of the system state, the problem is now steering a truncated power moment sequence to another, which is finite-dimensional and tractable. Decomposing the distribution by the power moments provides a completely new perspective to the challenge proposed by Brockett in [6].

In this paper we provide what can be regarded as the first computable and implementable solution to the distribution steering problem of the discrete-time linear system within limited steps by power moments, where the specified initial and terminal distributions, including probability densities and occupation measures of the agents, are arbitrary (only assuming the existence of first several orders of power moments). The paper is organized as follows. In Section 2, we propose a moment system representation as a counterpart of the discrete-time linear system. Subsequently, we introduce a problem definition for group steering based on the moment system. Unlike conventional control problems, the positivity of Hankel matrices for both control inputs and system states’ moments is required, posing challenges for standard approaches such as optimal control. In the context of general distribution steering, as considered in [25], the system trajectory is predetermined. We propose an empirical scheme for determining the system trajectory. Additionally, we employ a density parametrization algorithm from our prior work [29] to realize control inputs as analytic functions using power moments. The results mentioned above were presented at the IFAC World Congress 2023 [28]. This paper delves into more comprehensive quantitative analyses of the proposed density steering problem, and extends its applicability to steering vast groups of agents in real-world scenarios. With the number of moment terms used approaching infinity, we propose that the terminal distribution obtained by the proposed steering scheme converges almost everywhere to the desired one. The density steering problem is intrinsically infinite-dimensional, where densities are not assumed to adhere to specific functions. Since the number of power moments we utilize is finite, the presence of errors in the terminal density is inevitable. Consequently, we propose a tight upper bound for the error of the terminal density relative to the specified one. Building upon the density-steering algorithm outlined in Section 2, we introduce an algorithm for steering a finite number of agents characterized as an occupation measure. Each agent’s control input is determined by drawing independent and identically distributed (i.i.d.) samples from the realized control functions, facilitated by an acceptance-rejection sampling strategy. Finally, we provide six examples to validate the effectiveness of the two proposed steering algorithms. The examples encompass densities such as Gaussian distributions, mixtures of Gaussians, mixtures of generalized logistic distributions, and mixtures of Laplacians.

2 Steer the group of agents as a probability density function

We first consider the problem of steering an initial density function to a terminal one. Then the steering of arbitrary occupation measures will be treated in the following section, of which there has not been a solution in the previous papers.

2.1 A moment system representation

With a vast group of agents, i.e., NN is large, a conventional approach is to approximate x(k)x(k) and u(k)u(k) as random variables for which the density functions are denoted as qkq_{k} and pkp_{k}. It is a mean-field approximation [3] of the state evolution of the agents. The problem of steering the group of agents is then turned to steering an initial density function to a terminal one. The density control problem is formulated as follows.

The dynamics of the system is

x(k+1)=a(k)x(k)+u(k).x(k+1)=a(k)x(k)+u(k). (2)

Given an initial probability density function q0(x)q_{0}(x) of the random variable x(0)x(0), determine the control sequence, i.e., (u(0),,u(K1))\left(u(0),\cdots,u(K-1)\right) such that the terminal density function is τ(x)\tau(x) for x(K)x(K).

However it is not always feasible for us to obtain a closed form of solution to this problem. If the distributions are not assumed to fall within specific classes, the problem is intrinsically infinite-dimensional. We note that the density function of x(k+1)x(k+1) can be written as

qk+1(t)\displaystyle q_{k+1}(t) =fk(ξ,tξ)𝑑ξ\displaystyle=\int_{\mathbb{R}}f_{k}\left(\xi,t-\xi\right)d\xi (3)
xu¯¯qk(ξa(k))pk(tξ)𝑑ξ\displaystyle\underline{\underline{x\perp\!\!\!\perp u}}\int_{\mathbb{R}}q_{k}\left(\frac{\xi}{a(k)}\right)p_{k}\left(t-\xi\right)d\xi
xu¯¯(qk(ta(k))pk(t))(t).\displaystyle\underline{\underline{x\perp\!\!\!\perp u}}\left(q_{k}\left(\frac{t}{a(k)}\right)*p_{k}(t)\right)(t).

where xux\perp\!\!\!\perp u denotes that x(k)x(k) and u(k)u(k) are independent. Addressing the density steering problem requires obtaining an analytical solution for qk+1(t)q_{k+1}(t) in (3). The scenario would be more complicated with x(k),u(k)x(k),u(k) being correlated. However, except for specific function classes like Gaussian distributions or trigonometric functions, obtaining such an analytic solution is impractical for the majority of function classes. This limitation is a key factor, as evident in prior results with a problem setting akin to the density steering problem, where the examples predominantly feature Gaussian or trigonometric densities. Consequently, the applicability of these results in real-world scenarios is severely constrained.

There is a similar problem in non-Gaussian Bayesian filtering. In our previous results [29], we proposed a method of using the power moments to treat this intractable problem, mainly for characterizing the macroscopic property of the distributions. However, even it is theoretically feasible to characterize the distribution of the agents by the full power moment sequence, the problem is infinite dimensional. A common treatment is to truncate the first 2n2n moment terms [7, 14], which turns the problem we treat to a truncated moment problem.

By the system equation (2), the power moments of the states up to order 2n2n are written as

𝐄[xl(k+1)]=j=0l(lj)aj(k)𝐄[xj(k)ulj(k)]\mathbf{E}\left[x^{l}(k+1)\right]=\sum_{j=0}^{l}\binom{l}{j}a^{j}(k)\mathbf{E}\left[x^{j}(k)u^{l-j}(k)\right]

for l0l\in\mathbb{N}_{0} (0\mathbb{N}_{0} denotes the set of all nonnegative integers), l2nl\leq 2n. It is difficult to treat the term 𝐄[xj(k)ulj(k)]\mathbf{E}\left[x^{j}(k)u^{l-j}(k)\right]. However, we note that if x(k)x(k) and u(k)u(k) are independent from each other, i.e., 𝐄[xj(k)ulj(k)]=𝐄[xj(k)]𝐄[uij(k)]\mathbf{E}\left[x^{j}(k)u^{l-j}(k)\right]=\mathbf{E}\left[x^{j}(k)\right]\mathbf{E}\left[u^{i-j}(k)\right], the dynamics of the moments can be written in a linear matrix equation

𝒳(k+1)=𝒜(𝒰(k))𝒳(k)+𝒰(k)\mathscr{X}(k+1)=\mathscr{A}(\mathscr{U}(k))\mathscr{X}(k)+\mathscr{U}(k) (4)

where the new state vector is composed of the power moment terms up to order 2n2n,

𝒳(k)=[𝐄[x(k)]𝐄[x2(k)]𝐄[x2n(k)]].\mathscr{X}(k)=\begin{bmatrix}\mathbf{E}[x(k)]&\mathbf{E}[x^{2}(k)]&\cdots&\mathbf{E}[x^{2n}(k)]\end{bmatrix}^{\intercal}. (5)

The new input vector is written as

𝒰(k)=[𝐄[u(k)]𝐄[u2(k)]𝐄[u2n(k)]].\mathscr{U}(k)=\begin{bmatrix}\mathbf{E}[u(k)]&\mathbf{E}[u^{2}(k)]&\cdots&\mathbf{E}[u^{2n}(k)]\end{bmatrix}^{\intercal}. (6)

where we have denoted

𝐄[xl(k)]=𝒮xlqk(x)𝑑x,\mathbf{E}\left[x^{l}(k)\right]=\int_{\mathcal{S}}x^{l}q_{k}(x)dx, (7)
𝐄[xj(k)ulj(k)]=𝒮xjqk(x)𝑑x𝒞uljpk(u)𝑑u,\mathbf{E}\left[x^{j}(k)u^{l-j}(k)\right]=\int_{\mathcal{S}}x^{j}q_{k}(x)dx\int_{\mathcal{C}}u^{l-j}p_{k}(u)du,

and

𝐄[ul(k)]=𝒞ulpk(u)𝑑u.\mathbf{E}\left[u^{l}(k)\right]=\int_{\mathcal{C}}u^{l}p_{k}(u)du. (8)

The matrix 𝒜(𝒰(k))\mathscr{A}(\mathscr{U}(k)) of the new system can then be written as (9).

𝒜(𝒰(k))=[a(k)0002a(k)𝐄[u(k)]a2(k)003a(k)𝐄[u2(k)]3a2(k)𝐄[u(k)]a3(k)0(2n1)a(k)𝐄[u2n1(k)](2n2)a2(k)𝐄[u2n2(k)](2n3)a3(k)𝐄[u2n3(k)]a2n(k)].\mathscr{A}(\mathscr{U}(k))=\begin{bmatrix}a(k)&0&0&\cdots&0\\ 2a(k)\mathbf{E}[u(k)]&a^{2}(k)&0&\cdots&0\\ 3a(k)\mathbf{E}[u^{2}(k)]&3a^{2}(k)\mathbf{E}[u(k)]&a^{3}(k)&\cdots&0\\ \vdots&\vdots&\vdots&\ddots\\ \binom{2n}{1}a(k)\mathbf{E}[u^{2n-1}(k)]&\binom{2n}{2}a^{2}(k)\mathbf{E}[u^{2n-2}(k)]&\binom{2n}{3}a^{3}(k)\mathbf{E}[u^{2n-3}(k)]&&a^{2n}(k)\end{bmatrix}. (9)

The sets 𝒮\mathcal{S} and 𝒞\mathcal{C} are the support of qk(x)q_{k}(x) and pk(u)p_{k}(u) correspondingly. For control problems without state constraints, 𝒮\mathcal{S} can be chosen as the real line \mathbb{R}. Or it can be chosen as a compact subset of \mathbb{R} according to its bounds. Similarly, 𝒞\mathcal{C} can be chosen as \mathbb{R} if the control input is unconstrained, or a compact subset of \mathbb{R} if constrained.

By employing truncated power moments to describe the dynamics of the system (1), where x(k)x(k) and u(k)u(k) represent probability densities, we aim to reformulate the control problem as the manipulation of the power moments of these probability densities. The system (4) is denoted as the moment system associated with (1).

By the proposed moment system, the original density steering problem can be reduced to distribution steering by power moments, which is formulated as follows. The dynamics of the moment system is

𝒳(k+1)=𝒜(𝒰(k))𝒳(k)+𝒰(k)\mathscr{X}(k+1)=\mathscr{A}(\mathscr{U}(k))\mathscr{X}(k)+\mathscr{U}(k)

where 𝒳(k),𝒰(k)\mathscr{X}(k),\mathscr{U}(k) are obtained by (7),(8). Given an arbitrary initial density q0(x)q_{0}(x), determine the control sequence (u(0),,u(K1))\left(u(0),\cdots,u(K-1)\right) such that the first 2n2n order power moments of the terminal density are identical to those of an arbitrarily specified one, i.e.,

xlτ(x)𝑑x=xlqK(x)𝑑x\int_{\mathbb{R}}x^{l}\tau(x)dx=\int_{\mathbb{R}}x^{l}q_{K}(x)dx (10)

for l=1,,2nl=1,\cdots,2n, where τ\tau is the specified terminal density function.

Remark.

The advantage of the proposed problem is twofold. Firstly, with this formulation, the problem becomes finite-dimensional, allowing for a closed-form solution. Secondly, it eliminates the need for the initial and terminal density functions of the agents to adhere to specific classes, enhancing the algorithm’s applicability across a diverse array of real-world scenarios.

However, when using the moment system for control, there is still a need to design control laws that satisfy

𝐄[xj(k)ulj(k)]=𝐄[xj(k)]𝐄[uij(k)].\mathbf{E}\left[x^{j}(k)u^{l-j}(k)\right]=\mathbf{E}\left[x^{j}(k)\right]\mathbf{E}\left[u^{i-j}(k)\right]. (11)

With a limited number of agents, it might be possible to propose feedback control laws for each agent, i.e., the control input is a function of the states of all the agents. However with a large number of agents, it is no longer tractable. In our density steering problem, NN is infinity (for the problem of steering occupation measures, NN is still a large number). A feasible control law needs to satisfy that the control vector is independent of the current state vector. In the next part of section, we propose an algorithm for steering power moments to desired ones.

2.2 An empirical control scheme for the moment system

In the previous section, controlling the group of agents has been reduced to controlling the moment system corresponding to it. Then the task is now to figure out an algorithm to determine a sequence of (𝒰(0),,𝒰(K1))\left(\mathscr{U}(0),\cdots,\mathscr{U}(K-1)\right). Nonetheless, there exist two primary distinctions from conventional control problems. Firstly, in this problem, the system matrix of the moment system becomes a function of the control vector. Secondly, the sequence of elements in the control vector, denoted as 𝒰(k)\mathscr{U}(k), must adhere to the condition that the corresponding Hankel matrix

𝐇𝒰(k)=[𝐄[u0(k)]𝐄[u1(k)]𝐄[un(k)]𝐄[u1(k)]𝐄[u2(k)]𝐄[un+1(k)]𝐄[un(k)]𝐄[un+1(k)]𝐄[u2n(k)]]\bm{\mathrm{H}}_{\mathscr{U}(k)}=\begin{bmatrix}\mathbf{E}[u^{0}(k)]&\mathbf{E}[u^{1}(k)]&\cdots&\mathbf{E}[u^{n}(k)]\\ \mathbf{E}[u^{1}(k)]&\mathbf{E}[u^{2}(k)]&&\mathbf{E}[u^{n+1}(k)]\\ \vdots&\vdots&\ddots&\\ \mathbf{E}[u^{n}(k)]&\mathbf{E}[u^{n+1}(k)]&&\mathbf{E}[u^{2n}(k)]\end{bmatrix}

is positive definite. Here 𝐇𝒰\bm{\mathrm{H}}_{\mathscr{U}} denotes the Hankel matrix of the vector 𝒰\mathscr{U}. We define such a subspace of 2n\mathbb{R}^{2n} as 𝕍++2n:={𝒰2n𝐇𝒰(k)0}\mathbb{V}^{2n}_{++}:=\{\mathscr{U}\in\mathbb{R}^{2n}\mid\bm{\mathrm{H}}_{\mathscr{U}(k)}\succ 0\}.

In prior studies, the distribution steering problems have consistently employed optimal control strategies. However, such an approach is not entirely feasible in this specific problem. The challenge lies in the necessity to ensure that both 𝒳(k)\mathscr{X}(k) and 𝒰(k)\mathscr{U}(k) belong to 𝕍++2n\mathbb{V}^{2n}_{++}. To the best of our knowledge, there has yet to be a viable result capable of addressing the optimal control problem while imposing the constraint that states and control inputs must reside within 𝕍++2n\mathbb{V}^{2n}_{++} - in other words, ensuring that the corresponding Hankel matrices are positive definite.

Now we formulate the problem we are to treat in this part of section. Let 𝒰\mathcal{U} be the feasible set of control sequences 𝓤:=(𝒰(0),,𝒰(K1))\bm{\mathscr{U}}:=\left(\mathscr{U}(0),\cdots,\mathscr{U}(K-1)\right), which satisfies

k=0K1𝐄[𝒰(k)𝒰(k)]<\sum_{k=0}^{K-1}\mathbf{E}\left[\mathscr{U}^{\intercal}(k)\mathscr{U}(k)\right]<\infty

and effects the terminal system state x(K)x(K) to be distributed satisfying (10). Then the family 𝒰\mathcal{U} represents admissible control inputs which achieve the desired moment transfer. Denote the error of the moments from the specified ones as

e(k)=𝒳T𝒳(k),e(k)=\mathscr{X}_{T}-\mathscr{X}(k), (12)

where the elements of 𝒳T\mathscr{X}_{T} are the power moments corresponding to the specified terminal density function τ(x)\tau(x).

Simultaneously confining both 𝒰(k)\mathscr{U}(k) and 𝒳(k)\mathscr{X}(k) to belong to the set 𝕍++2n\mathbb{V}^{2n}_{++} is not always feasible. However, it’s worth noting that a sub-optimal solution to the control problem can be attained by first determining the trajectory of the state and subsequently obtaining the control inputs corresponding to this trajectory. A similar approach was also utilized in [25] to address the general distribution steering problem. We first determine the trajectory of the states.

Lemma 2.1.

Given

e(k0)=𝒳T𝒳(k0)𝕍++2n,e(k_{0})=\mathscr{X}_{T}-\mathscr{X}(k_{0})\in\mathbb{V}^{2n}_{++},

we have

𝒳(k)=𝒳(k0)+i=k0k1ωie(k0)𝕍++2n\mathscr{X}(k)=\mathscr{X}(k_{0})+\sum_{i=k_{0}}^{k-1}\omega_{i}e(k_{0})\in\mathbb{V}^{2n}_{++} (13)

for k=k0+1,,Kk=k_{0}+1,\cdots,K where ωi+\omega_{i}\in\mathbb{R}_{+} for i=k0,,K1i=k_{0},\cdots,K-1 and i=k0K1ωi=1\sum_{i=k_{0}}^{K-1}\omega_{i}=1.

Proof.

The proof is straightforward. Since 𝒳(k0),e(k0)𝕍++2n\mathscr{X}(k_{0}),e(k_{0})\in\mathbb{V}^{2n}_{++}, we have 𝐇𝒰(k)0\bm{\mathrm{H}}_{\mathscr{U}(k)}\succ 0, 𝐇e(k0)0\bm{\mathrm{H}}_{e(k_{0})}\succ 0. We note that the sum of positive definite matrices is still positive definite. Since ωi>0\omega_{i}>0, we have ωie(k0)𝕍++2n\omega_{i}e(k_{0})\in\mathbb{V}^{2n}_{++}. Then 𝒳(k)𝕍++2n\mathscr{X}(k)\in\mathbb{V}^{2n}_{++}. ∎

Now it remains to prove that there exists a time step k0k_{0} at which 𝒳T𝒳(k0)𝕍++2n\mathscr{X}_{T}-\mathscr{X}(k_{0})\in\mathbb{V}^{2n}_{++}.

Proposition 2.2.

There exists a time step k0k_{0} which satisfies 𝒳T𝒳(k0)𝕍++2n\mathscr{X}_{T}-\mathscr{X}(k_{0})\in\mathbb{V}^{2n}_{++}, assuming that 𝒳(k),0kk0\mathscr{X}(k),0\leq k\leq k_{0} are uncontrolled moment states, i.e., u(k)=0,0kk0u(k)=0,0\leq k\leq k_{0}.

Proof.

We write the Hankel matrix form of 𝒳T𝒳(k)\mathscr{X}_{T}-\mathscr{X}(k) as (14). Since u(k)=0,0kk0u(k)=0,0\leq k\leq k_{0}, we obtain (15).

𝐇𝒳T𝒳(k)\displaystyle\bm{\mathrm{H}}_{\mathscr{X}_{T}-\mathscr{X}(k)} (14)
=\displaystyle= [1𝐄[xT]𝐄[x(k)]𝐄[xTn]𝐄[xn(k)]𝐄[xT]𝐄[x(k)]𝐄[xT2]𝐄[x2(k)]𝐄[xTn+1]𝐄[xn+1(k)]𝐄[xTn]𝐄[xnk]𝐄[xTn+1]𝐄[xn+1(k)]𝐄[xT2n]𝐄[x2n(k)]]\displaystyle\begin{bmatrix}1&\mathbf{E}[x_{T}]-\mathbf{E}[x(k)]&\cdots&\mathbf{E}[x^{n}_{T}]-\mathbf{E}[x^{n}(k)]\\ \mathbf{E}[x_{T}]-\mathbf{E}[x(k)]&\mathbf{E}[x^{2}_{T}]-\mathbf{E}[x^{2}(k)]&\cdots&\mathbf{E}[x^{n+1}_{T}]-\mathbf{E}[x^{n+1}(k)]\\ \vdots&\vdots&\ddots&\\ \mathbf{E}[x^{n}_{T}]-\mathbf{E}[x^{n}{k}]&\mathbf{E}[x^{n+1}_{T}]-\mathbf{E}[x^{n+1}(k)]&&\mathbf{E}[x^{2n}_{T}]-\mathbf{E}[x^{2n}(k)]\end{bmatrix}
𝐇𝒳T𝒳(k0)=\displaystyle\bm{\mathrm{H}}_{\mathscr{X}_{T}-\mathscr{X}(k_{0})}= (15)
[1𝐄[xT]i=0k1a(i)𝐄[x(0)]𝐄[xTn](i=0k1a(i))n𝐄[xn(0)]𝐄[xT]i=0k1a(i)𝐄[x(0)]𝐄[xT2](i=0k1a(i))2𝐄[x2(0)]𝐄[xTn+1](i=0k1a(i))n+1𝐄[xn+1(0)]𝐄[xTn](i=0k1a(i))n𝐄[xn(0)]𝐄[xTn+1](i=0k1a(i))n+1𝐄[xn+1(0)]𝐄[xT2n](i=0k1a(i))2n𝐄[x2n(0)]]\displaystyle\begin{bmatrix}1&\mathbf{E}[x_{T}]-\prod_{i=0}^{k-1}a(i)\mathbf{E}[x(0)]&\cdots&\mathbf{E}[x^{n}_{T}]-\left(\prod_{i=0}^{k-1}a(i)\right)^{n}\mathbf{E}[x^{n}(0)]\\ \mathbf{E}[x_{T}]-\prod_{i=0}^{k-1}a(i)\mathbf{E}[x(0)]&\mathbf{E}[x^{2}_{T}]-\left(\prod_{i=0}^{k-1}a(i)\right)^{2}\mathbf{E}[x^{2}(0)]&\cdots&\mathbf{E}[x^{n+1}_{T}]-\left(\prod_{i=0}^{k-1}a(i)\right)^{n+1}\mathbf{E}[x^{n+1}(0)]\\ \vdots&\vdots&\ddots&\\ \mathbf{E}[x^{n}_{T}]-\left(\prod_{i=0}^{k-1}a(i)\right)^{n}\mathbf{E}[x^{n}(0)]&\mathbf{E}[x^{n+1}_{T}]-\left(\prod_{i=0}^{k-1}a(i)\right)^{n+1}\mathbf{E}[x^{n+1}(0)]&&\mathbf{E}[x^{2n}_{T}]-\left(\prod_{i=0}^{k-1}a(i)\right)^{2n}\mathbf{E}[x^{2n}(0)]\end{bmatrix}

Now it remains to prove that k0\exists k_{0}, 𝐇𝒳T𝒳(k0)0\bm{\mathrm{H}}_{\mathscr{X}_{T}-\mathscr{X}(k_{0})}\succ 0 with u(k)=0,0kk0u(k)=0,0\leq k\leq k_{0}. By definition of the positive definiteness, it is equivalent to prove that each leading principal minor, the determinant of leading principal submatrix, is positive. Denote the ithi_{\text{th}}-order leading principal submatrix of 𝐇𝒳T𝒳(k)\bm{\mathrm{H}}_{\mathscr{X}_{T}-\mathscr{X}(k)} as Hi(k)H_{i}(k), and the corresponding minor as det(Hi(k))\operatorname{det}(H_{i}(k)). We note that each det(Hi(k))\operatorname{det}(H_{i}(k)) is a polynomial of i=0k1a(i)\prod_{i=0}^{k-1}a(i), of which the degree is even. Therefore, if there exists no real zero for all the det(Hi(k))\operatorname{det}(H_{i}(k)), all k00k_{0}\in\mathbb{N}_{0} satisfies 𝐇𝒳T𝒳(k0)0\bm{\mathrm{H}}_{\mathscr{X}_{T}-\mathscr{X}(k_{0})}\succ 0. Now we consider the case that det(Hi(k))\operatorname{det}(H_{i}(k)) has at least a real zero in (0,1)(0,1). We note that det(Hi(k0))>0\operatorname{det}(H_{i}(k_{0}))>0 with k0+k_{0}\rightarrow+\infty. Let k˘i\breve{k}_{i} be the smallest integer greater than the largest zero of the polynomial det(Hi(k))\operatorname{det}(H_{i}(k)). By the continuity of det(Hi(k))\operatorname{det}(H_{i}(k)), we have that det(Hi(k))>0,k(k˘i,+)\operatorname{det}(H_{i}(k))>0,k\in(\breve{k}_{i},+\infty). Let k˘=maxi(k˘i)\breve{k}=\max_{i}(\breve{k}_{i}). With k0>k˘k_{0}>\breve{k}, we have Hi(k0)0H_{i}(k_{0})\succ 0, for i=1,,ni=1,\cdots,n. Therefore we have 𝐇𝒳T𝒳(k0)0\bm{\mathrm{H}}_{\mathscr{X}_{T}-\mathscr{X}(k_{0})}\succ 0, which ensures the positiveness of all Hi(k0)H_{i}(k_{0}) and completes the proof. ∎

By Proposition 2.2, it is possible to choose a time step k0k_{0} which satisfies 𝒳T𝒳(k0)𝕍++2n\mathscr{X}_{T}-\mathscr{X}(k_{0})\in\mathbb{V}^{2n}_{++}. We assume that the system is uncontrolled before k0k_{0}, i.e. u(k)=0,kk0u(k)=0,k\leq k_{0}. From step k0k_{0}, we impose controls on the system. Lemma 2.1 has proved the positiveness of 𝒳(k),k=k0,,K\mathscr{X}(k),k=k_{0},\cdots,K. Therefore it remains to determine the parameters ωk,k=k0,,K1\omega_{k},k=k_{0},\cdots,K-1 and the corresponding control inputs 𝒰(k)\mathscr{U}(k).

It is a non-trivial problem. We give an empirical scheme to treat it. To obtain a relatively smooth transition of states, it is desired that the ωi\omega_{i}’s are close to each other. It is usually feasible for us to choose

ωk0==ωK1=1Kk0\omega_{k_{0}}=\cdots=\omega_{K-1}=\frac{1}{K-k_{0}}

We note that to determine the trajectory of the system state a priori has also been adopted in the literature. In [25], prespecified system states, which was called to form a reference trajectory, were used to determine the control inputs. After that the parameters ωis\omega_{i}^{\prime}s are determined, the control inputs of the moment system 𝒰(i)\mathscr{U}(i) for i=k0,,K1i=k_{0},\cdots,K-1 can then be calculated by solving the equation (4), provided with 𝒳(k),k=k0+1,,K\mathscr{X}(k),k=k_{0}+1,\cdots,K calculated by (13).

However sometimes the control inputs 𝒰(k)𝕍++2n\mathscr{U}(k)\notin\mathbb{V}^{2n}_{++} by choosing the ωi\omega_{i}’s to be all equal. It usually happens when the specified initial/terminal density has several modes (peaks). If so, we can choose a larger ω0/ωK1\omega_{0}/\omega_{K-1}.

In conclusion, we have proposed to use the moment system to control the vast group of agents characterized as a probability density function. And an empirical control law has been proposed which ensures the existence of u(k)u(k). However, it is only feasible for us to obtain the power moments of the control inputs u(k)u(k). We need to obtain the u(k)u(k) given its power moments, which we call realization of the control inputs.

2.3 Realization of the control inputs

In this part of the section, we are to realize the probability density of u(k)u(k) given the power moments of the designed controls 𝒰(k)\mathscr{U}(k) for the moment system.

For the sake of simplicity, we omit kk if there is no ambiguity in the following part of this section. The problem now comes to proposing an estimation algorithm which is feasible of estimating the probability density of which the power moments are as specified.

We typically examine two scenarios for controller design. The first scenario involves unconstrained control, i.e., uu is supported across the entire real line. The second scenario is constrained control, where we consider uu to be supported on a compact interval within \mathbb{R}. These situations correspond to two distinct types of moment problems we aim to address: the Hamburger moment problem for the first scenario and the Hausdorff moment problem for the second one.

Let 𝒟\mathcal{D} denote the space of probability density functions on the real line with support there. Let 𝒟2n\mathcal{D}_{2n} be the subset of all p𝒟p\in\mathcal{D} that have at least 2n2n finite moments (in addition to 𝐄[u0(k)]\mathbf{E}[u^{0}(k)], which is 1 by definition). The Kullback-Leibler distance is then defined as

𝕂𝕃(rp)=r(u)logr(u)p(u)du\mathbb{K}\mathbb{L}(r\|p)=\int_{\mathbb{R}}r(u)\log\frac{r(u)}{p(u)}du (16)

where rr is an arbitrary reference probability density in 𝒟\mathcal{D}. According to the type of the control problem, 𝒞\mathcal{C} is either \mathbb{R} (unconstrained control), or a compact interval on \mathbb{R} (constrained control). We define

B(u)=[1uun1un]B(u)=\begin{bmatrix}1&u&\cdots&u^{n-1}&u^{n}\end{bmatrix}^{\intercal}

and the linear integral operator

Θ:p(u)Σ=B(u)p(u)B(u)𝑑u,\Theta:p(u)\mapsto\Sigma=\int_{\mathbb{R}}B(u)p(u)B^{\intercal}(u)du,

where p(u)p(u) is defined on the space 𝒟2n\mathcal{D}_{2n}. Moreover, let

Σ=[1𝐄[u]𝐄[un]𝐄[u]𝐄[u2]𝐄[un+1]𝐄[un]𝐄[un+1]𝐄[u2n]]\Sigma=\begin{bmatrix}1&\mathbf{E}[u]&\cdots&\mathbf{E}[u^{n}]\\ \mathbf{E}[u]&\mathbf{E}[u^{2}]&\cdots&\mathbf{E}[u^{n+1}]\\ \vdots&\vdots&\ddots&\\ \mathbf{E}[u^{n}]&\mathbf{E}[u^{n+1}]&&\mathbf{E}[u^{2n}]\end{bmatrix}

where 𝐄[ui],i=1,,2n\mathbf{E}[u^{i}],i=1,\cdots,2n, are the entries of the designed control 𝒰\mathscr{U}. Moreover, since 𝒟2n\mathcal{D}_{2n} is convex, then so is range(Θ)=Θ𝒟2n\operatorname{range}(\Theta)=\Theta\mathcal{D}_{2n}.

Let

+:={Λrange(Θ)B(u)ΛB(u)>0,u}.\mathcal{L}_{+}:=\left\{\Lambda\in\operatorname{range}(\Theta)\mid B(u)^{\intercal}\Lambda B(u)>0,u\in\mathbb{R}\right\}.

For any r𝒟r\in\mathcal{D} and any positive definite Σ\Sigma, there exists a unique p^𝒟2n\hat{p}\in\mathcal{D}_{2n} that minimizes (16) subject to Θ(p^)=Σ\Theta(\hat{p})=\Sigma. Specifically,

p^=rBΛ^B\hat{p}=\frac{r}{B^{\intercal}\hat{\Lambda}B} (17)

where Λ^\hat{\Lambda} is the unique solution to the minimization problem

𝕁r(Λ):=tr(ΛΣ)r(u)log[B(u)ΛB(u)]𝑑u\mathbb{J}_{r}(\Lambda):=\operatorname{tr}(\Lambda\Sigma)-\int_{\mathbb{R}}r(u)\log\left[B(u)^{\intercal}\Lambda B(u)\right]du (18)

over all Λ+\Lambda\in\mathcal{L}_{+} [29]. Here, tr()\operatorname{tr}(\cdot) denotes the trace of the matrix.

Then the density estimation is formulated as a convex optimization problem. Unlike other methods of moments, the power moments of our proposed density estimate are exactly identical to those specified, which makes it a satisfactory approach for realization of the control inputs. Since the prior density r(u)r(u) and the density estimate p^(u)\hat{p}(u) are both supported on \mathbb{R}, r(u)r(u) can be chosen as a Gaussian distribution (or a Cauchy distribution if p^(u)\hat{p}(u) is assumed to be heavy-tailed).

We note that for the Hausdorff moment problem, where the control uu is supported on a compact subset of \mathbb{R}, the proofs and results for the Hamburger moment problem are also valid by substituting the domain \mathbb{R} by a compact interval on the real line. By doing this, we will obtain a density estimate p^\hat{p} in the form of (17). The prior r(u)r(u) can then be chosen as a truncated Gaussian or a truncated Laplacian.

We conclude the algorithm for density steering of the vast group of agents in this section as in Algorithm 1.

Algorithm 1 Density Steering of a Vast Group of Agents.
1:The maximal time step KK; the parameter of the system a(k)a(k) for k=0,,K1k=0,\cdots,K-1; the initial system density q0(x)q_{0}(x); the specified terminal density τ(x)\tau(x).
2:The controls u(k)u(k), k=0,,K1k=0,\cdots,K-1.
3:k0k\Leftarrow 0
4:while k<Kk<K and e(k)𝕍++2ne(k)\notin\mathbb{V}^{2n}_{++} do
5:     Calculate 𝒳(k)\mathscr{X}(k) by (4) if k>0k>0 or by (5) if k=0k=0
6:     Calculate e(k)e(k) by (12)
7:     if e(k)𝕍++2ne(k)\in\mathbb{V}^{2n}_{++} then
8:         Calculate the states of the moment system 𝒳(i)\mathscr{X}(i) for i=k+1,,K1i=k+1,\cdots,K-1 by (13) with ωk==ωK1\omega_{k}=\cdots=\omega_{K-1}
9:         Calculate the controls of the moment system 𝒰(i)\mathscr{U}(i) for i=k,,K1i=k,\cdots,K-1 by (4)
10:         if i,𝒰(i)𝕍++2n\exists i,\mathscr{U}(i)\notin\mathbb{V}^{2n}_{++} then
11:              Back to Step 8, adjust ωk,,ωK1\omega_{k},\cdots,\omega_{K-1}
12:         end if
13:         Optimize the cost function (18) and obtain the analytic estimates of the densities p^i(u)\hat{p}_{i}(u) for i=k,,K1i=k,\cdots,K-1
14:     else
15:         u(k)=0u(k)=0
16:     end if
17:     Calculate the power moments of the system state x(k+1)x(k+1), i.e., 𝒳(k+1)\mathscr{X}(k+1)
18:     kk+1k\Leftarrow k+1
19:end while

2.4 Convergence and error analyses of the terminal density function

Through the proposed density steering scheme, the first 2n2n-order power moments of the terminal density function match those of the desired one. As nn approaches infinity, the full moment sequence of the terminal density converges to that of the desired density. By Theorem 4.5.5 in [12], we can straightforwardly demonstrate that the terminal distribution obtained from our distribution steering algorithm converges almost everywhere to the true desired distribution as nn tends to infinity. Assuming both the initial and desired terminal distributions are continuous, we can further conclude that the terminal density generated by the proposed algorithm converges to the desired distribution. This convergence to the desired density distinguishes our proposed algorithm from other existing methods.

Since we used the truncated power moments of the initial and terminal density functions for steering, there may exist an error between the terminal density and the desired one. We will propose an upper bound of error of the terminal density, as to characterize the maximal difference between the terminal density by our proposed algorithm and the specified one.

In [29], we proposed an error upper bound for the Hamburger moment problem in the sense of total variation distance, which is a measure widely used in the moment problem [27, 26].

The total variation distance between the terminal density qK(x)q_{K}(x) and the desired terminal density τ(x)\tau(x) is defined as follows:

V(qK,τ)=supx|(,x](qKτ)𝑑x|=supx|FqKFτ|\displaystyle V(q_{K},\tau)=\sup_{x}\left|\int_{\left(-\infty,x\right]}(q_{K}-\tau)dx\right|=\sup_{x}\left|F_{q_{K}}-F_{\tau}\right|

where FqKF_{q_{K}} and FτF_{\tau} are the two distribution functions of qKq_{K} and τ\tau.

Shannon-entropy is used to calculate the upper bound of the total variation distance in [27]. The Shannon-entropy [23] is defined as

H[q]=𝒞q(x)logq(x)𝑑x.H[q]=-\int_{\mathcal{C}}q(x)\log q(x)dx.

We first introduce the Shannon-entropy maximizing distribution Fq˘KF_{\breve{q}_{K}}, of which the moments are the power moments of qKq_{K}. It has the following density function [15],

q˘K(x)=exp(i=02nλixi)\breve{q}_{K}(x)=\exp\left(-\sum_{i=0}^{2n}\lambda_{i}x^{i}\right)

where λ0,,λ2n\lambda_{0},\cdots,\lambda_{2n} are determined by the following constraints,

𝒞xkexp(i=02nλixi)𝑑x=𝒞xkτ(x)𝑑x\int_{\mathcal{C}}x^{k}\exp\left(-\sum_{i=0}^{2n}\lambda_{i}x^{i}\right)dx=\int_{\mathcal{C}}x^{k}\tau(x)dx

for k=0,1,,2nk=0,1,\cdots,2n. By referring to [27], the KL distance between the true density and the Shannon-entropy maximizing density can be written as

KL(τq˘K)=\displaystyle KL\left(\tau\|\breve{q}_{K}\right)= 𝒞τ(x)logτ(x)q˘K(x)dx\displaystyle\int_{\mathcal{\mathcal{C}}}\tau(x)\log\frac{\tau(x)}{\breve{q}_{K}(x)}dx
=\displaystyle= H[τ]+i=02nλi𝒞xkτ(x)𝑑x\displaystyle-H\left[\tau\right]+\sum_{i=0}^{2n}\lambda_{i}\int_{\mathcal{C}}x^{k}\tau(x)dx
=\displaystyle= H[q˘K]H[τ].\displaystyle H\left[\breve{q}_{K}\right]-H\left[\tau\right].

Similarly, we can obtain KL(τqK)=H[qK]H[τ]KL\left(\tau\|q_{K}\right)=H\left[q_{K}\right]-H\left[\tau\right].

By [16, 27], we obtain

V(q˘K,qK)\displaystyle V\left(\breve{q}_{K},q_{K}\right) 3[1+{1+49KL(qKq˘K)}1/2]1/2\displaystyle\leq 3\left[-1+\left\{1+\frac{4}{9}KL\left(q_{K}\|\breve{q}_{K}\right)\right\}^{1/2}\right]^{1/2}
=3[1+{1+49(H[q˘K]H[qK])}1/2]1/2\displaystyle=3\left[-1+\left\{1+\frac{4}{9}\left(H\left[\breve{q}_{K}\right]-H\left[q_{K}\right]\right)\right\}^{1/2}\right]^{1/2}

and

V(q˘K,τ)3[1+{1+49(H[q˘K]H[τ])}1/2]1/2V\left(\breve{q}_{K},\tau\right)\leq 3\left[-1+\left\{1+\frac{4}{9}\left(H\left[\breve{q}_{K}\right]-H\left[\tau\right]\right)\right\}^{1/2}\right]^{1/2}

Then the error upper bound of the terminal density can be written as

V(qK,τ)\displaystyle V\left(q_{K},\tau\right)
=\displaystyle= supx|FqK(x)Fτ(x)|\displaystyle\sup_{x}|F_{q_{K}}\left(x\right)-F_{\tau}\left(x\right)|
\displaystyle\leq supx(|FqK(x)Fq˘K(x)|+|Fq˘K(x)Fτ(x)|)\displaystyle\sup_{x}\left(\left|F_{q_{K}}\left(x\right)-F_{\breve{q}_{K}}\left(x\right)\right|+\left|F_{\breve{q}_{K}}(x)-F_{\tau}(x)\right|\right)
\displaystyle\leq supx|FqK(x)Fq˘K(x)|+supx|Fq˘K(x)Fτ(x)|\displaystyle\sup_{x}\left|F_{q_{K}}\left(x\right)-F_{\breve{q}_{K}}\left(x\right)\right|+\sup_{x}\left|F_{\breve{q}_{K}}(x)-F_{\tau}(x)\right|
\displaystyle\leq 3[1+{1+49(H[q˘K]H[qK])}1/2]1/2\displaystyle 3\left[-1+\left\{1+\frac{4}{9}\left(H\left[\breve{q}_{K}\right]-H\left[q_{K}\right]\right)\right\}^{1/2}\right]^{1/2}
+\displaystyle+ 3[1+{1+49(H[q˘K]H[τ])}1/2]1/2\displaystyle 3\left[-1+\left\{1+\frac{4}{9}\left(H\left[\breve{q}_{K}\right]-H\left[\tau\right]\right)\right\}^{1/2}\right]^{1/2}

3 Steer the group as an occupation measure

In the preceding section, we introduced an algorithm designed to guide a vast group of agents, represented as a probability density function, to a specified terminal state. However, the characterization of agents as a density function is an approximation when the number of agents tends to infinity. To apply our proposed algorithm to real-world scenarios, where the number of agents is finite, we must address individual agents and provide specific control inputs for each. In this section, we employ the occupation measure to describe the agent group and present a control scheme aimed at steering an initial occupation measure towards a terminal one.

We first define the occupation measure [30] of the agents at time step kk by

dμk(x,u)=1Ni=1Nδ(xxi(k))δ(uui(k))dxdu.d\mu_{k}(x,u)=\frac{1}{N}\sum_{i=1}^{N}\delta\left(x-x_{i}(k)\right)\delta\left(u-u_{i}(k)\right)dxdu.

Then the occupation measure of the group state can be written as

dqk(x)=1Ni=1Nδ(xxi(k))dxdq_{k}(x)=\frac{1}{N}\sum_{i=1}^{N}\delta\left(x-x_{i}(k)\right)dx (19)

which is a marginal measure of dμk(x,u)d\mu_{k}(x,u).

Another marginal measure, the occupation measure of the controls on the group of agents reads

dpk(u)=1Ni=1Nδ(uui(k))dudp_{k}(u)=\frac{1}{N}\sum_{i=1}^{N}\delta\left(u-u_{i}(k)\right)du (20)

The power moments of the occupation measures can then be written as

𝐄[xl(k)]=x𝑑qk(x)=1Ni=1Nxil(k),\mathbf{E}\left[x^{l}(k)\right]=\int_{\mathbb{R}}xdq_{k}(x)=\frac{1}{N}\sum_{i=1}^{N}x^{l}_{i}(k), (21)

and

𝐄[ul(k)]=u𝑑pk(u)=1Ni=1Nuil(k).\mathbf{E}\left[u^{l}(k)\right]=\int_{\mathbb{R}}udp_{k}(u)=\frac{1}{N}\sum_{i=1}^{N}u^{l}_{i}(k). (22)

Now we define the occupation measure steering by power moments, which we will treat in this section.

The dynamics of the moment system is

𝒳(k+1)=𝒜(𝒰(k))𝒳(k)+𝒰(k).\mathscr{X}(k+1)=\mathscr{A}(\mathscr{U}(k))\mathscr{X}(k)+\mathscr{U}(k).

where 𝒳(k),𝒰(k)\mathscr{X}(k),\mathscr{U}(k) are defined as (21),(22). Given an arbitrary initial state x(0)x(0), determine the control sequence (ui(0),,ui(K1))\left(u_{i}(0),\cdots,u_{i}(K-1)\right) for each agent ii such that the power moments of the terminal occupation measure are identical to those of an arbitrarily specified state, i.e.,

𝐄[xTl]=xlτ(x)𝑑x=1Ni=1Nxil(K)\mathbf{E}\left[x_{T}^{l}\right]=\int_{\mathbb{R}}x^{l}\tau(x)dx=\frac{1}{N}\sum_{i=1}^{N}x_{i}^{l}(K) (23)

for l=1,,2nl=1,\cdots,2n, where τ(x)\tau(x) is the specified terminal density.

The main difference of the occupation measure steering problem from the density steering one lies in determining the control input for each agent. We naturally consider designing feedback control laws for the agents. It might be feasible with limited number of agents. However it is quite expensive and problematic with quite a large number of agents as in our problem. Moreover, as to implement the feedback control, we need to obtain the states of each agent and calculate the control inputs based on all of them, which requires us to install numerous sensors on the agents and we have to treat the issues of the communications between the agents. However, by our proposed algorithm using moments, it is not necessary to collect all states of the agents and to transmit them back to the center to calculate the control inputs at all time steps.

Since NN is large, we consider first estimating the occupation measure of u(k)u(k) as a continuous function p^(u)\hat{p}(u). Then we draw NN i.i.d samples from it and assign them to uiu_{i}, i.e., uip^(u),i0,iNu_{i}\sim\hat{p}(u),i\in\mathbb{N}_{0},i\leq N. By the strong law of large numbers, we note that

1Ni=1Nuil(k)a.s.ulp^k(u)𝑑u,withN+,\frac{1}{N}\sum_{i=1}^{N}u^{l}_{i}(k)\stackrel{{\scriptstyle a.s.}}{{\longrightarrow}}\int_{\mathbb{R}}u^{l}\hat{p}_{k}(u)du,\ \text{with}\ N\rightarrow+\infty, (24)

which means that the power moments of u(k)u(k) converge almost surely to the power moments 𝒰(k)\mathscr{U}(k) of the designed controls. Moreover, the sampling strategy ensures that the system state x(k)x(k) and the control input u(k)u(k) are independent from each other, hence (11) is satisfied. Then the problem comes to putting forward a sampling strategy. We consider using the acceptance-rejection sampling [5] strategy for this task.

The idea of acceptance-rejection sampling is that, even if it is not possible to directly sample from ρ^\hat{\rho}, there exists another density ρ~\tilde{\rho}, from which it is easy to sample. The task can be reduced to sampling from ρ~\tilde{\rho} directly and then rejecting the samples in a strategic way to make the remaining samples seemingly drawn from ρ^\hat{\rho}. We call the density ρ~\tilde{\rho} the ”candidate density” and ρ^\hat{\rho} the ”target density”.

We assume that

c=supxsupp(ρ^)ρ^ρ~<c=\sup_{x\in\operatorname{supp}(\hat{\rho})}\frac{\hat{\rho}}{\tilde{\rho}}<\infty (25)

and that we can calculate cc. In our paper, the support of both ρ^\hat{\rho} and ρ~\tilde{\rho} is 𝒞\mathcal{C}. As to satisfy (25), the candidate density needs to have heavier tails than the target density. Then we give the sampling algorithm in Algorithm 2.

Algorithm 2 Sample ui(k)u_{i}(k) from the Realized Control.
1:The number of agents N0N\in\mathbb{N}_{0}; the realized control ρ^k(u)\hat{\rho}_{k}(u); the candidate density ρ~k(u)\tilde{\rho}_{k}(u)
2:The controls ui(k),i=1,,Nu_{i}(k),i=1,\cdots,N
3:i1i\Leftarrow 1
4:while iNi\leq N do
5:     Sample an rir_{i} from a uniform distribution U[0,1]U[0,1]
6:     Sample a usupp(ρ^)u\in\operatorname{supp}(\hat{\rho}) from the candidate density ρ~\tilde{\rho}
7:     if uρ^cρ~u\leq\frac{\hat{\rho}}{c\tilde{\rho}} then
8:         ui(k)uu_{i}(k)\Leftarrow u
9:     else
10:         back to step 3
11:     end if
12:     ii+1i\Leftarrow i+1
13:end while

We note that in real applications, the controls ui(k),i=1,,Nu_{i}(k),i=1,\cdots,N are sometimes bounded. In previous results, the problem has not been treated by the feedback control laws given the domain of the system state being the whole \mathbb{R}. By the proposed Algorithm 2, we simply need to choose ρ^\hat{\rho} and the corresponding ρ~\tilde{\rho} to be truncated densities supported on specified bounded intervals.

We now adopt the proposed acceptance-rejection sampling strategy to update Algorithm 1 to treat the occupation measure steering problem, leading to Algorithm 3.

Algorithm 3 Steering Occupation Measure for a Large Group of Agents
1:The number of agents N0N\in\mathbb{N}_{0}; the maximal time step KK; the system parameter a(k)a(k) for k=0,,K1k=0,\cdots,K-1; the initial occupation measure dq0(x)dq_{0}(x); the specified terminal occupation measure dτ(x)d\tau(x).
2:Control inputs for the ithi_{\text{th}} target ui(k)u_{i}(k), k=0,,K1k=0,\cdots,K-1, i=1,,Ni=1,\cdots,N.
3:k0k\Leftarrow 0
4:while k<Kk<K and e(k)𝕍++2ne(k)\notin\mathbb{V}^{2n}_{++} do
5:     Calculate 𝒳(k)\mathscr{X}(k) using (4) if k>0k>0 or (5) if k=0k=0
6:     Calculate e(k)e(k) using (12)
7:     if e(k)𝕍++2ne(k)\in\mathbb{V}^{2n}_{++} then
8:         Calculate the states of the moment system 𝒳(i)\mathscr{X}(i) for i=k+1,,K1i=k+1,\cdots,K-1 using (13) with ωk==ωK1\omega_{k}=\cdots=\omega_{K-1}
9:         Calculate the controls of the moment system 𝒰(i)\mathscr{U}(i) for i=k,,K1i=k,\cdots,K-1 using (4)
10:         if i,𝒰(i)𝕍++2n\exists i,\mathscr{U}(i)\notin\mathbb{V}^{2n}_{++} then
11:              Go back to Step 8, adjust ωk,,ωK1\omega_{k},\cdots,\omega_{K-1}
12:         end if
13:         Optimize the cost function (18) and obtain the analytic estimates of the densities p^i(u)\hat{p}_{i}(u) for i=k,,K1i=k,\cdots,K-1
14:         Execute Algorithm 2 and obtain the control inputs ui(j)u_{i}(j) for all agents at time step j=k,,K1j=k,\cdots,K-1
15:     else
16:         ui(k)=0,i=1,,Nu_{i}(k)=0,i=1,\cdots,N
17:     end if
18:     Calculate the power moments of the system state x(k+1)x(k+1), i.e., 𝒳(k+1)\mathscr{X}(k+1)
19:     kk+1k\Leftarrow k+1
20:end while

4 Numerical examples

In the previous sections, we proposed algorithms for steering a vast group of agents, either characterized as a probability density function or an occupation measure. In this section, we perform numerical simulations on different types of distributions, supported on \mathbb{R} or a compact subset of \mathbb{R}, with multiple modes or a single mode, to validate our proposed algorithms.

We first simulate the steering of a vast group of agents as a probability density function. We begin with unconstrained control, i.e., the control inputs u(k)u(k) are not constrained.

4.1 Unconstrained density steering of a vast group of agents

In Example 1, we simulate a typical problem which is to steer a Gaussian density to another in four steps. The initial Gaussian density is chosen as

q0(x)=12πex22,q_{0}(x)=\frac{1}{\sqrt{2\pi}}e^{\frac{x^{2}}{2}}, (26)

and the terminal density is specified as

τ(x)=12π2e(x1)2222.\tau(x)=\frac{1}{\sqrt{2\pi}\cdot 2}e^{\frac{(x-1)^{2}}{2\cdot 2^{2}}}. (27)

The system parameters a(k),k=0,,3a(k),k=0,\cdots,3 are i.i.d. samples drawn from the uniform distribution U[0.5,0.7]U[0.5,0.7]. The states of the moment system, i.e., 𝒳(k)\mathscr{X}(k) for k=0,1,2,3k=0,1,2,3 are given in Figure 1. The controls of the moment system, i.e., 𝒰(k)\mathscr{U}(k) for k=0,1,2,3k=0,1,2,3 are given in Figure 2. We note that by our proposed algorithm, 𝒳(k),𝒰(k)𝕍++2n\mathscr{X}(k),\mathscr{U}(k)\in\mathbb{V}^{2n}_{++}, which makes it feasible for us to realize the controls. The realized control inputs are given in Figure 3. The transition of the control inputs is smooth, which is satisfactory.

Refer to caption
Figure 1: 𝒳(k)\mathscr{X}(k) at time steps k=0,1,2,3k=0,1,2,3. The upper left figure shows 𝐄[x(k)]\mathbf{E}\left[x(k)\right]. The upper right one shows 𝐄[x2(k)]\mathbf{E}\left[x^{2}(k)\right]. The lower left one shows 𝐄[x3(k)]\mathbf{E}\left[x^{3}(k)\right] and the lower right one shows 𝐄[x4(k)]\mathbf{E}\left[x^{4}(k)\right].
Refer to caption
Figure 2: 𝒰(k)\mathscr{U}(k) at time steps k=0,1,2,3k=0,1,2,3. The upper left figure shows 𝐄[u(k)]\mathbf{E}\left[u(k)\right]. The upper right one shows 𝐄[u2(k)]\mathbf{E}\left[u^{2}(k)\right]. The lower left one shows 𝐄[u3(k)]\mathbf{E}\left[u^{3}(k)\right] and the lower right one shows 𝐄[u4(k)]\mathbf{E}\left[u^{4}(k)\right].
Refer to caption
Figure 3: Realized control inputs u(k)u(k) by 𝒰(k)\mathscr{U}(k) for k=0,1,2,3k=0,1,2,3, which are obtained by our proposed control scheme.

In Example 2, we simulate a steering problem in three steps where the initial density function is a Gaussian and the terminal density function is a mixture of generalized logistic distributions. The initial one is chosen as

q0(x)=12πex22.q_{0}(x)=\frac{1}{\sqrt{2\pi}}e^{\frac{x^{2}}{2}}.

and the terminal one is specified as

τ(x)=0.42ex+1(1+ex+1)3+0.63ex2(1+ex2)4.\tau(x)=\frac{0.4\cdot 2e^{-x+1}}{(1+e^{-x+1})^{3}}+\frac{0.6\cdot 3e^{-x-2}}{(1+e^{-x-2})^{4}}. (28)

The system parameters a(k),k=0,,3a(k),k=0,\cdots,3 are i.i.d. samples drawn from the uniform distribution U[0.5,0.7]U[0.5,0.7]. The states of the moment system, i.e., 𝒳(k)\mathscr{X}(k) for k=0,1,2,3k=0,1,2,3 are given in Figure 4. The controls of the moment system, i.e., 𝒰(k)\mathscr{U}(k) for k=0,1,2,3k=0,1,2,3 are given in Figure 5. The realized controls in Figure 6 also show that the transition of the control inputs is smooth, even the specified terminal density is complicated.

Refer to caption
Figure 4: 𝒳(k)\mathscr{X}(k) at time steps k=0,1,2,3k=0,1,2,3.
Refer to caption
Figure 5: 𝒰(k)\mathscr{U}(k) at time steps k=0,1,2,3k=0,1,2,3.
Refer to caption
Figure 6: Realized control inputs u(k)u(k).

In Example 3, we will simulate a steering problem in four steps which has not been treated in the previous papers. The initial density is chosen as a Gaussian distribution

q0(x)=12πex22.q_{0}(x)=\frac{1}{\sqrt{2\pi}}e^{\frac{x^{2}}{2}}.

However the terminal density function is specified as a multi-modal density which is a mixture of two Laplacians

τ(x)=0.72e|x1|+0.32e|x+3|.\tau(x)=\frac{0.7}{2}e^{\left|x-1\right|}+\frac{0.3}{2}e^{-\left|x+3\right|}.

The system parameters a(k),k=0,,3a(k),k=0,\cdots,3 are i.i.d. samples drawn from the uniform distribution U[0.5,0.7]U[0.5,0.7]. The results are given in Figure 7, 8 and 9. In this example, the two modes are not close to each other as in Example 2. However the realized control inputs are still smooth, which validates the performance of our algorithm in steering a group to separate groups relatively far from each other.

Refer to caption
Figure 7: 𝒳(k)\mathscr{X}(k) at time steps k=0,1,2,3k=0,1,2,3.
Refer to caption
Figure 8: 𝒰(k)\mathscr{U}(k) at time steps k=0,1,2,3k=0,1,2,3.
Refer to caption
Figure 9: Realized control inputs u(k)u(k).

4.2 Constrained density steering of a vast group of agents

In some scenarios, there are boundary conditions on the control inputs. A common constraint is that they are bounded by a compact interval [a,b][a,b] on \mathbb{R}. With our proposed algorithm, we are able to treat the density steering problem with bounded control inputs.

In Example 4, we will simulate a problem which is sort of problematic however important in real practice. The initial and the terminal densities are both chosen as a mixture of two Gaussian densities, which are

q0(x)=0.52π2ex2222+0.52π2e(x+1)2222,q_{0}(x)=\frac{0.5}{\sqrt{2\pi}\cdot 2}e^{\frac{x^{2}}{2\cdot 2^{2}}}+\frac{0.5}{\sqrt{2\pi}\cdot 2}e^{\frac{(x+1)^{2}}{2\cdot 2^{2}}},

and

τ(x)=0.42πe(x1)22+0.62πe(x+1)22.\tau(x)=\frac{0.4}{\sqrt{2\pi}}e^{\frac{(x-1)^{2}}{2}}+\frac{0.6}{\sqrt{2\pi}}e^{\frac{(x+1)^{2}}{2}}.

The density steering is performed in five steps. The control inputs are bounded on the interval 𝒞=[2,2]\mathcal{C}=[-2,2]. The system parameters a(k),k=0,,4a(k),k=0,\cdots,4 are i.i.d. samples drawn from the uniform distribution U[0.5,0.7]U[0.5,0.7].

The results are given in Figure 10, 11 and 12. We note that at time step k=0k=0, 𝒳T𝒳(0)𝕍++2n\mathscr{X}_{T}-\mathscr{X}(0)\notin\mathbb{V}^{2n}_{++}. Then by the proposed algorithm, we don’t apply control inputs to the agents, which is shown in Figure 11. At time step k=1k=1, we have 𝒳T𝒳(1)𝕍++2n\mathscr{X}_{T}-\mathscr{X}(1)\in\mathbb{V}^{2n}_{++}. Hence we start applying controls starting from this step. It takes three steps for us to steer the density to the specified one.

The goal of the steering in this problem is to steer two distinct groups of agents to two specified terminal groups. The boundary of control inputs and the multiple modality of both initial and terminal densities make the problem a challenging one. By our proposed algorithm, we give a solution to this problem and the realized control inputs are still smooth, which is a satisfactory performance.

Refer to caption
Figure 10: 𝒳(k)\mathscr{X}(k) at time steps k=0,1,2,3,4k=0,1,2,3,4.
Refer to caption
Figure 11: 𝒰(k)\mathscr{U}(k) at time steps k=0,1,2,3,4k=0,1,2,3,4.
Refer to caption
Figure 12: Realized control inputs u(k)u(k) at time steps k=1,2,3,4k=1,2,3,4. The agents are uncontrolled at k=0k=0, i.e., u(0)=0u(0)=0.

Because of (3), we are not able to obtain an analytic qK(x)q_{K}(x) by the proposed Algorithm 1, which makes it infeasible to directly compare between qK(x)q_{K}(x) and τ(x)\tau(x). However fortunately, our proposed algorithm by using the moments is able to treat the occupation measure steering problem. The analytic function u(k)u(k) at each time step is realized as control inputs ui(k)u_{i}(k) of each agent ii. We can therefore obtain a terminal occupation measure to compare to the specified one. In the following part of this section, we simulate the occupation measure steering examples.

4.3 Occupation measure steering of a vast group of agents

We simulate examples on occupation measure steering in this part of section. In Example 5, we steer 20002000 agents to a specified occupation measure. The initial states of each agent xix_{i} is drawn i.i.d. from the Gaussian distribution (26). The specified terminal occupation measure consists of 20002000 i.i.d. samples drawn from the terminal distribution (27). The system parameter a(k),k=0,,3a(k),k=0,\cdots,3 are i.i.d. samples drawn from the uniform distribution U[0.5,0.7]U[0.5,0.7].

Refer to caption
Figure 13: The histograms of ui(k)u_{i}(k) at time step kk for each agent ii. The upper left and right figures are ui(0)u_{i}(0) and ui(1),i=1,,2000u_{i}(1),i=1,\cdots,2000 respectively. The lower left and right figures are ui(2)u_{i}(2) and ui(3)u_{i}(3) respectively.
Refer to caption
Figure 14: The histograms of the system states x(k)x(k) of 2000 Monte-Carlo simulations at time step k=0,1,2,3k=0,1,2,3, together with the desired terminal distribution (27). We note that the histogram of system state at step k=3k=3 is very close to (27). The transition of the states of the agents is also smooth.

Figure 13 displays histograms of ui(k)u_{i}(k) at time step kk for each agent ii. These histograms consist of 20002000 i.i.d samples drawn from the realizations of 𝒰(k)\mathscr{U}(k) shown in Figure 3. Additionally, Figure 14 illustrates the histograms of the occupation measure for the states of the 20002000 agents at time steps k=0,1,2,3k=0,1,2,3. The power moments of order 11 to 44 for the terminal occupation measure, obtained using our proposed algorithm, are 1.21,5.41,14.44,78.771.21,5.41,14.44,78.77 respectively. Notably, these values are close to the desired terminal distribution, with power moments of order 11 to 44 being 1,5,13,731,5,13,73 respectively. This outcome validates the effectiveness of our algorithm in steering the occupation measure.

In Example 6, we steer 20002000 agents to a specified occupation measure. The initial states of each agent xix_{i} is drawn i.i.d. from the Gaussian distribution (26). The specified terminal occupation measure consists of 20002000 i.i.d. samples drawn from the terminal distribution (28), of which the density is a mixture of two Gaussian densities. The system parameter a(k),k=0,,3a(k),k=0,\cdots,3 are i.i.d. samples drawn from the uniform distribution U[0.5,0.7]U[0.5,0.7].

The histograms of ui(k)u_{i}(k) at time step kk for each agent ii are given in Figure 15. They are 20002000 i.i.d samples drawn from the realizations of 𝒰(k)\mathscr{U}(k) in Figure 6. Figure 16 gives the histograms of the occupation measure of the states of the 20002000 agents at k=0,1,2,3k=0,1,2,3. We note that the power moments of order 11 to 44 of (28) are 0.5,3.88,8.8,52.80.5,3.88,8.8,52.8 respectively. The power moments of order 11 to 44 of the terminal occupation measure by our proposed algorithm are 0.54,3.97,9.0,52.790.54,3.97,9.0,52.79 respectively, which are quite close to the specified ones. This example validates the ability of the proposed scheme in treating distribution steering problems with complicated distributions.

Refer to caption
Figure 15: The histograms of ui(k)u_{i}(k) at time step kk for each agent ii. The upper left and right figures are ui(0)u_{i}(0) and ui(1),i=1,,2000u_{i}(1),i=1,\cdots,2000 respectively. The lower left and right figures are ui(2)u_{i}(2) and ui(3)u_{i}(3) respectively.
Refer to caption
Figure 16: The histograms of the system states x(k)x(k) of 2000 Monte-Carlo simulations at time step k=0,1,2,3k=0,1,2,3, together with the desired terminal distribution 28.

5 Concluding remarks

In this paper, we propose to use moments for the problem of steering a vast group of agents. The vast group of agents are characterized as probability density functions and occupation measures. We first treat the density steering problem. Without assuming the initial and terminal density to fall within specific function classes, the original problem is infinite-dimensional and intractable. As to treat this problem, we propose a moment system representation of the original system, and reduce the original problem to the control of the moment system. Different from the conventional control problems, the elements of the control inputs are in the system matrix, and we have to ensure the Hankel matrix of the control inputs at each time step to be positive definite. Since it is not treatable by the existing control schemes including the commonly used optimal control, we propose an empirical control scheme to treat this problem. By doing this, it is feasible for us to realize the control inputs for the original system. The previous results were presented at IFAC World Congress 2023 [28]. In this paper, we consider more fundamental issues of the group steering problems and propose algorithms for real-world group steering problems. We propose that with the number of moment terms used approaching infinity, the terminal distribution obtained by the proposed scheme is almost everywhere equal to the desired one. We also propose an error upper bound of the terminal density by using our proposed algorithm, in the sense of the total variation distance. Our algorithm of steering an arbitrary initial distribution to another arbitrary one in limited steps for the discrete-time first-order ODE systems, without assuming their function classes, is the first one in the literature. Based on the proposed density-steering algorithm, we put forward an algorithm steering an arbitrary occupation measure representing the vast group of agents to another arbitrary one. We use acceptance-rejection sampling to draw i.i.d. samples from the realized control inputs u(k)u(k) and assign them to each agents. By doing this, the control inputs are independent of the current states of each agent.

This paper treats the group steering problem of the first-order time-variant linear system. However, the problem is much more complicated for multi-order and multivariate systems. For first-order systems, the control inputs u(k)u(k)’s are one-dimensional density functions. Hence the positive definiteness of the Hankel matrix of each 𝒰(k)\mathscr{U}(k) is the necessary and sufficient condition of the existence of u(k)u(k) [22]. However for multi-dimensional densities, it is no longer valid. To derive control inputs 𝒰(k)\mathscr{U}(k)’s for the moment system then to realize them will be a challenging problem. Results in subjects e.g. mathematical moment problem and real algebraic geometry shall be used to treat the group steering problem for those systems.

References

  • [1] Efstathios Bakolas and Alexandros Tsolovikos. Greedy finite-horizon covariance steering for discrete-time stochastic nonlinear systems based on the unscented transform. In 2020 American Control Conference (ACC), pages 3595–3600. IEEE, 2020.
  • [2] Isin M Balci and Efstathios Bakolas. Covariance steering of discrete-time stochastic linear systems based on wasserstein distance terminal cost. IEEE Control Systems Letters, 5(6):2000–2005, 2020.
  • [3] Shiba Biswal, Karthik Elamvazhuthi, and Spring Berman. Stabilization of nonlinear discrete-time systems to target measures using stochastic feedback laws. IEEE Transactions on Automatic Control, 66(5):1957–1972, 2020.
  • [4] Shiba Biswal, Karthik Elamvazhuthi, and Spring Berman. Decentralized control of multi-agent systems using local density feedback. IEEE Transactions on Automatic Control, 2021.
  • [5] Paolo Brandimarte. Handbook in Monte Carlo simulation: applications in financial engineering, risk management, and economics. John Wiley & Sons, 2014.
  • [6] Roger W Brockett. Optimal control of the liouville equation. AMS IP Studies in Advanced Mathematics, 39:23, 2007.
  • [7] Christopher I Byrnes and Anders Lindquist. A convex optimization approach to generalized moment problems. In Control and modeling of complex systems, pages 3–21. Springer, 2003.
  • [8] Kenneth F Caluya and Abhishek Halder. Reflected schrödinger bridge: Density control with path constraints. In 2021 American Control Conference (ACC), pages 1137–1142. IEEE, 2021.
  • [9] Yongxin Chen, Tryphon T Georgiou, and Michele Pavon. Optimal steering of a linear stochastic system to a final probability distribution, part i. IEEE Transactions on Automatic Control, 61(5):1158–1169, 2015.
  • [10] Yongxin Chen, Tryphon T Georgiou, and Michele Pavon. Optimal steering of a linear stochastic system to a final probability distribution, part ii. IEEE Transactions on Automatic Control, 61(5):1170–1180, 2015.
  • [11] Yongxin Chen, Tryphon T Georgiou, and Michele Pavon. Optimal steering of a linear stochastic system to a final probability distribution—part iii. IEEE Transactions on Automatic Control, 63(9):3112–3118, 2018.
  • [12] Kai Lai Chung. A course in probability theory. Academic press, 2001.
  • [13] Karthik Elamvazhuthi and Spring Berman. Density stabilization strategies for nonholonomic agents on compact manifolds. IEEE Transactions on Automatic Control, 2023.
  • [14] Tryphon T Georgiou and Anders Lindquist. Kullback-leibler approximation of spectral density functions. IEEE Transactions on Information Theory, 49(11):2910–2917, 2003.
  • [15] Jagat Narain Kapur and Hiremaglur K Kesavan. Entropy optimization principles and their applications. In Entropy and energy dissipation in water resources, pages 3–20. Springer, 1992.
  • [16] S. Kullback. Correction to a lower bound for discrimination information in terms of variation. IEEE Transactions on Information Theory, 16(5):652–652, 1970.
  • [17] Fengjiao Liu and Panagiotis Tsiotras. Optimal covariance steering for continuous-time linear stochastic systems with martingale additive noise. IEEE Transactions on Automatic Control, 2023.
  • [18] Kazuhide Okamoto, Maxim Goldshtein, and Panagiotis Tsiotras. Optimal covariance control for stochastic systems under chance constraints. IEEE Control Systems Letters, 2(2):266–271, 2018.
  • [19] Kazuhide Okamoto and Panagiotis Tsiotras. Optimal stochastic vehicle path planning using covariance steering. IEEE Robotics and Automation Letters, 4(3):2276–2281, 2019.
  • [20] Joshua Pilipovsky and Panagiotis Tsiotras. Covariance steering with optimal risk allocation. IEEE Transactions on Aerospace and Electronic Systems, 57(6):3719–3733, 2021.
  • [21] Augustinos D Saravanos, Isin M Balci, Efstathios Bakolas, and Evangelos A Theodorou. Distributed model predictive covariance steering. arXiv preprint arXiv:2212.00398, 2022.
  • [22] Konrad Schmüdgen. The moment problem, volume 14. Springer, 2017.
  • [23] Claude Elwood Shannon. A mathematical theory of communication. The Bell system technical journal, 27(3):379–423, 1948.
  • [24] Carlo Sinigaglia, Andrea Manzoni, Francesco Braghin, and Spring Berman. Robust optimal density control of robotic swarms. arXiv preprint arXiv:2205.12592, 2022.
  • [25] Vignesh Sivaramakrishnan, Joshua Pilipovsky, Meeko Oishi, and Panagiotis Tsiotras. Distribution steering for discrete-time linear systems with general disturbances using characteristic functions. In 2022 American Control Conference (ACC), pages 4183–4190. IEEE, 2022.
  • [26] Aldo Tagliani. Maximum entropy solutions and moment problem in unbounded domains. Applied mathematics letters, 16(4):519–524, 2003.
  • [27] Aldo Tagliani. A note on proximity of distributions in terms of coinciding moments. Applied Mathematics and Computation, 145(2-3):195–203, 2003.
  • [28] Guangyu Wu and Anders Lindquist. Density steering by power moments. IFAC-PapersOnLine, 56(2):3423–3428, 2023.
  • [29] Guangyu Wu and Anders Lindquist. Non-Gaussian Bayesian filtering by density parametrization using power moments. Automatica, 153:111061, 2023.
  • [30] Silun Zhang, Axel Ringh, Xiaoming Hu, and Johan Karlsson. Modeling collective behaviors: A moment-based approach. IEEE Transactions on Automatic Control, 66(1):33–48, 2020.
  • [31] Dongliang Zheng, Jack Ridderhof, Panagiotis Tsiotras, and Ali-akbar Agha-mohammadi. Belief space planning: A covariance steering approach. In 2022 International Conference on Robotics and Automation (ICRA), pages 11051–11057. IEEE, 2022.
  • [32] Albert Nikolaevič Širâev, Ralph Philip Boas, and Dmitrij Mihajlovič Čibisov. Probability-1. Springer, 2016.
[Uncaptioned image]

Guangyu Wu received the B.E. degree from Northwestern Polytechnical University, Xi’an, China, in 2013, and two M.S. degrees, one in control science and engineering from Shanghai Jiao Tong University, Shanghai, China, in 2016, and the other in electrical engineering from the University of Notre Dame, South Bend, USA, in 2018.

He is currently pursuing the Ph.D. degree at Shanghai Jiao Tong University. His research interests are the moment problem and its applications to stochastic filtering, stochastic control and statistics.

[Uncaptioned image]

Anders Lindquist received the Ph.D. degree in optimization and systems theory from the Royal Institute of Technology, Stockholm, Sweden, in 1972, and an honorary doctorate (Doctor Scientiarum Honoris Causa) from Technion (Israel Institute of Technology) in 2010.

He is currently a Zhiyuan Chair Professor at Shanghai Jiao Tong University, China, and Professor Emeritus at the Royal Institute of Technology (KTH), Stockholm, Sweden. Before that he had a full academic career in the United States, after which he was appointed to the Chair of Optimization and Systems at KTH. Dr. Lindquist is a Member of the Royal Swedish Academy of Engineering Sciences, a Foreign Member of the Chinese Academy of Sciences, a Foreign Member of the Russian Academy of Natural Sciences, a Member of Academia Europaea (Academy of Europe), an Honorary Member the Hungarian Operations Research Society, a Life Fellow of IEEE, a Fellow of SIAM, and a Fellow of IFAC. He received the 2003 George S. Axelby Outstanding Paper Award, the 2009 Reid Prize in Mathematics from SIAM, and the 2020 IEEE Control Systems Award, the IEEE field award in Systems and Control.