This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Collision-Free Shepherding Control of a Single Target within a Swarmthanks: This work was supported in part by JSPS KAKENHI Grant Number JP21H01352.

Yaosheng Deng, Aiyi Li, Masaki Ogura, and Naoki Wakamiya Graduate School of Information Science and Technology, Osaka University,
Suita, Osaka 565-0871, Japan
{ys-deng, li-aiyi, m-ogura, wakamiya}@ist.osaka-u.ac.jp
Abstract

The shepherding problem refers to guiding a group of agents (called sheep) to a specific destination using an external agent with repulsive forces (called shepherd). Although various movement algorithms for the shepherd have been explored in the literature, there is a scarcity of methodologies for selective guidance, which is a key technology for precise swarm control. Therefore, this study investigates the problem of guiding a single target sheep within a swarm to a given destination using a shepherd. We first present our model of the dynamics of sheep agents and the interaction between sheep and shepherd agents. The model is shown to be well-defined with no collision if the interaction magnitude between sheep and shepherd is reasonably limited. Based on the analysis with Lyapunov stability principles, we design a shepherd control law to guide the target sheep to the origin while avoiding collisions among sheep agents. Experimental results demonstrate the effectiveness of the proposed method in guiding the target sheep in both small and large scale swarms.

Index Terms:
Shepherding control, Nonlinear control, Lyapunov method.

I Introduction

It has long been observed that various biological individuals tend to exhibit swarming behavior. Examples of swarms include flocks of birds, schools of fish, herds of animals, and colonies of bacteria [1, 2], and engineering applications include the formation control of multi-robot teams and autonomous aircraft. Many studies [3, 4, 5] have investigated the relevant properties of swarm motion based on the attraction-repulsion swarm model and its applications in areas such as UAV swarms and multi-robot systems.

As research on swarm intelligence advances, there has been increasing attention to the use of an external special agent (called shepherd) to control the movement of swarm agents (called sheep). This process is referred to as shepherding [6]. The shepherding model specifically refers to the situation in which “sheep” agents avoid a “shepherd” agent while also interacting with other sheep, according to a swarm model. In this context, we can find various methods [7, 8, 9] to guide a whole swarm of sheep agents.

There is emerging interest in guiding a part of the swarm (e.g., a target sheep) to the goal because of its potential engineering applications in the context of micro-/nano-robotic swarms [10, 11]. For example, Deptula et al. [12] proposed a control algorithm for a shepherd to guide a targeted sheep agent towards its destination. Licitra et al. [13] designed a controller for herding one sheep within a swarm to its destination without considering swarm interaction forces in the dynamics model. Using the framework of reinforcement learning, Sebastián and Montijano [14] developed a method for guiding a few agents within a swarm with heterogeneous agent dynamics. Le et al. [15] proposed a method to use a shepherd to regulate a sheep within a small swarm. Zhang et al. [16] designed an algorithm for capturing and regulating a portion of a swarm using multiple shepherds.

However, the shepherd control methods mentioned above [12, 13, 14, 15, 16] do not consider collisions within the swarm; their algorithms assume that no pairs of agents in the swarm collide. Because the occurrence of a collision can disrupt an entire system and lead to failed tasks [17, 18, 19], it is practically important to develop a shepherding control method with guaranteed collision-free properties. Although several studies have proposed distributed [20, 21] and leader-based [22] control methods for swarm systems with collision-free guarantees, these studies have not been directly extended to the context of shepherding control. Therefore, to design a stable shepherd controller to achieve certain control objectives, the first consideration must be to carefully analyze the collision properties of shepherding swarms.

In this study, the control objective is to use a shepherd agent to guide a sheep agent (target) in a swarm to the origin (goal). First, the repulsion-bounded swarm model is described. We then carefully analyze and rigorously prove that under relatively mild conditions, sheep in the swarm do not collide, even under the influence of a shepherd. Based on this analysis, we use the Lyapunov stability principle to design a motion controller for the shepherd to guide the target to the endpoint (i.e., the origin). The effectiveness of the proposed method is illustrated by numerical simulations, in which we compare it with a baseline strategy based on the farthest agent-targeting strategy presented in [23].

This paper is organized as follows: Section II describes the shepherding swarm model and the problem for controlling one target to the origin; in Section III we analyze the collision-free property of the shepherding swarm model; in Section IV we design a shepherd motion controller to guide a specific target to the destination. Section V provides several numerical simulations that highlight the shepherding behavior.

II Problem statement

We consider a swarm of N>2N>2 individuals (sheep) and one herder (shepherd) in 2\mathbb{R}^{2}. The positions of sheep ii and shepherd at time t0t\geq 0 are denoted by xi(t)2x_{i}(t)\in\mathbb{R}^{2} and y(t)2y(t)\in\mathbb{R}^{2}, respectively. Let 𝒩={1,2,,N}\mathcal{N}=\{1,2,...,N\}.

We assume that, for all i𝒩i\in\mathcal{N}, the velocity x˙i\dot{x}_{i} of the iith sheep is specified by a linear combination of three components:

x˙i=fai+fbi+fyi,\dot{x}_{i}=f_{ai}+f_{bi}+f_{yi}, (1)

where faif_{ai} and fbif_{bi} represent the attraction and repulsion functions from the other sheep to the iith sheep, respectively, and fyif_{yi} represents the function from the shepherd to the iith sheep.

Below, we describe how faif_{ai}, fbif_{bi}, and fyif_{yi} are constructed. First, the attraction function of the iith sheep is given by:

fai=j𝒩\{i}ϕa(xixj),f_{ai}=\sum_{j\in\mathcal{N}{\backslash\{i\}}}\phi_{a}(x_{i}-x_{j}), (2)

where the function ϕa:22\phi_{a}\colon\mathbb{R}^{2}\to\mathbb{R}^{2} is defined as

ϕa(x)=max\phi_{a}({x})=-m_{a}x (3)

for a positive constant mam_{a}. Then, the bounded repulsion function of the iith sheep is given by

fbi=j𝒩\{i}ϕb(xixj),f_{bi}=\sum_{j\in\mathcal{N}{\backslash\{i\}}}\phi_{b}(x_{i}-x_{j}), (4)

where the function ϕb:22\phi_{b}\colon\mathbb{R}^{2}\to\mathbb{R}^{2} is defined as

ϕb(x)={mbxx2,if x>b,Mbxx,if xb,\phi_{b}({x})=\begin{cases}m_{b}\dfrac{{x}}{\lVert{x}\rVert^{2}},&\mbox{if }\lVert{x}\rVert>\ell_{b},\vspace{2mm}\\ M_{b}\dfrac{{x}}{\lVert{x}\rVert},&\mbox{if }\lVert{x}\rVert\leq\ell_{b},\end{cases} (5)

for positive constants b\ell_{b}, mbm_{b}, and MbM_{b} satisfying

Mb=mbb,b<R.M_{b}=\frac{m_{b}}{\ell_{b}},\quad\ell_{b}<R. (6)

Finally, we define fyif_{yi} as

fyi={γ1xiyxiy2,if xiy>y,γ2xiyxiy2g(xiy),if xiyy,f_{yi}=\begin{cases}\gamma_{1}\dfrac{x_{i}-y}{\lVert x_{i}-y\rVert^{2}},\vspace{2mm}&\mbox{if $\lVert x_{i}-y\rVert>\ell_{y}$,}\\ \displaystyle\gamma_{2}\dfrac{x_{i}-y}{\lVert x_{i}-y\rVert^{2}}g(\lVert x_{i}-y\rVert),&\mbox{if $\lVert x_{i}-y\rVert\leq\ell_{y}$,}\end{cases} (7)

where the function g:[0,)[0,)g\colon[0,\infty)\to[0,\infty) is defined by:

g(d)={0,if d=0,exp(1/d),otherwiseg({d})=\begin{cases}0,&\mbox{if $d=0$,}\\ \exp\left({-{1}/{d}}\right),&\mbox{otherwise}\end{cases} (8)

and the parameters γ1\gamma_{1}, γ2\gamma_{2}, and y\ell_{y} are positive constants that satisfy

γ2=γ1g1(y).\gamma_{2}=\gamma_{1}g^{-1}(\ell_{y}). (9)

We can now state the problem studied in this paper.

Problem 1.

Assume that the shepherd knows all sheep positions and can calculate the interaction force faT+fbTf_{aT}+f_{bT} of the target sheep T𝒩T\in\mathcal{N}. Design a controller for the velocity of the shepherd such that the target sheep is asymptotically guided to the origin.

III Collision-Free Condition

In this section, we present a sufficient condition that guarantees that no pairs of sheep will collide within a swarm. We begin by presenting the main results of this section. Within the theorem, the maximum shepherd force for one sheep is defined as follows:

fymax=γ1y,f_{y}^{\max}=\frac{\gamma_{1}}{\ell_{y}}, (10)

which plays an important role in this process.

Theorem 1.

Suppose that

xi(0)xj(0)x_{i}(0)\neq x_{j}(0) (11)

for all distinct pairs (i,j)𝒩×𝒩(i,j)\in\mathcal{N}\times\mathcal{N}. If

fymax<Mb,f_{y}^{\max}<M_{b}, (12)

then

xi(t)xj(t)x_{i}(t)\neq x_{j}(t) (13)

for all distinct pairs (i,j)𝒩×𝒩(i,j)\in\mathcal{N}\times\mathcal{N} and t>0t>0.

In the remainder of this section, we present a proof of Theorem 1 using the following notation. For all i,j𝒩i,j\in\mathcal{N}, let

δij=xixj.\delta_{ij}=\lVert x_{i}-x_{j}\rVert. (14)

The first collision time of the system is defined as follows:

t1=min{t0mini,j𝒩,ijδij(t)=0}.t_{1}=\min\{t\geq 0\mid\min_{i,j\in\mathcal{N},\,i\neq j}\delta_{ij}(t)=0\}. (15)

Set 𝒞\mathcal{C} is defined as

𝒞={,if t1=,{i𝒩δij(t1)=0 for some ji},otherwise.\displaystyle\mathcal{C}=\begin{cases}\emptyset,&\mbox{if $t_{1}=\infty$,}\\ \{i\in\mathcal{N}\mid\delta_{ij}(t_{1})=0\mbox{ for some }j\neq i\},&\mbox{otherwise.}\end{cases} (16)

We then define the complement set as

=𝒩𝒞.\mathcal{R}=\mathcal{N}\setminus\mathcal{C}. (17)

We prove Theorem 1 by contradiction. First, the proof is outlined. Let us introduce the nonnegative function

𝒳2=i,j𝒞,ijδij2.\mathcal{X}_{2}=\sum_{i,j\in\mathcal{C},i\neq j}\delta_{ij}^{2}. (18)

If a finite-time collision occurs, that is, if the conclusion of Theorem 1 is violated, then t1t_{1} is finite; therefore, we have 𝒳2(t1)=0\mathcal{X}_{2}(t_{1})=0. Conversely, as will be shown later, a careful analysis of the derivative of 𝒳2(t)\mathcal{X}_{2}(t) allows us to show that function 𝒳2(t)\mathcal{X}_{2}(t) increases on the interval [t1τ,t1)[t_{1}-\tau,t_{1}) for a sufficiently small τ>0\tau>0 under inequality (12). This observation leads to 𝒳2(t)\mathcal{X}_{2}(t) being negative on the interval, which contradicts the intrinsic nonnegativity of function 𝒳2\mathcal{X}_{2}.

To proceed further, we first evaluate the derivative of 𝒳2\mathcal{X}_{2}. A straightforward calculation shows that the derivative can be decomposed as follows:

𝒳˙2=1+2+3+4+5,\dot{\mathcal{X}}_{2}=\mathcal{I}_{1}+\mathcal{I}_{2}+\mathcal{I}_{3}+\mathcal{I}_{4}+\mathcal{I}_{5}, (19)

where

1\displaystyle\mathcal{I}_{1} =\displaystyle= 2i,j𝒞ij(xixj)(fyifyj),\displaystyle 2\sum_{\begin{smallmatrix}i,j\in\mathcal{C}\\ i\neq j\end{smallmatrix}}(x_{i}-x_{j})^{\top}(f_{yi}-f_{yj}), (20)
2\displaystyle\mathcal{I}_{2} =\displaystyle= 2i,j,k𝒞ij,jk(xixj)(ϕa(xixk)+ϕb(xixk)),\displaystyle 2\sum_{\begin{smallmatrix}i,j,k\in\mathcal{C}\\ i\neq j,j\neq k\end{smallmatrix}}(x_{i}-x_{j})^{\top}\left(\phi_{a}(x_{i}-x_{k})+\phi_{b}(x_{i}-x_{k})\right),
3\displaystyle\mathcal{I}_{3} =\displaystyle= 2i,j,k𝒞ij,jk(xjxi)(ϕa(xjxk)+ϕb(xjxk)),\displaystyle 2\sum_{\begin{smallmatrix}i,j,k\in\mathcal{C}\\ i\neq j,j\neq k\end{smallmatrix}}(x_{j}-x_{i})^{\top}\left(\phi_{a}(x_{j}-x_{k})+\phi_{b}(x_{j}-x_{k})\right),
4\displaystyle\mathcal{I}_{4} =\displaystyle= 2i,j𝒞k,ij(xixj)(ϕa(xixk)+ϕb(xixk)),\displaystyle 2\sum_{\begin{smallmatrix}i,j\in\mathcal{C}\\ k\in\mathcal{R},i\neq j\end{smallmatrix}}(x_{i}-x_{j})^{\top}\left(\phi_{a}(x_{i}-x_{k})+\phi_{b}(x_{i}-x_{k})\right),
5\displaystyle\mathcal{I}_{5} =\displaystyle= 2i,j𝒞k,ij(xjxi)(ϕa(xjxk)+ϕb(xjxk)).\displaystyle 2\sum_{\begin{smallmatrix}i,j\in\mathcal{C}\\ k\in\mathcal{R},i\neq j\end{smallmatrix}}(x_{j}-x_{i})^{\top}\left(\phi_{a}(x_{j}-x_{k})+\phi_{b}(x_{j}-x_{k})\right).

Because the symmetry [24] of these expressions with respect to indices ii and jj allows us to show 2=3\mathcal{I}_{2}=\mathcal{I}_{3} and 4=5\mathcal{I}_{4}=\mathcal{I}_{5}, we obtain

𝒳˙2=1+22+24.\dot{\mathcal{X}}_{2}=\mathcal{I}_{1}+2\mathcal{I}_{2}+2\mathcal{I}_{4}. (21)

To further evaluate the derivative 𝒳˙2\dot{\mathcal{X}}_{2}, we provide lower bounds for 2\mathcal{I}_{2} and 4\mathcal{I}_{4}. A lower bound for 2\mathcal{I}_{2} is presented in the following proposition:

Proposition 2.

Let t0t\geq 0 be arbitrary. If

δ𝒞,max(t)=maxi,j𝒞,ijδij(t)\delta_{\mathcal{C},\max}(t)=\max_{i,j\in\mathcal{C},i\neq j}\delta_{ij}(t) (22)

satisfies

δ𝒞,max(t)b,\delta_{\mathcal{C},\max}(t)\leq\ell_{b}, (23)

then

2(t)Mb|𝒞|𝒳1(t)ma|𝒞|𝒳12(t),\mathcal{I}_{2}(t)\geq{M_{b}\left|\mathcal{C}\right|}\mathcal{X}_{1}(t)-{m_{a}\left|\mathcal{C}\right|}\mathcal{X}_{1}^{2}(t), (24)

where 𝒳1\mathcal{X}_{1} is defined as

𝒳1=i,j𝒞,ijδij.\mathcal{X}_{1}=\sum_{i,j\in\mathcal{C},i\neq j}\delta_{ij}. (25)
Proof.

We omit the time variable tt for the simplicity of notations within the proof. If 𝒞=\mathcal{C}=\emptyset, the inequality holds vacuously. We assume that 𝒞\mathcal{C} is nonempty. By using (3), (5), and (24), we can show that

2=2i,j,k𝒞ijk(xixj)(ma(xkxi)Mbxkxixkxi).\begin{multlined}\mathcal{I}_{2}=\!2\sum_{\begin{smallmatrix}i,j,k\in\mathcal{C}\\ i\neq j\neq k\end{smallmatrix}}(x_{i}-x_{j})^{\top}\Bigg{(}m_{a}(x_{k}-x_{i})-M_{b}\frac{x_{k}-x_{i}}{\lVert x_{k}-x_{i}\rVert}\Bigg{)}.\end{multlined}\mathcal{I}_{2}=\!2\sum_{\begin{smallmatrix}i,j,k\in\mathcal{C}\\ i\neq j\neq k\end{smallmatrix}}(x_{i}-x_{j})^{\top}\Bigg{(}m_{a}(x_{k}-x_{i})-M_{b}\frac{x_{k}-x_{i}}{\lVert x_{k}-x_{i}\rVert}\Bigg{)}. (26)

By exchanging indices ii and kk, we obtain

2=2i,j,k𝒞ijk(xkxj)(ma(xixk)Mbxixkxkxi).\begin{multlined}\mathcal{I}_{2}=2\!\sum_{\begin{smallmatrix}i,j,k\in\mathcal{C}\\ i\neq j\neq k\end{smallmatrix}}(x_{k}-x_{j})^{\top}\Bigg{(}m_{a}(x_{i}-x_{k})-M_{b}\frac{x_{i}-x_{k}}{\lVert x_{k}-x_{i}\rVert}\Bigg{)}.\end{multlined}\mathcal{I}_{2}=2\!\sum_{\begin{smallmatrix}i,j,k\in\mathcal{C}\\ i\neq j\neq k\end{smallmatrix}}(x_{k}-x_{j})^{\top}\Bigg{(}m_{a}(x_{i}-x_{k})-M_{b}\frac{x_{i}-x_{k}}{\lVert x_{k}-x_{i}\rVert}\Bigg{)}. (27)

Therefore,

2=i,j,k𝒞ijk(xixj)(ma(xkxi)Mbxkxixkxi)+i,j,k𝒞ijk(xkxj)(ma(xixk)Mbxixkxkxi).\begin{multlined}\mathcal{I}_{2}=\sum_{\begin{smallmatrix}i,j,k\in\mathcal{C}\\ i\neq j\neq k\end{smallmatrix}}(x_{i}-x_{j})^{\top}\Bigg{(}m_{a}(x_{k}-x_{i})-M_{b}\frac{x_{k}-x_{i}}{\lVert x_{k}-x_{i}\rVert}\Bigg{)}\\ +\sum_{\begin{smallmatrix}i,j,k\in\mathcal{C}\\ i\neq j\neq k\end{smallmatrix}}(x_{k}-x_{j})^{\top}\Bigg{(}m_{a}(x_{i}-x_{k})-M_{b}\frac{x_{i}-x_{k}}{\lVert x_{k}-x_{i}\rVert}\Bigg{)}.\end{multlined}\mathcal{I}_{2}=\sum_{\begin{smallmatrix}i,j,k\in\mathcal{C}\\ i\neq j\neq k\end{smallmatrix}}(x_{i}-x_{j})^{\top}\Bigg{(}m_{a}(x_{k}-x_{i})-M_{b}\frac{x_{k}-x_{i}}{\lVert x_{k}-x_{i}\rVert}\Bigg{)}\\ +\sum_{\begin{smallmatrix}i,j,k\in\mathcal{C}\\ i\neq j\neq k\end{smallmatrix}}(x_{k}-x_{j})^{\top}\Bigg{(}m_{a}(x_{i}-x_{k})-M_{b}\frac{x_{i}-x_{k}}{\lVert x_{k}-x_{i}\rVert}\Bigg{)}. (28)

Then, we can further simplify 2\mathcal{I}_{2} as

2\displaystyle\mathcal{I}_{2} =i,j,k𝒞ijk(maxkxi2+Mbxkxi)\displaystyle=\sum_{\begin{smallmatrix}i,j,k\in\mathcal{C}\\ i\neq j\neq k\end{smallmatrix}}\bigl{(}-m_{a}\lVert x_{k}-x_{i}\rVert^{2}+M_{b}\lVert x_{k}-x_{i}\rVert\bigr{)} (29)
=Mb|𝒞|𝒳1(t)ma|𝒞|𝒳2(t),\displaystyle={{M_{b}\left|\mathcal{C}\right|}\mathcal{X}_{1}(t)-{m_{a}\left|\mathcal{C}\right|}\mathcal{X}_{2}(t)},

which proves the desired inequality (24) using the trivial inequality 𝒳2𝒳12\mathcal{X}_{2}\leq\mathcal{X}_{1}^{2}. ∎

Then, we derive a lower bound for 4\mathcal{I}_{4}. We begin by presenting the following technical lemma. The proof of the lemma 3 is omitted due to space limitations.

Lemma 3.

Let t0t\geq 0 and i,j𝒞i,j\in\mathcal{C} be arbitrary. Assume iji\neq j. Define

a(t)=(xi(t)xj(t))ϕa(xi(t)xk(t))+(xj(t)xi(t))ϕa(xj(t)xk(t))\begin{multlined}\mathcal{I}_{a}(t)=(x_{i}(t)-x_{j}(t))^{\top}\phi_{a}(x_{i}(t)-x_{k}(t))\\ +(x_{j}(t)-x_{i}(t))^{\top}\phi_{a}(x_{j}(t)-x_{k}(t))\end{multlined}\mathcal{I}_{a}(t)=(x_{i}(t)-x_{j}(t))^{\top}\phi_{a}(x_{i}(t)-x_{k}(t))\\ +(x_{j}(t)-x_{i}(t))^{\top}\phi_{a}(x_{j}(t)-x_{k}(t)) (30)

and

b(t)=12((xi(t)xj(t))ϕb(xk(t)xi(t))+(xj(t)xi(t))ϕb(xk(t)xj(t))).\begin{multlined}\mathcal{I}_{b}(t)=-\frac{1}{2}\Big{(}(x_{i}(t)-x_{j}(t))^{\top}\phi_{b}(x_{k}(t)-x_{i}(t))\\ +(x_{j}(t)-x_{i}(t))^{\top}\phi_{b}(x_{k}(t)-x_{j}(t))\Big{)}.\end{multlined}\mathcal{I}_{b}(t)=-\frac{1}{2}\Big{(}(x_{i}(t)-x_{j}(t))^{\top}\phi_{b}(x_{k}(t)-x_{i}(t))\\ +(x_{j}(t)-x_{i}(t))^{\top}\phi_{b}(x_{k}(t)-x_{j}(t))\Big{)}. (31)

Then, we have

a(t)=ma2δij2(t)\mathcal{I}_{a}(t)=-\frac{m_{a}}{2}\delta_{ij}^{2}(t) (32)

and

b(t){0, if xk(t)xi(t)<b and xk(t)xj(t)<b,mb2(1b1b+δ𝒞,max(t))δij(t),if xk(t)xi(t)b and xk(t)xj(t)b,mb2(1b1b+δ𝒞,max(t))δij(t),if xk(t)xi(t)b and xk(t)xj(t)b.\begin{multlined}\mathcal{I}_{b}(t)\geq\\ \begin{cases}0,\mbox{ if $\lVert x_{k}(t)-x_{i}(t)\rVert<\ell_{b}$ and~{}$\lVert x_{k}(t)-x_{j}(t)\rVert<\ell_{b}$,}\!\!\\ \displaystyle-\frac{m_{b}}{2}\left(\frac{1}{\ell_{b}}-\frac{1}{\ell_{b}+\delta_{\mathcal{C},\max}(t)}\right)\delta_{ij}(t),\\ \quad\mbox{if $\lVert x_{k}(t)-x_{i}(t)\rVert\geq\ell_{b}$ and $\lVert x_{k}(t)-x_{j}(t)\rVert\geq\ell_{b}$,}\\ \displaystyle-\frac{m_{b}}{2}\left(\frac{1}{\ell_{b}}-\frac{1}{\ell_{b}+\delta_{\mathcal{C},\max}(t)}\right)\delta_{ij}(t),\\ \quad\mbox{if $\lVert x_{k}(t)-x_{i}(t)\rVert\geq\ell_{b}$ and~{}$\lVert x_{k}(t)-x_{j}(t)\rVert\leq\ell_{b}$.}\end{cases}\end{multlined}\mathcal{I}_{b}(t)\geq\\ \begin{cases}0,\mbox{ if $\lVert x_{k}(t)-x_{i}(t)\rVert<\ell_{b}$ and~{}$\lVert x_{k}(t)-x_{j}(t)\rVert<\ell_{b}$,}\!\!\\ \displaystyle-\frac{m_{b}}{2}\left(\frac{1}{\ell_{b}}-\frac{1}{\ell_{b}+\delta_{\mathcal{C},\max}(t)}\right)\delta_{ij}(t),\\ \quad\mbox{if $\lVert x_{k}(t)-x_{i}(t)\rVert\geq\ell_{b}$ and $\lVert x_{k}(t)-x_{j}(t)\rVert\geq\ell_{b}$,}\\ \displaystyle-\frac{m_{b}}{2}\left(\frac{1}{\ell_{b}}-\frac{1}{\ell_{b}+\delta_{\mathcal{C},\max}(t)}\right)\delta_{ij}(t),\\ \quad\mbox{if $\lVert x_{k}(t)-x_{i}(t)\rVert\geq\ell_{b}$ and~{}$\lVert x_{k}(t)-x_{j}(t)\rVert\leq\ell_{b}$.}\end{cases} (33)

Using Lemma 3, we can prove the following lower bound for 4\mathcal{I}_{4}. The proof of the proposition 4 is omitted due to space limitations.

Proposition 4.

Let t0t\geq 0 be arbitrary. If inequality (23) holds, then

4(t)||(MbMbbb+δ𝒞,max(t)+ma𝒳1(t))𝒳1(t).\mathcal{I}_{4}(t)\geq-\lvert\mathcal{R}\rvert\left({M_{b}}-\frac{M_{b}\ell_{b}}{\ell_{b}+\delta_{\mathcal{C},\max}(t)}+m_{a}\mathcal{X}_{1}(t)\right)\mathcal{X}_{1}(t). (34)

We are now ready to prove Theorem 1.

Proof of Theorem 1.

Assume that inequality (12) holds; i.e., fymax<Mbf_{y}^{\max}<M_{b}. Under this assumption, we need to show that set 𝒞\mathcal{C} is empty. Let us assume the contrary to derive a contradiction. Let us assume that 𝒞\mathcal{C} is nonempty. Then, we have

|𝒞|2.\lvert\mathcal{C}\rvert\geq 2. (35)

Further, t1t_{1} is finite and we have limtt1𝒳1(t)=0\lim_{t\xrightarrow{}t_{1}^{-}}\mathcal{X}_{1}(t)=0 and

limtt1(1bb+δ𝒞,max(t))=0.\lim_{t\xrightarrow{}t_{1}^{-}}\left(1-\frac{\ell_{b}}{\ell_{b}+\delta_{\mathcal{C},\max}(t)}\right)=0. (36)

Hence, for ϵ=(Mbfymax)/4>0\epsilon=({M_{b}-f_{y}^{\max}})/4>0, there exists τ>0\tau>0 such that, if t(t1τ,t1)t\in(t_{1}-\tau,t_{1}), then

ma||𝒳1<ϵ,ma|𝒞|𝒳1<ϵ,Mb||(1bb+δ𝒞,max(t))<ϵ.\begin{gathered}m_{a}\left|\mathcal{R}\right|\mathcal{X}_{1}<\epsilon,\quad m_{a}\left|\mathcal{C}\right|\mathcal{X}_{1}<\epsilon,\\ M_{b}\left|\mathcal{R}\right|\left(1-\frac{\ell_{b}}{\ell_{b}+\delta_{\mathcal{C},\max}(t)}\right)<\epsilon.\end{gathered} (37)

Now, from (20), (21), (24), (34), (35), and (37), we can evaluate the derivative 𝒳˙2(t)\dot{\mathcal{X}}_{2}(t) for t(t1τ,t1)t\in(t_{1}-\tau,t_{1}) as

𝒳˙2(t)\displaystyle\dot{\mathcal{X}}_{2}(t) 2|𝒞|(Mb𝒳1(t)ma𝒳12(t))2𝒳1(t)fymax(MbMbbb+δ𝒞,max(t)+ma𝒳1(t))2||𝒳1(t)\displaystyle\geq\begin{multlined}{2\left|\mathcal{C}\right|}\left(M_{b}\mathcal{X}_{1}(t)-m_{a}\mathcal{X}_{1}^{2}(t)\right)-2\mathcal{X}_{1}(t)f_{y}^{\max}\\ \!\!\!\!-\Bigg{(}M_{b}-\frac{M_{b}\ell_{b}}{\ell_{b}+\delta_{\mathcal{C},\max}(t)}+m_{a}\mathcal{X}_{1}(t)\Bigg{)}{2\left|\mathcal{R}\right|}\mathcal{X}_{1}(t)\hskip 39.83385pt\end{multlined}{2\left|\mathcal{C}\right|}\left(M_{b}\mathcal{X}_{1}(t)-m_{a}\mathcal{X}_{1}^{2}(t)\right)-2\mathcal{X}_{1}(t)f_{y}^{\max}\\ \!\!\!\!-\Bigg{(}M_{b}-\frac{M_{b}\ell_{b}}{\ell_{b}+\delta_{\mathcal{C},\max}(t)}+m_{a}\mathcal{X}_{1}(t)\Bigg{)}{2\left|\mathcal{R}\right|}\mathcal{X}_{1}(t)\hskip 39.83385pt (38)
2𝒳1(2Mb2fymax3ϵ)>0.\displaystyle\geq 2\mathcal{X}_{1}\left(2M_{b}-2f_{y}^{\max}-3\epsilon\right)>0.

This inequality implies 𝒳2(t1τ)<𝒳2(t1)=0\mathcal{X}_{2}(t_{1}-\tau^{*})<\mathcal{X}_{2}(t_{1})=0, which contradicts the intrinsic nonnegativity of 𝒳2\mathcal{X}_{2}, as desired. ∎

Remark 1.

Our analysis in Theorem 1 is based on the implicit assumption that when the first collision occurs, another collision by other sheep does not occur simultaneously at another location. Although it is possible to relax this assumption, we chose not to present the analysis due to space limitations.

IV Control Design

From Theorem 1, we conclude that no collision occurs, the dynamics of the system are continuous, and we do not need to consider collisions when using the model (1) to design a controller for regulating the target to the origin. In this section, we describe the design of the controller for the shepherd to ensure that the target sheep is controlled to the origin. We specifically use the Lyapunov stability theorem to ensure that the shepherd has ability of locally regulating the target to the origin.

Let us denote the sum of the interaction forces of the target as

wT=faT+fbT.w_{T}=f_{aT}+f_{bT}. (39)

We remark that this quantity is supposed to be available by the shepherd. Then, using (7), we can rewrite the velocity of the target as

x˙T={γ1xTy2(xTy)+wT,if xTy>y,γ2xTy2g(xTy)(xTy)+wT,otherwise.\begin{multlined}\dot{x}_{T}=\\ \begin{cases}\gamma_{1}\lVert x_{T}-y\rVert^{-2}(x_{T}-y)+w_{T},\ \ \ \ \ \ \ \!\mbox{if }\lVert x_{T}-y\rVert>\ell_{y},\\ \gamma_{2}\lVert x_{T}-y\rVert^{-2}g(\lVert x_{T}-y\rVert)(x_{T}-y)+w_{T},\ \mbox{otherwise.}\end{cases}\end{multlined}\dot{x}_{T}=\\ \begin{cases}\gamma_{1}\lVert x_{T}-y\rVert^{-2}(x_{T}-y)+w_{T},\ \ \ \ \ \ \ \!\mbox{if }\lVert x_{T}-y\rVert>\ell_{y},\\ \gamma_{2}\lVert x_{T}-y\rVert^{-2}g(\lVert x_{T}-y\rVert)(x_{T}-y)+w_{T},\ \mbox{otherwise.}\end{cases} (40)

The system (40) is controllable when fymaxwTf_{y}^{\max}\geq w_{T} for t>0t>0. The controller design should base on the controllability of the (40). Thus we select the gains that satisfy

fymax<MbfymaxN(Mbma)f_{y}^{\max}<M_{b}\quad f_{y}^{\max}\geq N(M_{b}-m_{a}) (41)

To realize our control objective, we set the desired position of the shepherd, denoted by yd(t)2y_{d}(t)\in\mathbb{R}^{2}, as

yd=K1xT,y_{d}=K_{1}x_{T}, (42)

where K1K_{1} is a positive constant. This choice is intuitive because if the shepherd is always at the desired position specified above, the target sheep will be regulated to the origin by the repulsive force of the shepherd. Then, we quantify the mismatch between the actual position yy and desired position of the shepherd as

ey=ydy.e_{y}=y_{d}-y. (43)

In this study, we consider the following first-order dynamics for the shepherd:

y˙=uy.\dot{y}=u_{y}. (44)

The following theorem shows that we can construct a feedback controller to locally achieve the control objective stated in Problem 1.

Theorem 5.

Assume that inequality (12) is satisfied. Define γ:2×2{γ1,γ2}\gamma\colon\mathbb{R}^{2}\times\mathbb{R}^{2}\to\{\gamma_{1},\gamma_{2}\} by

γ(xT,y)={γ1,if xTy>y,γ2,if xTyy.\begin{multlined}\gamma(x_{T},y)=\begin{cases}\gamma_{1},&\mbox{{if} }\lVert x_{T}-y\rVert>\ell_{y},\\ \gamma_{2},&\mbox{{if} }\lVert x_{T}-y\rVert\leq\ell_{y}.\end{cases}\end{multlined}\gamma(x_{T},y)=\begin{cases}\gamma_{1},&\mbox{{if} }\lVert x_{T}-y\rVert>\ell_{y},\\ \gamma_{2},&\mbox{{if} }\lVert x_{T}-y\rVert\leq\ell_{y}.\end{cases} (45)

and construct a feedback controller as

uy\displaystyle u_{y} =γeyey(K1+xT)wT\displaystyle=\gamma\frac{e_{y}}{\lVert e_{y}\rVert}(K_{1}+\lVert x_{T}\rVert)\lVert w_{T}\rVert (46)
+γeyey(xT2+xTy+K1xTyxTy2).\displaystyle+\gamma\frac{e_{y}}{\lVert e_{y}\rVert}\left(\frac{\lVert x_{T}\rVert^{2}+\lVert x_{T}\rVert\lVert y\rVert+K_{1}\lVert x_{T}-y\rVert}{\lVert x_{T}-y\rVert^{2}}\right).

Define

V=12xT2+ey.V=\frac{1}{2}\lVert x_{T}\rVert^{2}+\lVert e_{y}\rVert. (47)

Then, for any constant c>0c>0, set VcV\leq c is forward invariant under feedback controller (46).

Proof.

By inequality (12) and Theorem 1, the existence of the solution of the system is guaranteed. Therefore, it is sufficient to demonstrate that the time derivative

V˙\displaystyle\dot{V} =xTx˙T+(ydy)ydy(y˙dy˙)\displaystyle=x_{T}^{\top}\dot{x}_{T}+\frac{(y_{d}-y)^{\top}}{\lVert y_{d}-y\rVert}(\dot{y}_{d}-\dot{y}) (48)

is negative. First, if xTy>y\lVert x_{T}-y\rVert>\ell_{y}, then

V˙=γ1xT(xTyxTy2+wT)+K1γ1eyey(xTyxTy2+wT)eyeyy˙.\begin{multlined}\dot{V}=\gamma_{1}x_{T}^{\top}\left(\frac{x_{T}-y}{\lVert x_{T}-y\rVert^{2}}+w_{T}\right)\\ +K_{1}\gamma_{1}\frac{e_{y}^{\top}}{\lVert e_{y}\rVert}\left(\frac{x_{T}-y}{\lVert x_{T}-y\rVert^{2}}+w_{T}\right)-\frac{e_{y}^{\top}}{\lVert e_{y}\rVert}\dot{y}.\end{multlined}\dot{V}=\gamma_{1}x_{T}^{\top}\left(\frac{x_{T}-y}{\lVert x_{T}-y\rVert^{2}}+w_{T}\right)\\ +K_{1}\gamma_{1}\frac{e_{y}^{\top}}{\lVert e_{y}\rVert}\left(\frac{x_{T}-y}{\lVert x_{T}-y\rVert^{2}}+w_{T}\right)-\frac{e_{y}^{\top}}{\lVert e_{y}\rVert}\dot{y}. (49)

Therefore, using (46) and (49), we can easily show V˙<0\dot{V}<0. Let us consider the case where xTyy\lVert x_{T}-y\rVert\leq\ell_{y}. In this case, we show that

V˙=γ2xT(xTyxTy2g(xTy)+wT)\displaystyle\dot{V}=\gamma_{2}x_{T}^{\top}\left(\frac{x_{T}-y}{\lVert x_{T}-y\rVert^{2}}g(\lVert x_{T}-y\rVert)+w_{T}\right) (50)
+K1γ2eyey(xTyxTy2g(xTy)+wT)eyeyy˙.\displaystyle+K_{1}\gamma_{2}\frac{e_{y}^{\top}}{\lVert e_{y}\rVert}\left(\frac{x_{T}-y}{\lVert x_{T}-y\rVert^{2}}g(\lVert x_{T}-y\rVert)+w_{T}\right)-\frac{e_{y}^{\top}}{\lVert e_{y}\rVert}\dot{y}.

Because g(xTy)=exp(1/xTy)<1g(\lVert x_{T}-y\rVert)=\exp(-{1}/{\lVert x_{T}-y\rVert})<1, we can further evaluate the derivative as

V˙\displaystyle\dot{V} <γ2(xT2+xTy+K1xTyxTy2)+γ2(xT+K1)wTeyeyy˙=0\displaystyle<\begin{multlined}\gamma_{2}\left(\frac{\lVert x_{T}\rVert^{2}+\lVert x_{T}\rVert\lVert y\rVert+K_{1}\lVert x_{T}-y\rVert}{\lVert x_{T}-y\rVert^{2}}\right)\\ +\gamma_{2}(\lVert x_{T}\rVert+K_{1})w_{T}-\frac{e_{y}^{\top}}{\lVert e_{y}\rVert}\dot{y}=0\end{multlined}\gamma_{2}\left(\frac{\lVert x_{T}\rVert^{2}+\lVert x_{T}\rVert\lVert y\rVert+K_{1}\lVert x_{T}-y\rVert}{\lVert x_{T}-y\rVert^{2}}\right)\\ +\gamma_{2}(\lVert x_{T}\rVert+K_{1})w_{T}-\frac{e_{y}^{\top}}{\lVert e_{y}\rVert}\dot{y}=0 (51)

as desired. This completes the proof of the theorem. ∎

Refer to caption
(a) The proposed method with N=10N=10 sheep
Figure 1: The comparison between the proposed method and baseline. These four figures show the overall trajectory of one shepherd regulating the target to the origin in different swarm sizes, where the red line, blue line, and dashed lines represent the trajectories of the target, shepherd, and the other sheep in the swarm.

​​​​​​​​​​​​​ Refer to caption (b) The baseline method with N=10N=10 sheep
Refer to caption (c) The proposed method with N=50N=50 sheep ​​​​​​​​​​​​​ Refer to caption (d) The baseline method with N=50N=50 sheep

Remark 2.

Although Theorem 5 proves only a local effectiveness of the controller (46), numerical simulations presented in the next section suggests its global effectiveness. We leave the problem of theoretically establishing global efficacy of the controller (46) an open problem.

V Numerical Simulations

In this section, we demonstrate the effectiveness of the proposed feedback controller in regulating one target sheep to the origin in a large-scale swarm. The initial conditions for the first experiment were set to xT(0)=[13,14]x_{T}(0)=[13,14]^{\top} and y(0)=[17,17]y(0)=[17,17]^{\top}. We randomly generated N=200N=200 non-overlapping sheep in the square region of [0,17]2[0,17]^{2}. We selected the constants as ma=10m_{a}=10, mb=5m_{b}=5, Mb=10M_{b}=10, y=0.5\ell_{y}=0.5, γ1=1\gamma_{1}=1, γ2=e2\gamma_{2}=e^{2}, l=0.5l=0.5, and K1=1.5K_{1}=1.5. These constants were selected to satisfy inequality (12) in Theorem 1; therefore, the swarm guarantees the collision-free property and the controller can safely regulate the target to the origin.

Refer to caption
Figure 2: The norm of the target position xT\lVert x_{T}\rVert varying from t=0t=0 to t=200t=200 by baseline and proposed method, respectively. The dashed line represents the baseline method, while the solid line represents the proposed method.

To further illustrate the proposed method’s effectiveness, we compare its performance with a baseline strategy from the farthest-agent algorithm [23]. The farthest-agent algorithm utilizes the sum of the following two vectors as the velocity input for the shepherd, where one vector drives the shepherd to approach the target, and the other vector avoids the shepherd exceeding the target.

Figure 1(a) depicts two sets of comparative experiments, illustrating the change of the swarm trajectories by the proposed and baseline methods. As shown in Figure 1(a), we conducted comparative experiments to show the swarm trajectories change from t=0t=0 until either xT<0.1\lVert x_{T}\rVert<0.1 or a designated time limit of t=200t=200. Figures 1(a) and 1(b) show the overall trajectories of the proposed and baseline methods on a N=10N=10 swarm with the same initial placements, respectively, while Figures 1(c) and 1(d) show those on a N=50N=50 swarm with the same initial placements. We observe that the baseline method guides the target around the origin but fails in making the target’s position converge to the origin. On the other hand, the proposed method is guaranteed to make the tracking error converge to zero and, therefore, succeeds in guiding the target to the origin asymptotically.

We further illustrate the effectiveness of the proposed method in different swarm sizes (from N=10N=10 to N=200N=200) as shown in Figure 2. The dashed line represents the baseline method, and the solid line represents the proposed method. Each plot is obtained by averaging 100 different simulations with different initial placements. From Figure 2, we observe the proposed method regulates the target to the origin before t=200t=200 for all different swarm sizes and performs better than the baseline. This further corroborates the conclusion of the comparative analysis of the two methods mentioned above.

VI Conclusion and Future works

In this study, we investigated the problem of using a shepherd to accurately guide a target in a large swarm to its destination. We first analyzed the properties of the swarm and rigorously proved through mathematical analysis that under certain inequality constraints, no sheep in the swarm will collide. Subsequently, we used the Lyapunov stability principle to design a shepherd control law that precisely guides the target to the origin. We also compared our model with a baseline shepherding control algorithm. Our experimental results demonstrate that the shepherd can precisely guide the target to the goal in both small and large swarms. Our future work includes finding necessary and sufficient conditions of the collision-free property of shepherding swarm.

References

  • [1] E. O. Budrene and H. C. Berg, “Dynamics of formation of symmetrical patterns by chemotactic bacteria,” Nature, vol. 376, pp. 49–53, 1995.
  • [2] T. Vicsek, A. Czirók, E. Ben-Jacob, I. Cohen, and O. Shochet, “Novel type of phase transition in a system of self-driven particles,” Physical Review Letters, vol. 75, no. 6, p. 1226, 1995.
  • [3] V. Gazi and K. M. Passino, “A class of attractions/repulsion functions for stable swarm aggregations,” International Journal of Control, vol. 77, no. 18, pp. 1567–1579, 2004.
  • [4] R. Goel, J. Lewis, M. A. Goodrich, and P. Sujit, “Leader and predator based swarm steering for multiple tasks,” in 2019 IEEE International Conference on Systems, Man and Cybernetics, 2019, pp. 3791–3798.
  • [5] V. S. Chipade and D. Panagou, “Multi-swarm herding: Protecting against adversarial swarms,” in 59th IEEE Conference on Decision and Control, 2020, pp. 5374–5379.
  • [6] N. K. Long, K. Sammut, D. Sgarioto, M. Garratt, and H. A. Abbass, “A comprehensive review of shepherding as a bio-inspired swarm-robotics guidance approach,” IEEE Transactions on Emerging Topics in Computational Intelligence, vol. 4, no. 4, pp. 523–537, 2020.
  • [7] J. Hu, A. E. Turgut, T. Krajník, B. Lennox, and F. Arvin, “Occlusion-based coordination protocol design for autonomous robotic shepherding tasks,” IEEE Transactions on Cognitive and Developmental Systems, vol. 14, no. 1, pp. 126–135, 2020.
  • [8] Y. Tsunoda, Y. Sueoka, and K. Osuka, “On statistical analysis for shepherd guidance system,” in 2017 IEEE International Conference on Robotics and Biomimetics, 2017, pp. 1246–1251.
  • [9] A. Garrell and A. Sanfeliu, “Local optimization of cooperative robot movements for guiding and regrouping people in a guiding mission,” in 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010, pp. 3294–3299.
  • [10] H. Xie, M. Sun, X. Fan, Z. Lin, W. Chen, L. Wang, L. Dong, and Q. He, “Reconfigurable magnetic microrobot swarm: Multimode transformation, locomotion, and manipulation,” Science Robotics, vol. 4, no. 28, 2019.
  • [11] J. Yu, B. Wang, X. Du, Q. Wang, and L. Zhang, “Ultra-extensible ribbon-like magnetic microswarm,” Nature Communications, vol. 9, no. 1, p. 3260, 2018.
  • [12] P. Deptula, Z. I. Bell, F. M. Zegers, R. A. Licitra, and W. E. Dixon, “Approximate optimal influence over an agent through an uncertain interaction dynamic,” Automatica, vol. 134, p. 109913, 2021.
  • [13] R. A. Licitra, Z. I. Bell, and W. E. Dixon, “Single-agent indirect herding of multiple targets with uncertain dynamics,” IEEE Transactions on Robotics, vol. 35, no. 4, pp. 847–860, 2019.
  • [14] E. Sebastián and E. Montijano, “Multi-robot implicit control of herds,” in 2021 IEEE International Conference on Robotics and Automation, 2021, pp. 1601–1607.
  • [15] D. M. Le, X. Luo, L. J. Bridgeman, M. M. Zavlanos, and W. E. Dixon, “Single-agent indirect herding of multiple targets using metric temporal logic switching,” in 59th IEEE Conference on Decision and Control, 2020, pp. 1398–1403.
  • [16] W. Zhang, V. S. Chipade, and D. Panagou, “Herding an adversarial swarm in three-dimensional spaces,” in 2021 American Control Conference.   IEEE, 2021, pp. 4722–4728.
  • [17] S. Mayya, G. Notomista, D. Shell, S. Hutchinson, and M. Egerstedt, “Non-uniform robot densities in vibration driven swarms using phase separation theory,” in 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019, pp. 4106–4112.
  • [18] X. Yin, D. Yue, and Z. Chen, “Asymptotic behavior and collision avoidance in the cucker–smale model,” IEEE Transactions on Automatic Control, vol. 65, no. 7, pp. 3112–3119, 2019.
  • [19] S. Bandyopadhyay, V. Gehlot, M. Balas, D. S. Bayard, and M. B. Quadrelli, “Detection of transient instabilities in multi-agent systems and swarms,” in 2021 American Control Conference.   IEEE, 2021, pp. 1216–1223.
  • [20] H.-T. Zhang, C. Zhai, and Z. Chen, “A general alignment repulsion algorithm for flocking of multi-agent systems,” IEEE Transactions on Automatic Control, vol. 56, no. 2, pp. 430–435, 2010.
  • [21] B. Zhang and H. P. Gavin, “Natural deadlock resolution for multi-agent multi-swarm navigation,” in 60th IEEE Conference on Decision and Control, 2021, pp. 5958–5963.
  • [22] H. Su, X. Wang, and Z. Lin, “Flocking of multi-agents with a virtual leader,” IEEE Transactions on Automatic Control, vol. 54, no. 2, pp. 293–307, 2009.
  • [23] Y. Tsunoda, Y. Sueoka, Y. Sato, and K. Osuka, “Analysis of local-camera-based shepherding navigation,” Advanced Robotics, vol. 32, no. 23, pp. 1217–1228, 2018.
  • [24] S.-Y. Ha, J. Jung, J. Kim, J. Park, and X. Zhang, “Emergent behaviors of the swarmalator model for position-phase aggregation,” Mathematical Models and Methods in Applied Sciences, vol. 29, no. 12, pp. 2225–2269, 2019.