Collision-Free Shepherding Control of a Single Target within a Swarm^†^†thanks: This work was supported in part by JSPS KAKENHI Grant Number JP21H01352.

Yaosheng Deng, Aiyi Li, Masaki Ogura, and Naoki Wakamiya Graduate School of Information Science and Technology, Osaka University,
Suita, Osaka 565-0871, Japan
{ys-deng, li-aiyi, m-ogura, wakamiya}@ist.osaka-u.ac.jp

Abstract

The shepherding problem refers to guiding a group of agents (called sheep) to a specific destination using an external agent with repulsive forces (called shepherd). Although various movement algorithms for the shepherd have been explored in the literature, there is a scarcity of methodologies for selective guidance, which is a key technology for precise swarm control. Therefore, this study investigates the problem of guiding a single target sheep within a swarm to a given destination using a shepherd. We first present our model of the dynamics of sheep agents and the interaction between sheep and shepherd agents. The model is shown to be well-defined with no collision if the interaction magnitude between sheep and shepherd is reasonably limited. Based on the analysis with Lyapunov stability principles, we design a shepherd control law to guide the target sheep to the origin while avoiding collisions among sheep agents. Experimental results demonstrate the effectiveness of the proposed method in guiding the target sheep in both small and large scale swarms.

Index Terms:

Shepherding control, Nonlinear control, Lyapunov method.

I Introduction

It has long been observed that various biological individuals tend to exhibit swarming behavior. Examples of swarms include flocks of birds, schools of fish, herds of animals, and colonies of bacteria [1, 2], and engineering applications include the formation control of multi-robot teams and autonomous aircraft. Many studies [3, 4, 5] have investigated the relevant properties of swarm motion based on the attraction-repulsion swarm model and its applications in areas such as UAV swarms and multi-robot systems.

As research on swarm intelligence advances, there has been increasing attention to the use of an external special agent (called shepherd) to control the movement of swarm agents (called sheep). This process is referred to as shepherding [6]. The shepherding model specifically refers to the situation in which “sheep” agents avoid a “shepherd” agent while also interacting with other sheep, according to a swarm model. In this context, we can find various methods [7, 8, 9] to guide a whole swarm of sheep agents.

There is emerging interest in guiding a part of the swarm (e.g., a target sheep) to the goal because of its potential engineering applications in the context of micro-/nano-robotic swarms [10, 11]. For example, Deptula et al. [12] proposed a control algorithm for a shepherd to guide a targeted sheep agent towards its destination. Licitra et al. [13] designed a controller for herding one sheep within a swarm to its destination without considering swarm interaction forces in the dynamics model. Using the framework of reinforcement learning, Sebastián and Montijano [14] developed a method for guiding a few agents within a swarm with heterogeneous agent dynamics. Le et al. [15] proposed a method to use a shepherd to regulate a sheep within a small swarm. Zhang et al. [16] designed an algorithm for capturing and regulating a portion of a swarm using multiple shepherds.

However, the shepherd control methods mentioned above [12, 13, 14, 15, 16] do not consider collisions within the swarm; their algorithms assume that no pairs of agents in the swarm collide. Because the occurrence of a collision can disrupt an entire system and lead to failed tasks [17, 18, 19], it is practically important to develop a shepherding control method with guaranteed collision-free properties. Although several studies have proposed distributed [20, 21] and leader-based [22] control methods for swarm systems with collision-free guarantees, these studies have not been directly extended to the context of shepherding control. Therefore, to design a stable shepherd controller to achieve certain control objectives, the first consideration must be to carefully analyze the collision properties of shepherding swarms.

In this study, the control objective is to use a shepherd agent to guide a sheep agent (target) in a swarm to the origin (goal). First, the repulsion-bounded swarm model is described. We then carefully analyze and rigorously prove that under relatively mild conditions, sheep in the swarm do not collide, even under the influence of a shepherd. Based on this analysis, we use the Lyapunov stability principle to design a motion controller for the shepherd to guide the target to the endpoint (i.e., the origin). The effectiveness of the proposed method is illustrated by numerical simulations, in which we compare it with a baseline strategy based on the farthest agent-targeting strategy presented in [23].

This paper is organized as follows: Section II describes the shepherding swarm model and the problem for controlling one target to the origin; in Section III we analyze the collision-free property of the shepherding swarm model; in Section IV we design a shepherd motion controller to guide a specific target to the destination. Section V provides several numerical simulations that highlight the shepherding behavior.

II Problem statement

We consider a swarm of $N>2$ individuals (sheep) and one herder (shepherd) in $\mathbb{R}^{2}$ . The positions of sheep $i$ and shepherd at time $t\geq 0$ are denoted by $x_{i}(t)\in\mathbb{R}^{2}$ and $y(t)\in\mathbb{R}^{2}$ , respectively. Let $\mathcal{N}=\{1,2,...,N\}$ .

We assume that, for all $i\in\mathcal{N}$ , the velocity $\dot{x}_{i}$ of the $i$ th sheep is specified by a linear combination of three components:

\dot{x}_{i}=f_{ai}+f_{bi}+f_{yi},

(1)

where $f_{ai}$ and $f_{bi}$ represent the attraction and repulsion functions from the other sheep to the $i$ th sheep, respectively, and $f_{yi}$ represents the function from the shepherd to the $i$ th sheep.

Below, we describe how $f_{ai}$ , $f_{bi}$ , and $f_{yi}$ are constructed. First, the attraction function of the $i$ th sheep is given by:

f_{ai}=\sum_{j\in\mathcal{N}{\backslash\{i\}}}\phi_{a}(x_{i}-x_{j}),

(2)

where the function $\phi_{a}\colon\mathbb{R}^{2}\to\mathbb{R}^{2}$ is defined as

\phi_{a}({x})=-m_{a}x

(3)

for a positive constant $m_{a}$ . Then, the bounded repulsion function of the $i$ th sheep is given by

f_{bi}=\sum_{j\in\mathcal{N}{\backslash\{i\}}}\phi_{b}(x_{i}-x_{j}),

(4)

where the function $\phi_{b}\colon\mathbb{R}^{2}\to\mathbb{R}^{2}$ is defined as

\phi_{b}({x})=\begin{cases}m_{b}\dfrac{{x}}{\lVert{x}\rVert^{2}},&\mbox{if }\lVert{x}\rVert>\ell_{b},\vspace{2mm}\\ M_{b}\dfrac{{x}}{\lVert{x}\rVert},&\mbox{if }\lVert{x}\rVert\leq\ell_{b},\end{cases}

(5)

for positive constants $\ell_{b}$ , $m_{b}$ , and $M_{b}$ satisfying

M_{b}=\frac{m_{b}}{\ell_{b}},\quad\ell_{b}<R.

(6)

Finally, we define $f_{yi}$ as

f_{yi}=\begin{cases}\gamma_{1}\dfrac{x_{i}-y}{\lVert x_{i}-y\rVert^{2}},\vspace{2mm}&\mbox{if $\lVert x_{i}-y\rVert>\ell_{y}$,}\\ \displaystyle\gamma_{2}\dfrac{x_{i}-y}{\lVert x_{i}-y\rVert^{2}}g(\lVert x_{i}-y\rVert),&\mbox{if $\lVert x_{i}-y\rVert\leq\ell_{y}$,}\end{cases}

(7)

where the function $g\colon[0,\infty)\to[0,\infty)$ is defined by:

g({d})=\begin{cases}0,&\mbox{if $d=0$,}\\ \exp\left({-{1}/{d}}\right),&\mbox{otherwise}\end{cases}

(8)

and the parameters $\gamma_{1}$ , $\gamma_{2}$ , and $\ell_{y}$ are positive constants that satisfy

\gamma_{2}=\gamma_{1}g^{-1}(\ell_{y}).

(9)

We can now state the problem studied in this paper.

Problem 1.

Assume that the shepherd knows all sheep positions and can calculate the interaction force $f_{aT}+f_{bT}$ of the target sheep $T\in\mathcal{N}$ . Design a controller for the velocity of the shepherd such that the target sheep is asymptotically guided to the origin.

III Collision-Free Condition

In this section, we present a sufficient condition that guarantees that no pairs of sheep will collide within a swarm. We begin by presenting the main results of this section. Within the theorem, the maximum shepherd force for one sheep is defined as follows:

f_{y}^{\max}=\frac{\gamma_{1}}{\ell_{y}},

(10)

which plays an important role in this process.

Theorem 1.

Suppose that

x_{i}(0)\neq x_{j}(0)

(11)

for all distinct pairs $(i,j)\in\mathcal{N}\times\mathcal{N}$ . If

f_{y}^{\max}<M_{b},

(12)

then

x_{i}(t)\neq x_{j}(t)

(13)

for all distinct pairs $(i,j)\in\mathcal{N}\times\mathcal{N}$ and $t>0$ .

In the remainder of this section, we present a proof of Theorem 1 using the following notation. For all $i,j\in\mathcal{N}$ , let

\delta_{ij}=\lVert x_{i}-x_{j}\rVert.

(14)

The first collision time of the system is defined as follows:

t_{1}=\min\{t\geq 0\mid\min_{i,j\in\mathcal{N},\,i\neq j}\delta_{ij}(t)=0\}.

(15)

Set $\mathcal{C}$ is defined as

\displaystyle\mathcal{C}=\begin{cases}\emptyset,&\mbox{if $t_{1}=\infty$,}\\ \{i\in\mathcal{N}\mid\delta_{ij}(t_{1})=0\mbox{ for some }j\neq i\},&\mbox{otherwise.}\end{cases}

(16)

We then define the complement set as

\mathcal{R}=\mathcal{N}\setminus\mathcal{C}.

(17)

We prove Theorem 1 by contradiction. First, the proof is outlined. Let us introduce the nonnegative function

\mathcal{X}_{2}=\sum_{i,j\in\mathcal{C},i\neq j}\delta_{ij}^{2}.

(18)

If a finite-time collision occurs, that is, if the conclusion of Theorem 1 is violated, then $t_{1}$ is finite; therefore, we have $\mathcal{X}_{2}(t_{1})=0$ . Conversely, as will be shown later, a careful analysis of the derivative of $\mathcal{X}_{2}(t)$ allows us to show that function $\mathcal{X}_{2}(t)$ increases on the interval $[t_{1}-\tau,t_{1})$ for a sufficiently small $\tau>0$ under inequality (12). This observation leads to $\mathcal{X}_{2}(t)$ being negative on the interval, which contradicts the intrinsic nonnegativity of function $\mathcal{X}_{2}$ .

To proceed further, we first evaluate the derivative of $\mathcal{X}_{2}$ . A straightforward calculation shows that the derivative can be decomposed as follows:

\dot{\mathcal{X}}_{2}=\mathcal{I}_{1}+\mathcal{I}_{2}+\mathcal{I}_{3}+\mathcal{I}_{4}+\mathcal{I}_{5},

(19)

where

$\displaystyle\mathcal{I}_{1}$	$\displaystyle=$	$\displaystyle 2\sum_{\begin{smallmatrix}i,j\in\mathcal{C}\\ i\neq j\end{smallmatrix}}(x_{i}-x_{j})^{\top}(f_{yi}-f_{yj}),$	(20)
$\displaystyle\mathcal{I}_{2}$	$\displaystyle=$	$\displaystyle 2\sum_{\begin{smallmatrix}i,j,k\in\mathcal{C}\\ i\neq j,j\neq k\end{smallmatrix}}(x_{i}-x_{j})^{\top}\left(\phi_{a}(x_{i}-x_{k})+\phi_{b}(x_{i}-x_{k})\right),$
$\displaystyle\mathcal{I}_{3}$	$\displaystyle=$	$\displaystyle 2\sum_{\begin{smallmatrix}i,j,k\in\mathcal{C}\\ i\neq j,j\neq k\end{smallmatrix}}(x_{j}-x_{i})^{\top}\left(\phi_{a}(x_{j}-x_{k})+\phi_{b}(x_{j}-x_{k})\right),$
$\displaystyle\mathcal{I}_{4}$	$\displaystyle=$	$\displaystyle 2\sum_{\begin{smallmatrix}i,j\in\mathcal{C}\\ k\in\mathcal{R},i\neq j\end{smallmatrix}}(x_{i}-x_{j})^{\top}\left(\phi_{a}(x_{i}-x_{k})+\phi_{b}(x_{i}-x_{k})\right),$
$\displaystyle\mathcal{I}_{5}$	$\displaystyle=$	$\displaystyle 2\sum_{\begin{smallmatrix}i,j\in\mathcal{C}\\ k\in\mathcal{R},i\neq j\end{smallmatrix}}(x_{j}-x_{i})^{\top}\left(\phi_{a}(x_{j}-x_{k})+\phi_{b}(x_{j}-x_{k})\right).$

Because the symmetry [24] of these expressions with respect to indices $i$ and $j$ allows us to show $\mathcal{I}_{2}=\mathcal{I}_{3}$ and $\mathcal{I}_{4}=\mathcal{I}_{5}$ , we obtain

\dot{\mathcal{X}}_{2}=\mathcal{I}_{1}+2\mathcal{I}_{2}+2\mathcal{I}_{4}.

(21)

To further evaluate the derivative $\dot{\mathcal{X}}_{2}$ , we provide lower bounds for $\mathcal{I}_{2}$ and $\mathcal{I}_{4}$ . A lower bound for $\mathcal{I}_{2}$ is presented in the following proposition:

Proposition 2.

Let $t\geq 0$ be arbitrary. If

\delta_{\mathcal{C},\max}(t)=\max_{i,j\in\mathcal{C},i\neq j}\delta_{ij}(t)

(22)

satisfies

\delta_{\mathcal{C},\max}(t)\leq\ell_{b},

(23)

then

\mathcal{I}_{2}(t)\geq{M_{b}\left|\mathcal{C}\right|}\mathcal{X}_{1}(t)-{m_{a}\left|\mathcal{C}\right|}\mathcal{X}_{1}^{2}(t),

(24)

where $\mathcal{X}_{1}$ is defined as

\mathcal{X}_{1}=\sum_{i,j\in\mathcal{C},i\neq j}\delta_{ij}.

(25)

Proof.

We omit the time variable $t$ for the simplicity of notations within the proof. If $\mathcal{C}=\emptyset$ , the inequality holds vacuously. We assume that $\mathcal{C}$ is nonempty. By using (3), (5), and (24), we can show that

\begin{multlined}\mathcal{I}_{2}=\!2\sum_{\begin{smallmatrix}i,j,k\in\mathcal{C}\\ i\neq j\neq k\end{smallmatrix}}(x_{i}-x_{j})^{\top}\Bigg{(}m_{a}(x_{k}-x_{i})-M_{b}\frac{x_{k}-x_{i}}{\lVert x_{k}-x_{i}\rVert}\Bigg{)}.\end{multlined}\mathcal{I}_{2}=\!2\sum_{\begin{smallmatrix}i,j,k\in\mathcal{C}\\ i\neq j\neq k\end{smallmatrix}}(x_{i}-x_{j})^{\top}\Bigg{(}m_{a}(x_{k}-x_{i})-M_{b}\frac{x_{k}-x_{i}}{\lVert x_{k}-x_{i}\rVert}\Bigg{)}.

(26)

By exchanging indices $i$ and $k$ , we obtain

\begin{multlined}\mathcal{I}_{2}=2\!\sum_{\begin{smallmatrix}i,j,k\in\mathcal{C}\\ i\neq j\neq k\end{smallmatrix}}(x_{k}-x_{j})^{\top}\Bigg{(}m_{a}(x_{i}-x_{k})-M_{b}\frac{x_{i}-x_{k}}{\lVert x_{k}-x_{i}\rVert}\Bigg{)}.\end{multlined}\mathcal{I}_{2}=2\!\sum_{\begin{smallmatrix}i,j,k\in\mathcal{C}\\ i\neq j\neq k\end{smallmatrix}}(x_{k}-x_{j})^{\top}\Bigg{(}m_{a}(x_{i}-x_{k})-M_{b}\frac{x_{i}-x_{k}}{\lVert x_{k}-x_{i}\rVert}\Bigg{)}.

(27)

Therefore,

\begin{multlined}\mathcal{I}_{2}=\sum_{\begin{smallmatrix}i,j,k\in\mathcal{C}\\ i\neq j\neq k\end{smallmatrix}}(x_{i}-x_{j})^{\top}\Bigg{(}m_{a}(x_{k}-x_{i})-M_{b}\frac{x_{k}-x_{i}}{\lVert x_{k}-x_{i}\rVert}\Bigg{)}\\ +\sum_{\begin{smallmatrix}i,j,k\in\mathcal{C}\\ i\neq j\neq k\end{smallmatrix}}(x_{k}-x_{j})^{\top}\Bigg{(}m_{a}(x_{i}-x_{k})-M_{b}\frac{x_{i}-x_{k}}{\lVert x_{k}-x_{i}\rVert}\Bigg{)}.\end{multlined}\mathcal{I}_{2}=\sum_{\begin{smallmatrix}i,j,k\in\mathcal{C}\\ i\neq j\neq k\end{smallmatrix}}(x_{i}-x_{j})^{\top}\Bigg{(}m_{a}(x_{k}-x_{i})-M_{b}\frac{x_{k}-x_{i}}{\lVert x_{k}-x_{i}\rVert}\Bigg{)}\\ +\sum_{\begin{smallmatrix}i,j,k\in\mathcal{C}\\ i\neq j\neq k\end{smallmatrix}}(x_{k}-x_{j})^{\top}\Bigg{(}m_{a}(x_{i}-x_{k})-M_{b}\frac{x_{i}-x_{k}}{\lVert x_{k}-x_{i}\rVert}\Bigg{)}.

(28)

Then, we can further simplify $\mathcal{I}_{2}$ as

	$\displaystyle\mathcal{I}_{2}$	$\displaystyle=\sum_{\begin{smallmatrix}i,j,k\in\mathcal{C}\\ i\neq j\neq k\end{smallmatrix}}\bigl{(}-m_{a}\lVert x_{k}-x_{i}\rVert^{2}+M_{b}\lVert x_{k}-x_{i}\rVert\bigr{)}$		(29)
		$\displaystyle={{M_{b}\left\|\mathcal{C}\right\|}\mathcal{X}_{1}(t)-{m_{a}\left\|\mathcal{C}\right\|}\mathcal{X}_{2}(t)},$		(29)

which proves the desired inequality (24) using the trivial inequality $\mathcal{X}_{2}\leq\mathcal{X}_{1}^{2}$ . ∎

Then, we derive a lower bound for $\mathcal{I}_{4}$ . We begin by presenting the following technical lemma. The proof of the lemma 3 is omitted due to space limitations.

Lemma 3.

Let $t\geq 0$ and $i,j\in\mathcal{C}$ be arbitrary. Assume $i\neq j$ . Define

\begin{multlined}\mathcal{I}_{a}(t)=(x_{i}(t)-x_{j}(t))^{\top}\phi_{a}(x_{i}(t)-x_{k}(t))\\ +(x_{j}(t)-x_{i}(t))^{\top}\phi_{a}(x_{j}(t)-x_{k}(t))\end{multlined}\mathcal{I}_{a}(t)=(x_{i}(t)-x_{j}(t))^{\top}\phi_{a}(x_{i}(t)-x_{k}(t))\\ +(x_{j}(t)-x_{i}(t))^{\top}\phi_{a}(x_{j}(t)-x_{k}(t))

(30)

and

\begin{multlined}\mathcal{I}_{b}(t)=-\frac{1}{2}\Big{(}(x_{i}(t)-x_{j}(t))^{\top}\phi_{b}(x_{k}(t)-x_{i}(t))\\ +(x_{j}(t)-x_{i}(t))^{\top}\phi_{b}(x_{k}(t)-x_{j}(t))\Big{)}.\end{multlined}\mathcal{I}_{b}(t)=-\frac{1}{2}\Big{(}(x_{i}(t)-x_{j}(t))^{\top}\phi_{b}(x_{k}(t)-x_{i}(t))\\ +(x_{j}(t)-x_{i}(t))^{\top}\phi_{b}(x_{k}(t)-x_{j}(t))\Big{)}.

(31)

Then, we have

\mathcal{I}_{a}(t)=-\frac{m_{a}}{2}\delta_{ij}^{2}(t)

(32)

and

\begin{multlined}\mathcal{I}_{b}(t)\geq\\ \begin{cases}0,\mbox{ if $\lVert x_{k}(t)-x_{i}(t)\rVert<\ell_{b}$ and~{}$\lVert x_{k}(t)-x_{j}(t)\rVert<\ell_{b}$,}\!\!\\ \displaystyle-\frac{m_{b}}{2}\left(\frac{1}{\ell_{b}}-\frac{1}{\ell_{b}+\delta_{\mathcal{C},\max}(t)}\right)\delta_{ij}(t),\\ \quad\mbox{if $\lVert x_{k}(t)-x_{i}(t)\rVert\geq\ell_{b}$ and $\lVert x_{k}(t)-x_{j}(t)\rVert\geq\ell_{b}$,}\\ \displaystyle-\frac{m_{b}}{2}\left(\frac{1}{\ell_{b}}-\frac{1}{\ell_{b}+\delta_{\mathcal{C},\max}(t)}\right)\delta_{ij}(t),\\ \quad\mbox{if $\lVert x_{k}(t)-x_{i}(t)\rVert\geq\ell_{b}$ and~{}$\lVert x_{k}(t)-x_{j}(t)\rVert\leq\ell_{b}$.}\end{cases}\end{multlined}\mathcal{I}_{b}(t)\geq\\ \begin{cases}0,\mbox{ if $\lVert x_{k}(t)-x_{i}(t)\rVert<\ell_{b}$ and~{}$\lVert x_{k}(t)-x_{j}(t)\rVert<\ell_{b}$,}\!\!\\ \displaystyle-\frac{m_{b}}{2}\left(\frac{1}{\ell_{b}}-\frac{1}{\ell_{b}+\delta_{\mathcal{C},\max}(t)}\right)\delta_{ij}(t),\\ \quad\mbox{if $\lVert x_{k}(t)-x_{i}(t)\rVert\geq\ell_{b}$ and $\lVert x_{k}(t)-x_{j}(t)\rVert\geq\ell_{b}$,}\\ \displaystyle-\frac{m_{b}}{2}\left(\frac{1}{\ell_{b}}-\frac{1}{\ell_{b}+\delta_{\mathcal{C},\max}(t)}\right)\delta_{ij}(t),\\ \quad\mbox{if $\lVert x_{k}(t)-x_{i}(t)\rVert\geq\ell_{b}$ and~{}$\lVert x_{k}(t)-x_{j}(t)\rVert\leq\ell_{b}$.}\end{cases}

(33)

Using Lemma 3, we can prove the following lower bound for $\mathcal{I}_{4}$ . The proof of the proposition 4 is omitted due to space limitations.

Proposition 4.

Let $t\geq 0$ be arbitrary. If inequality (23) holds, then

\mathcal{I}_{4}(t)\geq-\lvert\mathcal{R}\rvert\left({M_{b}}-\frac{M_{b}\ell_{b}}{\ell_{b}+\delta_{\mathcal{C},\max}(t)}+m_{a}\mathcal{X}_{1}(t)\right)\mathcal{X}_{1}(t).

(34)

We are now ready to prove Theorem 1.

Proof of Theorem 1.

Assume that inequality (12) holds; i.e., $f_{y}^{\max}<M_{b}$ . Under this assumption, we need to show that set $\mathcal{C}$ is empty. Let us assume the contrary to derive a contradiction. Let us assume that $\mathcal{C}$ is nonempty. Then, we have

\lvert\mathcal{C}\rvert\geq 2.

(35)

Further, $t_{1}$ is finite and we have $\lim_{t\xrightarrow{}t_{1}^{-}}\mathcal{X}_{1}(t)=0$ and

\lim_{t\xrightarrow{}t_{1}^{-}}\left(1-\frac{\ell_{b}}{\ell_{b}+\delta_{\mathcal{C},\max}(t)}\right)=0.

(36)

Hence, for $\epsilon=({M_{b}-f_{y}^{\max}})/4>0$ , there exists $\tau>0$ such that, if $t\in(t_{1}-\tau,t_{1})$ , then

\begin{gathered}m_{a}\left|\mathcal{R}\right|\mathcal{X}_{1}<\epsilon,\quad m_{a}\left|\mathcal{C}\right|\mathcal{X}_{1}<\epsilon,\\ M_{b}\left|\mathcal{R}\right|\left(1-\frac{\ell_{b}}{\ell_{b}+\delta_{\mathcal{C},\max}(t)}\right)<\epsilon.\end{gathered}

(37)

Now, from (20), (21), (24), (34), (35), and (37), we can evaluate the derivative $\dot{\mathcal{X}}_{2}(t)$ for $t\in(t_{1}-\tau,t_{1})$ as

	$\displaystyle\dot{\mathcal{X}}_{2}(t)$	$\displaystyle\geq\begin{multlined}{2\left\|\mathcal{C}\right\|}\left(M_{b}\mathcal{X}_{1}(t)-m_{a}\mathcal{X}_{1}^{2}(t)\right)-2\mathcal{X}_{1}(t)f_{y}^{\max}\\ \!\!\!\!-\Bigg{(}M_{b}-\frac{M_{b}\ell_{b}}{\ell_{b}+\delta_{\mathcal{C},\max}(t)}+m_{a}\mathcal{X}_{1}(t)\Bigg{)}{2\left\|\mathcal{R}\right\|}\mathcal{X}_{1}(t)\hskip 39.83385pt\end{multlined}{2\left\|\mathcal{C}\right\|}\left(M_{b}\mathcal{X}_{1}(t)-m_{a}\mathcal{X}_{1}^{2}(t)\right)-2\mathcal{X}_{1}(t)f_{y}^{\max}\\ \!\!\!\!-\Bigg{(}M_{b}-\frac{M_{b}\ell_{b}}{\ell_{b}+\delta_{\mathcal{C},\max}(t)}+m_{a}\mathcal{X}_{1}(t)\Bigg{)}{2\left\|\mathcal{R}\right\|}\mathcal{X}_{1}(t)\hskip 39.83385pt$		(38)
		$\displaystyle\geq 2\mathcal{X}_{1}\left(2M_{b}-2f_{y}^{\max}-3\epsilon\right)>0.$		(38)

This inequality implies $\mathcal{X}_{2}(t_{1}-\tau^{*})<\mathcal{X}_{2}(t_{1})=0$ , which contradicts the intrinsic nonnegativity of $\mathcal{X}_{2}$ , as desired. ∎

Remark 1.

Our analysis in Theorem 1 is based on the implicit assumption that when the first collision occurs, another collision by other sheep does not occur simultaneously at another location. Although it is possible to relax this assumption, we chose not to present the analysis due to space limitations.

IV Control Design

From Theorem 1, we conclude that no collision occurs, the dynamics of the system are continuous, and we do not need to consider collisions when using the model (1) to design a controller for regulating the target to the origin. In this section, we describe the design of the controller for the shepherd to ensure that the target sheep is controlled to the origin. We specifically use the Lyapunov stability theorem to ensure that the shepherd has ability of locally regulating the target to the origin.

Let us denote the sum of the interaction forces of the target as

w_{T}=f_{aT}+f_{bT}.

(39)

We remark that this quantity is supposed to be available by the shepherd. Then, using (7), we can rewrite the velocity of the target as

\begin{multlined}\dot{x}_{T}=\\ \begin{cases}\gamma_{1}\lVert x_{T}-y\rVert^{-2}(x_{T}-y)+w_{T},\ \ \ \ \ \ \ \!\mbox{if }\lVert x_{T}-y\rVert>\ell_{y},\\ \gamma_{2}\lVert x_{T}-y\rVert^{-2}g(\lVert x_{T}-y\rVert)(x_{T}-y)+w_{T},\ \mbox{otherwise.}\end{cases}\end{multlined}\dot{x}_{T}=\\ \begin{cases}\gamma_{1}\lVert x_{T}-y\rVert^{-2}(x_{T}-y)+w_{T},\ \ \ \ \ \ \ \!\mbox{if }\lVert x_{T}-y\rVert>\ell_{y},\\ \gamma_{2}\lVert x_{T}-y\rVert^{-2}g(\lVert x_{T}-y\rVert)(x_{T}-y)+w_{T},\ \mbox{otherwise.}\end{cases}

(40)

The system (40) is controllable when $f_{y}^{\max}\geq w_{T}$ for $t>0$ . The controller design should base on the controllability of the (40). Thus we select the gains that satisfy

f_{y}^{\max}<M_{b}\quad f_{y}^{\max}\geq N(M_{b}-m_{a})

(41)

To realize our control objective, we set the desired position of the shepherd, denoted by $y_{d}(t)\in\mathbb{R}^{2}$ , as

y_{d}=K_{1}x_{T},

(42)

where $K_{1}$ is a positive constant. This choice is intuitive because if the shepherd is always at the desired position specified above, the target sheep will be regulated to the origin by the repulsive force of the shepherd. Then, we quantify the mismatch between the actual position $y$ and desired position of the shepherd as

e_{y}=y_{d}-y.

(43)

In this study, we consider the following first-order dynamics for the shepherd:

\dot{y}=u_{y}.

(44)

The following theorem shows that we can construct a feedback controller to locally achieve the control objective stated in Problem 1.

Theorem 5.

Assume that inequality (12) is satisfied. Define $\gamma\colon\mathbb{R}^{2}\times\mathbb{R}^{2}\to\{\gamma_{1},\gamma_{2}\}$ by

\begin{multlined}\gamma(x_{T},y)=\begin{cases}\gamma_{1},&\mbox{{if} }\lVert x_{T}-y\rVert>\ell_{y},\\ \gamma_{2},&\mbox{{if} }\lVert x_{T}-y\rVert\leq\ell_{y}.\end{cases}\end{multlined}\gamma(x_{T},y)=\begin{cases}\gamma_{1},&\mbox{{if} }\lVert x_{T}-y\rVert>\ell_{y},\\ \gamma_{2},&\mbox{{if} }\lVert x_{T}-y\rVert\leq\ell_{y}.\end{cases}

(45)

and construct a feedback controller as

	$\displaystyle u_{y}$	$\displaystyle=\gamma\frac{e_{y}}{\lVert e_{y}\rVert}(K_{1}+\lVert x_{T}\rVert)\lVert w_{T}\rVert$		(46)
		$\displaystyle+\gamma\frac{e_{y}}{\lVert e_{y}\rVert}\left(\frac{\lVert x_{T}\rVert^{2}+\lVert x_{T}\rVert\lVert y\rVert+K_{1}\lVert x_{T}-y\rVert}{\lVert x_{T}-y\rVert^{2}}\right).$		(46)

Define

V=\frac{1}{2}\lVert x_{T}\rVert^{2}+\lVert e_{y}\rVert.

(47)

Then, for any constant $c>0$ , set $V\leq c$ is forward invariant under feedback controller (46).

Proof.

By inequality (12) and Theorem 1, the existence of the solution of the system is guaranteed. Therefore, it is sufficient to demonstrate that the time derivative

\displaystyle\dot{V}

\displaystyle=x_{T}^{\top}\dot{x}_{T}+\frac{(y_{d}-y)^{\top}}{\lVert y_{d}-y\rVert}(\dot{y}_{d}-\dot{y})

(48)

is negative. First, if $\lVert x_{T}-y\rVert>\ell_{y}$ , then

\begin{multlined}\dot{V}=\gamma_{1}x_{T}^{\top}\left(\frac{x_{T}-y}{\lVert x_{T}-y\rVert^{2}}+w_{T}\right)\\ +K_{1}\gamma_{1}\frac{e_{y}^{\top}}{\lVert e_{y}\rVert}\left(\frac{x_{T}-y}{\lVert x_{T}-y\rVert^{2}}+w_{T}\right)-\frac{e_{y}^{\top}}{\lVert e_{y}\rVert}\dot{y}.\end{multlined}\dot{V}=\gamma_{1}x_{T}^{\top}\left(\frac{x_{T}-y}{\lVert x_{T}-y\rVert^{2}}+w_{T}\right)\\ +K_{1}\gamma_{1}\frac{e_{y}^{\top}}{\lVert e_{y}\rVert}\left(\frac{x_{T}-y}{\lVert x_{T}-y\rVert^{2}}+w_{T}\right)-\frac{e_{y}^{\top}}{\lVert e_{y}\rVert}\dot{y}.

(49)

Therefore, using (46) and (49), we can easily show $\dot{V}<0$ . Let us consider the case where $\lVert x_{T}-y\rVert\leq\ell_{y}$ . In this case, we show that

		$\displaystyle\dot{V}=\gamma_{2}x_{T}^{\top}\left(\frac{x_{T}-y}{\lVert x_{T}-y\rVert^{2}}g(\lVert x_{T}-y\rVert)+w_{T}\right)$		(50)
		$\displaystyle+K_{1}\gamma_{2}\frac{e_{y}^{\top}}{\lVert e_{y}\rVert}\left(\frac{x_{T}-y}{\lVert x_{T}-y\rVert^{2}}g(\lVert x_{T}-y\rVert)+w_{T}\right)-\frac{e_{y}^{\top}}{\lVert e_{y}\rVert}\dot{y}.$		(50)

Because $g(\lVert x_{T}-y\rVert)=\exp(-{1}/{\lVert x_{T}-y\rVert})<1$ , we can further evaluate the derivative as

\displaystyle\dot{V}

\displaystyle<\begin{multlined}\gamma_{2}\left(\frac{\lVert x_{T}\rVert^{2}+\lVert x_{T}\rVert\lVert y\rVert+K_{1}\lVert x_{T}-y\rVert}{\lVert x_{T}-y\rVert^{2}}\right)\\ +\gamma_{2}(\lVert x_{T}\rVert+K_{1})w_{T}-\frac{e_{y}^{\top}}{\lVert e_{y}\rVert}\dot{y}=0\end{multlined}\gamma_{2}\left(\frac{\lVert x_{T}\rVert^{2}+\lVert x_{T}\rVert\lVert y\rVert+K_{1}\lVert x_{T}-y\rVert}{\lVert x_{T}-y\rVert^{2}}\right)\\ +\gamma_{2}(\lVert x_{T}\rVert+K_{1})w_{T}-\frac{e_{y}^{\top}}{\lVert e_{y}\rVert}\dot{y}=0

(51)

as desired. This completes the proof of the theorem. ∎

Refer to caption — (a) The proposed method with $N=10$ sheep

Remark 2.

Although Theorem 5 proves only a local effectiveness of the controller (46), numerical simulations presented in the next section suggests its global effectiveness. We leave the problem of theoretically establishing global efficacy of the controller (46) an open problem.

V Numerical Simulations

In this section, we demonstrate the effectiveness of the proposed feedback controller in regulating one target sheep to the origin in a large-scale swarm. The initial conditions for the first experiment were set to $x_{T}(0)=[13,14]^{\top}$ and $y(0)=[17,17]^{\top}$ . We randomly generated $N=200$ non-overlapping sheep in the square region of $[0,17]^{2}$ . We selected the constants as $m_{a}=10$ , $m_{b}=5$ , $M_{b}=10$ , $\ell_{y}=0.5$ , $\gamma_{1}=1$ , $\gamma_{2}=e^{2}$ , $l=0.5$ , and $K_{1}=1.5$ . These constants were selected to satisfy inequality (12) in Theorem 1; therefore, the swarm guarantees the collision-free property and the controller can safely regulate the target to the origin.

To further illustrate the proposed method’s effectiveness, we compare its performance with a baseline strategy from the farthest-agent algorithm [23]. The farthest-agent algorithm utilizes the sum of the following two vectors as the velocity input for the shepherd, where one vector drives the shepherd to approach the target, and the other vector avoids the shepherd exceeding the target.

Figure 1(a) depicts two sets of comparative experiments, illustrating the change of the swarm trajectories by the proposed and baseline methods. As shown in Figure 1(a), we conducted comparative experiments to show the swarm trajectories change from $t=0$ until either $\lVert x_{T}\rVert<0.1$ or a designated time limit of $t=200$ . Figures 1(a) and 1(b) show the overall trajectories of the proposed and baseline methods on a $N=10$ swarm with the same initial placements, respectively, while Figures 1(c) and 1(d) show those on a $N=50$ swarm with the same initial placements. We observe that the baseline method guides the target around the origin but fails in making the target’s position converge to the origin. On the other hand, the proposed method is guaranteed to make the tracking error converge to zero and, therefore, succeeds in guiding the target to the origin asymptotically.

We further illustrate the effectiveness of the proposed method in different swarm sizes (from $N=10$ to $N=200$ ) as shown in Figure 2. The dashed line represents the baseline method, and the solid line represents the proposed method. Each plot is obtained by averaging 100 different simulations with different initial placements. From Figure 2, we observe the proposed method regulates the target to the origin before $t=200$ for all different swarm sizes and performs better than the baseline. This further corroborates the conclusion of the comparative analysis of the two methods mentioned above.

VI Conclusion and Future works

In this study, we investigated the problem of using a shepherd to accurately guide a target in a large swarm to its destination. We first analyzed the properties of the swarm and rigorously proved through mathematical analysis that under certain inequality constraints, no sheep in the swarm will collide. Subsequently, we used the Lyapunov stability principle to design a shepherd control law that precisely guides the target to the origin. We also compared our model with a baseline shepherding control algorithm. Our experimental results demonstrate that the shepherd can precisely guide the target to the goal in both small and large swarms. Our future work includes finding necessary and sufficient conditions of the collision-free property of shepherding swarm.

References

[1] E. O. Budrene and H. C. Berg, “Dynamics of formation of symmetrical patterns by chemotactic bacteria,” Nature, vol. 376, pp. 49–53, 1995.
[2] T. Vicsek, A. Czirók, E. Ben-Jacob, I. Cohen, and O. Shochet, “Novel type of phase transition in a system of self-driven particles,” Physical Review Letters, vol. 75, no. 6, p. 1226, 1995.
[3] V. Gazi and K. M. Passino, “A class of attractions/repulsion functions for stable swarm aggregations,” International Journal of Control, vol. 77, no. 18, pp. 1567–1579, 2004.
[4] R. Goel, J. Lewis, M. A. Goodrich, and P. Sujit, “Leader and predator based swarm steering for multiple tasks,” in 2019 IEEE International Conference on Systems, Man and Cybernetics, 2019, pp. 3791–3798.
[5] V. S. Chipade and D. Panagou, “Multi-swarm herding: Protecting against adversarial swarms,” in 59th IEEE Conference on Decision and Control, 2020, pp. 5374–5379.
[6] N. K. Long, K. Sammut, D. Sgarioto, M. Garratt, and H. A. Abbass, “A comprehensive review of shepherding as a bio-inspired swarm-robotics guidance approach,” IEEE Transactions on Emerging Topics in Computational Intelligence, vol. 4, no. 4, pp. 523–537, 2020.
[7] J. Hu, A. E. Turgut, T. Krajník, B. Lennox, and F. Arvin, “Occlusion-based coordination protocol design for autonomous robotic shepherding tasks,” IEEE Transactions on Cognitive and Developmental Systems, vol. 14, no. 1, pp. 126–135, 2020.
[8] Y. Tsunoda, Y. Sueoka, and K. Osuka, “On statistical analysis for shepherd guidance system,” in 2017 IEEE International Conference on Robotics and Biomimetics, 2017, pp. 1246–1251.
[9] A. Garrell and A. Sanfeliu, “Local optimization of cooperative robot movements for guiding and regrouping people in a guiding mission,” in 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010, pp. 3294–3299.
[10] H. Xie, M. Sun, X. Fan, Z. Lin, W. Chen, L. Wang, L. Dong, and Q. He, “Reconfigurable magnetic microrobot swarm: Multimode transformation, locomotion, and manipulation,” Science Robotics, vol. 4, no. 28, 2019.
[11] J. Yu, B. Wang, X. Du, Q. Wang, and L. Zhang, “Ultra-extensible ribbon-like magnetic microswarm,” Nature Communications, vol. 9, no. 1, p. 3260, 2018.
[12] P. Deptula, Z. I. Bell, F. M. Zegers, R. A. Licitra, and W. E. Dixon, “Approximate optimal influence over an agent through an uncertain interaction dynamic,” Automatica, vol. 134, p. 109913, 2021.
[13] R. A. Licitra, Z. I. Bell, and W. E. Dixon, “Single-agent indirect herding of multiple targets with uncertain dynamics,” IEEE Transactions on Robotics, vol. 35, no. 4, pp. 847–860, 2019.
[14] E. Sebastián and E. Montijano, “Multi-robot implicit control of herds,” in 2021 IEEE International Conference on Robotics and Automation, 2021, pp. 1601–1607.
[15] D. M. Le, X. Luo, L. J. Bridgeman, M. M. Zavlanos, and W. E. Dixon, “Single-agent indirect herding of multiple targets using metric temporal logic switching,” in 59th IEEE Conference on Decision and Control, 2020, pp. 1398–1403.
[16] W. Zhang, V. S. Chipade, and D. Panagou, “Herding an adversarial swarm in three-dimensional spaces,” in 2021 American Control Conference. IEEE, 2021, pp. 4722–4728.
[17] S. Mayya, G. Notomista, D. Shell, S. Hutchinson, and M. Egerstedt, “Non-uniform robot densities in vibration driven swarms using phase separation theory,” in 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019, pp. 4106–4112.
[18] X. Yin, D. Yue, and Z. Chen, “Asymptotic behavior and collision avoidance in the cucker–smale model,” IEEE Transactions on Automatic Control, vol. 65, no. 7, pp. 3112–3119, 2019.
[19] S. Bandyopadhyay, V. Gehlot, M. Balas, D. S. Bayard, and M. B. Quadrelli, “Detection of transient instabilities in multi-agent systems and swarms,” in 2021 American Control Conference. IEEE, 2021, pp. 1216–1223.
[20] H.-T. Zhang, C. Zhai, and Z. Chen, “A general alignment repulsion algorithm for flocking of multi-agent systems,” IEEE Transactions on Automatic Control, vol. 56, no. 2, pp. 430–435, 2010.
[21] B. Zhang and H. P. Gavin, “Natural deadlock resolution for multi-agent multi-swarm navigation,” in 60th IEEE Conference on Decision and Control, 2021, pp. 5958–5963.
[22] H. Su, X. Wang, and Z. Lin, “Flocking of multi-agents with a virtual leader,” IEEE Transactions on Automatic Control, vol. 54, no. 2, pp. 293–307, 2009.
[23] Y. Tsunoda, Y. Sueoka, Y. Sato, and K. Osuka, “Analysis of local-camera-based shepherding navigation,” Advanced Robotics, vol. 32, no. 23, pp. 1217–1228, 2018.
[24] S.-Y. Ha, J. Jung, J. Kim, J. Park, and X. Zhang, “Emergent behaviors of the swarmalator model for position-phase aggregation,” Mathematical Models and Methods in Applied Sciences, vol. 29, no. 12, pp. 2225–2269, 2019.

Collision-Free Shepherding Control of a Single Target within a Swarm††thanks: This work was supported in part by JSPS KAKENHI Grant Number JP21H01352.

Abstract

Index Terms:

I Introduction

II Problem statement

Problem 1.

III Collision-Free Condition

Theorem 1.

Proposition 2.

Proof.

Lemma 3.

Proposition 4.

Proof of Theorem 1.

Remark 1.

IV Control Design

Theorem 5.

Proof.

Remark 2.

V Numerical Simulations

VI Conclusion and Future works

References

Collision-Free Shepherding Control of a Single Target within a Swarm^†^†thanks: This work was supported in part by JSPS KAKENHI Grant Number JP21H01352.