MUI-TARE: Multi-Agent Cooperative Exploration with Unknown Initial Position

Jingtian Yan¹, Xingqiao Lin¹, Zhongqiang Ren¹, Shiqi Zhao², Jieqiong Yu¹, Chao Cao¹,
Peng Yin^1,∗, Ji Zhang¹ and Sebastian Scherer¹ This research was supported by grants from NVIDIA and utilized NVIDIA SDKs (CUDA Toolkit, TensorRT, and Omniverse). ¹J. Yan, X. Lin, P. Yin, and S. Scherer are with Robotics Institute, Carnegie Mellon University, Pittsburgh, PA 15213, USA (pyin2, basti@andrew.cmu.edu).

Abstract

Multi-agent exploration of a bounded 3D environment with unknown initial positions of agents is a challenging problem. It requires quickly exploring the environments as well as robustly merging the sub-maps built by the agents. We take the view that the existing approaches are either aggressive or conservative: Aggressive strategies merge two sub-maps built by different agents together when overlap is detected, which can lead to incorrect merging due to the false-positive detection of the overlap and is thus not robust. Conservative strategies direct one agent to revisit an excessive amount of the historical trajectory of another agent for verification before merging, which can lower the exploration efficiency due to the repeated exploration of the same space. To intelligently balance the robustness of sub-map merging and exploration efficiency, we develop a new approach for lidar-based multi-agent exploration, which can direct one agent to repeat another agent’s trajectory in an adaptive manner based on the quality indicator of the sub-map merging process. Additionally, our approach extends the recent single-agent hierarchical exploration strategy to multiple agents in a cooperative manner by planning for agents with merged sub-maps together to further improve exploration efficiency. Our experiments show that our approach is up to 50% more efficient than the baselines on average while merging sub-maps robustly.

Index Terms:

Multi-agent Exploration, Real-time Map Merging, Unknown Initial Pose

I Introduction

Refer to caption — Figure 1: Multi-agent exploration using Adaptive Merge. (a) represents the point clouds merged using estimate transform, and (b) represents the data association result between the features of two agents (i.e. each axis represents the trajectory of one agent). Once a sequence of features gets matched, the adaptive merge module will verify the association by measuring the verification gain. If the data association is incorrect, the association will be eliminated. Otherwise, it will adaptively increase the overlap and merge it into a sub-map. Using the relative pose and merged map given by the merged sub-map, the multi-agent sub-map exploration module is then used for exploration path planning.

Multi-Agent exploration of an unknown environment is an important problem in robotics and has been investigated for decades [1, 2, 3]. In this work, we consider the exploration of a bounded 3D environment with unknown initial positions of agents, which arises in applications such as planetary exploration [4, 5], underground mining [6] and search and rescue [7, 8]. This problem is challenging as it requires simultaneously handling two goals: (i) quickly exploring the environment, and (ii) robustly detecting the overlapped sub-maps built by different agents and merging the sub-maps. A common strategy to address the problem is to directly merge the sub-maps when the areas explored by different agents are detected to overlap. However, the sub-map merging procedure requires extracting features from each sub-map and associating the features across the sub-maps before merging them, which is often sensitive to false-positive feature matching [9]. In other words, a few mismatched features can lead to incorrect sub-map merging. To improve the robustness against false-positive feature matching, the idea of active verification has been applied [10]: when two agents are detected to have visited the same place, one agent is directed to repeat a certain amount of the other agent’s historical trajectory in order to increase the overlapped region and improve the robustness of feature association. However, this strategy achieves robustness at the cost of exploration efficiency, as it often leads to excessive duplicated exploration among agents than necessary.

In this paper, we propose MUI-TARE (Multi-agent TARE with Unknown Initial position), a multi-agent lidar-based exploration system that aims to intelligently balance between sub-map merging robustness and exploration efficiency. As shown in Fig. 1, the system has two features: adaptive sub-map merging and cooperative multi-agent exploration. First, MUI-TARE leverages the idea of active verification but in an adaptive manner: By leveraging our prior work AutoMerge [11], a sub-map merging method, MUI-TARE is able to learn from AutoMerge the quality of the current feature association and then increases the overlapped region only when needed so that a certain quality threshold on feature matching is reached in AutoMerge. This allows an agent to repeat another agent’s trajectory in an adaptive manner (based on the quality score) and thus avoid a large amount of duplicated exploration which leads to improvement in the overall exploration efficiency. Second, MUI-TARE employs a hierarchical cooperative multi-agent planning strategy for exploration, which extends a recent state-of-the-art single-agent exploration planner TARE [12] to multiple agents. The hierarchical strategy first plans a coarse global path that visits the areas to be explored based on the global information of the agent’s map and then plans a detailed local path to explore the current local area around the robot before moving to the next local area. Instead of naively applying this strategy to multiple agents, our approach plans for agents cooperatively when possible: it begins by planning each agent independently (before any sub-map is merged) and plans for agents together when their sub-maps are merged. Our approach improves the exploration efficiency due to this cooperative planning strategy.

To verify the idea of adaptive verification and cooperative multi-agent planning, we run simulations in several environments. Our numerical results show that (i) our adaptive verification strategy leads to up to 29% higher exploration efficiency than the existing active verification strategy while maintaining the robustness in the sub-map merging process; and (ii) the cooperative multi-agent planning reduces the overall exploration time for up to 52% than naively applying TARE to multiple agents without cooperation.

The rest of this work reviews the related work in Sec. II and describes the problem in Sec. III We then describe our method in Sec. IV and show the simulation results in Sec. V. Finally, we conclude and outline our future work in Sec. VI.

II Related Work

II-A Autonomous Exploration

Autonomous exploration using either single or multiple agents has been investigated for decades from different perspectives. Single-agent exploration has been extensively studied in both 2D and 3D, and various approaches have been developed ranging from frontier-based [13, 14], receding horizon sampling-based [15], information theoretic approaches [16, 17, 18], traveling salesman-based [12], etc. Our approach is an extension of the recent hierarchical traveling salesman-based approach TARE [12] to multiple agents.

Multi-agent exploration is more challenging than single-agent and existing work has focused on communication limitation [19], sub-map merging when initial positions of agents are unknown [10], allocating the unexplored areas to agents in a cooperative manner [1], etc. In this work, we limit our focus to the problem where agents’ initial locations are unknown. Many existing multi-agent exploration works are limited to 2D environment [1, 20, 10, 21], and extending them to 3D is challenging due to the increased amount of sensory data (e.g. 3D lidar scans), a large number of frontiers (i.e., the boundary between the explored and unexplored areas), the increased risk of feature mismatch when merging the sub-maps, etc. Inspired by the literature, we compare our approach against several baselines that take similar strategies to merge sub-maps as in [10, 21] to verify the effectiveness of our approach.

II-B Sub-map merging and data association

Map merging problems in 3D require integrating different sub-maps together, and the map is usually represented as 3D point clouds, occupancy grids [22, 23], or 3D meshes [24]. In this work, we limit our focus to lidar-based multi-agent exploration and consider maps represented as point clouds. To merge two sub-maps accurately and robustly, a key step is to identify unique features from each of the maps and associate the detected features. The SLAM community have proposed numerous methods by leveraging distinguishable semantic objects [25, 11], bag-of-words vectors [26], learned full-image descriptors [27], and sequence-based place recognition [28, 29], to name a few. Our prior work AutoMerge [11] provides a large-scale map alignment approach for multi-agent systems by leveraging a novel spherical harmonic feature extraction method for viewpoint-invariant place recognition and an adaptive sequence alignment method for accurate loop-closure detection. This work leverages AutoMerge to merge sub-maps and estimate the relative poses between agents.

In most SLAM systems, the agents are passive, i.e., the motion of the agents cannot be controlled. In a multi-agent exploration problem, the motion of agents needs to be planned in order to achieve both high exploration efficiency and high quality in sub-map merging, which is the focus of this work.

III Problem Statement

Let ${Q\subset\mathbb{R}^{3}}$ denote a bounded space to be explored and let ${Q_{trav}}\subseteq{Q}$ denote the traversable space in ${Q}$ . Additionally, let ${S}\subset{Q}$ denote all the surfaces in the ${Q}$ , which represents the generalized boundary between obstacle and obstacle-free space. Let ${I}=\{1,2,\dots,N\}$ represent a set of $N$ agents with unknown initial poses in ${Q_{trav}}$ . Agents are allowed to exchange information with a central station at any time. Let $u_{i}^{t}=\{p_{u_{i}^{t}},q_{u_{i}^{t}}\},p_{u_{i}^{t}}\in Q_{trav}$ denote a viewpoint of the sensor onboard agent ${i}\in{I}$ at time ${t}$ with ${p_{u_{i}^{t}}}$ and ${q_{u_{i}^{t}}}$ denoting the position and orientation of the sensor respectively. For each viewpoint, the surface perceived is represented as ${S_{v_{i}^{t}}^{cov}}$ . Let ${\tau_{i}}$ denote a path of agent ${i\in I}$ , and let $dist({\tau_{i}})$ represent the distance traveled by the agent along the path. A sequence of viewpoints ${U^{i}}=\{{{u_{i}^{0}},{u_{i}^{1}},\dots,{u_{1}^{T_{0}}}}\}\subset{Q_{trav}}$ is visited along the path, and the path must satisfy the kinodynamic constraints of the agent. The goal of the problem is to plan a set of kinodynamically feasible paths $\tau=\{\tau_{1},\tau_{2},\dots,\tau_{N}\}$ such that the perceived surface of all agents’ viewpoints along the paths covers ${S}$ (i.e., $\bigcup_{1}^{N}\bigcup_{0}^{T_{i}}{S_{v_{i}^{t}}^{cov}}={S}$ ) and the longest travel distance of all the paths (i.e., $\max_{i\in I}dist(\tau_{i})$ ) is minimized.

IV Method

This section begins with a system overview of MUI-TARE in Sec. IV-A. Instead of merging the point cloud from the agents passively, the framework used an adaptive sub-map merge algorithm to merge the sub-maps robustly, which is discussed in Sec. IV-B. Additionally, MUI-TARE utilizes a hierarchical multi-agent exploration method to efficiently explore each sub-map as presented in Sec. IV-C.

IV-A System Overview

As shown in Fig. 2, the entire system consists of multiple agents and an MUI-TARE server. During the exploration, the point clouds collected by the agents are sent to the server, and the server plans and sends the paths to the agents for execution. MUI-TARE (Fig. 2) consists of two major components: an adaptive and active map merging module, and a multi-agent sub-map exploration module. Here, the sub-map means the map that is formed by a group of agents whose relative transform is known. The map merging module merges the segments (i.e., the accumulated point cloud) of agents into sub-maps and estimates the relative pose of agents based on the merged sub-maps. Given the point clouds from different agents, our approach detects the potential overlap between the sub-maps via data association: if two series of frames (i.e. the extracted feature at a certain pose) are matched, we say a new inner connection is established between agents. Then, the server plans a path for the nearby agent to verify if this inner connection is a real overlapped area. Once the connection is confirmed, the nearest agent is further navigated to increase the overlap until this new inner connection satisfies the sub-map merge condition. Finally, the overlapped sub-maps are merged into a single sub-map, and the relative position of the agents in this sub-map can be computed.

Based on the sub-maps computed by the map merging module, multi-agent exploration is performed, which uses a hierarchical path planning approach for each sub-map. Specifically, a global coarse path is first planned for agents to visit all the sub-space to be explored. In each sub-space of an agent, a detailed local path is planned to cover all the surfaces in it. MUI-TARE runs the map merging module and the multi-agent exploration module parallel to cover the whole working space.

IV-B Adaptive Map Merging

During the exploration process, we maintain a factor graph $G=\{V,E\}$ to represent the inner connections between the segments of agents. Let $V=\{v_{1},v_{2},...,v_{n}\}$ denote the nodes of the graph which represent the segment obtained by each robot. Then, we define edge $E=\{\omega_{i,j}\}$ as the connections between segments, where $\omega_{i,j}$ is the inner connection of node $v_{i}$ and node $v_{j}$ . For the purpose of place recognition, the inner connection of two segments is determined by the overlap length (i.e., the number of consecutive matched frames) and place recognition descriptors. Thus, the inner-connection $\omega_{i,j}$ is defined as,

{\omega_{i,j}}=\left\{\begin{aligned} exp^{-\frac{\|{F_{i}-F_{j}}\|_{2}^{2}+C_{w}}{2L_{i,j}^{2}+\epsilon}},\ i\neq j,\\ 0,\ i=j,\end{aligned}\right.

(1)

where $F_{i},F_{j}$ indicates the feature descriptors extracted from the overlapped area of $v_{i},v_{j}$ , $L_{i,j}$ represents the overlap length between $v_{i},v_{j}$ , and $C_{w},\epsilon$ are hyper-parameters introduced in AutoMerge [11].

Algorithm 1 Pseudo-code Adaptive overlap estimation

1:while ExplorationNotFinished() do

2: MatchedSequence

\leftarrow

AutoMerge()

3: for sequence in MatchedSequence do

4: if !IsMatchValid(sequence) then

5: Remove(sequence)

6: continue

7: inner-connection

\leftarrow

ConnectionEval(sequence)

8: T

\leftarrow

TransformEval(sequence)

9: if inner-connection < threshold then

10: dist

\leftarrow

EstimateDist(inner-connection)

11: ActiveMerge(T, dist)

12: else

13: SubmapMerge(T)

Algorithm 2 Pseudo-code ActiveMerge

Input: T, dist

1:AgentStats

\leftarrow

GetAgentStatus()

2:if AgentStatus is BackToOverlap then

3:Path

\leftarrow

GetClosestPathToOverlap()

4:else if AgentStatus is ActiveMerge then

G_{2}

\leftarrow

PosegraphTransform(

G_{2}

, T)

6:Path

\leftarrow

GetLookAheadPath(

G_{2}

, dist)

7:VerificationGain

\leftarrow

EvaluateExploreGain()

8:if VerificationGain < threshold then

9:PublishInvalidOverlap()

In the merging procedure, only the segments with inner connections larger than the threshold will be clustered and merged into the same sub-map. The relative pose between agents, which is represented by a transform matrix, can be determined by the transform estimated in the map merge. However, such stable inner connections usually required long overlap distances which is hard to satisfy when agents are exploring without knowing others’ poses. Thus, using a shorter sequence of matched frames for potential overlap detection will be helpful to increase the chance an overlap can be detected. However, using a shorter sequence of matched frames may cause incorrect data association and large transform matrix errors.

Thus, the adaptive merge will plan a new path for the agent to verify the data association and enhance the inner connection(Fig. 3). As shown in Algorithm. 1, when a potential overlap was detected between agent $i$ in the sub-map $m_{1}$ and agent $j$ in the sub-map $m_{2}$ , the inner connection is evaluated and added to the graph $G$ . Then, a rough transform matrix $T$ can be estimated by doing the Iterative closest point (ICP) between associated point clouds. Based on eq. 1, the value of the inner connection is positively related to the overlap length and negatively related to the feature difference of the overlap. Thus, if the inner connection does not satisfy the merge condition, the planner will choose one agent $i$ , whose distance to overlap is shorter, and execute the active merge to increase the inner connection. To help the path planning of active merge, an estimation of the distance that the agent needs to travel is computed:

\displaystyle dist=C_{w}(\frac{1}{\ln{\omega^{t}_{i,j}}}-\frac{1}{\ln{\omega_{thresh}}})

(2)

where the $\omega^{t}_{i,j}$ is the current inner connection, and $\omega_{thresh}$ is the threshold of the inner connection to guide map merging.

As shown in Algorithm. 2, the planner will first plan a path for agent $i$ to move back to the overlapped region by doing $A^{*}$ search on the past trajectory of agent $i$ . Once the agent $i$ reaches the overlap, it will be navigated to visit the key pose of agent $j$ outside of the overlap. Denote the key pose position of agent $i$ in the sub-map $m_{1}$ as $\{^{m_{1}}x_{i}^{0},...,^{m_{1}}x_{i}^{t}\}$ , and denote the key pose position of agent $j$ in the sub-map $m_{2}$ as $\{^{m_{2}}x_{j}^{0},...,^{m_{2}}x_{j}^{t}\}$ . The trajectory of agent $j$ can be transformed to $\{^{m_{1}}\widehat{x}_{j}^{0},...,^{m_{1}}\widehat{x}_{j}^{t}\}$ using $T$ . Here, we use the greedy strategy to navigate the agent to the closest transformed key pose.

In order to measure the newly obtained overlap during the active merge, we define verification gain as:

\displaystyle G(i,j,t)=\frac{\omega_{i,j}^{t}-\omega_{i,j}^{t_{0}}+C_{\epsilon}}{\|t-{t_{0}}\|+\epsilon}

(3)

where, $\omega^{t}_{i,j}$ is the current value of inner connection, $\omega^{t_{0}}_{i,j}$ is the value of inner connection when agent start to increase overlap, $\epsilon$ and $C_{\epsilon}$ are constant value.

During the merge process, if the verification gain is smaller than the threshold, this new connection is regarded as an incorrect connection and gets eliminated from graph $G$ . Otherwise, the agent keeps active merge until it meets the sub-map merge requirement. Once the reliable inner connection between sub-maps is established, the sub-maps are merged into a single sub-map as described in [11].

Remark. This module uses the same feature extraction and data association function as the existing AutoMerge paper [11]. However, in order to actively increase the chance that an overlap gets detected, we design the adaptive merge method as shown in lines 3-13 in the Algorithm. 1 and Algorithm. 2.

IV-C Multi-agent exploration in sub-maps

Algorithm 3 Pseudocode for hierarchical planning

1:for m := 1 to M do

2: D’

\leftarrow

GetDistanceMatrix()

\{\tau_{1}^{global},...\tau_{k}^{global}\}\leftarrow GlobalPlanningSolver(\textit{D'})

4: for k := 1 to K do

\tau_{j}^{local}\leftarrow

LocalPlanner()

\tau_{j}\leftarrow

PathConcatenate(

\tau_{j}^{global},\tau_{j}^{local})

return

\{\tau_{1},\tau_{2},\dots,\tau_{N}\}

In this work, we extend a state-of-the-art single-agent exploration method TARE to multiple agents. As discussed in the previous part, since the transform of agents in each sub-map is given by the adaptive sub-map merge module, we plan the paths for agents based on these sub-maps. The total number of the sub-maps is denoted as $M$ , and let $m$ be the $m-th$ sub-map, where $m=1,2,\dots,M$ . Initially, as each sub-map only contains one agent, $M$ equals $N$ . As sub-maps get merged, $M$ decreases. Let $K_{m}$ denote the number of agents in sub-map $m$ . In each sub-map, a similar sub-spaces-based representation as mentioned in TARE [12] is used. It divided the workspace ${Q}$ into even subspaces. Based on the coverage of the subspace, the status of the subspace will be set to “exploring”, “explored”, or “unexplored”. If all the surfaces in the subspace have been covered, the subspace will be labeled as “explored”. When the subspace contains both uncovered surface and covered surface, the subspace will be assigned an “exploring” label. At the same time, for the subspace that is fully uncovered, the label of the subspace will be “unexplored”.

The path is planned by combining the detailed local path from the subspace planner and the coarse global path from the global planner. For the subspace planners, this paper used the same method as the local planner used in TARE. the local planner samples a set of viewpoints that can cover the surface in the subspace. Then, a local detailed path is planned to travel through all the viewpoints with the shortest distance while satisfying smoothness constraints. For the global planner, instead of using a greedy algorithm, this planner optimized the overall path of all agents as shown in Fig. 4.

The goal of the global planner is planning a set of global path $\tau_{global}=\{\tau_{1}^{global},...\tau_{K_{m}}^{global}\}$ for $K_{m}$ agents in the sub-map $m$ to travel through all the “exploring” subspace. Since the agents travel in parallel, in order to minimize the overall travel time, the planner tries to minimize the maximum travel distance among all the agents:

		$\displaystyle\min(dist(\tau^{global}))=$		(4)
		$\displaystyle\min(\max(dist(\tau_{1}^{global}),...,dist(\tau_{K_{m}}^{global})))$		(4)

Thus, this problem can be formulated as a variant of the standard multi-depot vehicle routing problem (MDVRP) as “min-max MDVRP”. Since this problem is NP-hard, heuristic methods are used to solve it [30].

As shown in the Algorithm 3, for $M$ sub-maps in the current working space, the distance between all the exploring cells and agents in each sub-map will be calculated to form the distance matrix $D^{\prime}$ . This distance is obtained by doing the $A^{*}$ search on the roadmap acquired during the exploration process. Then, we use the min-max MDVRP solver to solve the minimum longest path using the distance matrix $D^{\prime}$ . Given this solution, $K_{m}$ global coarse paths $\{\tau_{1}^{global},...\tau_{K_{m}}^{global}\}$ can be formed for $K_{m}$ agents in the sub-map $m$ . With the local path planned by the local planner, each agent replaces the part of the global path that falls in the subspace with its own local path to create the full exploration path. Finally, these full exploration paths $\{\tau_{1},\tau_{2},\dots,\tau_{N}\}$ are sent to all the $N$ agents for execution.

Remark. This multi-agent exploration uses a similar global representation and local planning method as in TARE [12]. Additionally, we design a new cooperative global planner for multi-agent global planning as shown in the Algorithm 3.

V Experiments

This section presents experimental results and discussions of MUI-TARE. We compare it with several exploration methods [21, 10, 12] and show the advantage of our MUI-TARE.

V-A Experiment Setup

We test MUI-TARE in several different simulation environments. The simulation experiment consists of multiple robots and a central server, each on a different computer. The robots are simulated in Gazebo, and each robot is equipped with a simulated 360-degree Velodyne Lidar. The collected point clouds are used for map merging and path planning for exploration. In addition, an IMU is fused with the Lidar data for state estimation. During the experiment, the maximum speed of the robot is 3 m/s. In our implementation, the simulation environment is divided evenly into sub-spaces, where each sub-space is a 10m×10m×5m cell. We use the ROS multi-master communication package [31] for data exchange between the agents and the server. As we only transfer the point clouds collected by agents, and the waypoints computed by the server. The average bandwidth needed for a single agent is 786 KB/s in our experiments, which is similar to the recent work [21]. As shown in Fig. 6, the experiments are done in two simulation environments from [32].

•

Campus (340m $\times$ 340m): A part of the Carnegie Mellon University campus, containing undulating terrains and convoluted environment layout.
•

Multi-storage Garage (140m $\times$ 130m, 5 floors): A complex 3D environment with multiple floors and sloped terrains.

V-B Adaptive Merge Evaluation

V-B1 Baselines

To show the performance of our Adaptive Merge algorithm, we compare MUI-TARE against the following baseline methods. These baselines are designed by adapting the ideas in the respective paper to our test scenarios.

•

TRUST-Explorer: leverages the idea in [10]. It verifies the overlap between two agents every time when the potential overlap is detected. The verification is done by navigating one robot to repeat an excessive amount of the other agent’s trajectory. However, this method is designed mainly for a 2D environment. We adapt this idea in our framework by letting one agent repeat a fixed amount of the other agent’s trajectory.
•

SMMR-Explorer: leverages the idea in [21]. It uses an aggressive merge strategy where the sub-maps are merged once the overlap is detected without verification.

TABLE I: Exploration efficiency for Adaptive Merge compare with TRUST-Explorer.

Metrics	TRUST-Explorer		ours
Metrics	N=2	N=3	N=2	N=3
Exploration Time (s)	1226.89	930.33	951.45	818.36

V-B2 Results and Discussion

In comparison with TRUST-Explorer, Fig. 7 shows the explored volume against time while Table I shows the exploration efficiency. We observed that our method is 29% faster for 2 agents, and 14% faster for 3 agents.

We then compare our method against SMMR-Explorer. As shown in Fig. 5 (a), the aggressive strategy in SMMR-Explorer can lead to incorrect data association, which results in an incorrect map after merging. As shown in Fig. 5 (b), SMMR-Explorer can also lead to large errors in the computed transform matrix. In comparison, Fig. 5 (c) shows the merged map using our approach which results in robust map merging and small errors in the transform matrix. To summarize, due to the adaptive exploration strategy, our MUI-TARE achieves higher exploration efficiency than TRUST-Explorer and more robust map merging than SMMR-Explorer.

V-C Multi-Agent Exploration Evaluation

TABLE II: Exploration efficiency & runtime for multi-agent exploration at campus and garage compare with mTARE.

Env	Metrics	mTARE		ours
Env	Metrics	N=2	N=3	N=2	N=3
Campus	Exploration Time(s)	1450.35	1234.36	951.45	818.36
	Local Plan Runtime (ms)	16.40	22.23	28.72	29.60
	Global Plan Runtime (ms)	8.18	8.34	11.48	17.30
Garage	Exploration Time (s)	2502.13	1989.09	1665.11	1022.92
	Local Plan Runtime (ms)	63.68	38.78	33.20	45.77
	Global Plan Runtime (ms)	9.99	9.99	18.93	23.19

This section compares our MUI-TARE with a naive extension of TARE to multiple agents. Specifically, we introduce multi-agent TARE (mTARE) as a baseline, which plans for each agent independently without planning multiple agents together after merging.

V-C1 Results

We compare our method against mTARE in the garage environment with 2 and 3 agents. As shown in Fig. 8, the upper plots show the explored volume versus exploration duration while the lower plots show the runtime of the planning during the exploration. We present the exploration time and planning time in Table II. It shows that the exploration time of our method is 52% faster than mTARE for 2 agents. For 3 agents, our MUI-TARE is 51% faster than mTARE. This result verifies the benefit of the multi-agent global planning strategy used in MUI-TARE. Using the global path planned from it, our agents will be able to reduce the cost.

Additionally, Table II shows the average local planning and global planning runtime per call in MUI-TARE and mTARE. We observed that, the planning time has no obvious increment as the number of agents increases. Finally, as shown in Fig. 8 (a), by increasing the number of agents from one to two, the time required to finish the exploration (i.e., the time when the explored volume stop increasing) is reduced by roughly 64%. However, by increasing the number of agents from two to three, this reduction in exploration time is only about 16%. This indicates a diminishing return when increasing the number of agents in a exploration task.

VI Conclusions and Future Work

This paper presents MUI-TARE, a multi-agent cooperative method for exploration with the unknown initial position. MUI-TARE intelligently balances the robustness of sub-map merging and exploration efficiency, by using an adaptive approach for map merging. Additionally, MUI-TARE extends the recent single-agent hierarchical exploration strategy to multiple agents in a cooperative manner by planning path of multiple agents together after their sub-maps are merged to further improve exploration efficiency. Our numerical results verifies the benefits of our approach.

For future work, one can consider extending MUI-TARE to a decentralized or distributed version where no global communication is required. In addition, this work mainly focuses on the problem where the initial positions of agents are completely unknown. One can extend this method to address the case where partial prior knowledge about the initial agent position is available.

References

[1] W. Burgard, M. Moors, C. Stachniss, and F. Schneider, “Coordinated multi-robot exploration,” IEEE Transactions on Robotics, vol. 21, no. 3, pp. 376–386, 2005.
[2] J. K. Verma and V. Ranga, “Multi-robot coordination analysis, taxonomy, challenges and future scope,” Journal of intelligent & robotic systems, vol. 102, no. 1, pp. 1–36, 2021.
[3] P. Petráček, V. Krátký, M. Petrlík, T. Báča, R. Kratochvíl, and M. Saska, “Large-scale exploration of cave environments by unmanned aerial vehicles,” IEEE Robotics and Automation Letters, vol. 6, no. 4, pp. 7596–7603, 2021.
[4] M. J. Schuster, S. G. Brunner, K. Bussmann, S. Büttner, A. Dömel, M. Hellerer, H. Lehner, P. Lehner, O. Porges, J. Reill et al., “Towards autonomous planetary exploration,” Journal of Intelligent & Robotic Systems, vol. 93, no. 3, pp. 461–494, 2019.
[5] M. Hassanalian, D. Rice, and A. Abdelkefi, “Evolution of space drones for planetary exploration: A review,” Progress in Aerospace Sciences, vol. 97, pp. 61–105, 2018.
[6] C. Papachristos, S. Khattak, F. Mascarich, and K. Alexis, “Autonomous navigation and mapping in underground mines using aerial robots,” in 2019 IEEE Aerospace Conference, 2019, pp. 1–8.
[7] A. Davids, “Urban search and rescue robots: from tragedy to technology,” IEEE Intelligent Systems, vol. 17, no. 2, pp. 81–83, 2002.
[8] B. Shah and H. Choset, “Survey on urban search and rescue robots,” Journal of the Robotics Society of Japan, vol. 22, no. 5, pp. 582–586, 2004.
[9] S. Yu, C. Fu, A. K. Gostar, and M. Hu, “A review on map-merging methods for typical map types in multiple-ground-robot slam solutions,” Sensors, vol. 20, no. 23, p. 6988, 2020.
[10] A. Caccavale and M. Schwager, “Trust but verify: A distributed algorithm for multi-robot wireframe exploration and mapping,” in 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2019, pp. 3294–3301.
[11] P. Yin, H. Lai, S. Zhao, R. Fu, I. Cisneros, R. Ge, J. Zhang, H. Choset, and S. Scherer, “Automerge: A framework for map assembling and smoothing in city-scale environments,” 2022.
[12] C. Cao, H. Zhu, H. Choset, and J. Zhang, “Tare: A hierarchical framework for efficiently exploring complex 3d environments.” in Robotics: Science and Systems, 2021.
[13] B. Yamauchi, “A frontier-based approach for autonomous exploration,” in Proceedings 1997 IEEE International Symposium on Computational Intelligence in Robotics and Automation CIRA’97. ’Towards New Computational Principles for Robotics and Automation’, 1997, pp. 146–151.
[14] A. Batinovic, T. Petrovic, A. Ivanovic, F. Petric, and S. Bogdan, “A multi-resolution frontier-based planner for autonomous 3d exploration,” IEEE Robotics and Automation Letters, vol. 6, no. 3, pp. 4528–4535, 2021.
[15] A. Bircher, M. Kamel, K. Alexis, H. Oleynikova, and R. Siegwart, “Receding horizon ”next-best-view” planner for 3d exploration,” in 2016 IEEE International Conference on Robotics and Automation (ICRA), 2016, pp. 1462–1468.
[16] F. Bourgault, A. Makarenko, S. Williams, B. Grocholsky, and H. Durrant-Whyte, “Information based adaptive robotic exploration,” in IEEE/RSJ International Conference on Intelligent Robots and Systems, vol. 1, 2002, pp. 540–545 vol.1.
[17] C. Stachniss, G. Grisetti, and W. Burgard, “Information gain-based exploration using rao-blackwellized particle filters.” in Robotics: Science and systems, vol. 2, 2005, pp. 65–72.
[18] H. Carrillo, P. Dames, V. Kumar, and J. A. Castellanos, “Autonomous robotic exploration using a utility function based on rényi’s general theory of entropy,” Autonomous Robots, vol. 42, no. 2, pp. 235–256, 2018.
[19] F. Amigoni, J. Banfi, and N. Basilico, “Multirobot exploration of communication-restricted environments: A survey,” IEEE Intelligent Systems, vol. 32, no. 6, pp. 48–57, 2017.
[20] D. Fox, J. Ko, K. Konolige, B. Limketkai, D. Schulz, and B. Stewart, “Distributed multirobot exploration and mapping,” Proceedings of the IEEE, vol. 94, no. 7, pp. 1325–1339, 2006.
[21] J. Yu, J. Tong, Y. Xu, Z. Xu, H. Dong, T. Yang, and Y. Wang, “Smmr-explore: Submap-based multi-robot exploration system with multi-robot multi-target potential field exploration method,” in 2021 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2021, pp. 8779–8785.
[22] S. Carpin, “Fast and accurate map merging for multi-robot systems,” Autonomous robots, vol. 25, no. 3, pp. 305–316, 2008.
[23] A. Hornung, K. M. Wurm, M. Bennewitz, C. Stachniss, and W. Burgard, “Octomap: An efficient probabilistic 3d mapping framework based on octrees,” Autonomous robots, vol. 34, no. 3, pp. 189–206, 2013.
[24] A. Rosinol, A. Violette, M. Abate, N. Hughes, Y. Chang, J. Shi, A. Gupta, and L. Carlone, “Kimera: From slam to spatial perception with 3d dynamic scene graphs,” The International Journal of Robotics Research, vol. 40, no. 12-14, pp. 1510–1546, 2021.
[25] R. Dube, A. Cramariuc, D. Dugas, H. Sommer, M. Dymczyk, J. Nieto, R. Siegwart, and C. Cadena, “Segmap: Segment-based mapping and localization using data-driven descriptors,” The International Journal of Robotics Research, vol. 39, no. 2-3, pp. 339–355, 2020.
[26] D. Galvez-López and J. D. Tardos, “Bags of binary words for fast place recognition in image sequences,” IEEE Transactions on Robotics, vol. 28, no. 5, pp. 1188–1197, 2012.
[27] R. Arandjelovic and A. Zisserman, “All about vlad,” in Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, 2013, pp. 1578–1585.
[28] M. J. Milford and G. F. Wyeth, “Seqslam: Visual route-based navigation for sunny summer days and stormy winter nights,” in 2012 IEEE International Conference on Robotics and Automation, 2012, pp. 1643–1649.
[29] P. Yin, F. Wang, A. Egorov, J. Hou, J. Zhang, and H. Choset, “Seqspherevlad: Sequence matching enhanced orientation-invariant place recognition,” in 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2020, pp. 5024–5029.
[30] J. Carlsson, D. Ge, A. Subramaniam, A. Wu, and Y. Ye, “Solving min-max multi-depot vehicle routing problem,” Lectures on global optimization, vol. 55, pp. 31–46, 2009.
[31] A. Tiderko, F. Hoeller, and T. Röhling, “The ros multimaster extension for simplified deployment of multi-robot systems,” in Robot operating system (ROS). Springer, 2016, pp. 629–650.
[32] C. Cao, H. Zhu, F. Yang, Y. Xia, H. Choset, J. Oh, and J. Zhang, “Autonomous exploration development environment and the planning algorithms,” in 2022 International Conference on Robotics and Automation (ICRA). IEEE, 2022, pp. 8921–8928.