4.6 Article

EM-Patroller: Entropy Maximized Multi-Robot Patrolling With Steady State Distribution Approximation

期刊

IEEE ROBOTICS AND AUTOMATION LETTERS
卷 8, 期 9, 页码 5712-5719

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/LRA.2023.3300245

关键词

Un-normalized joint steady state distribution; multi-robot patrolling; multi-agent model-based policy gradient

类别

向作者/读者索取更多资源

This letter proposes an efficient iterative algorithm, named Entropy Maximized Patroller (EM-Patroller), to solve the multi-robot patrolling (MuRP) problem in a discrete environment. EM-Patroller achieves uniform node coverage probability distribution by reformulating the MuRP problem as an unnormalized joint steady state distribution entropy maximization problem. It uses the multi-layer perceptron (MLP) to model the relationship between each robot's patrolling strategy and the individual steady state distribution.
This letter investigates the multi-robot patrolling (MuRP) problem in a discrete environment with the objective of achieving uniform node coverage probability distribution by the robot team. Existing MuRP solutions for uniform node coverage either involve high computational complexity for the global optimal solution or rely on heuristics for approximate solutions without performance guarantees. To bridge the gap, we propose an efficient iterative algorithm, namely Entropy Maximized Patroller (EM-Patroller), with the per-iteration performance improvement guarantee and polynomial computational complexity. We reformulate the MuRP problem as an unnormalized joint steady state distribution entropy maximization problem and use multi-layer perceptron (MLP) to model the relationship between each robot's patrolling strategy and the individual steady state distribution. We derive a multi-agent model-based policy gradient method to update the robots' patrolling strategies towards the optimum. Complexity analysis indicates the polynomial computational complexity of EM-Patroller, and we show that EM-Patroller has additional benefits of accommodating user-defined joint steady state distributions and incorporating other objectives such as entropy maximization of individual steady state distribution. We compare EM-Patroller with state-of-the-art MuRP algorithms in various canonical MuRP environments and deploy it to a real multi-robot system for patrolling in a self-constructed indoor environment.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据