4.7 Article

Exploring communication protocols and centralized critics in multi-agent deep learning

Journal

INTEGRATED COMPUTER-AIDED ENGINEERING
Volume 27, Issue 4, Pages 333-351

Publisher

IOS PRESS
DOI: 10.3233/ICA-200631

Keywords

Multi-agent systems; neural networks; emergent communication; deep reinforcement learning

Funding

  1. Portuguese Foundation for Science and Technology (FCT) [PD/BD/113963/2015]
  2. Institute of Electronics and Informatics Engineering of Aveiro - IEETA [UIDB/00127/2020]
  3. Artificial Intelligence and Computer Science Laboratory - LIACC [UIDB/00027/2020]
  4. Fundação para a Ciência e a Tecnologia [PD/BD/113963/2015, UIDB/00027/2020] Funding Source: FCT

Ask authors/readers for more resources

Tackling multi-agent environments where each agent has a local limited observation of the global state is a non-trivial task that often requires hand-tuned solutions. A team of agents coordinating in such scenarios must handle the complex underlying environment, while each agent only has partial knowledge about the environment. Deep reinforcement learning has been shown to achieve super-human performance in single-agent environments, and has since been adapted to the multi-agent paradigm. This paper proposes A3C3, a multi-agent deep learning algorithm, where agents are evaluated by a centralized referee during the learning phase, but remain independent from each other in actual execution. This referee's neural network is augmented with a permutation invariance architecture to increase its scalability to large teams. A3C3 also allows agents to learn communication protocols with which agents share relevant information to their team members, allowing them to overcome their limited knowledge, and achieve coordination. A3C3 and its permutation invariant augmentation is evaluated in multiple multi-agent test-beds, which include partially-observable scenarios, swarm environments, and complex 3D soccer simulations.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available