4.7 Article

Effective Communications: A Joint Learning and Communication Framework for Multi-Agent Reinforcement Learning Over Noisy Channels

期刊

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JSAC.2021.3087248

关键词

Noise measurement; Protocols; Channel coding; Semantics; Reinforcement learning; Modulation; Wireless communication; Learning to communicate; reinforcement learning (RL); multi-agent systems; joint source-channel coding; error correction coding

资金

  1. European Research Council (ERC) Starting Grant BEACON [677854]
  2. U.K. Engineering and Physical Sciences Research Council (EPSRC) [EP/T023600/1]
  3. EPSRC [EP/T023600/1] Funding Source: UKRI

向作者/读者索取更多资源

The novel formulation of the effectiveness problem in communications focuses on multiple agents communicating over a noisy channel to achieve better coordination in multi-agent reinforcement learning. Agents learn to collaborate and effectively communicate over a noisy channel in a MA-POMDP framework, improving joint policies compared to separate communication consideration. This framework has wide real-world applications and utilizes deep reinforcement learning for designing multi-user communication systems.
We propose a novel formulation of the effectiveness problem in communications, put forth by Shannon and Weaver in their seminal work The Mathematical Theory of Communication, by considering multiple agents communicating over a noisy channel in order to achieve better coordination and cooperation in a multi-agent reinforcement learning (MARL) framework. Specifically, we consider a multi-agent partially observable Markov decision process (MA-POMDP), in which the agents, in addition to interacting with the environment, can also communicate with each other over a noisy communication channel. The noisy communication channel is considered explicitly as part of the dynamics of the environment, and the message each agent sends is part of the action that the agent can take. As a result, the agents learn not only to collaborate with each other but also to communicate effectively over a noisy channel. This framework generalizes both the traditional communication problem, where the main goal is to convey a message reliably over a noisy channel, and the learning to communicate framework that has received recent attention in the MARL literature, where the underlying communication channels are assumed to be error-free. We show via examples that the joint policy learned using the proposed framework is superior to that where the communication is considered separately from the underlying MA-POMDP. This is a very powerful framework, which has many real world applications, from autonomous vehicle planning to drone swarm control, and opens up the rich toolbox of deep reinforcement learning for the design of multi-user communication systems.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据