4.7 Article

Multi-Agent Deep Reinforcement Learning-Based Flexible Satellite Payload for Mobile Terminals

Journal

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY
Volume 69, Issue 9, Pages 9849-9865

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TVT.2020.3002983

Keywords

Channel allocation; Resource management; Games; Reinforcement learning; Dynamic scheduling; Complexity theory; Satellite broadcasting; Multi-beam satellite system; multi-agent; deep reinforcement learning; bandwidth allocation

Funding

  1. National Natural Science Foundation of China [91438114, 61701033]
  2. Marine S&T Fund of Shandong Province for Pilot National Laboratory for Marine Science and Technology (Qingdao) [2018SDKJ0102-6]

Ask authors/readers for more resources

Information dissemination in mobile networks turns out to be a problem when the network is sparse. Mobile networks begin to establish a separate cluster attributable to the limited communication range of terminals. The multi-beam satellite communication systems can play a significant role in providing direct-to-user satellite mobile services and connecting the separated clusters. This paper focuses on how to efficiently schedule limited satellite-based radio resources to enhance transmission efficiency and meet the requested traffic with low complexity. Taking the inter-beam interference and resource utilization variance into consideration, we build a game-theoretic based model for bandwidth allocation in the forward link. As the size of satellite beams increases, the size of the action space for deep reinforcement learning based on a single agent becomes large, resulting in high time complexity. Thus, we extend the single-agent deep reinforcement learning to the multi-agent context and then propose a cooperative multi-agent deep reinforcement learning method to achieve the optimal bandwidth allocation strategy. Each beam works as a player who is willing to satisfy the request traffic with flexible payloads. We built a multi-beam satellite platform using real historical data. The experimental results show that this approach is capable of enhancing transmission efficiency and can be flexible to achieve the desired goal with low complexity.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available