4.7 Article Proceedings Paper

Resource Allocation in Uplink NOMA-IoT Networks: A Reinforcement-Learning Approach

期刊

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS
卷 20, 期 8, 页码 5083-5098

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TWC.2021.3065523

关键词

NOMA; Resource management; Dynamic scheduling; Heuristic algorithms; Wireless networks; Uplink; Vehicle dynamics; Deep reinforcement learning; Internet of Things; non-orthogonal multiple access; power allocation; SARSA learning; user clustering

资金

  1. U.K. Engineering and Physical Science Research Council (EPSRC) [EP/R006466/1]
  2. EPSRC [EP/R006466/1] Funding Source: UKRI

向作者/读者索取更多资源

This article develops an intelligent resource allocation scheme for uplink NOMA-IoT communications, utilizing reinforcement learning algorithms to optimize network performance. Different algorithms are used based on the traffic load to achieve the optimal resource allocation strategy.
Non-orthogonal multiple access (NOMA) exploits the potential of the power domain to enhance the connectivity for the Internet of Things (IoT). Due to time-varying communication channels, dynamic user clustering is a promising method to increase the throughput of NOMA-IoT networks. This article develops an intelligent resource allocation scheme for uplink NOMA-IoT communications. To maximise the average performance of sum rates, this work designs an efficient optimization approach based on two reinforcement learning algorithms, namely deep reinforcement learning (DRL) and SARSA-learning. For light traffic, SARSA-learning is used to explore the safest resource allocation policy with low cost. For heavy traffic, DRL is used to handle traffic-introduced huge variables. With the aid of the considered approach, this work addresses two main problems of fair resource allocation in NOMA techniques: 1) allocating users dynamically and 2) balancing resource blocks and network traffic. We analytically demonstrate that the rate of convergence is inversely proportional to network sizes. Numerical results show that: 1) Compared with the optimal benchmark scheme, the proposed DRL and SARSA-learning algorithms have lower complexity with acceptable accuracy and 2) NOMA-enabled IoT networks outperform the conventional orthogonal multiple access based IoT networks in terms of system throughput.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据