☆ 4.8 Article

Reinforcement-Learning-Based Dynamic Spectrum Access for Software-Defined Cognitive Industrial Internet of Things

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS (2022)

Journal

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS

Volume 18, Issue 6, Pages 4244-4253

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TII.2021.3113949

Keywords

Cognitive industrial Internet of Things (CIIoT); dynamic spectrum access; Q-learning; reinforcement learning (RL); reward

Funding

National Natural Science Foundation of China [U1833102]
Civil Aviation of China [U1833102]
Natural Science Foundation of Liaoning Province [2020-HYLH-13, 2019-ZD-0014]
Fundamental Research Funds for the Central Universities [DUT21JC20]
Engineering Research Center of Mobile Communications of the Ministry of Education

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

In this article, a Q-learning-based dynamic spectrum access scheme is proposed for the cognitive industrial Internet of Things (CIIoT) to intelligently utilize spectrum resources in three access scenarios. Simulation results show the advantages of the Q-learning-based NOMA scheme in terms of guaranteeing CIIoT throughput and reducing interference.

The cognitive industrial Internet of Things (CIIoT) can improve transmission performance by utilizing the spectrum licensed to a primary user (PU), providing that the normal communication of the PU is not disturbed. However, the traditional spectrum access schemes for the CIIoT are difficult to adapt to the various communication environments. In this article, Q-learning-based dynamic spectrum access is proposed for the CIIoT to intelligently utilize the spectrum resources in three access scenarios: orthogonal multiple access (OMA), underlay spectrum access, and nonorthogonal multiple access (NOMA). In the OMA scheme, the CIIoT learns to access the idle channels to avoid distributing the PUs, but its communication continuity cannot be guaranteed when most of the channels are occupied by the PUs. In the underlay scheme, the CIIoT learns to utilize the busy channels to ensure the communication continuity by limiting its transmit power within the tolerance of the PU. However, the interference to the PU cannot be eliminated, which will decrease the PU's throughput. In the NOMA scheme, however, the CIIoT can utilize the busy channels by canceling the interference to the PU with successive interference cancellation, which will guarantee the transmission performance of both the CIIoT and the PU. A Q-learning-based spectrum access algorithm is proposed to improve the transmission performance of the CIIoT in the three schemes. The simulation results have shown the advantages of the Q-learning-based NOMA scheme in terms of guaranteeing the throughput of the CIIoT nodes and decreasing the interference to the PUs.

Reinforcement-Learning-Based Dynamic Spectrum Access for Software-Defined Cognitive Industrial Internet of Things

Journal

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Reinforcement-Learning-Based Dynamic Spectrum Access for Software-Defined Cognitive Industrial Internet of Things

Journal

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper