☆ 4.7 Article

Semantic Communication Systems for Speech Transmission

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS (2021)

期刊

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS

卷 39, 期 8, 页码 2434-2444

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/JSAC.2021.3087240

关键词

Training; Adaptation models; Communication systems; Simulation; Multimedia systems; Semantics; Telephone sets; Deep learning; semantic communication; speech transmission; squeeze-and-excitation networks

类别

Engineering, Electrical & Electronic Telecommunications

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The paper introduces a deep learning-enabled semantic communication system for speech signals named DeepSC-S, which utilizes an attention mechanism to improve recovery accuracy and robustness. The simulation results show that DeepSC-S outperforms traditional communications in terms of speech signal metrics and is more resilient to channel variations, especially in low signal-to-noise environments.

Semantic communications could improve the transmission efficiency significantly by exploring the semantic information. In this paper, we make an effort to recover the transmitted speech signals in the semantic communication systems, which minimizes the error at the semantic level rather than the bit or symbol level. Particularly, we design a deep learning (DL)-enabled semantic communication system for speech signals, named DeepSC-S. In order to improve the recovery accuracy of speech signals, especially for the essential information, DeepSC-S is developed based on an attention mechanism by utilizing a squeeze-and-excitation (SE) network. The motivation behind the attention mechanism is to identify the essential speech information by providing higher weights to them when training the neural network. Moreover, in order to facilitate the proposed DeepSC-S for dynamic channel environments, we find a general model to cope with various channel conditions without retraining. Furthermore, we investigate DeepSC-S in telephone systems as well as multimedia transmission systems to verify the model adaptation in practice. The simulation results demonstrate that our proposed DeepSC-S outperforms the traditional communications in both cases in terms of the speech signals metrics, such as signal-to-distortion ration and perceptual evaluation of speech distortion. Besides, DeepSC-S is more robust to channel variations, especially in the low signal-to-noise (SNR) regime.

Semantic Communication Systems for Speech Transmission

期刊

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Semantic Communication Systems for Speech Transmission

期刊

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文