Deep neural network techniques for monaural speech enhancement and separation: state of the art analysis
Published 2023 View Full Article
- Home
- Publications
- Publication Search
- Publication Details
Title
Deep neural network techniques for monaural speech enhancement and separation: state of the art analysis
Authors
Keywords
-
Journal
ARTIFICIAL INTELLIGENCE REVIEW
Volume -, Issue -, Pages -
Publisher
Springer Science and Business Media LLC
Online
2023-10-25
DOI
10.1007/s10462-023-10612-2
References
Ask authors/readers for more resources
Related references
Note: Only part of the references are listed.- Sound Source Separation Mechanisms of Different Deep Networks Explained from the Perspective of Auditory Perception
- (2022) Han Li et al. Applied Sciences-Basel
- RemixIT: Continual Self-Training of Speech Enhancement Models via Bootstrapped Remixing
- (2022) Efthymios Tzinis et al. IEEE Journal of Selected Topics in Signal Processing
- Neural speech enhancement with unsupervised pre-training and mixture training
- (2022) Xiang Hao et al. NEURAL NETWORKS
- An Electroglottograph Auxiliary Neural Network for Target Speaker Extraction
- (2022) Lijiang Chen et al. Applied Sciences-Basel
- Knowledge Distillation: A Survey
- (2021) Jianping Gou et al. INTERNATIONAL JOURNAL OF COMPUTER VISION
- Knowledge Distillation and Student-Teacher Learning for Visual Intelligence: A Review and New Outlooks
- (2021) Lin Wang et al. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE
- Phasebook and Friends: Leveraging Discrete Representations for Source Separation
- (2019) Jonathan Le Roux et al. IEEE Journal of Selected Topics in Signal Processing
- Speaker-independent auditory attention decoding without access to clean speech sources
- (2019) Cong Han et al. Science Advances
- Conv-TasNet: Surpassing Ideal Time–Frequency Magnitude Masking for Speech Separation
- (2019) Yi Luo et al. IEEE-ACM Transactions on Audio Speech and Language Processing
- Increasing Compactness of Deep Learning Based Speech Enhancement Models With Parameter Pruning and Quantization Techniques
- (2019) Jyun-Yi Wu et al. IEEE SIGNAL PROCESSING LETTERS
- Divide and Conquer: A Deep CASA Approach to Talker-Independent Monaural Speaker Separation
- (2019) Yuzhou Liu et al. IEEE-ACM Transactions on Audio Speech and Language Processing
- A Conditional Generative Model for Speech Enhancement
- (2018) Zeng-Xi Li et al. CIRCUITS SYSTEMS AND SIGNAL PROCESSING
- Speaker-Independent Speech Separation With Deep Attractor Network
- (2018) Yi Luo et al. IEEE-ACM Transactions on Audio Speech and Language Processing
- End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks
- (2018) Szu-Wei Fu et al. IEEE-ACM Transactions on Audio Speech and Language Processing
- Supervised Speech Separation Based on Deep Learning: An Overview
- (2018) DeLiang Wang et al. IEEE-ACM Transactions on Audio Speech and Language Processing
- A Deep Learning Loss Function Based on the Perceptual Evaluation of the Speech Quality
- (2018) Juan Manuel Martin-Donas et al. IEEE SIGNAL PROCESSING LETTERS
- Two-stage Deep Learning for Noisy-reverberant Speech Enhancement
- (2018) Yan Zhao et al. IEEE-ACM Transactions on Audio Speech and Language Processing
- Phase-Aware Speech Enhancement Based on Deep Neural Networks
- (2018) Naijun Zheng et al. IEEE-ACM Transactions on Audio Speech and Language Processing
- On the Relationship Between Short-Time Objective Intelligibility and Short-Time Spectral-Amplitude Mean-Square Error for Speech Enhancement
- (2018) Morten Kolbaek et al. IEEE-ACM Transactions on Audio Speech and Language Processing
- Time-Frequency Masking in the Complex Domain for Speech Dereverberation and Denoising
- (2017) Donald S. Williamson et al. IEEE-ACM Transactions on Audio Speech and Language Processing
- Multitalker Speech Separation With Utterance-Level Permutation Invariant Training of Deep Recurrent Neural Networks
- (2017) Morten Kolbaek et al. IEEE-ACM Transactions on Audio Speech and Language Processing
- A Gender Mixture Detection Approach to Unsupervised Single-Channel Speech Separation Based on Deep Neural Networks
- (2017) Yannan Wang et al. IEEE-ACM Transactions on Audio Speech and Language Processing
- A Consolidated Perspective on Multimicrophone Speech Enhancement and Source Separation
- (2017) Sharon Gannot et al. IEEE-ACM Transactions on Audio Speech and Language Processing
- Complex Ratio Masking for Monaural Speech Separation
- (2016) Donald S. Williamson et al. IEEE-ACM Transactions on Audio Speech and Language Processing
- A Deep Ensemble Learning Method for Monaural Speech Separation
- (2016) Xiao-Lei Zhang et al. IEEE-ACM Transactions on Audio Speech and Language Processing
- Single-channel Dereverberation for Distant-Talking Speech Recognition by Combining Denoising Autoencoder and Temporal Structure Normalization
- (2015) Yuma Ueda et al. Journal of Signal Processing Systems for Signal Image and Video Technology
- Speaker Adaptive Training of Deep Neural Network Acoustic Models Using I-Vectors
- (2015) Yajie Miao et al. IEEE-ACM Transactions on Audio Speech and Language Processing
- Deep Neural Networks for Single-Channel Multi-Talker Speech Recognition
- (2015) Chao Weng et al. IEEE-ACM Transactions on Audio Speech and Language Processing
- Learning Spectral Mapping for Speech Dereverberation and Denoising
- (2015) Kun Han et al. IEEE-ACM Transactions on Audio Speech and Language Processing
- Joint Optimization of Masks and Deep Recurrent Neural Networks for Monaural Source Separation
- (2015) Po-Sen Huang et al. IEEE-ACM Transactions on Audio Speech and Language Processing
- A Regression Approach to Speech Enhancement Based on Deep Neural Networks
- (2015) Yong Xu et al. IEEE-ACM Transactions on Audio Speech and Language Processing
- Wiener filtering based speech enhancement with Weighted Denoising Auto-encoder and noise classification
- (2014) Bingyin Xia et al. SPEECH COMMUNICATION
- Binaural Classification for Reverberant Speech Segregation Using Deep Neural Networks
- (2014) Yi Jiang et al. IEEE-ACM Transactions on Audio Speech and Language Processing
- On Training Targets for Supervised Speech Separation
- (2014) Yuxuan Wang et al. IEEE-ACM Transactions on Audio Speech and Language Processing
- An Experimental Study on Speech Enhancement Based on Deep Neural Networks
- (2013) Yong Xu et al. IEEE SIGNAL PROCESSING LETTERS
- Towards Scaling Up Classification-Based Speech Separation
- (2013) Yuxuan Wang et al. IEEE Transactions on Audio Speech and Language Processing
- Exploring Monaural Features for Classification-Based Speech Segregation
- (2012) Yuxuan Wang et al. IEEE Transactions on Audio Speech and Language Processing
- Unknown
- (2011) N. Haba et al. ACTA PHYSICA POLONICA B
- An Algorithm for Intelligibility Prediction of Time–Frequency Weighted Noisy Speech
- (2011) Cees H. Taal et al. IEEE Transactions on Audio Speech and Language Processing
- The influence of spectral characteristics of early reflections on speech intelligibility
- (2011) Iris Arweiler et al. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA
- Iterative Phase Estimation for the Synthesis of Separated Sources From Single-Channel Mixtures
- (2010) D. Gunawan et al. IEEE SIGNAL PROCESSING LETTERS
- Reasons why Current Speech-Enhancement Algorithms do not Improve Speech Intelligibility and Suggested Solutions
- (2010) Philipos C. Loizou et al. IEEE Transactions on Audio Speech and Language Processing
- Domain Adaptation via Transfer Component Analysis
- (2010) Sinno Jialin Pan et al. IEEE TRANSACTIONS ON NEURAL NETWORKS
- The importance of phase in speech enhancement
- (2010) Kuldip Paliwal et al. SPEECH COMMUNICATION
- A Supervised Learning Approach to Monaural Segregation of Reverberant Speech
- (2009) Zhaozhang Jin et al. IEEE Transactions on Audio Speech and Language Processing
- Role of mask pattern in intelligibility of ideal binary-masked noisy speech
- (2009) Ulrik Kjems et al. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA
- Combining Spectral Representations for Large-Vocabulary Continuous Speech Recognition
- (2008) G. Garau et al. IEEE Transactions on Audio Speech and Language Processing
- Time-Frequency Masking for Speech Separation and Its Potential for Hearing Aid Design
- (2008) DeLiang Wang Trends in Amplification
Publish scientific posters with Peeref
Peeref publishes scientific posters from all research disciplines. Our Diamond Open Access policy means free access to content and no publication fees for authors.
Learn MoreAdd your recorded webinar
Do you already have a recorded webinar? Grow your audience and get more views by easily listing your recording on Peeref.
Upload Now