Mask-Based Neural Beamforming for Moving Speakers With Self-Attention-Based Tracking
Published 2023 View Full Article
- Home
- Publications
- Publication Search
- Publication Details
Title
Mask-Based Neural Beamforming for Moving Speakers With Self-Attention-Based Tracking
Authors
Keywords
-
Journal
IEEE-ACM Transactions on Audio Speech and Language Processing
Volume 31, Issue -, Pages 835-848
Publisher
Institute of Electrical and Electronics Engineers (IEEE)
Online
2023-01-17
DOI
10.1109/taslp.2023.3237172
References
Ask authors/readers for more resources
Related references
Note: Only part of the references are listed.- Auxiliary function-based algorithm for blind extraction of a moving speaker
- (2022) Jakub Janský et al. EURASIP Journal on Audio Speech and Music Processing
- Deep neural network-based generalized sidelobe canceller for dual-channel far-field speech recognition
- (2021) Guanjun Li et al. NEURAL NETWORKS
- gpuRIR: A python library for room impulse response simulation with GPU acceleration
- (2020) David Diaz-Guerra et al. MULTIMEDIA TOOLS AND APPLICATIONS
- Far-Field Automatic Speech Recognition
- (2020) Reinhold Haeb-Umbach et al. PROCEEDINGS OF THE IEEE
- Multi-Speaker DOA Estimation Using Deep Convolutional Networks Trained With Noise Signals
- (2019) Soumitro Chakrabarty et al. IEEE Journal of Selected Topics in Signal Processing
- A Unified Convolutional Beamformer for Simultaneous Denoising and Dereverberation
- (2019) Tomohiro Nakatani et al. IEEE SIGNAL PROCESSING LETTERS
- Conv-TasNet: Surpassing Ideal Time–Frequency Magnitude Masking for Speech Separation
- (2019) Yi Luo et al. IEEE-ACM Transactions on Audio Speech and Language Processing
- Block-online multi-channel speech enhancement using deep neural network-supported relative transfer function estimates
- (2019) Jiri Malek et al. IET Signal Processing
- Adaptive Generalized Sidelobe Canceler Beamforming With Time-Varying Direction-of-Arrival Estimation for Arrayed Sensors
- (2019) Dah-Chung Chang et al. IEEE SENSORS JOURNAL
- Online MVDR Beamformer Based on Complex Gaussian Mixture Model With Spatial Prior for Noise Robust ASR
- (2017) Takuya Higuchi et al. IEEE-ACM Transactions on Audio Speech and Language Processing
- A Multichannel MMSE-Based Framework for Speech Source Separation and Noise Reduction
- (2013) Mehrez Souden et al. IEEE Transactions on Audio Speech and Language Processing
- Microphone Array Processing for Distant Speech Recognition: From Close-Talking Microphones to Far-Field Sensors
- (2012) Kenichi Kumatani et al. IEEE SIGNAL PROCESSING MAGAZINE
- Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups
- (2012) Geoffrey Hinton et al. IEEE SIGNAL PROCESSING MAGAZINE
- An Algorithm for Intelligibility Prediction of Time–Frequency Weighted Noisy Speech
- (2011) Cees H. Taal et al. IEEE Transactions on Audio Speech and Language Processing
- On Optimal Frequency-Domain Multichannel Linear Filtering for Noise Reduction
- (2009) M. Souden et al. IEEE Transactions on Audio Speech and Language Processing
- DOA Estimation for Multiple Sparse Sources with Arbitrarily Arranged Multiple Sensors
- (2009) Shoko Araki et al. Journal of Signal Processing Systems for Signal Image and Video Technology
- Time-Frequency Masking for Speech Separation and Its Potential for Hearing Aid Design
- (2008) DeLiang Wang Trends in Amplification
Find Funding. Review Successful Grants.
Explore over 25,000 new funding opportunities and over 6,000,000 successful grants.
ExploreBecome a Peeref-certified reviewer
The Peeref Institute provides free reviewer training that teaches the core competencies of the academic peer review process.
Get Started