Paper
31 December 2019 Multiple modules speech enhancement in mixed noise and low SNR environments
Author Affiliations +
Proceedings Volume 11384, Eleventh International Conference on Signal Processing Systems; 1138406 (2019) https://doi.org/10.1117/12.2559657
Event: Eleventh International Conference on Signal Processing Systems, 2019, Chengdu, China
Abstract
Achieving stationary speech enhancement in low signal-to-noise ratio (SNR) environments is a challenging problem. Because noise energy is dominant in noisy speech at low SNR level, the existence of numerous obvious random noises may lead neural network to forget some useful information obtained by early training. Moreover, it is difficult for a single neural network to obtain effective speech features and noise features. Therefore, this paper designs to utilize multiple neural networks in two stages to discriminately learn a certain type of noise features and reduce the introduction of interference. Experiment results demonstrate that proposed method leads to consistently better source-to-distortion ratio (SDR) and perceptual evaluation of speech quality (PESQ) than baseline models in low SNR condition. And the results indicate that the method can suppress the forgetting of early information of neural network.
© (2019) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Tian Lan, Wenzheng Ye, Guoqiang Hui, Sen Li, and Qiao Liu "Multiple modules speech enhancement in mixed noise and low SNR environments", Proc. SPIE 11384, Eleventh International Conference on Signal Processing Systems, 1138406 (31 December 2019); https://doi.org/10.1117/12.2559657
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Signal to noise ratio

Neural networks

Denoising

Detection and tracking algorithms

Performance modeling

Data modeling

Interference (communication)

Back to Top