Technical Program

Paper Detail

Paper ID	F-3-3.2
Paper Title	Robust Speech Dereverberation Based on WPE and Deep Learning
Authors	Hao Li, Inner Mongolia University, China; Xueliang Zhang, Guanglai Gao, Professor, China
Session	F-3-3: Signal Processing Systems for AI
Time	Thursday, 10 December, 17:30 - 19:30
Presentation Time:	Thursday, 10 December, 17:45 - 18:00 Check your Time Zone
	All times are in New Zealand Time (UTC +13)
Topic	Signal Processing Systems: Design and Implementation (SPS):
Abstract	Reverberation has a considerable impacts on speech quality and intelligibility. Weighted prediction error (WPE) employs a linear regression model to estimate late reverberation and then cancel it. The key point of the WPE is to estimate the power spectrum of the early speech. However, its estimation relies on an iterative procedure with high computational complexity. Another problem is that the WPE has a noise-free assumption. So, the performance degrades in noisy conditions. To address these problems, we propose an algorithm for speech dereverberation in the presence of background noise, in which deep learning is integrated into the WPE method. Specifically, we employ a neural network to predict the power spectral density (PSD) of early speech and a binary mask which distinguishes target speech from background noise. To alleviate the noise impact on estimation of echo path, a dual-filter strategy is adopted to model the echo paths of target speech and background noise individually. Experimental results show that the proposed method significantly improves speech quality in noisy environments.