Technical Program

Paper Detail

Paper ID	E-3-1.4
Paper Title	INDEPENDENT VECTOR ANALYSIS FOR BLIND SPEECH SEPARATION USING COMPLEX GENERALIZED GAUSSIAN MIXTURE MODEL WITH WEIGHTED VARIANCE
Authors	Xinyu Tang, Chongqing University of Posts and Telecommunications, China; Rilin Chen, Tencent, China; Xiyuan Wang, Beijing Information Science and Technology University, China; Yi Zhou, Chongqing University of Posts and Telecommunications, China; Dan Su, Tencent, China
Session	E-3-1: Speech Separation 1
Time	Thursday, 10 December, 12:30 - 14:00
Presentation Time:	Thursday, 10 December, 13:15 - 13:30 Check your Time Zone
	All times are in New Zealand Time (UTC +13)
Topic	Speech, Language, and Audio (SLA):
Abstract	In this paper, we propose using complex generalized Gaussian mixture distribution with weighted variance for speech modelling and devise an improved independent vector analysis (IVA) algorithm for blind speech separation (BSS). Capable of capturing both non-Gaussianity and non-stationarity, the proposed complex generalized Gaussian mixture model (CGGMM) allows for a much flexible characterization of practical speech signals. The majorization minimization (MM) framework is adopted for the IVA algorithm design. Each iteration of the algorithm is comprised of the updates of demixing matrices and mixture model parameters. For demixing matrices, the update operates in a manner similar to that of the auxiliary function based IVA (AuxIVA) method, and for mixture parameters, the expectation maximization (EM) update is performed. As both updates are in closed form and pre-whitening is not a prerequisite, the IVA algorithm under CGGMM is of low complexity and can be carried out efficiently. Experimental results show that the proposed algorithm outperforms existing ones in terms of separation accuracy and also enjoys a fast convergence rate in both simulated and real environments.