Technical Program

Paper Detail

Paper IDE-3-2.6
Paper Title ADAPTIVE NOISE SUPPRESSION FOR WAKE-WORD DETECTION BY TEMPORAL-DIFFERENCE GENERALIZED EIGENVALUE BEAMFORMER
Authors Takehiko Kagoshima, Ning Ding, Hiroshi Fujimura, Toshiba Corporation, Japan
Session E-3-2: Speech Separation 2, Sound source separation
TimeThursday, 10 December, 15:30 - 17:15
Presentation Time:Thursday, 10 December, 16:45 - 17:00 Check your Time Zone
All times are in New Zealand Time (UTC +13)
Topic Speech, Language, and Audio (SLA):
Abstract This paper proposes an adaptive noise suppression method for wake-word detection by a temporal-difference generalized eigenvalue (TDGEV) beamformer. To emphasize wakeword utterances, which are leading phrases with short duration, the proposed method is based on a generalized eigenvalue beamformer regarding current and past spatial covariance matrixes for speech and noise, respectively. It can emphasize wake-words with small distortion and suppress any noises regardless of directions of arrival (DoAs) and noise sources. We perform experiments of wake-word detection with and without beamformers using test data including wake-word utterances from various DoAs. The results show that the proposed TDGEV method reduces false rejects with 32.9% relative error rate reduction.