Technical Program

Paper Detail

Paper IDF-1-3.3
Paper Title SPEECH ENHANCEMENT FOR DEMODULATED SIGNALS UNDER MULTIPATH FADING COMMUNICATION CHANNELS
Authors Akio Kobayashi, Tsukuba University of Technology, Japan
Session F-1-3: Speech Enhancement 1
TimeTuesday, 08 December, 17:15 - 19:15
Presentation Time:Tuesday, 08 December, 17:45 - 18:00 Check your Time Zone
All times are in New Zealand Time (UTC +13)
Topic Speech, Language, and Audio (SLA):
Abstract In analog communication channels, such as radio broadcasting, the superposition of multiple reflected signals often causes multipath fading. Multipath fading often results in the fluctuation of the received electric intensity levels of these signals; thus, it causes severe quality degradation in audible sounds. In this paper, we focus on speech enhancement under a fading communication channel with additive Gaussian noise. We attempt to reconstruct the original speech based on the use of denoising autoencoders that employ mean-squared-error and additive perceptual evaluation of speech quality (PESQ)-based loss functions in multi-task learning (MTL). The experimental results indicate that the MTL-based autoencoder improves PESQ scores from 2.00 to 2.75 for signals under fading communication channels with additive Gaussian noise.