SLP-L1: Speech enhancement and separation - Diffusion and other probabilistic models
Tue, 16 Apr, 13:10 - 15:10 (UTC +9)
Location: Room 104
Session Type: Lecture
Session Co-Chairs: Timo Gerkmann, Universität Hamburg and Tomohiro Nakatani, NTT Corporation
Track: Speech and Language Processing
Click the to view the manuscript on IEEE Xplore Open Preview
Tue, 16 Apr, 13:10 - 13:30 (UTC +9)
SLP-L1.1: DIFFUSION-BASED SPEECH ENHANCEMENT IN MATCHED AND MISMATCHED CONDITIONS USING A HEUN-BASED SAMPLER
Tue, 16 Apr, 13:30 - 13:50 (UTC +9)
SLP-L1.2: Unsupervised Speech Enhancement with Diffusion-based Generative Models
Tue, 16 Apr, 13:50 - 14:10 (UTC +9)
SLP-L1.3: BOOSTING SPEECH ENHANCEMENT WITH CLEAN SELF-SUPERVISED FEATURES VIA CONDITIONAL VARIATIONAL AUTOENCODERS
Tue, 16 Apr, 14:10 - 14:30 (UTC +9)
SLP-L1.4: Diffusion-based Speech Enhancement with a Weighted Generative-Supervised Learning Loss
Tue, 16 Apr, 14:30 - 14:50 (UTC +9)
SLP-L1.5: AV2WAV: DIFFUSION-BASED RE-SYNTHESIS FROM CONTINUOUS SELF-SUPERVISED FEATURES FOR AUDIO-VISUAL SPEECH ENHANCEMENT
Tue, 16 Apr, 14:50 - 15:10 (UTC +9)