SLP-L18: Text to Speech Generation -O2
Thu, 18 Apr, 08:20 - 10:20 (UTC +9)
Location: Room 103
Session Type: Lecture
Session Co-Chairs: Helen Meng, CUHK and Zhenhua Ling, USTC
Track: Speech and Language Processing
Click the to view the manuscript on IEEE Xplore Open Preview
Thu, 18 Apr, 08:20 - 08:40 (UTC +9)
SLP-L18.1: ULTRA-LIGHTWEIGHT NEURAL DIFFERENTIAL DSP VOCODER FOR HIGH QUALITY SPEECH SYNTHESIS
Thu, 18 Apr, 08:40 - 09:00 (UTC +9)
SLP-L18.2: FREGRAD: LIGHTWEIGHT AND FAST FREQUENCY-AWARE DIFFUSION VOCODER
Thu, 18 Apr, 09:00 - 09:20 (UTC +9)
SLP-L18.3: BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network
Thu, 18 Apr, 09:20 - 09:40 (UTC +9)
SLP-L18.4: NOISE-ROBUST ZERO-SHOT TEXT-TO-SPEECH SYNTHESIS CONDITIONED ON SELF-SUPERVISED SPEECH-REPRESENTATION MODEL WITH ADAPTERS
Thu, 18 Apr, 09:40 - 10:00 (UTC +9)
SLP-L18.5: SPEAK WHILE YOU THINK: STREAMING SPEECH SYNTHESIS DURING TEXT GENERATION
Thu, 18 Apr, 10:00 - 10:20 (UTC +9)