Paper ID | SPE-P3.4 |
Paper Title |
Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and autoregressive prosody prior |
Authors |
Guangzhi Sun, Cambridge University, United Kingdom; Yu Zhang, Ron Weiss, Yuan Cao, Heiga Zen, Andrew Rosenberg, Bhuvana Ramabhadran, Yonghui Wu, Google, United States |
Session | SPE-P3: Machine Learning for Speech Synthesis I |
Location | On-Demand |
Session Time: | Tuesday, 05 May, 16:30 - 18:30 |
Presentation Time: | Tuesday, 05 May, 16:30 - 18:30 |
Presentation |
Poster
|
Topic |
Speech Processing: [SPE-SYNT] Speech Synthesis and Generation |
IEEE Xplore Open Preview |
Click here to view in IEEE Xplore |
Virtual Presentation |
Click here to watch in the Virtual Conference |