| Paper ID | SPE-P3.4 |
| Paper Title |
Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and autoregressive prosody prior |
| Authors |
Guangzhi Sun, Cambridge University, United Kingdom; Yu Zhang, Ron Weiss, Yuan Cao, Heiga Zen, Andrew Rosenberg, Bhuvana Ramabhadran, Yonghui Wu, Google, United States |
| Session | SPE-P3: Machine Learning for Speech Synthesis I |
| Location | On-Demand |
| Session Time: | Tuesday, 05 May, 16:30 - 18:30 |
| Presentation Time: | Tuesday, 05 May, 16:30 - 18:30 |
| Presentation |
Poster
|
| Topic |
Speech Processing: [SPE-SYNT] Speech Synthesis and Generation |
| IEEE Xplore Open Preview |
Click here to view in IEEE Xplore |
| Virtual Presentation |
Click here to watch in the Virtual Conference |