| Paper ID | SPE-L5.2 |
| Paper Title |
ZERO-SHOT MULTI-SPEAKER TEXT-TO-SPEECH WITH STATE-OF-THE-ART NEURAL SPEAKER EMBEDDINGS |
| Authors |
Erica Cooper, National Institute of Informatics, Japan; Cheng-I Lai, Massachusetts Institute of Technology, United States; Yusuke Yasuda, Fuming Fang, Xin Wang, National Institute of Informatics, Japan; Nanxin Chen, Johns Hopkins University, United States; Junichi Yamagishi, National Institute of Informatics, Japan |
| Session | SPE-L5: Speech Synthesis and Voice Conversion I |
| Location | On-Demand |
| Session Time: | Wednesday, 06 May, 09:00 - 11:00 |
| Presentation Time: | Wednesday, 06 May, 09:20 - 09:40 |
| Presentation |
Lecture
|
| Topic |
Speech Processing: [SPE-SYNT] Speech Synthesis and Generation |
| IEEE Xplore Open Preview |
Click here to view in IEEE Xplore |
| Virtual Presentation |
Click here to watch in the Virtual Conference |