Technical Program

SLP-P22: Speech Synthesis III

Session Type: Poster
Time: Friday, May 17, 13:30 - 15:30
Location: Poster Area B, Ground Floor
Session Chair: Dong Yu, Tencent AI Lab
 
SLP-P22.1: DNN-BASED SPECTRAL ENHANCEMENT FOR NEURAL WAVEFORM GENERATORS WITH LOW-BIT QUANTIZATION
Manuscript Link:  Click here to view manuscript on IEEE Xplore
         Yang Ai; University of Science and Technology of China
         Jing-Xuan Zhang; University of Science and Technology of China
         Liang Chen; Anhui Science and Technology Research Institute
         Zhen-Hua Ling; University of Science and Technology of China
 
SLP-P22.2: DNN-BASED SPEAKER-ADAPTIVE POSTFILTERING WITH LIMITED ADAPTATION DATA FOR STATISTICAL SPEECH SYNTHESIS SYSTEMS
Manuscript Link:  Click here to view manuscript on IEEE Xplore
         Miraç Göksu Öztürk; Boğaziçi University
         Okan Ulusoy; Boğaziçi University
         Cenk Demiroglu; Özyeğin University
 
SLP-P22.3: SELF-ATTENTION BASED PROSODIC BOUNDARY PREDICTION FOR CHINESE SPEECH SYNTHESIS
Manuscript Link:  Click here to view manuscript on IEEE Xplore
         Chunhui Lu; Institute of Acoustics, Chinese Academy of Sciences
         Pengyuan Zhang; Institute of Acoustics, Chinese Academy of Sciences
         Yonghong Yan; Institute of Acoustics, Chinese Academy of Sciences
 
SLP-P22.4: AN END-TO-END NETWORK TO SYNTHESIZE INTONATION USING A GENERALIZED COMMAND RESPONSE MODEL
Manuscript Link:  Click here to view manuscript on IEEE Xplore
         François Marelli; Idiap Research Institute
         Bastian Schnell; Idiap Research Institute
         Hervé Bourlard; Idiap Research Institute
         Thierry Dutoit; Université de Mons
         Philip N. Garner; Idiap Research Institute
 
SLP-P22.5: SPEECH WAVEFORM RECONSTRUCTION USING CONVOLUTIONAL NEURAL NETWORKS WITH NOISE AND PERIODIC INPUTS
Manuscript Link:  Click here to view manuscript on IEEE Xplore
         Oliver Watts; Edinburgh University
         Cassia Valentini-Botinhao; Edinburgh University
         Simon King; Edinburgh University
 
SLP-P22.6: IMPLEMENTING PROSODIC PHRASING IN CHINESE END-TO-END SPEECH SYNTHESIS
Manuscript Link:  Click here to view manuscript on IEEE Xplore
         Yanfeng Lu; Institute for Infocomm Research
         Minghui Dong; Institute for Infocomm Research
         Ying Chen; Nanjing University of Science and Technology
 
SLP-P22.7: QUASI-FULLY CONVOLUTIONAL NEURAL NETWORK WITH VARIATIONAL INFERENCE FOR SPEECH SYNTHESIS
Manuscript Link:  Click here to view manuscript on IEEE Xplore
         Mu Wang; Tsinghua University
         Xixin Wu; The Chinese University of Hong Kong
         Zhiyong Wu; Tsinghua University
         Shiyin Kang; Tencent
         Deyi Tuo; Tencent
         Guangzhi Li; Tencent
         Dan Su; Tencent
         Dong Yu; Tencent
         Helen Meng; The Chinese University of Hong Kong
 
SLP-P22.8: STFT SPECTRAL LOSS FOR TRAINING A NEURAL SPEECH WAVEFORM MODEL
Manuscript Link:  Click here to view manuscript on IEEE Xplore
         Shinji Takaki; National Institute of Informatics
         Toru Nakashika; The University of Electro-Communications
         Xin Wang; National Institute of Informatics
         Junichi Yamagishi; National Institute of Informatics
 
SLP-P22.9: GENERATIVE MOMENT MATCHING NETWORK-BASED RANDOM MODULATION POST-FILTER FOR DNN-BASED SINGING VOICE SYNTHESIS AND NEURAL DOUBLE-TRACKING
Manuscript Link:  Click here to view manuscript on IEEE Xplore
         Hiroki Tamaru; The University of Tokyo
         Yuki Saito; The University of Tokyo
         Shinnosuke Takamichi; The University of Tokyo
         Tomoki Koriyama; Tokyo Institute of Technology
         Hiroshi Saruwatari; The University of Tokyo
 
SLP-P22.10: EFFECT OF DATA REDUCTION ON SEQUENCE-TO-SEQUENCE NEURAL TTS
Manuscript Link:  Click here to view manuscript on IEEE Xplore
         Javier Latorre; Amazon
         Jakub Lachowicz; Amazon
         Jaime Lorenzo-Trueba; Amazon
         Thomas Merritt; Amazon
         Thomas Drugman; Amazon
         Srikanth Ronanki; Amazon
         Viacheslav Klimkov; Amazon