SLP-P20: Speech Synthesis II |
Session Type: Poster |
Time: Friday, May 17, 08:30 - 10:30 |
Location: Poster Area B, Ground Floor |
Session Chair: Zhenhua Ling, University of Science and Technology of China |
SLP-P20.1: INVESTIGATION OF ENHANCED TACOTRON TEXT-TO-SPEECH SYNTHESIS SYSTEMS WITH SELF-ATTENTION FOR PITCH ACCENT LANGUAGE |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Yusuke Yasuda; National Institute of Informatics |
Xin Wang; National Institute of Informatics |
Shinji Takaki; National Institute of Informatics |
Junichi Yamagishi; National Institute of Informatics |
SLP-P20.2: ENHANCING HYBRID SELF-ATTENTION STRUCTURE WITH RELATIVE-POSITION-AWARE BIAS FOR SPEECH SYNTHESIS |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Shan Yang; Northwestern Polytechnical University |
Heng Lu; Tencent AI Lab |
Shiying Kang; Tencent AI Lab |
Lei Xie; Northwestern Polytechnical University |
Dong Yu; Tencent AI Lab |
SLP-P20.3: WAVEFORM GENERATION FOR TEXT-TO-SPEECH SYNTHESIS USING PITCH-SYNCHRONOUS MULTI-SCALE GENERATIVE ADVERSARIAL NETWORKS |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Lauri Juvela; Aalto University |
Bajibabu Bollepalli; Aalto University |
Junichi Yamagishi; National Institute of Informatics |
Paavo Alku; Aalto University |
SLP-P20.4: INVESTIGATING CONTEXT FEATURES HIDDEN IN END-TO-END TTS |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Kohki Mametani; Doshisha University |
Tsuneo Kato; Doshisha University |
Seiichi Yamamoto; Doshisha University |
SLP-P20.5: CASTING TO CORPUS: SEGMENTING AND SELECTING SPONTANEOUS DIALOGUE FOR TTS WITH A CNN-LSTM SPEAKER-DEPENDENT BREATH DETECTOR |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Éva Székely; KTH Royal Institute of Technology |
Gustav Eje Henter; KTH Royal Institute of Technology |
Joakim Gustafson; KTH Royal Institute of Technology |
SLP-P20.6: PHONEME DEPENDENT SPEAKER EMBEDDING AND MODEL FACTORIZATION FOR MULTI-SPEAKER SPEECH SYNTHESIS AND ADAPTATION |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Ruibo Fu; CASIA |
Jianhua Tao; CASIA |
Zhengqi Wen; CASIA |
Yibin Zheng; CASIA |
SLP-P20.7: END-TO-END CODE-SWITCHED TTS WITH MIX OF MONOLINGUAL RECORDINGS |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Yuewen Cao; The Chinese University of Hong Kong |
Xixin Wu; The Chinese University of Hong Kong |
Songxiang Liu; The Chinese University of Hong Kong |
Jianwei Yu; The Chinese University of Hong Kong |
Xu Li; The Chinese University of Hong Kong |
Zhiyong Wu; Tsinghua-CUHK Joint Research Center for Media Sciences, Technologies and Systems |
Xunying Liu; The Chinese University of Hong Kong |
Helen Meng; The Chinese University of Hong Kong |
SLP-P20.8: SEMI-SUPERVISED TRAINING FOR IMPROVING DATA EFFICIENCY IN END-TO-END SPEECH SYNTHESIS |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Yu-An Chung; Massachusetts Institute of Technology |
Yuxuan Wang; Google, Inc. |
Wei-Ning Hsu; Massachusetts Institute of Technology |
Yu Zhang; Google, Inc. |
RJ Skerry-Ryan; Google, Inc. |
SLP-P20.9: LEARNING LATENT REPRESENTATIONS FOR STYLE CONTROL AND TRANSFER IN END-TO-END SPEECH SYNTHESIS |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Ya-Jie Zhang; University of Science and Technology of China |
Shifeng Pan; Microsoft |
Lei He; Microsoft |
Zhen-Hua Ling; University of Science and Technology of China |
SLP-P20.10: MULTI-SPEAKER EMOTIONAL ACOUSTIC MODELING FOR CNN-BASED SPEECH SYNTHESIS |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Heejin Choi; Korea Advanced Institute of Science and Technology |
Sangjun Park; Korea Advanced Institute of Science and Technology |
Jinuk Park; Korea Advanced Institute of Science and Technology |
Minsoo Hahn; Korea Advanced Institute of Science and Technology |
SLP-P20.11: SINGING VOICE SYNTHESIS BASED ON GENERATIVE ADVERSARIAL NETWORKS |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Yukiya Hono; Nagoya Institute of Technology |
Kei Hashimoto; Nagoya Institute of Technology |
Keiichiro Oura; Nagoya Institute of Technology |
Yoshihiko Nankaku; Nagoya Institute of Technology |
Keiichi Tokuda; Nagoya Institute of Technology |
SLP-P20.12: ENHANCED VIRTUAL SINGERS GENERATION BY INCORPORATING SINGING DYNAMICS TO PERSONALIZED TEXT-TO-SPEECH-TO-SINGING |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Kantapon Kaewtip; Oben Inc |
Fernando Villavicencio; Oben Inc |
Fang-yu Kuo; Oben Inc |
Mark Harvilla; Oben Inc |
Iris Ouyang; Oben Inc |
Pierre Lanchantin; Oben Inc |
SLP-P20.13: INVESTIGATIONS OF REAL-TIME GAUSSIAN FFTNET AND PARALLEL WAVENET NEURAL VOCODERS WITH SIMPLE ACOUSTIC FEATURES |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Takuma Okamoto; National Institute of Information and Communications Technology |
Tomoki Toda; Nagoya University |
Yoshinori Shiga; National Institute of Information and Communications Technology |
Hisashi Kawai; National Institute of Information and Communications Technology |