Technical Program

Click on the icon to view the manuscript on IEEE XPlore in the IEEE ICASSP 2020 Open Preview.

SPE-P21: Voice Conversion

Session Type: Poster
Time: Friday, 8 May, 15:15 - 17:15
Location: On-Demand
Virtual Session: View on Virtual Platform
Session Chairs: Xunying Liu, Chinese University of Hong Kong and Greg Sell, Johns Hopkins University
 
 SPE-P21.1: ONE-SHOT VOICE CONVERSION USING STAR-GAN
         Ruobai Wang; Netease Inc.
         Yu Ding; Netease Inc.
         Lincheng Li; Netease Inc.
         Changjie Fan; Netease Inc.
 
 SPE-P21.2: ONE-SHOT VOICE CONVERSION BY VECTOR QUANTIZATION
         Da-Yi Wu; National Taiwan University
         Hung-yi Lee; National Taiwan University
 
 SPE-P21.3: NEUTRAL TO LOMBARD SPEECH CONVERSION WITH DEEP LEARNING
         Enguerrand Gentet; Groupe PSA
         Bertrand David; LTCI, Télécom Paris, Institut Polytechnique de Paris
         Sébastien Denjean; Groupe PSA
         Gaël Richard; LTCI, Télécom Paris, Institut Polytechnique de Paris
         Vincent Roussarie; Groupe PSA
 
 SPE-P21.4: END-TO-END VOICE CONVERSION VIA CROSS-MODAL KNOWLEDGE DISTILLATION FOR DYSARTHRIC SPEECH RECONSTRUCTION
         Disong Wang; Chinese University of Hong Kong
         Jianwei Yu; Chinese University of Hong Kong
         Xixin Wu; Chinese University of Hong Kong
         Songxiang Liu; Chinese University of Hong Kong
         Lifa Sun; SpeechX Limited
         Xunying Liu; Chinese University of Hong Kong
         Helen Meng; Chinese University of Hong Kong
 
 SPE-P21.5: PITCHNET: UNSUPERVISED SINGING VOICE CONVERSION WITH PITCH ADVERSARIAL NETWORK
         Chengqi Deng; Zhejiang University
         Chengzhu Yu; Tencent
         Heng Lu; Tencent
         Chao Weng; Tencent
         Dong Yu; Tencent
 
 SPE-P21.6: AN IMPROVED FRAME-UNIT-SELECTION BASED VOICE CONVERSION SYSTEM WITHOUT PARALLEL TRAINING DATA
         Feng-Long Xie; Tencent
         Xin-Hui Li; Tencent
         Bo Liu; Tencent
         Yi-Bin Zheng; Tencent
         Li Meng; Tencent
         Li Lu; Tencent
         Frank K. Soong; Microsoft Research Asia
 
 SPE-P21.7: VOICE CONVERSION WITH TRANSFORMER NETWORK
         Ruolan Liu; Samsung Research China-Beijing
         Xiao Chen; Samsung Research China-Beijing
         Xue Wen; Samsung Research China-Beijing
 
 SPE-P21.8: MSPEC-NET : MULTI-DOMAIN SPEECH CONVERSION NETWORK
         Harshit Malaviya; Dhirubhai Ambani Institute of Information and Communication Technology
         Jui Shah; Dhirubhai Ambani Institute of Information and Communication Technology
         Maitreya Patel; Dhirubhai Ambani Institute of Information and Communication Technology
         Jalansh Munshi; Dhirubhai Ambani Institute of Information and Communication Technology
         Hemant Patil; Dhirubhai Ambani Institute of Information and Communication Technology
 
 SPE-P21.9: MULTI-SPEAKER AND MULTI-DOMAIN EMOTIONAL VOICE CONVERSION USING FACTORIZED HIERARCHICAL VARIATIONAL AUTOENCODER
         Mohamed Elgaar; Humelo Inc. and Korea Advanced Institute of Science and Technology
         Jung Bae Park; Humelo Inc. and Korea Advanced Institute of Science and Technology
         Sang Wan Lee; Humelo Inc. and Korea Advanced Institute of Science and Technology, KAIST Institute for Artificial Intelligence, KAIST Center for Neuroscience-inspired Artificial Intelligence
 
 SPE-P21.10: EMOTIONAL VOICE CONVERSION USING MULTITASK LEARNING WITH TEXT-TO-SPEECH
         Tae-Ho Kim; Korea Advanced Institute of Science and Technology (KAIST)
         Sungjae Cho; Korea Advanced Institute of Science and Technology (KAIST)
         Shinkook Choi; Korea Advanced Institute of Science and Technology (KAIST)
         Sejik Park; Korea Advanced Institute of Science and Technology (KAIST)
         Soo-Young Lee; Korea Advanced Institute of Science and Technology (KAIST)
 
 SPE-P21.11: EFFECTIVE WAVENET ADAPTATION FOR VOICE CONVERSION WITH LIMITED DATA
         Hongqiang Du; Northwestern Polytechnical University
         Xiaohai Tian; National University of Singapore
         Lei Xie; Northwestern Polytechnical University
         Haizhou Li; National University of Singapore
 
 SPE-P21.12: LIFTER TRAINING AND SUB-BAND MODELING FOR COMPUTATIONALLY EFFICIENT AND HIGH-QUALITY VOICE CONVERSION USING SPECTRAL DIFFERENTIALS
         Takaaki Saeki; University of Tokyo
         Yuki Saito; University of Tokyo
         Shinnosuke Takamichi; University of Tokyo
         Hiroshi Saruwatari; University of Tokyo