GC-L10.2

GENERAL SPEECH RESTORATION USING TWO-STAGE GENERATIVE ADVERSARIAL NETWORKS

Qinwen Hu, Tianyi Tan, Nanjing University, China; Ming Tang, State Grid Jiangsu Electric Power Company Limited, China; Yuxiang Hu, Changbao Zhu, Horizon Robotics, China; Jing Lu, Nanjing University, China

Session:
GC-L10: ICASSP 2024 SPEECH SIGNAL IMPROVEMENT CHALLENGE Lecture

Track:
Grand Challenges

Location:
Room 209B

Presentation Time:
Fri, 19 Apr, 13:30 - 13:50 (UTC +9)

Session Co-Chairs:
Nicolae Ristea, Microsoft and Ross Cutler, Microsoft
Presentation
Discussion
Resources
No resources available.
Session GC-L10
GC-L10.1: ICASSP 2024 SPEECH SIGNAL IMPROVEMENT CHALLENGE
Nicolae Catalin Ristea, Ando Saabas, Ross Cutler, Babak Naderi, Sebastian Braun, Solomiya Branets, Microsoft, Romania
GC-L10.2: GENERAL SPEECH RESTORATION USING TWO-STAGE GENERATIVE ADVERSARIAL NETWORKS
Qinwen Hu, Tianyi Tan, Nanjing University, China; Ming Tang, State Grid Jiangsu Electric Power Company Limited, China; Yuxiang Hu, Changbao Zhu, Horizon Robotics, China; Jing Lu, Nanjing University, China
GC-L10.3: RENET: A TIME-FREQUENCY DOMAIN GENERAL SPEECH RESTORATION NETWORK FOR ICASSP 2024 SPEECH SIGNAL IMPROVEMENT CHALLENGE
Fengyuan Hao, Huiyong Zhang, Lingling Dai, Xiaoxue Luo, Xiaodong Li, Chengshi Zheng, Key Laboratory of Noise and Vibration Research, Institute of Acoustics, Chinese Academy of Sciences; University of Chinese Academy of Sciences, China
GC-L10.4: RAD-NET: A REPAIRING AND DENOISING NETWORK FOR SPEECH SIGNAL IMPROVEMENT
Mingshuai Liu, NWPU, China; Zhuangqi Chen, ByteDance, China; Xiaopeng Yan, Yuanju Lv, NWPU, China; Xianjun Xia, ByteDance, China; Chuanzeng Huang, Speech, Audio and Music Intelligence (SAMI) group, ByteDance, China; Yijian Xiao, ByteDance, China; Lei Xie, NWPU, China
GC-L10.5: KS-NET: MULTI-BAND JOINT SPEECH RESTORATION AND ENHANCEMENT NETWORK FOR 2024 ICASSP SSI CHALLENGE
Guochen Yu, Runqiang Han, Chenglin Xu, Haoran Zhao, Nan Li, Chen Zhang, Xiguang Zheng, Chao Zhou, Qi Huang, Bing Yu, Kuaishou Technology, Beijing, China, China
GC-L10.6: REBUILD, REGENERATE: A GATED TEMPORAL CONVOLUTION BASED GAN FOR SPEECH SIGNAL IMPROVEMENT
Nikhil Das, Rakesh Pogula, Mohammad Imtiaz Ali, Sasank Kottapalli, Sanniboyina Venkata Kiran, Independent Researcher, India
Contacts