SLP-P32.3
Curricular Contrastive Regularization for Speech Enhancement with Self-supervised Representations
Xinmeng Xu, Chang Han, Yiqun Zhang, Weiping Tu, Yuhong Yang, Wuhan University, China
Session:
SLP-P32: Speech enhancement and separation II Poster
Track:
Speech and Language Processing
Location:
Poster Zone 6A
Poster Board PZ-6A.3
Poster Board PZ-6A.3
Presentation Time:
Thu, 18 Apr, 16:30 - 18:30 (UTC +9)
Session Co-Chairs:
Emanuël Habets, International Audio Laboratories Erlangen and Robin Scheibler, LY Corporation
Session SLP-P32
SLP-P32.1: HIERARCHICAL SPEAKER REPRESENTATION FOR TARGET SPEAKER EXTRACTION
Shulin He, Huaiwen Zhang, College of Computer Science, Inner Mongolia University, China; Wei Rao, Tencent, China; Kanghao Zhang, College of Computer Science, Inner Mongolia University, China; Yukai Ju, Tencent, China; Yang Yang, Xueliang Zhang, College of Computer Science, Inner Mongolia University, China
SLP-P32.2: TARGET SPEAKER EXTRACTION BY DIRECTLY EXPLOITING CONTEXTUAL INFORMATION IN THE TIME-FREQUENCY DOMAIN
Xue Yang, Changchun Bao, Jing Zhou, Xianhong Chen, Beijing University of Technology, China
SLP-P32.3: Curricular Contrastive Regularization for Speech Enhancement with Self-supervised Representations
Xinmeng Xu, Chang Han, Yiqun Zhang, Weiping Tu, Yuhong Yang, Wuhan University, China
SLP-P32.4: EMPLOYING REAL TRAINING DATA FOR DEEP NOISE SUPPRESSION
Ziyi Xu, Marvin Sach, Jan Pirklbauer, Tim Fingscheidt, Technische Universität Braunschweig, Germany
SLP-P32.5: LEVERAGING SELF-SUPERVISED SPEECH REPRESENTATIONS FOR DOMAIN ADAPTATION IN SPEECH ENHANCEMENT
Ching-Hua Lee, Chouchang Yang, Rakshith Sharma Srinivasa, Yashas Malur Saidutta, Jaejin Cho, Yilin Shen, Hongxia Jin, Samsung Research America, United States of America
SLP-P32.6: NEURAL NETWORK-BASED VIRTUAL MICROPHONE ESTIMATION WITH VIRTUAL MICROPHONE AND BEAMFORMER-LEVEL MULTI-TASK LOSS
Hanako Segawa, Tsukuba University, Japan; Tsubasa Ochiai, Marc Delcroix, Tomohiro Nakatani, Rintaro Ikeshita, Shoko Araki, NTT Corporation, Japan; Takeshi Yamada, Shoji Makino, Tsukuba University, Japan
SLP-P32.7: OPINE: Leveraging A Optimization-Inspired Deep Unfolding Method for Multi-channel Speech Enhancement
Andong Li, Rilin Chen, Yu Gu, Chao Weng, Dan Su, Tencent, China
SLP-P32.8: SELM: Speech Enhancement Using Discrete Tokens and Language Models
Ziqian Wang, Xinfa Zhu, Zihan Zhang, YuanJun Lv, Northwestern Polytechnical University, China; Ning Jiang, Guoqing Zhao, Mashang Consumer Finance Co., Ltd., China; Lei Xie, Northwestern Polytechnical University, China
SLP-P32.9: SECP: A SPEECH ENHANCEMENT-BASED CURATION PIPELINE FOR SCALABLE ACQUISITION OF CLEAN SPEECH
Adam Sabra, Cyprian Wronka, Michelle Mao, Samer Hijazi, Cisco Systems, Inc, United States of America
SLP-P32.10: ATTENTION-DRIVEN MULTICHANNEL SPEECH ENHANCEMENT IN MOVING SOUND SOURCE SCENARIOS
Yuzhu Wang, Archontis Politis, Tuomas Virtanen, Tampere University, Finland
Contacts