SLP-P32: Speech enhancement and separation II
Thu, 18 Apr, 16:30 - 18:30 (UTC +9)
Location: Poster Zone 6A
Session Type: Poster
Session Co-Chairs: Emanuël Habets, International Audio Laboratories Erlangen and Robin Scheibler, LY Corporation
Track: Speech and Language Processing
Click the to view the manuscript on IEEE Xplore Open Preview
 

SLP-P32.1: HIERARCHICAL SPEAKER REPRESENTATION FOR TARGET SPEAKER EXTRACTION

Shulin He, Huaiwen Zhang, College of Computer Science, Inner Mongolia University, China; Wei Rao, Tencent, China; Kanghao Zhang, College of Computer Science, Inner Mongolia University, China; Yukai Ju, Tencent, China; Yang Yang, Xueliang Zhang, College of Computer Science, Inner Mongolia University, China
 

SLP-P32.2: TARGET SPEAKER EXTRACTION BY DIRECTLY EXPLOITING CONTEXTUAL INFORMATION IN THE TIME-FREQUENCY DOMAIN

Xue Yang, Changchun Bao, Jing Zhou, Xianhong Chen, Beijing University of Technology, China
 

SLP-P32.3: Curricular Contrastive Regularization for Speech Enhancement with Self-supervised Representations

Xinmeng Xu, Chang Han, Yiqun Zhang, Weiping Tu, Yuhong Yang, Wuhan University, China
 

SLP-P32.4: EMPLOYING REAL TRAINING DATA FOR DEEP NOISE SUPPRESSION

Ziyi Xu, Marvin Sach, Jan Pirklbauer, Tim Fingscheidt, Technische Universität Braunschweig, Germany
 

SLP-P32.5: LEVERAGING SELF-SUPERVISED SPEECH REPRESENTATIONS FOR DOMAIN ADAPTATION IN SPEECH ENHANCEMENT

Ching-Hua Lee, Chouchang Yang, Rakshith Sharma Srinivasa, Yashas Malur Saidutta, Jaejin Cho, Yilin Shen, Hongxia Jin, Samsung Research America, United States of America
 

SLP-P32.6: NEURAL NETWORK-BASED VIRTUAL MICROPHONE ESTIMATION WITH VIRTUAL MICROPHONE AND BEAMFORMER-LEVEL MULTI-TASK LOSS

Hanako Segawa, Tsukuba University, Japan; Tsubasa Ochiai, Marc Delcroix, Tomohiro Nakatani, Rintaro Ikeshita, Shoko Araki, NTT Corporation, Japan; Takeshi Yamada, Shoji Makino, Tsukuba University, Japan
 

SLP-P32.8: SELM: Speech Enhancement Using Discrete Tokens and Language Models

Ziqian Wang, Xinfa Zhu, Zihan Zhang, YuanJun Lv, Northwestern Polytechnical University, China; Ning Jiang, Guoqing Zhao, Mashang Consumer Finance Co., Ltd., China; Lei Xie, Northwestern Polytechnical University, China
 

SLP-P32.9: SECP: A SPEECH ENHANCEMENT-BASED CURATION PIPELINE FOR SCALABLE ACQUISITION OF CLEAN SPEECH

Adam Sabra, Cyprian Wronka, Michelle Mao, Samer Hijazi, Cisco Systems, Inc, United States of America
 

SLP-P32.10: ATTENTION-DRIVEN MULTICHANNEL SPEECH ENHANCEMENT IN MOVING SOUND SOURCE SCENARIOS

Yuzhu Wang, Archontis Politis, Tuomas Virtanen, Tampere University, Finland