AASP-P18: Audio and Speech Source Separation and Signal Enhancement I
Poster
Thu, 7 May, 14:00 - 16:00
Location: Poster Area 24
Session Type: Poster
Track: Audio and Acoustic Signal Processing [AA]
Click the to view the manuscript on IEEE Xplore Open Preview

AASP-P18.1: MAGE: A COARSE-TO-FINE SPEECH ENHANCER WITH MASKED GENERATIVE MODEL

The Hieu Pham, Ho Chi Minh City University of Technology, Viet Nam; Tan Dat Nguyen, Korea Advanced Institute of Science and Technology, Korea, Republic of; Phuong Thanh Tran, Ho Chi Minh City University of Technology, Viet Nam; Joon Son Chung, Korea Advanced Institute of Science and Technology, Korea, Republic of; Duc Dung Nguyen, Ho Chi Minh City University of Technology, Viet Nam

AASP-P18.2: ADAPTIVE DETERMINISTIC FLOW MATCHING FOR TARGET SPEAKER EXTRACTION

Tsun-An Hsieh, Minje Kim, University of Illinois Urbana-Champaign, United States of America

AASP-P18.3: CODESEP: LOW-BITRATE CODEC-DRIVEN SPEECH SEPARATION WITH BASE-TOKEN DISENTANGLEMENT AND AUXILIARY-TOKEN SERIAL PREDICTION

Hui-Peng Du, Yang Ai, Xiao-Hang Jiang, Rui-Chen Zheng, Zhen-Hua Ling, University of Science and Technology of China, China

AASP-P18.4: AR-BSNET: TOWARDS ULTRA-LOW COMPLEXITY AUTOREGRESSIVE TARGET SPEAKER EXTRACTION WITH BAND-SPLIT MODELING

Fengyuan Hao, Andong Li, Xiaodong Li, Chengshi Zheng, Laboratory of Noise and Audio Research, Institute of Acoustics, Chinese Academy of Sciences, China

AASP-P18.5: From Diet to Free Lunch: Estimating Auxiliary Signal Properties using Dynamic Pruning Masks in Speech Enhancement Networks

Riccardo Miccini, Technical University of Denmark, Denmark; Clément Laroche, Tobias Piechowiak, GN Hearing, Denmark; Xenofon Fafoutis, Luca Pezzarossa, Technical University of Denmark, Denmark

AASP-P18.6: SLM-SS: Speech Language Model for Generative Speech Separation

Tianhua Li, Chenda Li, Wei Wang, Xin Zhou, Xihui Chen, Shanghai Jiao Tong University, China; Jianqing Gao, iFLYTEK Company Limited, China; Yanmin Qian, Shanghai Jiao Tong University, China

AASP-P18.7: BLEED NO MORE: GENERATIVE INTERFERENCE REDUCTION FOR MUSICAL RECORDINGS

Rajesh R, Rashen Fernando, University of Illinois Chicago, United States of America; Padmanabhan Rajan, Indian Institute of Technology Mandi, India; Ryan Corey, University of Illinois Chicago, United States of America

AASP-P18.8: SAMPLING-RATE-AGNOSTIC SPEECH SUPER-RESOLUTION BASED ON GAUSSIAN PROCESS DYNAMICAL SYSTEMS WITH DEEP KERNEL LEARNING

Aditya Arie Nugraha, Diego Di Carlo, Yoshiaki Bando, RIKEN, Japan; Mathieu Fontaine, Télécom Paris, Institut Polytechnique de Paris, France; Kazuyoshi Yoshii, Kyoto University, Japan

AASP-P18.9: GDIFFUSE: DIFFUSION-BASED SPEECH ENHANCEMENT WITH NOISE MODEL GUIDANCE

Efrayim Yanir, David Burshtein, Sharon Gannot, Tel-Aviv University, Israel

AASP-P18.10: TOWARDS DISTANCE-AWARE SYNTHETIC AUDIO MIXTURES FOR UNIVERSAL SOUND SEPARATION

Wonjun Park, Tuan Dang, Kenny Zhu, University of Texas at Arlington, United States of America