AASP-P16: Music separation; Audio for multimedia and audio processing systems
Thu, 18 Apr, 16:30 - 18:30 (UTC +9)
Location: Poster Zone 4C
Session Type: Poster
Session Chair: Jordi Pons, Stability AI
Track: Audio and Acoustic Signal Processing
Click the to view the manuscript on IEEE Xplore Open Preview

AASP-P16.1: PoP-IDLMA: Product-of-Prior Independent Deeply Learned Matrix Analysis for Multichannel Music Source Separation

Takuya Hasumi, Tomohiko Nakamura, Norihiro Takamune, Hiroshi Saruwatari, The University of Tokyo, Japan; Daichi Kitamura, National Institute of Technology, Kagawa College, Japan; Yu Takahashi, Kazunobu Kondo, Yamaha Corporation, Japan

AASP-P16.2: VIRTUAL BASS ENHANCEMENT VIA MUSIC DEMIXING

Riccardo Giampiccolo, Alessandro Ilic Mezza, Alberto Bernardini, Augusto Sarti, Politecnico di Milano, Italy
 

AASP-P16.3: MUSIC SOURCE SEPARATION BASED ON A LIGHTWEIGHT DEEP LEARNING FRAMEWORK (DTTNET: DUAL-PATH TFC-TDF UNET)

Junyu Chen, Imperial College London, United Kingdom of Great Britain and Northern Ireland; Susmitha Vekkot, Amrita Vishwa Vidyapeetham, India; Pancham Shukla, Imperial College London, United Kingdom of Great Britain and Northern Ireland
 

AASP-P16.4: ON THE EFFECT OF DATA-AUGMENTATION ON LOCAL EMBEDDING PROPERTIES IN THE CONTRASTIVE LEARNING OF MUSIC AUDIO REPRESENTATIONS

Matthew McCallum, Matthew Davies, Florian Henkel, Jaehun Kim, Samuel Sandberg, Sirius XM, United States of America
 

AASP-P16.5: SCNet: Sparse Compression Network for Music Source Separation

Weinan Tong, Jiaxu Zhu, Jun Chen, Tsinghua University, China; Shiyin Kang, Tao Jiang, Yang Li, Skywork AI PTE. LTD., China; Zhiyong Wu, Tsinghua University, China; Helen Meng, The Chinese University of Hong Kong, China
 

AASP-P16.6: MDX-GAN: ENHANCING PERCEPTUAL QUALITY IN MULTI-CLASS SOURCE SEPARATION VIA ADVERSARIAL TRAINING

Ke Chen, University of California San Diego, United States of America; Jiaqi Su, Zeyu Jin, Adobe Research, United States of America
 

AASP-P16.7: STEREOPHONIC MUSIC SOURCE SEPARATION WITH SPATIALLY-INFORMED BRIDGING BAND-SPLIT NETWORK

Yichen Yang, Haowen Li, Xianrui Wang, Wen Zhang, Northwestern Polytechnical University, China; Shoji Makino, Waseda University, Japan; Jingdong Chen, Northwestern Polytechnical University, China
 

AASP-P16.8: VOICE TOXICITY DETECTION USING MULTI-TASK LEARNING

Mahesh Kumar Nandwana, Roblox, United States of America; Yifan He, Carnegie Mellon University, United States of America; Joseph Liu, Xiao Yu, Charles Shang, Eloi Du Bois, Morgan McGuire, Kiran Bhat, Roblox, United States of America
 

AASP-P16.9: AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data

Jianwei Yu, Hangting Chen, Yanyao Bian, Xiang Li, Yi Luo, Jinchuan Tian, Mengyang Liu, Jiayi Jiang, Tencent, China; Shuai Wang, Shenzhen Research Institute of Big Data, China
 

AASP-P16.10: AAT: ADAPTING AUDIO TRANSFORMER FOR VARIOUS ACOUSTICS RECOGNITION TASKS

Yun Liang, Hai Lin, Shaojian Qiu, Yihang Zhang, South China Agricultural University, China

AASP-P16.11: Hybrid Packet Loss Concealment for Real-Time Networked Music Applications

Alessandro Mezza, Matteo Amerena, Alberto Bernardini, Augusto Sarti, Politecnico di Milano - Dipartimento di Elettronica, Informazione e Bioingegneria Piazza L. Da Vinci 32 , Milano 20133 Italy