WS-3.4

VOCODEC: AN EFFICIENT LIGHTWEIGHT LOW-BITRATE SPEECH CODEC

Leyan Yang, Ronghui Hu, Yang Xu, Jing Lu, Nanjing University, China

Session:
WS-3: Low-Resource Audio Codec (LRAC) Workshop Oral

Track:
Satellite Workshops

Location:
Room 113

Presentation Time:
Mon, 4 May, 09:00 - 18:00

Presentation
Discussion
Resources
No resources available.
Session WS-3
WS-3.1: IRIS: Low-Complexity High-Efficiency Neural Network Codec for Real-Time Audio Transmission
Ziqian Wu, Jiawei Jiang, Kunpeng Lin, He Wang, Qingbo Huang, Dejun Zhang, ByteDance, China; Andong Li, Chinese Academy of Sciences, China
WS-3.2: Progressive Refinement Training for Low-Resource Neural Speech Coding and Enhancement
Ronghui Hu, Leyan Yang, Yang Xu, Qinwen Hu, Jing Lu, NanJing University, China
WS-3.3: KD-Vocodec: A Low-Complexity Model for Joint Speech Coding and Enhancement Using Knowledge Distillation
Yang Xu, Ronghui Hu, Leyan Yang, Jing Lu, Nanjing University, China
WS-3.4: VOCODEC: AN EFFICIENT LIGHTWEIGHT LOW-BITRATE SPEECH CODEC
Leyan Yang, Ronghui Hu, Yang Xu, Jing Lu, Nanjing University, China
WS-3.5: Low-Resource Audio Codec (LRAC): 2025 Challenge Description
Kamil Wojcicki, Yusuf Isik, Laura Lechler, Mansur Yesilbursa, Ivana Balić, Wolfgang Mack, Rafał Łaganowski, Guoqing Zhang, Cisco Systems, Australia; Yossi Adi, Hebrew University of Jerusalem, Israel; Minje Kim, University of Illinois Urbana-Champaign, United States of America
WS-3.6: Exploring Disentangled Neural Speech Codecs from Self-Supervised Representations
Ryo Aihara, MERL / Mitsubishi Electric, Japan; Yoshiki Masuyama, François Germain, Gordon Wichern, Jonathan Le Roux, MERL, United States of America
WS-3.7: Nanocodec: Towards Low Bitrate and Low Complexity Real-Time Neural Audio Codec
Andong Li, Institute of Acoustics, Chinese Academy of Sciences; University of Chinese Academy of Sciences, China; Linping Xu, Zhe Han, ByteDance, China; Lingling Dai, Institute of Acoustics, Chinese Academy of Sciences; University of Chinese Academy of Sciences, China; Yiqing Guo, Hua Gao, ByteDance, China; Xiaodong Li, Chengshi Zheng, Institute of Acoustics, Chinese Academy of Sciences; University of Chinese Academy of Sciences, China
WS-3.8: PHOENIXCODEC: TAMING NEURAL SPEECH CODING FOR EXTREME LOW-RESOURCE SCENARIOS
Zixiang Wan, Peking University, China; Haoran Zhao, Guochang Zhang, Runqiang Han, Jianqiang Wei, Anker Innovations, China; Yuexian Zou, Peking University, China
WS-3.9: Attention-Guided Audio Compression for Multimodal LLMs
Prerana Rane, IEEE Senior Member, United States of America; Amitesh Vatsa, Indian Institute of Technology (BHU), India; Yash Pethe, Independent Researcher, India; Ogan Batu Aktolun, University of Texas at Austin, United States of America; Kevin Li, Ishan Singh, Independent Researcher, United States of America
Contacts