AASP-P6.3
MULTI-TASK PSEUDO-LABEL LEARNING FOR NON-INTRUSIVE SPEECH QUALITY ASSESSMENT MODEL
Ryandhimas Zezario, Academia Sinica, Taiwan; Bo-Ren Brian Bai, Fortemedia, Taiwan, Taiwan; Chiou-Shann Fuh, National Taiwan University, Taiwan; Hsin-Min Wang, Yu Tsao, Academia Sinica, Taiwan
Session:
AASP-P6: Audio and speech quality and intelligibility measures; Music analysis 1 Poster
Track:
Audio and Acoustic Signal Processing
Location:
Poster Zone 6A
Poster Board PZ-6A.3
Poster Board PZ-6A.3
Presentation Time:
Wed, 17 Apr, 08:20 - 10:20 (UTC +9)
Session Co-Chairs:
Michael Mandel, The Graduate School and University Center of The City University of New York and Juhan Nam, Korea Advanced Institute of Science and Technology
Session AASP-P6
AASP-P6.1: NON-INTRUSIVE SPEECH INTELLIGIBILITY PREDICTION FOR HEARING-IMPAIRED USERS USING INTERMEDIATE ASR FEATURES AND HUMAN MEMORY MODELS
Rhiannon Mogridge, George Close, Robert Sutherland, Thomas Hain, Jon Barker, Stefan Goetze, Anton Ragni, University of Sheffield, United Kingdom of Great Britain and Northern Ireland
AASP-P6.2: CORN: CO-TRAINED FULL- AND NO-REFERENCE SPEECH QUALITY ASSESSMENT
Pranay Manocha, Princeton University, United States of America; Donald Williamson, The Ohio State University, United States of America; Adam Finkelstein, Princeton University, United States of America
AASP-P6.3: MULTI-TASK PSEUDO-LABEL LEARNING FOR NON-INTRUSIVE SPEECH QUALITY ASSESSMENT MODEL
Ryandhimas Zezario, Academia Sinica, Taiwan; Bo-Ren Brian Bai, Fortemedia, Taiwan, Taiwan; Chiou-Shann Fuh, National Taiwan University, Taiwan; Hsin-Min Wang, Yu Tsao, Academia Sinica, Taiwan
AASP-P6.4: NON-INTRUSIVE SPEECH QUALITY ASSESSMENT WITH MULTI-TASK LEARNING BASED ON TENSOR NETWORK
Hanyue Liu, Miao Liu, Jing Wang, Xiang Xie, Beijing Institute of Technology, China; Lidong Yang, Inner Mongolia University of Science and Technology, China
AASP-P6.5: NOMAD: UNSUPERVISED LEARNING OF PERCEPTUAL EMBEDDINGS FOR SPEECH ENHANCEMENT AND NON-MATCHING REFERENCE AUDIO QUALITY ASSESSMENT
Alessandro Ragano, University College Dublin, Ireland; Jan Skoglund, Google LLC, United States of America; Andrew Hines, University College Dublin, Ireland
AASP-P6.6: SPEECH FOUNDATION MODELS ON INTELLIGIBILITY PREDICTION FOR HEARING-IMPAIRED LISTENERS
Santiago Cuervo, Ricard Marxer, Université de Toulon, Aix Marseille Université, CNRS, LIS, France
AASP-P6.7: MUSICLDM: ENHANCING NOVELTY IN TEXT-TO-MUSIC GENERATION USING BEAT-SYNCHRONOUS MIXUP STRATEGIES
Ke Chen, University of California San Diego, United States of America; Yusong Wu, Université de Montréal, Canada; Haohe Liu, University of Surrey, United Kingdom of Great Britain and Northern Ireland; Marianna Nezhurina, LAION, Canada; Taylor Berg-Kirkpatrick, Shlomo Dubnov, University of California San Diego, United States of America
AASP-P6.8: A COMPARATIVE ANALYSIS OF POETRY READING AUDIO: SINGING, NARRATING, OR SOMEWHERE IN BETWEEN?
Kahyun Choi, Indiana University, United States of America; Minje Kim, University of Illinois at Urbana-Champaign, United States of America
AASP-P6.9: MIR-MLPOP: A MULTILINGUAL POP MUSIC DATASET WITH TIME-ALIGNED LYRICS AND AUDIO
Jun-You Wang, Chung-Che Wang, Chon-In Leong, Jyh-Shing Roger Jang, National Taiwan University, Taiwan
AASP-P6.10: GENERATING STEREOPHONIC MUSIC WITH SINGLE-STAGE LANGUAGE MODELS
Xingda Li, Fan Zhuo, Dan Luo, Jun Chen, Shiyin Kang, Skywork AI PTE. LTD, China; Zhiyong Wu, Shenzhen International Graduate School, Tsinghua University, China; Tao Jiang, Yang Li, Han Fang, Yahui Zhou, Skywork AI PTE. LTD, China
AASP-P6.11: STRING SOUND SYNTHESIZER ON GPU-ACCELERATED FINITE DIFFERENCE SCHEME
Jin Woo Lee, Min Jun Choi, Kyogu Lee, Seoul National University, Korea, Republic of
AASP-P6.12: MUSIC UNDERSTANDING LLAMA: ADVANCING TEXT-TO-MUSIC GENERATION WITH QUESTION ANSWERING AND CAPTIONING
Shansong Liu, Tencent, China; Atin Sakkeer Hussain, Chenshuo Sun, National University of Singapore, United Arab Emirates; Ying Shan, Tencent, United States of America
Contacts