AASP-P8.7
SPAM: Style Prompt Adherence Metric for Prompt-based TTS
Chanhee Cho, Nayeon Kim, Bugeun Kim, Chung-Ang University, Korea, Republic of
Session:
AASP-P8: Audio and Speech Quality and Intelligibility Measures II Poster
Track:
Audio and Acoustic Signal Processing [AA]
Location:
Poster Area 26
Presentation Time:
Wed, 6 May, 09:00 - 11:00
Presentation
Discussion
Resources
No resources available.
Session AASP-P8
AASP-P8.1: CONFIDENCE-BASED FILTERING FOR SPEECH DATASET CURATION WITH GENERATIVE SPEECH ENHANCEMENT USING DISCRETE TOKENS
Kazuki Yamauchi, CyberAgent / The University of Tokyo, Japan; Masato Murata, Shogo Seki, CyberAgent, Japan
AASP-P8.2: REFERENCE-AWARE SFM LAYERS FOR INTRUSIVE INTELLIGIBILITY PREDICTION
Hanlin Yu, The University of British Columbia, Canada; Haoshuai Zhou, Boxuan Cao, Changgeng Mo, Linkai Li, Orka Labs Inc., China, China; Shan Xiang Wang, Stanford University,, United States of America
AASP-P8.3: LEVERAGING MULTIPLE SPEECH ENHANCERS FOR NON-INTRUSIVE INTELLIGIBILITY PREDICTION FOR HEARING-IMPAIRED LISTENERS
Boxuan Cao, Linkai Li, Orka Labs Inc., China; Hanlin Yu, The University of British Columbia, Canada; Changgeng Mo, Haoshuai Zhou, Orka Labs Inc., China; Shan Xiang Wang, Stanford University, United States of America
AASP-P8.4: SPEECH QUALITY-BASED LOCALIZATION OF LOW-QUALITY SPEECH AND TEXT-TO-SPEECH SYNTHESIS ARTEFACTS
Michael Kuhlmann, Alexander Werning, Thilo von Neumann, Reinhold Haeb-Umbach, Paderborn University, Germany
AASP-P8.5: SP-MCQA: EVALUATING INTELLIGIBILITY OF TTS BEYOND THE WORD LEVEL
Hitomi Jin Ling Tee, Chaoren Wang, Zijie Zhang, Zhizheng Wu, The Chinese University of Hong Kong, Shenzhen, Malaysia
AASP-P8.6: ENHANCING SPEECH INTELLIGIBILITY PREDICTION FOR HEARING AIDS WITH COMPLEMENTARY SPEECH FOUNDATION MODEL REPRESENTATIONS
Guojian Lin, Xuefei Wang, Southern University of Science and Technology, China; Ryandhimas Zezario, Academia Sinica, Taiwan; Fei Chen, Southern University of Science and Technology, China
AASP-P8.7: SPAM: Style Prompt Adherence Metric for Prompt-based TTS
Chanhee Cho, Nayeon Kim, Bugeun Kim, Chung-Ang University, Korea, Republic of
AASP-P8.8: WAV2LEV: PREDICTING LEVENSHTEIN EDIT OPERATION SEQUENCES FOR FINE-GRAINED ESTIMATION OF AUTOMATIC SPEECH RECOGNITION ERROR
Harvey Donnelly, Ken Shi, Gerald Penn, University of Toronto, Canada
AASP-P8.9: SingMOS-Pro: An Comprehensive Benchmark for Singing Quality Assessment
Yuxun Tang, Renmin University of China, China; Lan Liu, Sun Yat-sen University, China; Wenhao Feng, Renmin University of China, China; Yiwen Zhao, Jionghao Han, Carnegie Mellon University, China; Yifeng Yu, Georgia Institute of Technology, China; Jiatong Shi, Carnegie Mellon University, China; Qin Jin, Renmin University of China, China
AASP-P8.10: Better Naturalness Evaluation of TTS Systems
Sajad Shirali-Shahreza, Gerald B. Penn,
Contacts