SLP-L6: Speech LLM: Alignment & Decoding
Oral
Wed, 6 May, 09:00 - 11:00
Location: Room 115
Session Type: Oral
Track: Speech and Language Processing [SL]
Click the to view the manuscript on IEEE Xplore Open Preview
Wed, 6 May, 09:00 - 09:20

SLP-L6.1: AQUA-Bench: Beyond Finding Answers to Knowing When There Are None in Audio Question Answering

Chun-Yi Kuan, Hung-yi Lee, National Taiwan University, Taiwan
Wed, 6 May, 09:20 - 09:40

SLP-L6.2: KEEPING MODELS LISTENING: SEGMENT- AND TIME-AWARE ATTENTION RESCALING AT DECODING TIME

Hangyu Du, National University of Singapore, China; Jingxing Zhong, Fuzhou University, China
Wed, 6 May, 09:40 - 10:00

SLP-L6.3: Still Thinking or Stopped Talking? Dialogue Silence Intention Classification Using Multimodal Large Language Model

Muyun Wu, Zi Haur Pang, Koji Inoue, Tatsuya Kawahara, Kyoto university, Japan
Wed, 6 May, 10:00 - 10:20

SLP-L6.4: SPEECHMAPPER: SPEECH-TO-TEXT EMBEDDING PROJECTOR FOR LLMS

Biswesh Mohapatra, Inria Paris, France; Marcely Zanon boito, Ioan Calapodescu, NAVER LABS Europe, France
Wed, 6 May, 10:20 - 10:40

SLP-L6.5: UNDERSTANDING TEXTUAL CAPABILITY DEGRADATION IN SPEECH LLMS VIA PARAMETER IMPORTANCE ANALYSIS

Chao Wang, Rui-Chen Zheng, Yang Ai, Zhen-Hua Ling, University of Science and Technology of China, China
Wed, 6 May, 10:40 - 11:00

SLP-L6.6: TASU: TEXT-ONLY ALIGNMENT FOR SPEECH UNDERSTANDING

Jing Peng, Yi Yang, Shanghai Jiao Tong University, China; Xu Li, AIspeech, China; Yu Xi, Shanghai Jiao Tong University, China; Quanwei Tang, Suzhou University, China; Yangui Fang, Huazhong Techonology University, China; Junjie Li, Kai Yu, Shanghai Jiao Tong University, China