SS-L11: In-Context Learning Methods for Speech and Spoken Language Processing
Thu, 18 Apr, 16:30 - 18:30 (UTC +9)
Location: Room 103
Session Type: Lecture
Session Co-Chairs: Chao Zhang, Cambridge Universeity and Chao-Han Huck Yang, NVIDIA and Marco Siniscalchi, University of Palermo
Track: Special Sessions
Click the to view the manuscript on IEEE Xplore Open Preview
Thu, 18 Apr, 16:30 - 16:50 (UTC +9)
SS-L11.1: SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation
Thu, 18 Apr, 16:50 - 17:10 (UTC +9)
SS-L11.2: Hierarchical cross-modality knowledge transfer with sinkhorn attention for CTC-based ASR
Thu, 18 Apr, 17:10 - 17:30 (UTC +9)
SS-L11.3: VOXTLM: UNIFIED DECODER-ONLY MODELS FOR CONSOLIDATING SPEECH RECOGNITION, SYNTHESIS AND SPEECH, TEXT CONTINUATION TASKS
Thu, 18 Apr, 17:30 - 17:50 (UTC +9)
SS-L11.4: Prompting Large Language Models with Speech Recognition Abilities
Thu, 18 Apr, 18:10 - 18:30 (UTC +9)