SLP-L25: Audio-visual speech/intent recognition
Fri, 19 Apr, 08:20 - 10:20 (UTC +9)
Location: Room 102
Session Type: Lecture
Session Co-Chairs: Albert Zeyer, AppTek GmbH and Dmitriy Serdyuk, Google
Track: Speech and Language Processing
Click the to view the manuscript on IEEE Xplore Open Preview
Fri, 19 Apr, 08:20 - 08:40 (UTC +9)
SLP-L25.1: Conformer is all you need for visual speech recognition
Fri, 19 Apr, 08:40 - 09:00 (UTC +9)
SLP-L25.2: LITEVSR: EFFICIENT VISUAL SPEECH RECOGNITION BY LEARNING FROM SPEECH REPRESENTATIONS OF UNLABELED DATA
Fri, 19 Apr, 09:00 - 09:20 (UTC +9)
SLP-L25.3: MULTILINGUAL AUDIO-VISUAL SPEECH RECOGNITION WITH HYBRID CTC/RNN-T FAST CONFORMER
Fri, 19 Apr, 09:20 - 09:40 (UTC +9)
SLP-L25.4: LCB-NET: LONG-CONTEXT BIASING FOR AUDIO-VISUAL SPEECH RECOGNITION
Fri, 19 Apr, 09:40 - 10:00 (UTC +9)
SLP-L25.5: VILAS: EXPLORING THE EFFECTS OF VISION AND LANGUAGE CONTEXT IN AUTOMATIC SPEECH RECOGNITION
Fri, 19 Apr, 10:00 - 10:20 (UTC +9)