| Paper: | H-6.2 | ||
| Paper Title: | Cross-utterance Context for Multimodal Video Transcription | ||
| Authors: | Roshan Sharma, Bhiksha Raj, Carnegie Mellon University, United States | ||
| Session: | Computer Vision (invited) | ||
| Location: | Virtual H | ||
| Presentation Time: | Wednesday, November 2, 09:30 - 10:30 | ||
| Virtual Presentation: | Attend on Virtual Platform | ||
| Presentation: | Virtual | ||
| Topic: | Speech, Image and Video Processing: Invited Session: Computer Vision | ||