Paper: | H-6.2 | ||
Paper Title: | Cross-utterance Context for Multimodal Video Transcription | ||
Authors: | Roshan Sharma, Bhiksha Raj, Carnegie Mellon University, United States | ||
Session: | Computer Vision (invited) | ||
Location: | Virtual H | ||
Presentation Time: | Wednesday, November 2, 09:30 - 10:30 | ||
Virtual Presentation: | Attend on Virtual Platform | ||
Presentation: | Virtual | ||
Topic: | Speech, Image and Video Processing: Invited Session: Computer Vision |