Technical Program

Paper Detail

Paper:SS-P5.7
Session:Multimodal Representation Learning for Language Generation and Understanding
Location:Poster Area E, Meeting Room 1A
Session Time:Thursday, May 16, 15:30 - 17:30
Presentation Time:Thursday, May 16, 15:30 - 17:30
Presentation: Poster
Topic: Special Sessions: Multimodal Representation Learning for Language Generation and Understanding
Paper Title: MULTIMODAL GROUNDING FOR SEQUENCE-TO-SEQUENCE SPEECH RECOGNITION
Manuscript Link:  Click here to view manuscript on IEEE Xplore
Authors: Ozan Caglayan, Le Mans University, France; Ramon Sanabria, Shruti Palaskar, Carnegie Mellon University, United States; Loïc Barrault, Le Mans University, France; Florian Metze, Carnegie Mellon University, United States