Paper: | SS-P5.7 |
Session: | Multimodal Representation Learning for Language Generation and Understanding |
Session Time: | Thursday, May 16, 15:30 - 17:30 |
Presentation Time: | Thursday, May 16, 15:30 - 17:30 |
Presentation: |
Poster
|
Topic: |
Special Sessions: Multimodal Representation Learning for Language Generation and Understanding |
Paper Title: |
MULTIMODAL GROUNDING FOR SEQUENCE-TO-SEQUENCE SPEECH RECOGNITION |
Authors: |
Ozan Caglayan; Le Mans University | | |
| Ramon Sanabria; Carnegie Mellon University | | |
| Shruti Palaskar; Carnegie Mellon University | | |
| Loïc Barrault; Le Mans University | | |
| Florian Metze; Carnegie Mellon University | | |