| Paper: | SS-P5.7 |
| Session: | Multimodal Representation Learning for Language Generation and Understanding |
| Session Time: | Thursday, May 16, 15:30 - 17:30 |
| Presentation Time: | Thursday, May 16, 15:30 - 17:30 |
| Presentation: |
Poster
|
| Topic: |
Special Sessions: Multimodal Representation Learning for Language Generation and Understanding |
| Paper Title: |
MULTIMODAL GROUNDING FOR SEQUENCE-TO-SEQUENCE SPEECH RECOGNITION |
| Authors: |
Ozan Caglayan; Le Mans University | | |
| | Ramon Sanabria; Carnegie Mellon University | | |
| | Shruti Palaskar; Carnegie Mellon University | | |
| | Loïc Barrault; Le Mans University | | |
| | Florian Metze; Carnegie Mellon University | | |