Wed AM1.L5.4

Multitask learning in Audio Captioning: a sentence embedding regression loss acts as a regularizer

Etienne Labbé, Julien Pinquier, Thomas Pellegrini, IRIT, France

Session:
Wed AM1.L5: Multimodal Learning for Audio and Language Lecture

Track:
Special Sessions

Location:
Press room

Presentation Time:
Wed, 6 Sep, 11:30 - 11:50 Finland Time (UTC +3)

Session Chair:
Xubo Liu, University of Surrey
Presentation
Discussion
Resources
No resources available.
Session Wed AM1.L5