ASMSP-L3.1

Representation Learning for Semantic Alignment of Language, Audio, and Visual Modalities.

Parthasaarathy Sudarsanam, Irene Martín-Morató, Tuomas Virtanen, Tampere University, Finland

Session:
ASMSP-L3: Multimodal and Cross-Domain Audio Learning Lecture

Track:
ASMSP - Acoustic, Speech and Music Signal Processing

Location:
Teatro del Sole

Presentation Time:
Wed, 10 Sep, 09:00 - 09:20 Italy Time (UTC +2)

Session Co-Chairs:
Tuomas Virtanen, Tampere University and Mark Sandler,
Presentation
Discussion
Resources
No resources available.