ASMSP-L3.5

Big model only for hard audios: Sample dependent Whisper model selection for efficient inferences

Hugo Malard, Salah Zaiem, Telecom Paris, France; Robin Algayres, Mohamed Bin Zayed University of Artificial Intelligence, France

Session:
ASMSP-L3: Multimodal and Cross-Domain Audio Learning Lecture

Track:
ASMSP - Acoustic, Speech and Music Signal Processing

Location:
Teatro del Sole

Presentation Time:
Wed, 10 Sep, 10:20 - 10:40 Italy Time (UTC +2)

Session Co-Chairs:
Tuomas Virtanen, Tampere University and Mark Sandler,
Presentation
Discussion
Resources
No resources available.