WS-15c.4

A LIGHTWEIGHT DUAL-STAGE FRAMEWORK FOR PERSONALIZED SPEECH ENHANCEMENT BASED ON DEEPFILTERNET2

Thomas Serre, Orosound, France; Mathieu Fontaine, Télécom Paris, Institut Polytechnique de Paris, France; Éric Benhaim, Geoffroy Dutour, Orosound, France; Slim Essid, Télécom Paris, Institut Polytechnique de Paris, France

Session:
WS-15c: Hands-free Speech Communication and Microphone Arrays (HSCMA 2024): Efficient and Personalized Speech Processing through Data Science III Poster

Track:
Satellite Workshops

Location:
Workshop Poster
Poster Board WSP.4

Presentation Time:
Tue, 16 Apr, 13:10 - 15:10 (UTC +9)

Presentation
Discussion
Resources
No resources available.
Session WS-15c
WS-15c.1: Gated Low-rank Adaptation for personalized Code-Switching Automatic Speech Recognition on the low-spec devices
Gwantae Kim, Bokyeung Lee, Donghyeon Kim, Hanseok Ko, Korea University, Korea, Republic of
WS-15c.2: Light Gated Multi Mini-patch Extractor for Audio Classification
Bo He, Shiqi Zhang, Xianrui Wang, Zheng Qiu, Waseda University, Japan; Daiki Takeuchi, Daisuke Niizumi, Noboru Harada, NTT Corporation, Japan; Shoji Makino, Waseda University, Japan
WS-15c.3: Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization
Thilo von Neumann, Christoph Boeddeker, Tobias Cord-Landwehr, Paderborn University, Germany; Marc Delcroix, NTT Corporation, Japan; Reinhold Haeb-Umbach, Paderborn University, Germany
WS-15c.4: A LIGHTWEIGHT DUAL-STAGE FRAMEWORK FOR PERSONALIZED SPEECH ENHANCEMENT BASED ON DEEPFILTERNET2
Thomas Serre, Orosound, France; Mathieu Fontaine, Télécom Paris, Institut Polytechnique de Paris, France; Éric Benhaim, Geoffroy Dutour, Orosound, France; Slim Essid, Télécom Paris, Institut Polytechnique de Paris, France
WS-15c.5: VoiceVector: Multimodal Enrolment Vectors for Speaker Separation
Akam Rahimi, Andrew Zisserman, Triantafyllos Afouras, University of Oxford, United Kingdom of Great Britain and Northern Ireland
WS-15c.6: HOMULA-RIR: A Room Impulse Response Dataset for Teleconferencing and Spatial Audio Applications Acquired Through Higher-Order Microphones and Uniform Linear Microphone Arrays
Federico Miotello, Paolo Ostan, Mirco Pezzoli, Luca Comanducci, Alberto Bernardini, Fabio Antonacci, Augusto Sarti, Politecnico di Milano, Italy
Contacts