EUSIPCO 2024 || Lyon, France || 26

TH3.SC2: Speech Recognition

Thu, 29 Aug, 16:10 - 17:50 France Time (UTC +2)

Location: Saint Clair 2

Session Type: Lecture

Session Co-Chairs: Stefan Goetze, University of Sheffield and Andreas Triantafyllopoulos, Technical University of Munich

Track: ASMSP - Acoustic, Speech and Music Signal Processing

Thu, 29 Aug, 16:10 - 16:30 France Time (UTC +2)

TH3.SC2.1: Character Error Rate Estimation for Automatic Speech Recognition of Short Utterances

Chanho Park, Hyunsik Kang, Thomas Hain, University of Sheffield, United Kingdom

Thu, 29 Aug, 16:30 - 16:50 France Time (UTC +2)

TH3.SC2.2: IMPROVING ACCENTED SPEECH RECOGNITION USING DATA AUGMENTATION BASED ON UNSUPERVISED TEXT-TO-SPEECH SYNTHESIS

Cong-Thanh Do, Toshiba Europe Limited, United Kingdom; Shuhei Imai, Tohoku University, Japan; Rama Doddipatla, Toshiba Europe Limited, United Kingdom; Thomas Hain, University of Sheffield, United Kingdom

Thu, 29 Aug, 16:50 - 17:10 France Time (UTC +2)

TH3.SC2.3: A Comprehensive Analysis of Tokenization and Self-Supervised Learning in End-to-End Automatic Speech Recognition applied on French Language

Thibault Bañeras-Roux, Nantes University, France; Mickael Rouvier, Avignon University, France; Jane Wottawa, Le Mans University, France; Richard Dufour, Nantes University, France

Thu, 29 Aug, 17:10 - 17:30 France Time (UTC +2)

TH3.SC2.4: Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model

Jiawen Huang, Emmanouil Benetos, Queen Mary University of London, United Kingdom

Thu, 29 Aug, 17:30 - 17:50 France Time (UTC +2)

TH3.SC2.5: LDASR: An Experimental Study on Layer Drop using Conformer-based Architecture

Abdul Hannan, University of Trento, Italy; Alessio Brutti, Daniele Falavigna, Fondazione Bruno Kessler, Italy