SLP-L21: End-to-end modeling for automatic speech recognition
Thu, 18 Apr, 13:10 - 15:10 (UTC +9)
Location: Room 103
Session Type: Lecture
Session Co-Chairs: Bhuvana Ramabhadran, Google and Jinyu Li, Microsoft
Track: Speech and Language Processing
Click the to view the manuscript on IEEE Xplore Open Preview
Thu, 18 Apr, 13:10 - 13:30 (UTC +9)
SLP-L21.1: IMPROVING ATTENTION-BASED END-TO-END SPEECH RECOGNITION BY MONOTONIC ALIGNMENT ATTENTION MATRIX RECONSTRUCTION
Thu, 18 Apr, 13:30 - 13:50 (UTC +9)
SLP-L21.2: USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models
Thu, 18 Apr, 13:50 - 14:10 (UTC +9)
SLP-L21.3: KEEP DECODING PARALLEL WITH EFFECTIVE KNOWLEDGE DISTILLATION FROM LANGUAGE MODELS TO END-TO-END SPEECH RECOGNISERS
Thu, 18 Apr, 14:10 - 14:30 (UTC +9)
SLP-L21.4: EXTREME ENCODER OUTPUT FRAME RATE REDUCTION: IMPROVING COMPUTATIONAL LATENCIES OF LARGE END-TO-END MODELS
Thu, 18 Apr, 14:30 - 14:50 (UTC +9)
SLP-L21.5: IMPROVING MULTI-SPEAKER ASR WITH OVERLAP-AWARE ENCODING AND MONOTONIC ATTENTION
Thu, 18 Apr, 14:50 - 15:10 (UTC +9)