Technical Program

Click on the icon to view the manuscript on IEEE XPlore in the IEEE ICASSP 2020 Open Preview.

SPE-P22: Large Vocabulary Continuous Speech Recognition and Search

Session Type: Poster
Time: Friday, 8 May, 15:15 - 17:15
Location: On-Demand
Virtual Session: View on Virtual Platform
Session Chairs: Gakuto Kurata, IBM Research - Tokyo and Steve Renals, University of Edinburgh
 
 SPE-P22.1: IMPROVING PROPER NOUN RECOGNITION IN END-TO-END ASR BY CUSTOMIZATION OF THE MWER LOSS CRITERION
         Cal Peyser; Google, Inc.
         Tara Sainath; Google, Inc.
         Golan Pundak; Google, Inc.
 
 SPE-P22.2: NEURAL LATTICE SEARCH FOR SPEECH RECOGNITION
         Rao Ma; Shanghai Jiao Tong University
         Hao Li; Shanghai Jiao Tong University
         Qi Liu; Shanghai Jiao Tong University
         Lu Chen; Shanghai Jiao Tong University
         Kai Yu; Shanghai Jiao Tong University
 
 SPE-P22.3: DELIBERATION MODEL BASED TWO-PASS END-TO-END SPEECH RECOGNITION
         Ke Hu; Google, Inc.
         Tara Sainath; Google, Inc.
         Ruoming Pang; Google, Inc.
         Rohit Prabhavalkar; Google, Inc.
 
 SPE-P22.4: ALIGNMENT-LENGTH SYNCHRONOUS DECODING FOR RNN TRANSDUCER
         George Saon; IBM Research AI
         Zoltan Tuske; IBM Research AI
         Kartik Audhkhasi; IBM Research AI
 
 SPE-P22.5: INCORPORATING WRITTEN DOMAIN NUMERIC GRAMMARS INTO END-TO-END CONTEXTUAL SPEECH RECOGNITION SYSTEMS FOR IMPROVED RECOGNITION OF NUMERIC SEQUENCES
         Ben Haynor; Google, Inc.
         Petar Aleksic; Google, Inc.
 
 SPE-P22.6: LSTM-BASED ONE-PASS DECODER FOR LOW-LATENCY STREAMING
         Javier Jorge; Universitat Politècnica de València
         Adrià Giménez; Universitat Politècnica de València
         Javier Iranzo-Sánchez; Universitat Politècnica de València
         Joan Albert Silvestre-Cerdà; Universitat Politècnica de València
         Jorge Civera; Universitat Politècnica de València
         Albert Sanchis; Universitat Politècnica de València
         Alfons Juan; Universitat Politècnica de València
 
 SPE-P22.7: MULTISTATE ENCODING WITH END-TO-END SPEECH RNN TRANSDUCER NETWORK
         Zelin Wu; Google LLC
         Bo Li; Google LLC
         Yu Zhang; Google LLC
         Petar Aleksic; Google LLC
         Tara Sainath; Google LLC
 
 SPE-P22.8: NEURAL ORACLE SEARCH ON N-BEST HYPOTHESES
         Ehsan Variani; Google
         Tongzhou Chen; Google
         James Apfel; Google
         Bhuvana Ramabhadran; Google
         Seungji Lee; Google
         Pedro Moreno; Google
 
 SPE-P22.10: TRANSFORMER TRANSDUCER: A STREAMABLE SPEECH RECOGNITION MODEL WITH TRANSFORMER ENCODERS AND RNN-T LOSS
         Qian Zhang; Google
         Han Lu; Google
         Hasim Sak; Google
         Anshuman Tripathi; Google
         Erik McDermott; Google
         Stephen Koo; Google
         Shankar Kumar; Google
 
 SPE-P22.11: FULL-SUM DECODING FOR HYBRID HMM BASED SPEECH RECOGNITION USING LSTM LANGUAGE MODEL
         Wei Zhou; RWTH Aachen University
         Ralf Schlüter; RWTH Aachen University
         Hermann Ney; RWTH Aachen University
 
 SPE-P22.12: THE RWTH ASR SYSTEM FOR TED-LIUM RELEASE 2: IMPROVING HYBRID HMM WITH SPECAUGMENT
         Wei Zhou; RWTH Aachen University
         Wilfried Michel; RWTH Aachen University
         Kazuki Irie; RWTH Aachen University
         Markus Kitza; RWTH Aachen University
         Ralf Schlüter; RWTH Aachen University
         Hermann Ney; RWTH Aachen University