Technical Program

Click on the icon to view the manuscript on IEEE XPlore in the IEEE ICASSP 2020 Open Preview.

SPE-P23: Speech Recognition: General Topics

Session Type: Poster
Time: Friday, 8 May, 15:15 - 17:15
Location: On-Demand
Virtual Session: View on Virtual Platform
Session Chair: Eric Fosler-Lussier, The Ohio State University
 
 SPE-P23.1: META LEARNING FOR END-TO-END LOW-RESOURCE SPEECH RECOGNITION
         Jui-Yang Hsu; National Taiwan University
         Yuan-Jui Chen; National Taiwan University
         Hung-yi Lee; National Taiwan University
 
 SPE-P23.2: CROSS-SPEAKER SILENT-SPEECH COMMAND WORD RECOGNITION USING ELECTRO-OPTICAL STOMATOGRAPHY
         Simon Stone; Technische Universität Dresden
         Peter Birkholz; Technische Universität Dresden
 
 SPE-P23.3: EXPLORING A ZERO-ORDER DIRECT HMM BASED ON LATENT ATTENTION FOR AUTOMATIC SPEECH RECOGNITION
         Parnia Bahar; RWTH Aachen University
         Nikita Makarov; RWTH Aachen University
         Albert Zeyer; RWTH Aachen University
         Ralf Schlüter; RWTH Aachen University
         Hermann Ney; RWTH Aachen University
 
 SPE-P23.4: IMPROVING DEVICE DIRECTEDNESS CLASSIFICATION OF UTTERANCES WITH SEMANTIC LEXICAL FEATURES
         Kellen Gillespie; Amazon, Inc.
         Ioannis Konstantakopoulos; Amazon, Inc.
         Xingzhi Guo; Stony Brook University
         Vishal Thanvantri Vasudevan; Amazon, Inc.
         Abhinav Sethy; Amazon, Inc.
 
 SPE-P23.5: TRAINING ASR MODELS BY GENERATION OF CONTEXTUAL INFORMATION
         Kritika Singh; Facebook AI
         Dmytro Okhonko; Facebook AI
         Jun Liu; Facebook AI
         Yongqiang Wang; Facebook AI
         Frank Zhang; Facebook AI
         Ross Girshick; Facebook AI
         Sergey Edunov; Facebook AI
         Fuchun Peng; Facebook AI
         Yatharth Saraf; Facebook AI
         Geoffrey Zweig; Facebook AI
         Abdelrahman Mohamed; Facebook AI
 
 SPE-P23.6: SPEECH RECOGNITION MODEL COMPRESSION
         Madhumitha Sakthi; University of Texas at Austin
         Ahmed Tewfik; University of Texas at Austin
         Raj Pawate; Cadence Design Systems Inc.
 
 SPE-P23.7: GPU-ACCELERATED VITERBI EXACT LATTICE DECODER FOR BATCHED ONLINE AND OFFLINE SPEECH RECOGNITION
         Hugo Braun; NVIDIA
         Justin Luitjens; NVIDIA
         Ryan Leary; NVIDIA
         Tim Kaldewey; NVIDIA
         Daniel Povey; Self
 
 SPE-P23.8: SEQUENCE-TO-SEQUENCE AUTOMATIC SPEECH RECOGNITION WITH WORD EMBEDDING REGULARIZATION AND FUSED DECODING
         Alexander H. Liu; National Taiwan University
         Tzu-Wei Sung; University of California, San Diego
         Shun-Po Chuang; National Taiwan University
         Hung-yi Lee; National Taiwan University
         Lin-shan Lee; National Taiwan University
 
 SPE-P23.9: SYNCHRONOUS TRANSFORMERS FOR END-TO-END SPEECH RECOGNITION
         Zhengkun Tian; Institute of Automation, Chinese Academy of Sciences
         Jiangyan Yi; Institute of Automation, Chinese Academy of Sciences
         Ye Bai; Institute of Automation, Chinese Academy of Sciences
         Jianhua Tao; Institute of Automation, Chinese Academy of Sciences
         Shuai Zhang; Institute of Automation, Chinese Academy of Sciences
         Zhengqi Wen; Institute of Automation, Chinese Academy of Sciences
 
 SPE-P23.10: INVESTIGATION OF METHODS TO IMPROVE THE RECOGNITION PERFORMANCE OF TAMIL-ENGLISH CODE-SWITCHED DATA IN TRANSFORMER FRAMEWORK
         Metilda Sagaya Mary N J; Indian Institute of Technology Madras
         Vishwas M. Shetty; Indian Institute of Technology Madras
         Srinivasan Umesh; Indian Institute of Technology Madras
 
 SPE-P23.11: BANGLA VOICE COMMAND RECOGNITION IN END-TO-END SYSTEM USING TOPIC MODELING BASED CONTEXTUAL RESCORING
         Nafis Sadeq; Bangladesh University of Engineering and Technology
         Shafayat Ahmed; Bangladesh University of Engineering and Technology
         Sudipta Saha Shubha; Bangladesh University of Engineering and Technology
         Md. Nahidul Islam; Bangladesh University of Engineering and Technology
         Muhammad Abdullah Adnan; Bangladesh University of Engineering and Technology
 
 SPE-P23.12: LEARNING TO DETECT KEYWORD PARTS AND WHOLE BY SMOOTHED MAX POOLING
         Hyun-Jin Park; Google
         Patrick Violette; Google
         Niranjan Subrahmanya; Google