SPE-P23: Speech Recognition: General Topics |
Session Type: Poster |
Time: Friday, 8 May, 15:15 - 17:15 |
Location: On-Demand |
Virtual Session: View on Virtual Platform |
Session Chair: Eric Fosler-Lussier, The Ohio State University
|
|
SPE-P23.1: META LEARNING FOR END-TO-END LOW-RESOURCE SPEECH RECOGNITION |
Jui-Yang Hsu; National Taiwan University |
Yuan-Jui Chen; National Taiwan University |
Hung-yi Lee; National Taiwan University |
|
SPE-P23.2: CROSS-SPEAKER SILENT-SPEECH COMMAND WORD RECOGNITION USING ELECTRO-OPTICAL STOMATOGRAPHY |
Simon Stone; Technische Universität Dresden |
Peter Birkholz; Technische Universität Dresden |
|
SPE-P23.3: EXPLORING A ZERO-ORDER DIRECT HMM BASED ON LATENT ATTENTION FOR AUTOMATIC SPEECH RECOGNITION |
Parnia Bahar; RWTH Aachen University |
Nikita Makarov; RWTH Aachen University |
Albert Zeyer; RWTH Aachen University |
Ralf Schlüter; RWTH Aachen University |
Hermann Ney; RWTH Aachen University |
|
SPE-P23.4: IMPROVING DEVICE DIRECTEDNESS CLASSIFICATION OF UTTERANCES WITH SEMANTIC LEXICAL FEATURES |
Kellen Gillespie; Amazon, Inc. |
Ioannis Konstantakopoulos; Amazon, Inc. |
Xingzhi Guo; Stony Brook University |
Vishal Thanvantri Vasudevan; Amazon, Inc. |
Abhinav Sethy; Amazon, Inc. |
|
SPE-P23.5: TRAINING ASR MODELS BY GENERATION OF CONTEXTUAL INFORMATION |
Kritika Singh; Facebook AI |
Dmytro Okhonko; Facebook AI |
Jun Liu; Facebook AI |
Yongqiang Wang; Facebook AI |
Frank Zhang; Facebook AI |
Ross Girshick; Facebook AI |
Sergey Edunov; Facebook AI |
Fuchun Peng; Facebook AI |
Yatharth Saraf; Facebook AI |
Geoffrey Zweig; Facebook AI |
Abdelrahman Mohamed; Facebook AI |
|
SPE-P23.6: SPEECH RECOGNITION MODEL COMPRESSION |
Madhumitha Sakthi; University of Texas at Austin |
Ahmed Tewfik; University of Texas at Austin |
Raj Pawate; Cadence Design Systems Inc. |
|
SPE-P23.7: GPU-ACCELERATED VITERBI EXACT LATTICE DECODER FOR BATCHED ONLINE AND OFFLINE SPEECH RECOGNITION |
Hugo Braun; NVIDIA |
Justin Luitjens; NVIDIA |
Ryan Leary; NVIDIA |
Tim Kaldewey; NVIDIA |
Daniel Povey; Self |
|
SPE-P23.8: SEQUENCE-TO-SEQUENCE AUTOMATIC SPEECH RECOGNITION WITH WORD EMBEDDING REGULARIZATION AND FUSED DECODING |
Alexander H. Liu; National Taiwan University |
Tzu-Wei Sung; University of California, San Diego |
Shun-Po Chuang; National Taiwan University |
Hung-yi Lee; National Taiwan University |
Lin-shan Lee; National Taiwan University |
|
SPE-P23.9: SYNCHRONOUS TRANSFORMERS FOR END-TO-END SPEECH RECOGNITION |
Zhengkun Tian; Institute of Automation, Chinese Academy of Sciences |
Jiangyan Yi; Institute of Automation, Chinese Academy of Sciences |
Ye Bai; Institute of Automation, Chinese Academy of Sciences |
Jianhua Tao; Institute of Automation, Chinese Academy of Sciences |
Shuai Zhang; Institute of Automation, Chinese Academy of Sciences |
Zhengqi Wen; Institute of Automation, Chinese Academy of Sciences |
|
SPE-P23.10: INVESTIGATION OF METHODS TO IMPROVE THE RECOGNITION PERFORMANCE OF TAMIL-ENGLISH CODE-SWITCHED DATA IN TRANSFORMER FRAMEWORK |
Metilda Sagaya Mary N J; Indian Institute of Technology Madras |
Vishwas M. Shetty; Indian Institute of Technology Madras |
Srinivasan Umesh; Indian Institute of Technology Madras |
|
SPE-P23.11: BANGLA VOICE COMMAND RECOGNITION IN END-TO-END SYSTEM USING TOPIC MODELING BASED CONTEXTUAL RESCORING |
Nafis Sadeq; Bangladesh University of Engineering and Technology |
Shafayat Ahmed; Bangladesh University of Engineering and Technology |
Sudipta Saha Shubha; Bangladesh University of Engineering and Technology |
Md. Nahidul Islam; Bangladesh University of Engineering and Technology |
Muhammad Abdullah Adnan; Bangladesh University of Engineering and Technology |
|
SPE-P23.12: LEARNING TO DETECT KEYWORD PARTS AND WHOLE BY SMOOTHED MAX POOLING |
Hyun-Jin Park; Google |
Patrick Violette; Google |
Niranjan Subrahmanya; Google |
|