SPE-P23: Speech Recognition: General Topics |
| Session Type: Poster |
| Time: Friday, 8 May, 15:15 - 17:15 |
| Location: On-Demand |
| Virtual Session: View on Virtual Platform |
| Session Chair: Eric Fosler-Lussier, The Ohio State University |
| SPE-P23.1: META LEARNING FOR END-TO-END LOW-RESOURCE SPEECH RECOGNITION |
| Jui-Yang Hsu; National Taiwan University |
| Yuan-Jui Chen; National Taiwan University |
| Hung-yi Lee; National Taiwan University |
| SPE-P23.2: CROSS-SPEAKER SILENT-SPEECH COMMAND WORD RECOGNITION USING ELECTRO-OPTICAL STOMATOGRAPHY |
| Simon Stone; Technische Universität Dresden |
| Peter Birkholz; Technische Universität Dresden |
| SPE-P23.3: EXPLORING A ZERO-ORDER DIRECT HMM BASED ON LATENT ATTENTION FOR AUTOMATIC SPEECH RECOGNITION |
| Parnia Bahar; RWTH Aachen University |
| Nikita Makarov; RWTH Aachen University |
| Albert Zeyer; RWTH Aachen University |
| Ralf Schlüter; RWTH Aachen University |
| Hermann Ney; RWTH Aachen University |
| SPE-P23.4: IMPROVING DEVICE DIRECTEDNESS CLASSIFICATION OF UTTERANCES WITH SEMANTIC LEXICAL FEATURES |
| Kellen Gillespie; Amazon, Inc. |
| Ioannis Konstantakopoulos; Amazon, Inc. |
| Xingzhi Guo; Stony Brook University |
| Vishal Thanvantri Vasudevan; Amazon, Inc. |
| Abhinav Sethy; Amazon, Inc. |
| SPE-P23.5: TRAINING ASR MODELS BY GENERATION OF CONTEXTUAL INFORMATION |
| Kritika Singh; Facebook AI |
| Dmytro Okhonko; Facebook AI |
| Jun Liu; Facebook AI |
| Yongqiang Wang; Facebook AI |
| Frank Zhang; Facebook AI |
| Ross Girshick; Facebook AI |
| Sergey Edunov; Facebook AI |
| Fuchun Peng; Facebook AI |
| Yatharth Saraf; Facebook AI |
| Geoffrey Zweig; Facebook AI |
| Abdelrahman Mohamed; Facebook AI |
| SPE-P23.6: SPEECH RECOGNITION MODEL COMPRESSION |
| Madhumitha Sakthi; University of Texas at Austin |
| Ahmed Tewfik; University of Texas at Austin |
| Raj Pawate; Cadence Design Systems Inc. |
| SPE-P23.7: GPU-ACCELERATED VITERBI EXACT LATTICE DECODER FOR BATCHED ONLINE AND OFFLINE SPEECH RECOGNITION |
| Hugo Braun; NVIDIA |
| Justin Luitjens; NVIDIA |
| Ryan Leary; NVIDIA |
| Tim Kaldewey; NVIDIA |
| Daniel Povey; Self |
| SPE-P23.8: SEQUENCE-TO-SEQUENCE AUTOMATIC SPEECH RECOGNITION WITH WORD EMBEDDING REGULARIZATION AND FUSED DECODING |
| Alexander H. Liu; National Taiwan University |
| Tzu-Wei Sung; University of California, San Diego |
| Shun-Po Chuang; National Taiwan University |
| Hung-yi Lee; National Taiwan University |
| Lin-shan Lee; National Taiwan University |
| SPE-P23.9: SYNCHRONOUS TRANSFORMERS FOR END-TO-END SPEECH RECOGNITION |
| Zhengkun Tian; Institute of Automation, Chinese Academy of Sciences |
| Jiangyan Yi; Institute of Automation, Chinese Academy of Sciences |
| Ye Bai; Institute of Automation, Chinese Academy of Sciences |
| Jianhua Tao; Institute of Automation, Chinese Academy of Sciences |
| Shuai Zhang; Institute of Automation, Chinese Academy of Sciences |
| Zhengqi Wen; Institute of Automation, Chinese Academy of Sciences |
| SPE-P23.10: INVESTIGATION OF METHODS TO IMPROVE THE RECOGNITION PERFORMANCE OF TAMIL-ENGLISH CODE-SWITCHED DATA IN TRANSFORMER FRAMEWORK |
| Metilda Sagaya Mary N J; Indian Institute of Technology Madras |
| Vishwas M. Shetty; Indian Institute of Technology Madras |
| Srinivasan Umesh; Indian Institute of Technology Madras |
| SPE-P23.11: BANGLA VOICE COMMAND RECOGNITION IN END-TO-END SYSTEM USING TOPIC MODELING BASED CONTEXTUAL RESCORING |
| Nafis Sadeq; Bangladesh University of Engineering and Technology |
| Shafayat Ahmed; Bangladesh University of Engineering and Technology |
| Sudipta Saha Shubha; Bangladesh University of Engineering and Technology |
| Md. Nahidul Islam; Bangladesh University of Engineering and Technology |
| Muhammad Abdullah Adnan; Bangladesh University of Engineering and Technology |
| SPE-P23.12: LEARNING TO DETECT KEYWORD PARTS AND WHOLE BY SMOOTHED MAX POOLING |
| Hyun-Jin Park; Google |
| Patrick Violette; Google |
| Niranjan Subrahmanya; Google |