SPE-P16: Word Spotting |
Session Type: Poster |
Time: Friday, 8 May, 08:00 - 10:00 |
Location: On-Demand |
Virtual Session: View on Virtual Platform |
Session Chairs: Lei Xie, Northwestern Polytechnical University and Lin-Shan Lee, National Taiwan University
|
|
SPE-P16.1: MINING EFFECTIVE NEGATIVE TRAINING SAMPLES FOR KEYWORD SPOTTING |
Jingyong Hou; Northwestern Polytechnical University |
Yangyang Shi; Mobvoi AI Lab |
Mari Ostendorf; University of Washington |
Mei-Yuh Hwang; Mobvoi AI Lab |
Lei Xie; Northwestern Polytechnical University |
|
SPE-P16.2: MULTI-TASK LEARNING FOR VOICE TRIGGER DETECTION |
Siddharth Sigtia; Apple |
Pascal Clark; Apple |
Rob Haynes; Apple |
Hywel Richards; Apple |
John Bridle; Apple |
|
SPE-P16.3: SMALL-FOOTPRINT KEYWORD SPOTTING ON RAW AUDIO DATA WITH SINC-CONVOLUTIONS |
Simon Mittermaier; Technische Universität München |
Ludwig Kürzinger; Technische Universität München |
Bernd Waschneck; Infineon Technologies AG |
Gerhard Rigoll; Technische Universität München |
|
SPE-P16.4: LATTICE-BASED IMPROVEMENTS FOR VOICE TRIGGERING USING GRAPH NEURAL NETWORKS |
Pranay Dighe; Apple |
Saurabh Adya; Apple |
Nuoyu Li; Apple |
Srikanth Vishnubhotla; Apple |
Devang Naik; Apple |
Adithya Sagar; Apple |
Ying Ma; Apple |
Stephen Pulman; Apple |
Jason Williams; Apple |
|
SPE-P16.5: INTEGRATION OF MULTI-LOOK BEAMFORMERS FOR MULTI-CHANNEL KEYWORD SPOTTING |
Xuan Ji; Tencent |
Meng Yu; Tencent |
Jie Chen; Tencent |
Jimeng Zheng; Tencent |
Dan Su; Tencent |
Dong Yu; Tencent |
|
SPE-P16.6: FAST LATTICE-FREE KEYWORD FILTERING FOR ACCELERATED SPOKEN TERM DETECTION |
Jonathan Wintrode; Raytheon Applied Signal Technology |
Jenny Wilkes; Raytheon Applied Signal Technology |
|
SPE-P16.7: TRAINING KEYWORD SPOTTERS WITH LIMITED AND SYNTHESIZED SPEECH DATA |
James Lin; Google Research |
Kevin Kilgour; Google Research |
Dominik Roblek; Google Research |
Matt Sharifi; Google Research |
|
SPE-P16.8: TOWARDS DATA-EFFICIENT MODELING FOR WAKE WORD SPOTTING |
Yixin Gao; Amazon, Inc. |
Yuriy Mishchenko; Amazon, Inc. |
Anish Shah; Amazon, Inc. |
Spyros Matsoukas; Amazon, Inc. |
Shiv Vitaladevuni; Amazon, Inc. |
|
SPE-P16.9: ADAPTATION OF RNN TRANSDUCER WITH TEXT-TO-SPEECH TECHNOLOGY FOR KEYWORD SPOTTING |
Eva Sharma; Khoury College of Computer Sciences, Northeastern University |
Guoli Ye; Speech and Language Group, Microsoft |
Wenning Wei; Microsoft China |
Rui Zhao; Speech and Language Group, Microsoft |
Yao Tian; Microsoft China |
Jian Wu; Speech and Language Group, Microsoft |
Lei He; Microsoft China |
Ed Lin; Microsoft China |
Yifan Gong; Speech and Language Group, Microsoft |
|
SPE-P16.11: CRNN-CTC BASED MANDARIN KEYWORDS SPOTTING |
Haikang Yan; South China University of Technology |
Qianhua He; South China University of Technology |
Wei Xie; South China University of Technology |
|