Technical Program

Click on the icon to view the manuscript on IEEE XPlore in the IEEE ICASSP 2020 Open Preview.

SPE-P16: Word Spotting

Session Type: Poster
Time: Friday, 8 May, 08:00 - 10:00
Location: On-Demand
Virtual Session: View on Virtual Platform
Session Chairs: Lei Xie, Northwestern Polytechnical University and Lin-Shan Lee, National Taiwan University
 
 SPE-P16.1: MINING EFFECTIVE NEGATIVE TRAINING SAMPLES FOR KEYWORD SPOTTING
         Jingyong Hou; Northwestern Polytechnical University
         Yangyang Shi; Mobvoi AI Lab
         Mari Ostendorf; University of Washington
         Mei-Yuh Hwang; Mobvoi AI Lab
         Lei Xie; Northwestern Polytechnical University
 
 SPE-P16.2: MULTI-TASK LEARNING FOR VOICE TRIGGER DETECTION
         Siddharth Sigtia; Apple
         Pascal Clark; Apple
         Rob Haynes; Apple
         Hywel Richards; Apple
         John Bridle; Apple
 
 SPE-P16.3: SMALL-FOOTPRINT KEYWORD SPOTTING ON RAW AUDIO DATA WITH SINC-CONVOLUTIONS
         Simon Mittermaier; Technische Universität München
         Ludwig Kürzinger; Technische Universität München
         Bernd Waschneck; Infineon Technologies AG
         Gerhard Rigoll; Technische Universität München
 
 SPE-P16.4: LATTICE-BASED IMPROVEMENTS FOR VOICE TRIGGERING USING GRAPH NEURAL NETWORKS
         Pranay Dighe; Apple
         Saurabh Adya; Apple
         Nuoyu Li; Apple
         Srikanth Vishnubhotla; Apple
         Devang Naik; Apple
         Adithya Sagar; Apple
         Ying Ma; Apple
         Stephen Pulman; Apple
         Jason Williams; Apple
 
 SPE-P16.5: INTEGRATION OF MULTI-LOOK BEAMFORMERS FOR MULTI-CHANNEL KEYWORD SPOTTING
         Xuan Ji; Tencent
         Meng Yu; Tencent
         Jie Chen; Tencent
         Jimeng Zheng; Tencent
         Dan Su; Tencent
         Dong Yu; Tencent
 
 SPE-P16.6: FAST LATTICE-FREE KEYWORD FILTERING FOR ACCELERATED SPOKEN TERM DETECTION
         Jonathan Wintrode; Raytheon Applied Signal Technology
         Jenny Wilkes; Raytheon Applied Signal Technology
 
 SPE-P16.7: TRAINING KEYWORD SPOTTERS WITH LIMITED AND SYNTHESIZED SPEECH DATA
         James Lin; Google Research
         Kevin Kilgour; Google Research
         Dominik Roblek; Google Research
         Matt Sharifi; Google Research
 
 SPE-P16.8: TOWARDS DATA-EFFICIENT MODELING FOR WAKE WORD SPOTTING
         Yixin Gao; Amazon, Inc.
         Yuriy Mishchenko; Amazon, Inc.
         Anish Shah; Amazon, Inc.
         Spyros Matsoukas; Amazon, Inc.
         Shiv Vitaladevuni; Amazon, Inc.
 
 SPE-P16.9: ADAPTATION OF RNN TRANSDUCER WITH TEXT-TO-SPEECH TECHNOLOGY FOR KEYWORD SPOTTING
         Eva Sharma; Khoury College of Computer Sciences, Northeastern University
         Guoli Ye; Speech and Language Group, Microsoft
         Wenning Wei; Microsoft China
         Rui Zhao; Speech and Language Group, Microsoft
         Yao Tian; Microsoft China
         Jian Wu; Speech and Language Group, Microsoft
         Lei He; Microsoft China
         Ed Lin; Microsoft China
         Yifan Gong; Speech and Language Group, Microsoft
 
 SPE-P16.11: CRNN-CTC BASED MANDARIN KEYWORDS SPOTTING
         Haikang Yan; South China University of Technology
         Qianhua He; South China University of Technology
         Wei Xie; South China University of Technology