Technical Program

SLR-1: Speaker/Language Recognition I

Session Type: Poster
Time: Sunday, December 15, 16:00 - 17:30
Location: VHS Event Centre, Level 1
Session Chair: Tomi Kinnunen, University of Eastern Finland
 
SLR-1.1: JOINT OPTIMIZATION OF CLASSIFICATION AND CLUSTERING FOR DEEP SPEAKER EMBEDDING
Zhiming Wang, Kaisheng Yao, Shuo Fang, Xiaolong Li, Ant Financial, Inc., China
 
SLR-1.2: EXPLORING EFFECTIVE DATA AUGMENTATION WITH TDNN-LSTM NEURAL NETWORK EMBEDDING FOR SPEAKER RECOGNITION
Chien-Lin Huang, PingAn AI Lab, United States
 
SLR-1.3: END-TO-END NEURAL SPEAKER DIARIZATION WITH SELF-ATTENTION
Yusuke Fujita, Naoyuki Kanda, Shota Horiguchi, Yawen Xue, Kenji Nagamatsu, Hitachi, Ltd., Japan; Shinji Watanabe, Johns Hopkins University, United States
 
SLR-1.4: A CROSS-CORPUS STUDY ON SPEECH EMOTION RECOGNITION
Rosanna Milner, Md Asif Jalal, University of Sheffield, United Kingdom; Raymond W. M. Ng, Emotech Labs, United Kingdom; Thomas Hain, University of Sheffield, United Kingdom
 
SLR-1.5: ADVERSARIAL ATTACKS ON SPOOFING COUNTERMEASURES OF AUTOMATIC SPEAKER VERIFICATION
Songxiang Liu, Chinese University of Hong Kong, China; Haibin Wu, Hung-yi Lee, National Taiwan University, Taiwan; Helen Meng, Chinese University of Hong Kong, China
 
SLR-1.6: SPOKEN LANGUAGE IDENTIFICATION USING BIDIRECTIONAL LSTM BASED LID SEQUENTIAL SENONES.
Muralikrishna H, Pulkit Sapra, Anuksha Jain, Dileep Aroor Dinesh, Indian Institute of Technology, Mandi, India
 
SLR-1.7: TIME-DOMAIN SPEAKER EXTRACTION NETWORK
Chenglin Xu, Nanyang Technological University, Singapore; Wei Rao, National University of Singapore, Singapore; Eng Siong Chng, Nanyang Technological University, Singapore; Haizhou Li, National University of Singapore, Singapore
 
SLR-1.8: SHORT UTTERANCE COMPENSATION IN SPEAKER VERIFICATION VIA COSINE-BASED TEACHER-STUDENT LEARNING OF SPEAKER EMBEDDINGS
Jee-weon Jung, Hee-Soo Heo, Hye-jin Shim, Ha-Jin Yu, University of Seoul, Korea (South)
 
SLR-1.9: NOVEL ENHANCED TEAGER ENERGY BASED CEPSTRAL COEFFICIENTS FOR REPLAY SPOOF DETECTION
Rajul Acharya, Hemant A. Patil, Harsh Kotta, Dhirubhai Ambani Institute of Information and Communication Technology, India
 
SLR-1.10: SYLLABLE-DEPENDENT DISCRIMINATIVE LEARNING FOR SMALL FOOTPRINT TEXT-DEPENDENT SPEAKER VERIFICATION
Junyi Peng, Yuexian Zou, Peking University, China; Na Li, Deyi Tuo, Dan Su, Meng Yu, Chunlei Zhang, Dong Yu, Tencent, China
 
SLR-1.11: LATENT SPACE REPRESENTATION FOR MULTI-TARGET SPEAKER DETECTION AND IDENTIFICATION WITH A SPARSE DATASET USING TRIPLET NEURAL NETWORKS
Kin Wai Cheuk, Balamurali B T, Gemma Roig, Dorien Herremans, Singapore University of Technology and Design, Singapore
 
SLR-1.12: SELF-ADAPTIVE SOFT VOICE ACTIVITY DETECTION USING DEEP NEURAL NETWORKS FOR ROBUST SPEAKER VERIFICATION
Youngmoon Jung, Yeunju Choi, Hoirin Kim, Korea Advanced Institute of Science and Technology (KAIST), Korea (South)
 
SLR-1.13: SPHEREDIAR: AN EFFECTIVE SPEAKER DIARIZATION SYSTEM FOR MEETING DATA
Tuomas Kaseva, Aku Rouhe, Mikko Kurimo, Aalto University, Finland
 
SLR-1.14: BAYESIAN ADVERSARIAL LEARNING FOR SPEAKER RECOGNITION
Jen-Tzung Chien, Chun-Lin Kuo, National Chiao Tung University, Taiwan
 
SLR-1.15: AN INVESTIGATION OF LSTM-CTC BASED JOINT ACOUSTIC MODEL FOR INDIAN LANGUAGE IDENTIFICATION
Tirusha Mandava, Ravi Kumar Vuddagiri, Hari Krishna Vydana, Anil Kumar Vuppala, IIIT Hyderabad, India
 
SLR-1.16: A MULTI PURPOSE AND LARGE SCALE SPEECH CORPUS IN PERSIAN AND ENGLISH FOR SPEAKER AND SPEECH RECOGNITION: THE DEEPMINE DATABASE
Hossein Zeinali, Lukáš Burget, Jan Cernocky, Brno University of Technology, Czech Republic
 
SLR-1.17: NATIVE LANGUAGE IDENTIFICATION FROM RAW WAVEFORMS USING DEEP CONVOLUTIONAL NEURAL NETWORKS WITH ATTENTIVE POOLING
Rutuja Ubale, Vikram Ramanarayanan, Yao Qian, Keelan Evanini, Chee Wee Leong, Chong Min Lee, Educational Testing Service, United States
 
SLR-1.18: SPEAKER VERIFICATION WITH APPLICATION-AWARE BEAMFORMING
Ladislav Mošner, Oldřich Plchot, Johan Rohdin, Lukáš Burget, Jan Černocký, Brno University of Technology, Czech Republic