SLR-1: Speaker/Language Recognition I |
| Session Type: Poster |
| Time: Sunday, December 15, 16:00 - 17:30 |
| Location: VHS Event Centre, Level 1 |
| Session Chair: Tomi Kinnunen, University of Eastern Finland |
| SLR-1.1: JOINT OPTIMIZATION OF CLASSIFICATION AND CLUSTERING FOR DEEP SPEAKER EMBEDDING |
| Zhiming Wang, Kaisheng Yao, Shuo Fang, Xiaolong Li, Ant Financial, Inc., China |
| SLR-1.2: EXPLORING EFFECTIVE DATA AUGMENTATION WITH TDNN-LSTM NEURAL NETWORK EMBEDDING FOR SPEAKER RECOGNITION |
| Chien-Lin Huang, PingAn AI Lab, United States |
| SLR-1.3: END-TO-END NEURAL SPEAKER DIARIZATION WITH SELF-ATTENTION |
| Yusuke Fujita, Naoyuki Kanda, Shota Horiguchi, Yawen Xue, Kenji Nagamatsu, Hitachi, Ltd., Japan; Shinji Watanabe, Johns Hopkins University, United States |
| SLR-1.4: A CROSS-CORPUS STUDY ON SPEECH EMOTION RECOGNITION |
| Rosanna Milner, Md Asif Jalal, University of Sheffield, United Kingdom; Raymond W. M. Ng, Emotech Labs, United Kingdom; Thomas Hain, University of Sheffield, United Kingdom |
| SLR-1.5: ADVERSARIAL ATTACKS ON SPOOFING COUNTERMEASURES OF AUTOMATIC SPEAKER VERIFICATION |
| Songxiang Liu, Chinese University of Hong Kong, China; Haibin Wu, Hung-yi Lee, National Taiwan University, Taiwan; Helen Meng, Chinese University of Hong Kong, China |
| SLR-1.6: SPOKEN LANGUAGE IDENTIFICATION USING BIDIRECTIONAL LSTM BASED LID SEQUENTIAL SENONES. |
| Muralikrishna H, Pulkit Sapra, Anuksha Jain, Dileep Aroor Dinesh, Indian Institute of Technology, Mandi, India |
| SLR-1.7: TIME-DOMAIN SPEAKER EXTRACTION NETWORK |
| Chenglin Xu, Nanyang Technological University, Singapore; Wei Rao, National University of Singapore, Singapore; Eng Siong Chng, Nanyang Technological University, Singapore; Haizhou Li, National University of Singapore, Singapore |
| SLR-1.8: SHORT UTTERANCE COMPENSATION IN SPEAKER VERIFICATION VIA COSINE-BASED TEACHER-STUDENT LEARNING OF SPEAKER EMBEDDINGS |
| Jee-weon Jung, Hee-Soo Heo, Hye-jin Shim, Ha-Jin Yu, University of Seoul, Korea (South) |
| SLR-1.9: NOVEL ENHANCED TEAGER ENERGY BASED CEPSTRAL COEFFICIENTS FOR REPLAY SPOOF DETECTION |
| Rajul Acharya, Hemant A. Patil, Harsh Kotta, Dhirubhai Ambani Institute of Information and Communication Technology, India |
| SLR-1.10: SYLLABLE-DEPENDENT DISCRIMINATIVE LEARNING FOR SMALL FOOTPRINT TEXT-DEPENDENT SPEAKER VERIFICATION |
| Junyi Peng, Yuexian Zou, Peking University, China; Na Li, Deyi Tuo, Dan Su, Meng Yu, Chunlei Zhang, Dong Yu, Tencent, China |
| SLR-1.11: LATENT SPACE REPRESENTATION FOR MULTI-TARGET SPEAKER DETECTION AND IDENTIFICATION WITH A SPARSE DATASET USING TRIPLET NEURAL NETWORKS |
| Kin Wai Cheuk, Balamurali B T, Gemma Roig, Dorien Herremans, Singapore University of Technology and Design, Singapore |
| SLR-1.12: SELF-ADAPTIVE SOFT VOICE ACTIVITY DETECTION USING DEEP NEURAL NETWORKS FOR ROBUST SPEAKER VERIFICATION |
| Youngmoon Jung, Yeunju Choi, Hoirin Kim, Korea Advanced Institute of Science and Technology (KAIST), Korea (South) |
| SLR-1.13: SPHEREDIAR: AN EFFECTIVE SPEAKER DIARIZATION SYSTEM FOR MEETING DATA |
| Tuomas Kaseva, Aku Rouhe, Mikko Kurimo, Aalto University, Finland |
| SLR-1.14: BAYESIAN ADVERSARIAL LEARNING FOR SPEAKER RECOGNITION |
| Jen-Tzung Chien, Chun-Lin Kuo, National Chiao Tung University, Taiwan |
| SLR-1.15: AN INVESTIGATION OF LSTM-CTC BASED JOINT ACOUSTIC MODEL FOR INDIAN LANGUAGE IDENTIFICATION |
| Tirusha Mandava, Ravi Kumar Vuddagiri, Hari Krishna Vydana, Anil Kumar Vuppala, IIIT Hyderabad, India |
| SLR-1.16: A MULTI PURPOSE AND LARGE SCALE SPEECH CORPUS IN PERSIAN AND ENGLISH FOR SPEAKER AND SPEECH RECOGNITION: THE DEEPMINE DATABASE |
| Hossein Zeinali, Lukáš Burget, Jan Cernocky, Brno University of Technology, Czech Republic |
| SLR-1.17: NATIVE LANGUAGE IDENTIFICATION FROM RAW WAVEFORMS USING DEEP CONVOLUTIONAL NEURAL NETWORKS WITH ATTENTIVE POOLING |
| Rutuja Ubale, Vikram Ramanarayanan, Yao Qian, Keelan Evanini, Chee Wee Leong, Chong Min Lee, Educational Testing Service, United States |
| SLR-1.18: SPEAKER VERIFICATION WITH APPLICATION-AWARE BEAMFORMING |
| Ladislav Mošner, Oldřich Plchot, Johan Rohdin, Lukáš Burget, Jan Černocký, Brno University of Technology, Czech Republic |