OD-SLA-4: Emotion, Paralinguistic, and Speaker Analysis
Thu, 16 Dec, 14:00 - 16:00 Japan Standard Time (UTC +9)
Thu, 16 Dec, 05:00 - 07:00 Coordinated Universal Time
Thu, 16 Dec, 00:00 - 02:00 Eastern Standard Time (UTC -4)
Wed, 15 Dec, 21:00 - 23:00 Pacific Standard Time (UTC -7)
Session Chair: Sayaka Shiota, Tokyo Metropolitan University
Track: Speech, Language, and Audio (SLA)

OD-SLA-4.2: Hierarchical Prosody Analysis Improves Categorical and Dimensional Emotion Recognition

Xingfeng LI, Xinhui Hu, Xinkang Xu, Hithink RoyalFlush AI Research Institute, China; Taiyang Guo, Masato Akagi, Japan Advanced Institute of Science and Technology, Japan; Jianwu Dang, Japan Advanced Institute of Science and Technology;Tianjin University, Japan

OD-SLA-4.3: A Study of Salient Modulation Domain Features for Speaker Identification

Simon McKnight, Aidan Hogg, Vincent Neo, Patrick Naylor, Imperial College London, United Kingdom of Great Britain and Northern Ireland

OD-SLA-4.4: A STUDY ON DECOUPLED PROBABILISTIC LINEAR DISCRIMINANT ANALYSIS

Di Wang, Hongzhi Yu, Key Laboratory of China’s Ethnic Languages and Information Technology of Ministry of Education, Northwest Minzu University, China; Lantian Li, Dong Wang, Center for Speech and Language Technologies, BNRist, Tsinghua University, China

OD-SLA-4.5: GENERATION OF SPEAKER REPRESENTATIONS USING HETEROGENEOUS TRAINING BATCH ASSEMBLY

Yu-Huai Peng, Hung-Shin Lee, Pin-Tuan Huang, Hsin-Min Wang, Academia Sinica, Taiwan

OD-SLA-4.6: Speech Emotion Recognition with Fusion of Acoustic- and Linguistic-Feature-Based Decisions

Ryotaro Nagase, Takahiro Fukumori, Yoichi Yamashita, Ritsumeikan University, Japan

OD-SLA-4.7: AUTOMATIC NATURALNESS RECOGNITION FROM ACTED SPEECH USING NEURAL NETWORKS

Bagus Tris Atmaja, Akira Sasou, National Institute of Advanced Industrial Science and Technology, Japan; Masato Akagi, Japan Advanced Institute of Science and Technology, Japan

OD-SLA-4.8: COMPARATIVE STUDY OF FILTER BANKS TO IMPROVE THE PERFORMANCE OF VOICE DISORDER ASSESSMENT SYSTEMS USING LTAS FEATURES

Purva Barche, Krishna Gurugubelli, Anil Kumar Vuppala, International Institute of Information Technology, India

OD-SLA-4.9: Dual Dropout Ranking of Linguistic Features for Alzheimer’s Disease Recognition

Xiaoquan Ke, Man-Wai Mak, The Hong Kong Polytechnic University, Hong Kong; Jinchao Li, Helen M. Meng, The Chinese University of Hong Kong, Hong Kong

OD-SLA-4.10: A Multilingual Framework Based on Pretraining Model for Speech Emotion Recognition

Zhaohang Zhang, Beihang University, Beijing, China, China; Xiaohui Zhang, Beijing Jiaotong University, Beijing, China, China; Min Guo, Wei-Qiang Zhang, Tsinghua University, Beijing, China, China; Ke Li, Yukai Huang, Beijing Haitian Ruisheng Science Technology Ltd., Beijing 100083, China, China

OD-SLA-4.12: DETECTING MULTIPLE DISFLUENCIES FROM SPEECH USING PRE-LINGUISTIC AUTOMATIC SYLLABIFICATION WITH ACOUSTIC AND PROSODY FEATURES

Utkarsh Mehrotra, Sparsh Garg, Gurugubelli Krishna, Anil Kumar Vuppala, IIIT Hyderabad, India

OD-SLA-4.13: Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification

Xugang Lu, Peng Shen, Hisashi Kawai, National Institute of Information and Communications Technology, Japan; Yu Tsao, Research Center for Information Technology Innovation, Taiwan

OD-SLA-4.14: Deep Convolutional Neural Network for Voice Liveness Detection

Siddhant Gupta, Kuldeep Khoria, Ankur T. Patil, Hemant A. Patil, Dhirubhai Ambani Institute of Information and Communication Technology, India

OD-SLA-4.15: HOW SPEECH IS RECOGNIZED TO BE EMOTIONAL - A STUDY BASED ON INFORMATION DECOMPOSITION

Haoran Sun, Lantian Li, Thomas Fang Zheng, Dong Wang, Tsinghua University, China

OD-SLA-4.16: END-TO-END SPEAKER AGE AND HEIGHT ESTIMATION USING ATTENTION MECHANISM AND TRIPLET LOSS

Manav Kaushik, Birla Institute of Technology and Science Pilani, India; Van Tung Pham, Tran The Anh, Eng Siong Chng, Nanyang Technological University, Singapore