SLP-P15.5

FRAME-LEVEL EMOTIONAL STATE ALIGNMENT METHOD FOR SPEECH EMOTION RECOGNITION

Qifei Li, Yingming Gao, Cong Wang, Yayue Deng, Jinlong Xue, Yichen Han, Ya Li, Beijing University of Posts and Telecommunications, China

Session:
SLP-P15: Speech Emotion Recognition and Analysis III Poster

Track:
Speech and Language Processing

Location:
Poster Zone 5A
Poster Board PZ-5A.5

Presentation Time:
Wed, 17 Apr, 16:30 - 18:30 (UTC +9)

Session Co-Chairs:
Douglas O'Shaughnessy, INRS, University of Quebec and Carlos Busso, University of Texas at Dallas
View Manuscript
Presentation
Discussion
Resources
Session SLP-P15
SLP-P15.1: RL-EMO: A REINFORCEMENT LEARNING FRAMEWORK FOR MULTIMODAL EMOTION RECOGNITION
Chengwen Zhang, Yuhao Zhang, Bo Cheng, Beijing University of Posts & Telecommunications, China
SLP-P15.2: ZERO SHOT AUDIO TO AUDIO EMOTION TRANSFER WITH SPEAKER DISENTANGLEMENT
Soumya Dutta, Sriram Ganapathy, Indian Institute of Science Bangalore, India
SLP-P15.3: TRUST-SER: ON THE TRUSTWORTHINESS OF FINE-TUNING PRE-TRAINED SPEECH EMBEDDINGS FOR SPEECH EMOTION RECOGNITION
Tiantian Feng, Rajat Hebbar, Shrikanth Narayanan, University of Southern California, United States of America
SLP-P15.4: STYLECAP: AUTOMATIC SPEAKING-STYLE CAPTIONING FROM SPEECH BASED ON SPEECH AND LANGUAGE SELF-SUPERVISED LEARNING MODELS
Kazuki Yamauchi, The University of Tokyo, Japan; Yusuke Ijima, NTT Corporation, Japan; Yuki Saito, The University of Tokyo, Japan
SLP-P15.5: FRAME-LEVEL EMOTIONAL STATE ALIGNMENT METHOD FOR SPEECH EMOTION RECOGNITION
Qifei Li, Yingming Gao, Cong Wang, Yayue Deng, Jinlong Xue, Yichen Han, Ya Li, Beijing University of Posts and Telecommunications, China
SLP-P15.6: GRADIENT-BASED DIMENSIONALITY REDUCTION FOR SPEECH EMOTION RECOGNITION USING DEEP NETWORKS
Hongxuan Wang, Prahlad Vadakkepat, National University of Singapore, Singapore
SLP-P15.7: DISENTANGLEMENT NETWORK: DISENTANGLE THE EMOTIONAL FEATURES FROM ACOUSTIC FEATURES FOR SPEECH EMOTION RECOGNITION
Zhichen Yuan, C. L. Philip Chen, Shuzhen Li, Tong Zhang, South China University of Technology, China
SLP-P15.8: Balancing Speaker-Rater Fairness for Gender-Neutral Speech Emotion Recognition
Woan-Shiuan Chien, Shreya G. Upadhyay, Chi-Chun Lee, National Tsing Hua University, Taiwan
SLP-P15.9: PROMPTING AUDIOS USING ACOUSTIC PROPERTIES FOR EMOTION REPRESENTATION
Hira Dhamyal, Carnegie Mellon University, United States of America; Benjamin Elizalde, Soham Deshmukh, Huaming Wang, Microsoft, United States of America; Bhiksha Raj, Rita Singh, Carnegie Mellon University, United States of America
SLP-P15.10: LEARNING AROUSAL-VALENCE REPRESENTATION FROM CATEGORICAL EMOTION LABELS OF SPEECH
Enting Zhou, You Zhang, Zhiyao Duan, University of Rochester, United States of America
SLP-P15.11: A ROBUST PITCH-FUSION MODEL FOR SPEECH EMOTION RECOGNITION IN TONAL LANGUAGES
Viet Thanh Pham, Thi Thu Huyen Ngo, Ngoc Quan Pham, Thi Thu Trang Nguyen, Hanoi University of Science and Technology, Viet Nam
SLP-P15.12: MODELING INTRAPERSONAL AND INTERPERSONAL INFLUENCES FOR AUTOMATIC ESTIMATION OF THERAPIST EMPATHY IN COUNSELING CONVERSATION
Dehua Tao, Tan Lee, Harold Chui, Sarah Luk, The Chinese University of Hong Kong, Hong Kong
Contacts