SLP-L12.3

EMOCONV-DIFF: DIFFUSION-BASED SPEECH EMOTION CONVERSION FOR NON-PARALLEL AND IN-THE-WILD DATA

Navin Raj Prabhu, Bunlong Lay, Simon Welker, Nale Lehmann-Willenbrock, Timo Gerkmann, Universität Hamburg, Germany

Session:
SLP-L12: Speech Emotion Recognition and Analysis II Lecture

Track:
Speech and Language Processing

Location:
Room 103

Presentation Time:
Wed, 17 Apr, 13:50 - 14:10 (UTC +9)

Session Co-Chairs:
Shrikanth Narayanan, University of Southern California, US and Björn Schuller, Technische Universität München (TUM)
View Manuscript
Presentation
Discussion
Resources
Session SLP-L12
SLP-L12.1: FOUNDATION MODEL ASSISTED AUTOMATIC SPEECH EMOTION RECOGNITION: TRANSCRIBING, ANNOTATING, AND AUGMENTING
Tiantian Feng, Shrikanth Narayanan, University of Southern California, United States of America
SLP-L12.2: CLAP4EMO: CHATGPT-ASSISTED SPEECH EMOTION RETRIEVAL WITH NATURAL LANGUAGE SUPERVISION
Wei-Cheng Lin, Shabnam Ghaffarzadegan, Luca Bondi, Abinaya Kumar, Samarjit Das, Ho-Hsiang Wu, Bosch Research, United States of America
SLP-L12.3: EMOCONV-DIFF: DIFFUSION-BASED SPEECH EMOTION CONVERSION FOR NON-PARALLEL AND IN-THE-WILD DATA
Navin Raj Prabhu, Bunlong Lay, Simon Welker, Nale Lehmann-Willenbrock, Timo Gerkmann, Universität Hamburg, Germany
SLP-L12.4: LARGE LANGUAGE MODEL-BASED EMOTIONAL SPEECH ANNOTATION USING CONTEXT AND ACOUSTIC FEATURE FOR SPEECH EMOTION RECOGNITION
Jennifer Santoso, Kenkichi Ishizuka, Taiichi Hashimoto, RevComm, Inc., Japan
SLP-L12.5: LEVERAGING SPEECH PTM, TEXT LLM, AND EMOTIONAL TTS FOR SPEECH EMOTION RECOGNITION
Ziyang Ma, Shanghai Jiao Tong University, China; Wen Wu, University of Cambridge, United Kingdom of Great Britain and Northern Ireland; Zhisheng Zheng, Yiwei Guo, Shanghai Jiao Tong University, China; Qian Chen, Shiliang Zhang, Alibaba Group, China; Xie Chen, Shanghai Jiao Tong University, China
SLP-L12.6: Customising General Large Language Models for Specialised Emotion Recognition Tasks
Liyizhe Peng, Zixing Zhang, Tao Pang, Hunan University, China; Jing Han, University of Cambridge, United Kingdom of Great Britain and Northern Ireland; Huan Zhao, Hao Chen, Hunan University, China; Björn W. Schuller, Imperial College London, United Kingdom of Great Britain and Northern Ireland
Contacts