SLP-L18.4

NOISE-ROBUST ZERO-SHOT TEXT-TO-SPEECH SYNTHESIS CONDITIONED ON SELF-SUPERVISED SPEECH-REPRESENTATION MODEL WITH ADAPTERS

Kenichi Fujita, Hiroshi Sato, Takanori Ashihara, Hiroki Kanagawa, Marc Delcroix, Takafumi Moriya, Yusuke Ijima, Nippon Telegraph and telephone corporation, Japan

Session:
SLP-L18: Text to Speech Generation -O2 Lecture

Track:
Speech and Language Processing

Location:
Room 103

Presentation Time:
Thu, 18 Apr, 09:20 - 09:40 (UTC +9)

Session Co-Chairs:
Helen Meng, CUHK and Zhenhua Ling, USTC
View Manuscript
Presentation
Discussion
Resources
Contacts