GC-L4: LIMMITS'24: Multi-speaker, Multi-lingual Indic TTS with voice cloning
Wed, 17 Apr, 13:10 - 15:10 (UTC +9)
Location: Room 209B
Session Type: Lecture
Session Co-Chairs: Sathvik Udupa, Indian Institute of Science (IISc) Bangalore, India and Saurabh Kumar, Indian Institute of Science (IISc) Bangalore, India
Track: Grand Challenges
Click the to view the manuscript on IEEE Xplore Open Preview
Wed, 17 Apr, 13:10 - 13:30 (UTC +9)

GC-L4.1: LIMMITS’24: MULTI-SPEAKER, MULTI-LINGUAL INDIC TTS WITH VOICE CLONING

Abhayjeet Singh, Amala Nagireddi, Deekshitha G, Jesuraja Bandekar, Roopa R, Sandhya Badiger, Sathvik Udupa, Prasanta Kumar Ghosh, Indian Institute of Science, India; Hema A Murthy, Indian Institute of Technology, Madras, India; Pranaw Kumar, Centre for Development of Advanced Computing, India; Keiichi Tokuda, Nagoya Institute of Technology, Japan, India; Mark Hasegawa-Johnson, University of Illinois, India; Philipp Olbrich, Deutsche Gesellschaft f ̈ur Internationale Zusammenarbeit (GIZ), India
Wed, 17 Apr, 13:30 - 13:50 (UTC +9)

GC-L4.2: LEVERAGING EFFECTIVE LANGUAGE AND SPEAKER CONDITIONING IN INDIC TTS FOR LIMMITS 2024 CHALLENGE

Yejin Jeon, Youngjae Kim, Gary Geunbae Lee, POSTECH, Korea, Republic of
Wed, 17 Apr, 13:50 - 14:10 (UTC +9)

GC-L4.3: SINGLE-STAGE TTS WITH ADAPTED VOCODER AND CROSS-ATTENTION: TALTECH SYSTEMS FOR THE LIMMITS’24 CHALLENGE

Daniil Rõbnikov, Tanel Alumäe, Tallinn University of Technology, Estonia
Wed, 17 Apr, 14:10 - 14:30 (UTC +9)

GC-L4.4: SCALING NVIDIA's MULTI-SPEAKER MULTI-LINGUAL TTS SYSTEMS WITH ZERO-SHOT TTS TO INDIC LANGUAGES

Akshit Arora, Rohan Badlani, Sungwon Kim, Rafael Valle, Bryan Catanzaro, NVIDIA, United States of America
Wed, 17 Apr, 14:30 - 14:50 (UTC +9)

GC-L4.5: THE THU-HCSI MULTI-SPEAKER MULTI-LINGUAL FEW-SHOT VOICE CLONING SYSTEM FOR LIMMITS’24 CHALLENGE

Yixuan Zhou, Shuoyi Zhou, Shun Lei, Zhiyong Wu, Tsinghua University, China; Menglin Wu, ByteDance, China
Wed, 17 Apr, 14:50 - 15:10 (UTC +9)

GC-L4.6: Cross-lingual Text-to-Speech via Hierarchical Style Transfer

Sang-Hoon Lee, Ha-Yeong Choi, Seong-Whan Lee, Korea University, Korea, Republic of