SLP-P7: Natural Language Processing for speech-to-text
Wed, 17 Apr, 13:10 - 15:10 (UTC +9)
Location: Poster Zone 1A
Session Type: Poster
Session Co-Chairs: Hsin-Min Wang, Institute of Information Science and Shixiong Zhang, Tencent
Track: Speech and Language Processing
Click the to view the manuscript on IEEE Xplore Open Preview
 

SLP-P7.1: Communication-Efficient Personalized Federated Learning for Speech-to-Text Tasks

Yichao Du, University of Science and Technology of China, China; Zhirui Zhang, Tencent, China; Linan Yue, University of Science and Technology of China, China; Xu Huang, Nanjing University, China; Yuqing Zhang, Anhui Xinhua University, China; Tong Xu, Linli Xu, Enhong Chen, University of Science and Technology of China, China
 

SLP-P7.2: MULTIPLE REPRESENTATION TRANSFER FROM LARGE LANGUAGE MODELS TO END-TO-END ASR SYSTEMS

Takuma Udagawa, Masayuki Suzuki, Gakuto Kurata, Masayasu Muraoka, George Saon, IBM Research AI, Japan
 

SLP-P7.3: A Chat About Boring Problems: Studying GPT-based text normalization

Yang Zhang, NVIDIA, United States of America; Travis Bartley, City University of New York, Graduate Center, United States of America; Mariana Graterol-Fuenmayor, Vitaly Lavrukhin, Evelina Bakhturina, Boris Ginsburg, NVIDIA, United States of America
 

SLP-P7.5: PROMPTFORMER: PROMPTED CONFORMER TRANSDUCER FOR ASR

Sergio Duarte-Torres, Arunasish Sen, Aman Rana, Lukas Drude, Alejandro Gomez-Alanis, Andreas Schwarz, Leif Rädel, Volker Leutnant, Amazon, Netherlands
 

SLP-P7.6: Leveraging Large Pretrained Models for Line-by-Line Spoken Program Recognition

Sadia Nowrin, Keith Vertanen, Michigan Technological University, United States of America
 

SLP-P7.7: LEVERAGING LARGE LANGUAGE MODELS FOR EXPLOITING ASR UNCERTAINTY

Pranay Dighe, Yi Su, Shangshang Zheng, Yunshu Liu, Vineet Garg, Xiaochuan Niu, Ahmed Tewfik, Apple, United States of America
 

SLP-P7.8: CSNET: CONTRASTIVE SIAMESE NETWORK FOR ROBUST SLU

Hao Yang, min zhang, daimeng wei, jiaxin guo, huawei, China