SLP-P51: Speech Representations & Enhancement
Poster
Fri, 8 May, 09:00 - 11:00
Location: Poster Area 31
Session Type: Poster
Track: Speech and Language Processing [SL]
Click the to view the manuscript on IEEE Xplore Open Preview

SLP-P51.1: MATE: MATRYOSHKA AUDIO–TEXT EMBEDDINGS FOR OPEN-VOCABULARY KEYWORD SPOTTING

Youngmoon Jung, Myunghun Jung, Joon-Young Yang, Yong-Hyeok Lee, Jaeyoung Roh, Hoon-Young Cho, Samsung Research, Korea, Republic of

SLP-P51.2: DIFFUSION-LINK: DIFFUSION PROBABILISTIC MODEL FOR BRIDGING THE AUDIO-TEXT MODALITY GAP

KiHyun Nam, Jongmin Choi, Hyeongkeun Lee, Korea Advanced Institute of Science and Technology, Korea, Republic of; Jungwoo Heo, University of Seoul, Korea, Republic of; Joon Son Chung, Korea Advanced Institute of Science and Technology, Korea, Republic of

SLP-P51.3: ARTI-6: TOWARDS SIX-DIMENSIONAL ARTICULATORY SPEECH ENCODING

Jihwan Lee, Sean Foley, Thanathai Lertpetchpun, Kevin Huang, Yoonjeong Lee, Tiantian Feng, Louis Goldstein, Dani Byrd, Shrikanth Narayanan, University of Southern California, United States of America

SLP-P51.4: DYNAMIC KALMAN FUSION FOR ROBUST CONTINUOUS SIGN LANGUAGE RECOGNITION

Bofan Liu, Shuxuan Gao, Wuhan University of Technology, China; Yilin Wang, Shanghai Jiao Tong University, China; Yingchao Wei, Wuhan University of Technology, China

SLP-P51.5: DUAL-BRANCH SPIRAL INTERSECT NETWORK FOR MULTIMODAL SENTIMENT ANALYSIS

Hongfei Gao, Xinhua Zhu, Kunhao Ma, School of Computer Science and Engineering, Guangxi Normal University, China

SLP-P51.6: S-PHiNe: PHYSICS-INFORMED MULTICHANNEL SPEECH ENHANCEMENT USING SPECTRO-SPATIAL FUSION FOR LOW-SNR CONDITIONS

Stephen Afrifa, Zhang Tao, Tianjin University, China; Peter Appiahene, University of Energy and Natural Resources, Ghana; Vijayakumar Varadarajan, University of Technology Sydney, Australia; Yanzhang Geng, Tianjin University, China

SLP-P51.7: INTERPRETABLE ALZHEIMER’S DISEASE DETECTION VIA MULTI-SCALE FUSION OF DISENTANGLED SPEECH FEATURES

Ye Chen, Southern University of Science and Technology, China; Xiaokang Liu, Rongfeng Su, Lan Wang, Nan Yan, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, China

SLP-P51.8: Layer-Aware Early Fusion of Acoustic and Linguistic Embeddings for Cognitive Status Classification

Krystof Novotny, Brno University of Technology, Czechia; Laureano Moro-Velázquez, Johns Hopkins University, United States of America; Jiri Mekyska, Brno University of Technology, United States of America

SLP-P51.9: BREAKING DATA EFFICIENCY DILEMMA: A FEDERATED AND AUGMENTED LEARNING FRAMEWORK FOR ALZHEIMER’S DISEASE DETECTION VIA SPEECH

Xiao Wei, Bin Wen, Tianjin University, China; Yuqin Lin, Fuzhou University, China; Kai Li, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, China; Mingyang Gu, Xiaobao Wang, Longbiao Wang, Tianjin University, China; Jianwu Dang, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, China

SLP-P51.10: AGRI-MIX:MUTUAL INFORMATION-GUIDED HIERARCHICAL FUSION FOR AGRICULTURAL DISEASE MULTIMODAL RELATION EXTRACTION

Zihua Song, Bo Kong, Wenkang Zhang, Liruizhi Jia, Jing Huang, Xinjiang University, China; Huiqing Wang, Plant Protection and Quarantine Station of Xinjiang Uygur Autonomous Region, China; Shaochen Jiang, Yuan Liu, Shengquan Liu, Xinjiang University, China