MLSP-P26.11

VISUAL-LINGUISTIC REPRESENTATION LEARNING WITH DEEP CROSS-MODALITY FUSION FOR REFERRING MULTI-OBJECT TRACKING

Wenyan He, Yajun Jian, Yang Lu, Hanzi Wang, Xiamen University, China

Session:
MLSP-P26: Learning from Multimodal Data III Poster

Track:
Machine Learning for Signal Processing

Location:
Poster Zone 3B
Poster Board PZ-3B.11

Presentation Time:
Thu, 18 Apr, 13:10 - 15:10 (UTC +9)

Session Chair:
Yuanbo Hou, University of Ghent
View Manuscript
Presentation
Discussion
Resources
Session MLSP-P26
MLSP-P26.1: Higher Order Multiple Graph Filtering for Structured Graph Learning
Liang Du, Xiaodong Li, Shanxi university, China; Yan Chen, Sichuan university, China; Gui Yang, Shanxi university, China; Mian Ilyas Ahmad, National University of Sciences and Technology, Pakistan; Peng Zhou, Anhui university, China
MLSP-P26.2: Multi-grained Multimodal Interaction Network for Sentiment Analysis
Lingyong Fang, Gongshen Liu, Shanghai Jiao Tong University, China; Ru Zhang, Beijing University of Posts and Telecommunications, China
MLSP-P26.3: Multimodal Transformer Distillation for Audio-Visual Synchronization
Xuanjun Chen, Haibin Wu, Chung-Che Wang, Hung-yi Lee, Jyh-Shing Roger Jang, National Taiwan University, Taiwan
MLSP-P26.4: One-Step Late Fusion Multi-view Clustering with Compressed Subspace
Qiyuan Ou, Pei Zhang, Sihang Zhou, En Zhu, National University of Defense Technology, China
MLSP-P26.5: SELF-MOTION AS SUPERVISION FOR EGOCENTRIC AUDIOVISUAL LOCALIZATION
Calvin Murdock, Ishwarya Ananthabhotla, Hao Lu, Vamsi Krishna Ithapu, Reality Labs Research at Meta, United States of America
MLSP-P26.6: ADAPTIVE IMAGE-ENHANCED KNOWLEDGE GRAPH COMPLETION
Meng Gao, Wei Chen, Tengjiao Wang, Peking University, China; Dawei Lu, State Grid Information & Telecommunication Group Co., Ltd., China; Jiabin Zheng, Peking University, China
MLSP-P26.7: DUAL-MIX FOR CROSS-MODAL RETRIEVAL WITH NOISY LABELS
Feng Ding, Xiu Liu, Xinyi Wang, Fangming Zhong, Dalian University of Technology, China
MLSP-P26.8: FUSING MULTI-LEVEL FEATURES FROM AUDIO AND CONTEXTUAL SENTENCE EMBEDDING FROM TEXT FOR INTERVIEW-BASED DEPRESSION DETECTION
Junqi Xue, Ruihan Qin, Xinxu Zhou, Honghai Liu, Min Zhang, Zhiguo Zhang, Harbin Institute of Technology, Shenzhen, China, China
MLSP-P26.9: SYNONYM REPLACEMENT AND GENERATION ENHANCEMENT FOR DOCUMENT AUGMENTATION
Jianwei Sun, The 15th Research Institute of China Electronics Technology Group, China, China; Yang An, School of Software, Shandong University, China, China; Xinyu Jiang, Qian Li, Yulong Liu, The 15th Research Institute of China Electronics Technology Group, China, China; Yongshun Gong, School of Software, Shandong University, China, China
MLSP-P26.10: MACCN:MULTI-MODAL ADAPTIVE CO-ATTENTION FUSION CONTRASTIVE LEARNING NETWORKS FOR FAKE NEWS DETECTION
Zepu Yi, Songfeng Lu, Xueming Tang, Junjun Wu, Jianxin Zhu, Huazhong University of Science and Technology, China
MLSP-P26.11: VISUAL-LINGUISTIC REPRESENTATION LEARNING WITH DEEP CROSS-MODALITY FUSION FOR REFERRING MULTI-OBJECT TRACKING
Wenyan He, Yajun Jian, Yang Lu, Hanzi Wang, Xiamen University, China
MLSP-P26.12: Revisiting Deep Generalized Canonical Correlation Analysis
Paris Karakasis, Nicholas Sidiropoulos, University of Virginia, United States of America
Contacts