MMSP-P8.1

VISUAL PROMPT TUNING FOR WEAKLY SUPERVSED PHRASE GROUNDING

Pengyue Lin, Zhihan Yu, Mingcong Lu, Fangxiang Feng, Ruifan Li, Xiaojie Wang, Beijing University of Posts and Telecommunications, China

Session:
MMSP-P8: Multimodal Processing Poster

Track:
Multimedia Signal Processing

Location:
Poster Zone 5B
Poster Board PZ-5B.1

Presentation Time:
Fri, 19 Apr, 13:10 - 15:10 (UTC +9)

Session Chair:
Lu Zhang, INSA Rennes, France
View Manuscript
Presentation
Discussion
Resources
Session MMSP-P8
MMSP-P8.1: VISUAL PROMPT TUNING FOR WEAKLY SUPERVSED PHRASE GROUNDING
Pengyue Lin, Zhihan Yu, Mingcong Lu, Fangxiang Feng, Ruifan Li, Xiaojie Wang, Beijing University of Posts and Telecommunications, China
MMSP-P8.2: VK-G2T: VISION AND CONTEXT KNOWLEDGE ENHANCED GLOSS2TEXT
Liqiang Jing, Xuemeng Song, Shandong University, United States of America; Xinxing Zu, Alibaba Group, China; Na Zheng, National University of Singapore, Singapore; Zhongzhou Zhao, Alibaba Group, Singapore; Liqiang Nie, Harbin Institute of Technology (Shenzhen), Singapore
MMSP-P8.3: BEVLOC: END-TO-END 6-DOF LOCALIZATION VIA CROSS-MODALITY CORRELATION UNDER BIRD’S EYE VIEW
Nanjie Chen, Jinping Wang, Sun Yat-sen University, China; Hao Chen, University of Birmingham, United Kingdom of Great Britain and Northern Ireland; Ying Shen, Shuai Wang, Xiaojun Tan, Sun Yat-sen University, China
MMSP-P8.4: SPEECH GUIDED MASKED IMAGE MODELING FOR VISUALLY GROUNDED SPEECH
Jongbhin Woo, Hyeonggon Ryu, Arda Senocak, Joon Son Chung, KAIST, Korea, Republic of
MMSP-P8.5: TOWARDS ROBUST MULTIMODAL PROMPTING WITH MISSING MODALITIES
Jaehyuk Jang, Yooseung Wang, Changick Kim, Korea Advanced Institute of Science and Technology, Korea, Republic of
MMSP-P8.6: MULTIMODAL GRAPH-BASED AUDIO-VISUAL EVENT LOCALIZATION
Zhen Wang, Dongyuan Li, Manabu Okumura, Tokyo Institute of Technology, Japan
MMSP-P8.7: DUAL-COLOR GRANULARITY ALIGNMENT FOR TEXT-BASED PERSON SEARCH
Weichen Zhao, Hengyang Normal University, China; Yuxing Lu, Peking University, China; Ge Jiao, Yuan Yang, Hengyang Normal University, China
MMSP-P8.8: ELECTROENCEPHALOGRAM HELPS FEW-SHOT LEARNING
Xiaoya FAN, Yuntao LIU, Zhong WANG, Dalian University of Technology, China
MMSP-P8.9: FCC-MF: DETECTING VIOLENCE IN AUDIO-VISUAL CONTEXT WITH FRAME-WISE CLUSTER CONTRAST AND MODALITY-STAGE FLOODING
Jiaqing He, Yanzhen Ren, Wuhan University, China; Liming Zhai, Central China Normal University, China; Wuyang Liu, Wuhan University, China
MMSP-P8.10: ETP: Learning Transferable ECG Representations via ECG-Text Pre-training
Che Liu, Imperial College London, United Kingdom of Great Britain and Northern Ireland; Zhongwei Wan, The Ohio State University, United Kingdom of Great Britain and Northern Ireland; Sibo Cheng, Imperial College London, United Kingdom of Great Britain and Northern Ireland; Mi Zhang, The Ohio State University, United Kingdom of Great Britain and Northern Ireland; Rossella Arcucci, Imperial College London, United Kingdom of Great Britain and Northern Ireland
MMSP-P8.11: CLIP-BASED SYNERGISTIC KNOWLEDGE TRANSFER FOR TEXT-BASED PERSON RETRIEVAL
Yating Liu, Tsinghua University and Peng Cheng Laboratory, China; Yaowei Li, Peking University and Peng Cheng Laboratory, China; Zimo Liu, Peng Cheng Laboratory, China; Wenming Yang, Tsinghua University, China; Yaowei Wang, Peng Cheng Laboratory, China; Qingmin Liao, Tsinghua University, China
MMSP-P8.12: ON UNIQUE LOCALIZATION OF UNCORRELATED CONSTANT-MODULUS SOURCES USING SPARSE LINEAR ARRAYS
Wenlong Wang, Zai Yang, Xunmeng Wu, Xi'an Jiaotong University, China
Contacts