IVMSP-P5.1
Human Guided Cross-Modal Reasoning with Semantic Attention Learning for Visual Question Answering
Lei Liao, Mao Feng, Meng Yang, Sun Yat-sen University, China
Session:
IVMSP-P5: Vision and language I Poster
Track:
Image, Video, and Multidimensional Signal Processing
Location:
Poster Zone 5C
Poster Board PZ-5C.1
Poster Board PZ-5C.1
Presentation Time:
Tue, 16 Apr, 16:30 - 18:30 (UTC +9)
Session Chair:
Huy Le, FPT Software company Ltd
Session IVMSP-P5
IVMSP-P5.1: Human Guided Cross-Modal Reasoning with Semantic Attention Learning for Visual Question Answering
Lei Liao, Mao Feng, Meng Yang, Sun Yat-sen University, China
IVMSP-P5.2: SEMANTICMAPPER: REGION-SPECIFIC DOMAIN ADAPTATION FOR 3D SHAPES THROUGH LEXICAL DELINEATION
Tianci Xie, University of Edinburgh, China; Siyang Luo, Shanghai Infortech Software Development Co., Ltd, China; Zhenghan Chen, Tencent, China; Xiaoxuan Liang, University of Massachusetts Amherst, United States of America
IVMSP-P5.3: SELF-DISTILLED DYNAMIC FUSION NETWORK FOR LANGUAGE-BASED FASHION RETRIEVAL
Yiming Wu, Hangfei Li, Zhejiang University of Technology, China; Fangfang Wang, Zhejiang Laboratory, China; Yilong Zhang, Ronghua Liang, Zhejiang University of Technology, China
IVMSP-P5.4: Implicit-Knowledge-Guided Align before Understanding for KB-VQA
Mao Feng, Lei Liao, Meng Yang, Sun Yat-sen University, China
IVMSP-P5.5: IMITATING THE HUMAN VISUAL SYSTEM FOR SCANPATH PREDICTING
Mengtang Li, Jie Zhu, Zhixin Huang, Chao Gou, Shenzhen Campus of Sun Yat-sen University, China
IVMSP-P5.6: READ, SPELL AND REPEAT: SCENE TEXT RECOGNITION WITH VISION-LANGUAGE CIRCULAR REFINEMENT
Taiwei Zhang, Zhenghui Hu, Weixin Li, Qingjie Liu, Yunhong Wang, Beihang University, China
IVMSP-P5.7: end-to-end spatially-constrained multi-perspective fine-grained image captioning
Yifan Zhang, Chunzhen Lin, Donglin Cao, Dazhen Lin, Xiamen University, China
IVMSP-P5.8: IMPROVED IMAGE CAPTIONING VIA KNOWLEDGE GRAPH-AUGMENTED MODELS
Sergio Sánchez Santiesteban, Sara Atito, Muhammad Awais, Yi-Zhe Song, Josef Kittler, University of Surrey, United Kingdom of Great Britain and Northern Ireland
IVMSP-P5.9: Think as People: Context-driven Multi-image News Captioning with Adaptive Dual Attention
Qiang Yang, Xiaodong Wu, Xiuying Chen, Xin Gao, Xiangliang Zhang, KAUST, Saudi Arabia
IVMSP-P5.10: MGRL: MUTUAL-GUIDANCE REPRESENTATION LEARNING FOR TEXT-TO-IMAGE PERSON RETRIEVAL
Tianle Lv, Shuang Li, Jiaxu Leng, Xinbo Gao, Chongqing Post and Communications University, China
IVMSP-P5.11: FINE-GRAINED FEATURES ALIGNMENT AND FUSION FOR TEXT-VIDEO CROSS-MODAL RETRIEVAL
Shuili Zhang, Hongzhang Mu, Quangang Li, Institute of Information Engineering, Chinese Academy of Sciences, China; Chenglong Xiao, Department of Computer Science, Shantou University, China; Tingwen Liu, Institute of Information Engineering, Chinese Academy of Sciences, China
IVMSP-P5.12: LABEL CORRECTION FOR SKETCH-BASED 3D SHAPE RETRIEVAL
Shuang Liang, Jiaming Lu, Yiyang Cai, Tongji University, China
Contacts