MMSP-P8.9

SYNERGISTIC HYBRID ATTENTION NETWORK: AN ENHANCED MULTI-MODAL INTERACTION ARCHITECTURE FOR EFFICIENT VISUAL QUESTION ANSWERING

He Wang, Feng Yan, XinJiang University, China

Session:
MMSP-P8: Visual Question Answering and Multimodal Reasoning Poster

Track:
Multimedia Signal Processing [MM]

Location:
Poster Area 21

Presentation Time:
Wed, 6 May, 09:00 - 11:00

Presentation
Discussion
Resources
No resources available.
Session MMSP-P8
MMSP-P8.1: M3GQA: A MULTIMODAL MULTI-HOP AND KNOWLEDGE GRAPH-BASED FRAMEWORK FOR QUESTION ANSWERING
Shuhao Hu, Xin Wang, Ji Xiang, Xiaobo Guo, Lei Wang, Miaobo Hu, Institute of Information Engineering, Chinese Academy of Sciences, China
MMSP-P8.2: ENHANCING REFERRING EXPRESSION COMPREHENSION WITH PIXEL-WORD CORRELATION AND CROSS-LAYER REGULARIZATION
Shanshan Yang, Ruilin Yao, Wuhan University of Technology, China; Shengwu Xiong, Wuhan College, China; Xiaoxin Mi, Xueping Zhang, Yi Rong, Wuhan University of Technology, China
MMSP-P8.3: RSHR: HIERARCHICAL VISUAL REPRESENTATION AND STATE-SPACE REASONING FOR REMOTE SENSING VISUAL QUESTION ANSWERING
Pengfei Xu, Feng Yan, Yuqing Zhou, Yuli Tao, Yuancheng Liu, Xinjiang University, China
MMSP-P8.4: VISUAL SALIENCY STEERING DISTILLATION FOR MULTIMODAL CHAIN-OF-THOUGHT REASONING
Hao Yang, Jin Wang, XueJie zhang, Yun nan university, China
MMSP-P8.5: FOCUS BEFORE REASONING: A BIDIRECTIONAL SELECTION FRAMEWORK FOR NOISE-MITIGATION IN KNOWLEDGE-BASED VISION QUESTION ANSWERING
Huan Zhao, Zexin Zhou, Guanghui Ye, Hunan University, China
MMSP-P8.6: MIRG-RL: Multi-Image Reasoning and Grounding with Reinforcement Learning
Lihao Zheng, Jiawei Chen, Xintian Shen, Hao Ma, Tao Wei, Li Auto, China
MMSP-P8.7: STRUCTURE-GUIDED GRAPH REFINEMENT NETWORK FOR FACIAL AESTHETIC ASSESSMENT
Kejing Wu, Yihua Chen, Chunyu Wu, Guangxi Normal University, China; Pengsheng Huang, Harbin Institute of Technology, China; Zhenjun Tang, Guangxi Normal University, China
MMSP-P8.8: SEE WHAT YOU NEED: QUERY-AWARE VISUAL INTELLIGENCE THROUGH REASONING-PERCEPTION LOOPS
Zixuan Dong, National University of Defense Technology, China; Baoyun Peng, Academy of Military Sciences, China; Yufei Wang, Lin Liu, Xinxin Dong, Yunlong Cao, Xiaodong Wang, National University of Defense Technology, China
MMSP-P8.9: SYNERGISTIC HYBRID ATTENTION NETWORK: AN ENHANCED MULTI-MODAL INTERACTION ARCHITECTURE FOR EFFICIENT VISUAL QUESTION ANSWERING
He Wang, Feng Yan, XinJiang University, China
MMSP-P8.10: AUTOVQA-G: SELF-IMPROVING AGENTIC FRAMEWORK FOR AUTOMATED VISUAL QUESTION ANSWERING AND GROUNDING ANNOTATION
Rongsheng Hu, Jiangnan University, China; Runwei Guan, Hong Kong University of Science and Technology (Guangzhou), China; Yicheng Di, Jiayu Bao, Yuan Liu, Jiangnan University, China
Contacts