FP-V1.V6.3

IMAGE FUSION TRANSFORMER

Vibashan VS, Jeya Maria Jose Valanarasu, Poojan Oza, Vishal M. Patel, Johns Hopkins University, United States of America

Session:
Transformers
Virtual Poster

Track:
Applications of Machine Learning

Location:
Gather.Town 6

Presentation Time:
Fri, 7 Oct, 21:00 - 22:00 China Standard Time (UTC +8)
Fri, 7 Oct, 15:00 - 16:00 Central European Time (UTC +2)
Fri, 7 Oct, 13:00 - 14:00 UTC
Fri, 7 Oct, 09:00 - 10:00 Eastern Time (UTC -4)

Session Co-Chairs:
Andrea Cavallaro, Queen Mary University of London and Jean-Christophe Pesquet, CentraleSupélec and Rebecca Willett, University of Chicago
Presentation
Discussion
Resources
No resources available.
Session FP-V1.V6
FP-V1.V6.1: TRANSFORMER VISUAL TRACKER BASED ON TEMPLATE FEATURES CORRESPONDING TO FOREGROUND REGION
Jianglei Yu, Xin Ma, Shandong University, China
FP-V1.V6.2: ConMW Transformer: A General Vision Transformer Backbone with Merged-Window Attention
Ang Li, Jichao Jiao, Ning Li, Wangjing Qi, Beijing University of Posts and Telecommunications(BUPT), China; Wei Xu, Min Pang, The 22nd Research Institute of CETC, China; , ,
FP-V1.V6.3: IMAGE FUSION TRANSFORMER
Vibashan VS, Jeya Maria Jose Valanarasu, Poojan Oza, Vishal M. Patel, Johns Hopkins University, United States of America
FP-V1.V6.4: AVT: AU-ASSISTED VISUAL TRANSFORMER FOR FACIAL EXPRESSION RECOGNITION
Rijin Jin, Sirui Zhao, Yifan Xu, Tong Xu, Enhong Chen, University of Science and Technology of China, China; Zhongkai Hao, Tsinghua University, China
FP-V1.V6.5: NCTR: NEIGHBORHOOD CONSENSUS TRANSFORMER FOR FEATURE MATCHING
Xiaoyong Lu, Songlin Du, Southeast University, China
FP-V1.V6.6: MULTI-VIEW 3D RECONSTRUCTION FROM VIDEO WITH TRANSFORMER
Yijie Zhong, Zhengxing Sun, Yunhan Sun, Shoutong Luo, Yi Wang, Wei Zhang, Nanjing University, China
FP-V1.V6.7: MASK-VIT: AN OBJECT MASK EMBEDDING IN VISION TRANSFORMER FOR FINE-GRAINED VISUAL CLASSIFICATION
Tong Su, Chengqun Song, Jun Cheng, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, China; Shuo Ye, Huazhong University of Science and Technology, China
FP-V1.V6.8: DEPTHFORMER: MULTISCALE VISION TRANSFORMER FOR MONOCULAR DEPTH ESTIMATION WITH GLOBAL LOCAL INFORMATION FUSION
Ashutosh Agarwal, Chetan Arora, Indian Institute of Technology, Delhi, India
FP-V1.V6.9: PASTS: TOWARD EFFECTIVE DISTILLING TRANSFORMER FOR PANORAMIC SEMANTIC SEGMENTATION
Jihyun Kim, Somi Jeong, Kwanghoon Sohn, Yonsei University, Korea, Republic of
FP-V1.V6.10: CBPT: A New Backbone for Enhancing Information Transmission of Vision Transformers
Wenxin Yu, Hongru Zhang, Tianxiang Lan, Yucheng Hu, Dong Yin, University of Science and Technology of China, China
FP-V1.V6.11: TRANSFORMER-BASED APPROACH FOR DOCUMENT LAYOUT UNDERSTANDING
Huichen Yang, William Hsu, Kansas State University, United States of America