TP-V1.V5: Image & Video Interpretation and Understanding
Tue, 4 Oct, 21:00 - 22:00 China Standard Time (UTC +8)
Tue, 4 Oct, 15:00 - 16:00 Central European Time (UTC +1)
Tue, 4 Oct, 13:00 - 14:00 UTC
Tue, 4 Oct, 09:00 - 10:00 Eastern Time (UTC -5)
Virtual Poster
Location: Gather.Town 5
Session Co-Chairs: Jean-Christophe Pesquet, CentraleSupélec and Andrea Cavallaro, Queen Mary University of London and Rebecca Willett, University of Chicago
Track: Image and Video Analysis, Synthesis, and Retrieval

TP-V1.V5.1: SPATIAL-SEMANTIC ATTENTION FOR GROUNDED IMAGE CAPTIONING

Wenzhe Hu, Lanxiao Wang, Linfeng Xu, University of Electronic Science and Technology of China, China

TP-V1.V5.3: Back To Old Constraints to Jointly Supervise Learning Depth, Camera Motion and Optical Flow in a Monocular Video

Hicham Sekkati, Jean-Francois Lapointe, National Research Council Canada (NRC Canada), Canada

TP-V1.V5.4: STRUCTURED DROPCONNECT FOR UNCERTAINTY INFERENCE IN IMAGE CLASSIFICATION

Wenqing Zheng, Jiyang Xie, Zhanyu Ma, Beijing University of Posts and Telecommunications, China; Xian Sun, Aerospace Information Research Institute, Chinese Academy of Sciences, China

TP-V1.V5.5: QUES-TO-VISUAL GUIDED VISUAL QUESTION ANSWERING

Xiangyu Wu, Jianfeng Lu, Zhuanfeng Li, Fengchao Xiong, Nanjing University of Science and Technology, China

TP-V1.V5.6: OPEN-WORLD OBJECT DETECTION VIA DISCRIMINATIVE CLASS PROTOTYPE LEARNING

Jinan Yu, Liyan Ma, Zhenglin Li, Yan Peng, Shaorong Xie, Shanghai University, China

TP-V1.V5.7: VISUAL SENTIMENT PREDICTION USING CROSS-WAY FEW-SHOT LEARNING BASED ON KNOWLEDGE DISTILLATION

Yingrui Ye, Yuya Moroto, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama, Hokkaido University, Japan

TP-V1.V5.8: DPNET: DUAL-PATH NETWORK FOR EFFICIENT OBJECT DETECTION WITH LIGHTWEIGHT SELF-ATTENTION

Huimin Shi, Quan Zhou, Yinghao Ni, Xiaofu Wu, Nanjing University of Posts and Telecommunications, China; Longin Jan Latecki, Temple University, United States of America

TP-V1.V5.9: SUPERPIXEL GROUP-CORRELATION NETWORK FOR CO-SALIENCY DETECTION

Minhyeok Lee, Chaewon Park, Suhwan Cho, Sangyoun Lee, Yonsei University, Korea, Republic of

TP-V1.V5.10: SALIENCY DETECTION VIA GLOBAL CONTEXT ENHANCED FEATURE FUSION AND EDGE WEIGHTED LOSS

Chaewon Park, Minhyeok Lee, MyeongAh Cho, Sangyoun Lee, Yonsei University, Korea, Republic of

TP-V1.V5.11: REAL-WORLD VIDEO ANOMALY DETECTION BY EXTRACTING SALIENT FEATURES

Yudai Watanabe, Makoto Okabe, Shizuoka University, Japan; Yasunori Harada, Naoji Kashima, Chubu Electric Power Co., Inc., Japan

TP-V1.V5.12: EMCENET: EFFICIENT MULTI-SCALE CONTEXT EXPLORATION NETWORK FOR SALIENT OBJECT DETECTION

Yanguang Sun, College of Computer Science and Engineering, Anhui University of Science and Technology, China; Chenxing Xia, College of Computer Science and Engineering, Anhui University of Science and Technology; Institute of Energy, Hefei Comprehensive National Science Center., China; Xiuju Gao, Bin Ge, College of Computer Science and Engineering, Anhui University of Science and Technology., China; Hanling Zhang, School of Design, Hunan University, China; Kuan-Ching Li, Department of Computer Science and Information Engineering, Providence University, Taiwan

TP-V1.V5.13: MULTI-MODALITY DIVERSITY FUSION NETWORK WITH SWINTRANSFORMER FOR RGB-D SALIENT OBJECT DETECTION

Songsong Duan, Chenxing Xia, Xiuju Gao, Bin Ge, Anhui University of Science and Technology, China; Hanling Zhang, Hunan University, China; Kuan-Ching Li, Providence University, Taiwan

TP-V1.V5.14: MDNET: MOTION DISTINCTION NETWORK FOR EFFECTIVE ACTION RECOGNITION

Rongrong Jin, Weirong Ye, Xiao Wang, Yan Yan, Hanzi Wang, Xiamen University, China

TP-V1.V5.15: HIGHER-ORDER RECURRENT NETWORK WITH SPACE-TIME ATTENTION FOR VIDEO EARLY ACTION RECOGNITION

Tsung-Ming Tai, Oswald Lanz, Free University of Bozen-Bolzano, Italy; Giuseppe Fiameni, Cheng-Kuang Lee, NVIDIA AI Technology Center, Italy