ARS-04: Image & Video Interpretation and Understanding |
| Interactive Q&A Time: Tuesday, 27 October, 09:00 - 09:25 |
| Virtual Session: View on Virtual Platform |
| Session Chairs: Alexandros Iosifidis, Aarhus University and Lei Wang, University of Wollongong |
| ARS-04.1: ACED: ACCURATE AND EDGE-CONSISTENT MONOCULAR DEPTH ESTIMATION |
| Kunal Swami; Samsung Research Institute Bangalore |
| Prasanna Vishnu Bondada; Samsung Research Institute Bangalore |
| Pankaj Kumar Bajpai; Samsung Research Institute Bangalore |
| ARS-04.2: ROBUST AUDIO-VISUAL MANDARIN SPEECH RECOGNITION BASED ON ADAPTIVE DECISION FUSION AND TONE FEATURES |
| Hong Liu; Peking University |
| Zhengyan Chen; Peking University |
| Wei Shi; Peking University |
| ARS-04.3: ACTIVITY NORMALIZATION FOR ACTIVITY DETECTION IN SURVEILLANCE VIDEOS |
| Takashi Hosono; NTT |
| Kiyohito Sawada; National Police Academy |
| Yongqing Sun; NTT |
| Kazuya Hayase; NTT |
| Jun Shimamura; NTT |
| ARS-04.4: TEMPORAL ACTION PROPOSAL GENERATION VIA DEEP FEATURE ENHANCEMENT |
| He-Yen Hsieh; Academia Sinica |
| Ding-Jie Chen; Academia Sinica |
| Tyng-Luh Liu; Academia Sinica |
| ARS-04.5: VIDEO LOGO RETRIEVAL BASED ON LOCAL FEATURES |
| Bochen Guan; University of Wisconsin-Madison |
| Hanrong Ye; Peking University |
| Hong Liu; Peking University |
| William A. Sethares; University of Wisconsin-Madison |
| ARS-04.6: RETRIEVING AND HIGHLIGHTING ACTION WITH SPATIOTEMPORAL REFERENCE |
| Seito Kasai; Keio University |
| Yuchi Ishikawa; Keio University |
| Masaki Hayashi; Keio University |
| Yoshimitsu Aoki; Keio University |
| Kensho Hara; National Institute of Advanced Industrial Science and Technology (AIST) |
| Hirokatsu Kataoka; National Institute of Advanced Industrial Science and Technology (AIST) |
| ARS-04.7: GPU ACCELERATED POLAR FOURIER ANALYSIS FOR FEATURE EXTRACTION |
| Mingkai Tang; Guangdong University of Technology |
| Zhuozhang Li; Guangdong University of Technology |
| Zhuo Yang; Guangdong University of Technology |
| Yinwei Zhan; Guangdong University of Technology |
| Jia Su; Capital Normal University |
| Wenxin Yu; Southwest University of Science and Technology |
| ARS-04.8: PRIOR VISUAL RELATIONSHIP REASONING FOR VISUAL QUESTION ANSWERING |
| Zhuoqian Yang; Carnegie Mellon University |
| Zengchang Qin; Beihang University |
| Jing Yu; Chinese Academy of Sciences |
| Tao Wan; Beihang University |
| ARS-04.9: FPHA-AFFORD: A DOMAIN-SPECIFIC BENCHMARK DATASET FOR OCCLUDED OBJECT AFFORDANCE ESTIMATION IN HUMAN-OBJECT-ROBOT INTERACTION |
| S. Muzamil Hussain .S; Shanghai Jiao Tong University |
| Liu Liu; University of Science and Technology of China |
| Wenqiang Xu; Shanghai Jiao Tong University |
| Cewu Lu; Shanghai Jiao Tong University |
| ARS-04.10: ESTIMATION OF IMPRESSION ASSOCIATED WITH PORTRAITS USING FACIAL LANDMARKS AND VISUAL FEATURES |
| Mari Miyata; University of Tokyo |
| Kiyoharu Aizawa; University of Tokyo |
| ARS-04.11: TASK-ORIENTED MULTI-MODAL QUESTION ANSWERING FOR COLLABORATIVE APPLICATIONS |
| Hui Li Tan; Agency for Science, Technology and Research |
| Mei Chee Leong; Agency for Science, Technology and Research |
| Qianli Xu; Agency for Science, Technology and Research |
| Liyuan Li; Agency for Science, Technology and Research |
| Fen Fang; Agency for Science, Technology and Research |
| Yi Cheng; Agency for Science, Technology and Research |
| Nicolas Gauthier; Agency for Science, Technology and Research |
| Ying Sun; Agency for Science, Technology and Research |
| Joo Hwee Lim; Agency for Science, Technology and Research |
| ARS-04.12: DCM: A DENSE-ATTENTION CONTEXT MODULE FOR SEMANTIC SEGMENTATION |
| Shenghua Li; Nanjing University of Posts and Telecommunications |
| Quan Zhou; Nanjing University of Posts and Telecommunications |
| Jia Liu; Nanjing University of Posts and Telecommunications |
| Jie Wang; Nanjing University of Posts and Telecommunications |
| Yawen Fan; Nanjing University of Posts and Telecommunications |
| Xiaofu Wu; Nanjing University of Posts and Telecommunications |
| Longin Jan Latecki; Temple University |
| ARS-04.13: A CONTEXT-BASED NETWORK FOR REFERRING IMAGE SEGMENTATION |
| Xinyu Li; Dalian University of Technology |
| Yu Liu; Dalian University of Technology |
| Kaiping Xu; Dalian University of Technology |
| Zhehuan Zhao; Dalian University of Technology |
| Sipei Liu; North Information Control Research Academy Group Co., Ltd. |
| ARS-04.14: DEPTH ESTIMATION FROM SINGLE IMAGE AND SEMANTIC PRIOR |
| Praful Hambarde; Computer Vision and Pattern Recognition Lab, IIT Ropar |
| Akshay Dudhane; Computer Vision and Pattern Recognition Lab, IIT Ropar |
| Prashant Patil; Computer Vision and Pattern Recognition Lab, IIT Ropar |
| Subrahmanyam Murala; Computer Vision and Pattern Recognition Lab, IIT Ropar |
| Abhinav Dhall; Monash University |
| ARS-04.15: AN END-TO-END FRAMEWORK FOR POSE ESTIMATION OF OCCLUDED PEDESTRIANS |
| Sudip Das; Indian Statistical Institute |
| Perla Sai Raj Kishore; Institute of Engineering and Management |
| Ujjwal Bhattacharya; Indian Statistical Institute |
| ARS-04.16: GRAPH MATCHING APPLIED FOR TEXTURED PATTERN RECOGNITION |
| Raphaël Abelé; STMicroelectronics |
| Jean-Luc Damoiseaux; Laboratoire d’Informatique et Systèmes |
| Jean-Marc Boï; Laboratoire d’Informatique et Systèmes |
| Daniele Fronte; STMicroeletronics |
| Pierre-Yvan Liardet; STMicroeletronics |
| Djamal Merad; Laboratoire d’Informatique et Systèmes |
| ARS-04.17: DIGGING HIERARCHICAL INFORMATION FOR VISUAL PLACE RECOGNITION WITH WEIGHTING SIMILARITY METRIC |
| Hong Liu; Peking University |
| Qian Zhang; Peking University |
| Guoliang Hua; Peking University |
| Chenyang Zhao; Peking University |
| ARS-04.18: VISUAL RELATIONSHIP DETECTION WITH A DEEP CONVOLUTIONAL RELATIONSHIP NETWORK |
| Yaopeng Peng; University of Notre Dame |
| Danny Z. Chen; University of Notre Dame |
| Lanfen Lin; Zhejiang University |
| ARS-04.19: SPATIAL KEYFRAME EXTRACTION OF MOBILE VIDEOS FOR EFFICIENT OBJECT DETECTION AT THE EDGE |
| George Constantinou; University of Southern California |
| Cyrus Shahabi; University of Southern California |
| Seon Ho Kim; University of Southern California |
| ARS-04.20: GSANET: SEMANTIC SEGMENTATION WITH GLOBAL AND SELECTIVE ATTENTION |
| Qingfeng Liu; Samsung SSI |
| Mostafa El-Khamy; Samsung SSI |
| Dongwoon Bai; Samsung SSI |
| Jungwon Lee; Samsung SSI |
| ARS-04.21: UNSUPERVISED VISUAL RELATIONSHIP INFERENCE |
| Taiga Kashima; University of Tokyo |
| Kento Masui; University of Tokyo |
| Hideki Nakayama; University of Tokyo |
| ARS-04.22: VC-VQA: VISUAL CALIBRATION MECHANISM FOR VISUAL QUESTION ANSWERING |
| Yanyuan Qiao; Institute of Automation, Chinese Academy of Sciences |
| Zheng Yu; Peking University |
| Jing Liu; Institute of Automation, Chinese Academy of Sciences |