ARS-04: Image & Video Interpretation and Understanding |
Interactive Q&A Time: Tuesday, 27 October, 09:00 - 09:25 |
Virtual Session: View on Virtual Platform |
Session Chairs: Alexandros Iosifidis, Aarhus University and Lei Wang, University of Wollongong
|
|
ARS-04.1: ACED: ACCURATE AND EDGE-CONSISTENT MONOCULAR DEPTH ESTIMATION |
Kunal Swami; Samsung Research Institute Bangalore |
Prasanna Vishnu Bondada; Samsung Research Institute Bangalore |
Pankaj Kumar Bajpai; Samsung Research Institute Bangalore |
|
ARS-04.2: ROBUST AUDIO-VISUAL MANDARIN SPEECH RECOGNITION BASED ON ADAPTIVE DECISION FUSION AND TONE FEATURES |
Hong Liu; Peking University |
Zhengyan Chen; Peking University |
Wei Shi; Peking University |
|
ARS-04.3: ACTIVITY NORMALIZATION FOR ACTIVITY DETECTION IN SURVEILLANCE VIDEOS |
Takashi Hosono; NTT |
Kiyohito Sawada; National Police Academy |
Yongqing Sun; NTT |
Kazuya Hayase; NTT |
Jun Shimamura; NTT |
|
ARS-04.4: TEMPORAL ACTION PROPOSAL GENERATION VIA DEEP FEATURE ENHANCEMENT |
He-Yen Hsieh; Academia Sinica |
Ding-Jie Chen; Academia Sinica |
Tyng-Luh Liu; Academia Sinica |
|
ARS-04.5: VIDEO LOGO RETRIEVAL BASED ON LOCAL FEATURES |
Bochen Guan; University of Wisconsin-Madison |
Hanrong Ye; Peking University |
Hong Liu; Peking University |
William A. Sethares; University of Wisconsin-Madison |
|
ARS-04.6: RETRIEVING AND HIGHLIGHTING ACTION WITH SPATIOTEMPORAL REFERENCE |
Seito Kasai; Keio University |
Yuchi Ishikawa; Keio University |
Masaki Hayashi; Keio University |
Yoshimitsu Aoki; Keio University |
Kensho Hara; National Institute of Advanced Industrial Science and Technology (AIST) |
Hirokatsu Kataoka; National Institute of Advanced Industrial Science and Technology (AIST) |
|
ARS-04.7: GPU ACCELERATED POLAR FOURIER ANALYSIS FOR FEATURE EXTRACTION |
Mingkai Tang; Guangdong University of Technology |
Zhuozhang Li; Guangdong University of Technology |
Zhuo Yang; Guangdong University of Technology |
Yinwei Zhan; Guangdong University of Technology |
Jia Su; Capital Normal University |
Wenxin Yu; Southwest University of Science and Technology |
|
ARS-04.8: PRIOR VISUAL RELATIONSHIP REASONING FOR VISUAL QUESTION ANSWERING |
Zhuoqian Yang; Carnegie Mellon University |
Zengchang Qin; Beihang University |
Jing Yu; Chinese Academy of Sciences |
Tao Wan; Beihang University |
|
ARS-04.9: FPHA-AFFORD: A DOMAIN-SPECIFIC BENCHMARK DATASET FOR OCCLUDED OBJECT AFFORDANCE ESTIMATION IN HUMAN-OBJECT-ROBOT INTERACTION |
S. Muzamil Hussain .S; Shanghai Jiao Tong University |
Liu Liu; University of Science and Technology of China |
Wenqiang Xu; Shanghai Jiao Tong University |
Cewu Lu; Shanghai Jiao Tong University |
|
ARS-04.10: ESTIMATION OF IMPRESSION ASSOCIATED WITH PORTRAITS USING FACIAL LANDMARKS AND VISUAL FEATURES |
Mari Miyata; University of Tokyo |
Kiyoharu Aizawa; University of Tokyo |
|
ARS-04.11: TASK-ORIENTED MULTI-MODAL QUESTION ANSWERING FOR COLLABORATIVE APPLICATIONS |
Hui Li Tan; Agency for Science, Technology and Research |
Mei Chee Leong; Agency for Science, Technology and Research |
Qianli Xu; Agency for Science, Technology and Research |
Liyuan Li; Agency for Science, Technology and Research |
Fen Fang; Agency for Science, Technology and Research |
Yi Cheng; Agency for Science, Technology and Research |
Nicolas Gauthier; Agency for Science, Technology and Research |
Ying Sun; Agency for Science, Technology and Research |
Joo Hwee Lim; Agency for Science, Technology and Research |
|
ARS-04.12: DCM: A DENSE-ATTENTION CONTEXT MODULE FOR SEMANTIC SEGMENTATION |
Shenghua Li; Nanjing University of Posts and Telecommunications |
Quan Zhou; Nanjing University of Posts and Telecommunications |
Jia Liu; Nanjing University of Posts and Telecommunications |
Jie Wang; Nanjing University of Posts and Telecommunications |
Yawen Fan; Nanjing University of Posts and Telecommunications |
Xiaofu Wu; Nanjing University of Posts and Telecommunications |
Longin Jan Latecki; Temple University |
|
ARS-04.13: A CONTEXT-BASED NETWORK FOR REFERRING IMAGE SEGMENTATION |
Xinyu Li; Dalian University of Technology |
Yu Liu; Dalian University of Technology |
Kaiping Xu; Dalian University of Technology |
Zhehuan Zhao; Dalian University of Technology |
Sipei Liu; North Information Control Research Academy Group Co., Ltd. |
|
ARS-04.14: DEPTH ESTIMATION FROM SINGLE IMAGE AND SEMANTIC PRIOR |
Praful Hambarde; Computer Vision and Pattern Recognition Lab, IIT Ropar |
Akshay Dudhane; Computer Vision and Pattern Recognition Lab, IIT Ropar |
Prashant Patil; Computer Vision and Pattern Recognition Lab, IIT Ropar |
Subrahmanyam Murala; Computer Vision and Pattern Recognition Lab, IIT Ropar |
Abhinav Dhall; Monash University |
|
ARS-04.15: AN END-TO-END FRAMEWORK FOR POSE ESTIMATION OF OCCLUDED PEDESTRIANS |
Sudip Das; Indian Statistical Institute |
Perla Sai Raj Kishore; Institute of Engineering and Management |
Ujjwal Bhattacharya; Indian Statistical Institute |
|
ARS-04.16: GRAPH MATCHING APPLIED FOR TEXTURED PATTERN RECOGNITION |
Raphaël Abelé; STMicroelectronics |
Jean-Luc Damoiseaux; Laboratoire d’Informatique et Systèmes |
Jean-Marc Boï; Laboratoire d’Informatique et Systèmes |
Daniele Fronte; STMicroeletronics |
Pierre-Yvan Liardet; STMicroeletronics |
Djamal Merad; Laboratoire d’Informatique et Systèmes |
|
ARS-04.17: DIGGING HIERARCHICAL INFORMATION FOR VISUAL PLACE RECOGNITION WITH WEIGHTING SIMILARITY METRIC |
Hong Liu; Peking University |
Qian Zhang; Peking University |
Guoliang Hua; Peking University |
Chenyang Zhao; Peking University |
|
ARS-04.18: VISUAL RELATIONSHIP DETECTION WITH A DEEP CONVOLUTIONAL RELATIONSHIP NETWORK |
Yaopeng Peng; University of Notre Dame |
Danny Z. Chen; University of Notre Dame |
Lanfen Lin; Zhejiang University |
|
ARS-04.19: SPATIAL KEYFRAME EXTRACTION OF MOBILE VIDEOS FOR EFFICIENT OBJECT DETECTION AT THE EDGE |
George Constantinou; University of Southern California |
Cyrus Shahabi; University of Southern California |
Seon Ho Kim; University of Southern California |
|
ARS-04.20: GSANET: SEMANTIC SEGMENTATION WITH GLOBAL AND SELECTIVE ATTENTION |
Qingfeng Liu; Samsung SSI |
Mostafa El-Khamy; Samsung SSI |
Dongwoon Bai; Samsung SSI |
Jungwon Lee; Samsung SSI |
|
ARS-04.21: UNSUPERVISED VISUAL RELATIONSHIP INFERENCE |
Taiga Kashima; University of Tokyo |
Kento Masui; University of Tokyo |
Hideki Nakayama; University of Tokyo |
|
ARS-04.22: VC-VQA: VISUAL CALIBRATION MECHANISM FOR VISUAL QUESTION ANSWERING |
Yanyuan Qiao; Institute of Automation, Chinese Academy of Sciences |
Zheng Yu; Peking University |
Jing Liu; Institute of Automation, Chinese Academy of Sciences |
|