MMSP-L2: Deep Learning for Multimedia Processing and Analysis II |
Session Type: Lecture |
Time: Friday, 8 May, 11:45 - 13:45 |
Location: On-Demand |
Virtual Session: View on Virtual Platform |
Session Chair: Giuseppe Valenzise, CNRS - CentraleSupelec - Université Paris-Sud |
MMSP-L2.1: A SIAMESE CONTENT-ATTENTIVE GRAPH CONVOLUTIONAL NETWORK FOR PERSONALITY RECOGNITION USING PHYSIOLOGY |
Hao-Chun Yang; National Tsing Hua University |
Chi-Chun Lee; National Tsing Hua University |
MMSP-L2.2: SELF-SUPERVISED LEARNING FOR AUDIO-VISUAL SPEAKER DIARIZATION |
Yifan Ding; University of Central Florida |
Yong Xu; Tencent AI Lab |
Shi-Xiong Zhang; Tencent AI Lab |
Yahuan Cong; Beijing University of Posts and Telecommunications |
Liqiang Wang; University of Central Florida |
MMSP-L2.3: WHAT MAKES THE SOUND?: A DUAL-MODALITY INTERACTING NETWORK FOR AUDIO-VISUAL EVENT LOCALIZATION |
Janani Ramaswamy; Indian Institute of Technology Madras |
MMSP-L2.4: ATTENTIONAL FUSED TEMPORAL TRANSFORMATION NETWORK FOR VIDEO ACTION RECOGNITION |
Ke Yang; National University of Defense Technology |
Zhiyuan Wang; National Innovation Institute of Defense Technology |
Huadong Dai; National Innovation Institute of Defense Technology |
Tianlong Shen; National Innovation Institute of Defense Technology |
Peng Qiao; National University of Defense Technology |
Xin Niu; National University of Defense Technology |
Jie Jiang; National University of Defense Technology |
Dongsheng Li; National University of Defense Technology |
Yong Dou; National University of Defense Technology |
MMSP-L2.5: DEEP PRODUCT QUANTIZATION MODULE FOR EFFICIENT IMAGE RETRIEVAL |
Meihan Liu; Peking University |
Yongxing Dai; Peking University |
Yan Bai; Peking University |
Ling-Yu Duan; Peking University |
MMSP-L2.6: THE OPEN BRANDS DATASET: UNIFIED BRAND DETECTION AND RECOGNITION AT SCALE |
Xuan Jin; Alibaba Group |
Wei Su; Alibaba Group |
Rong Zhang; Alibaba Group |
Yuan He; Alibaba Group |
Hui Xue; Alibaba Group |