MMSP-P1: Multimodal Signal Processing II |
Session Type: Poster |
Time: Friday, May 17, 08:30 - 10:30 |
Location: Poster Area C, Ground Floor |
Session Chair: Shmuel Peleg, The Hebrew University of Jerusalem
|
|
MMSP-P1.1: SEEING THROUGH SOUNDS: PREDICTING VISUAL SEMANTIC SEGMENTATION RESULTS FROM MULTICHANNEL AUDIO SIGNALS |
Go Irie; NTT Corporation |
Mirela Ostrek; The University of Zagreb |
Haochen Wang; The University of British Columbia |
Hirokazu Kameoka; NTT Corporation |
Akisato Kimura; NTT Corporation |
Takahito Kawanishi; NTT Corporation |
Kunio Kashino; NTT Corporation |
|
MMSP-P1.2: PERFECT MATCH: IMPROVED CROSS-MODAL EMBEDDINGS FOR AUDIO-VISUAL SYNCHRONISATION |
Soo-Whan Chung; Yonsei University |
Joon Son Chung; Naver Corp. |
Hong-Goo Kang; Yonsei University |
|
MMSP-P1.3: A NEIGHBOR-AWARE APPROACH FOR IMAGE-TEXT MATCHING |
Chunxiao Liu; Institute of Information Engineering, Chinese Academy of Sciences |
Zhendong Mao; Institute of Information Engineering, Chinese Academy of Sciences |
Wenyu Zang; Institute of Information Engineering, Chinese Academy of Sciences |
Bin Wang; Institute of Information Engineering, Chinese Academy of Sciences; Xiaomi AI Lab |
|
MMSP-P1.4: LEARNING AFFECTIVE CORRESPONDENCE BETWEEN MUSIC AND IMAGE |
Gaurav Verma; Adobe Research |
Eeshan Gunesh Dhekane; Mila, Université de Montréal |
Tanaya Guha; University of Warwick |
|
MMSP-P1.5: DYNAMIC TEMPORAL ALIGNMENT OF SPEECH TO LIPS |
Tavi Halperin; The Hebrew University of Jerusalem |
Ariel Ephrat; Google, Inc. |
Shmuel Peleg; The Hebrew University of Jerusalem |
|
MMSP-P1.6: GRAYSCALE-THERMAL TRACKING VIA CANONICAL CORRELATION ANALYSIS BASED INVERSE SPARSE REPRESENTATION |
Wan Ding; College of Communication and Information Engineering, Nanjing University of Posts and Telecommunications |
Bin Kang; College of Internet of Things, Nanjing University of Posts and Telecommunications |
Quan Zhou; College of Communication and Information Engineering, Nanjing University of Posts and Telecommunications |
Min Lin; College of Communication and Information Engineering, Nanjing University of Posts and Telecommunications |
Suofei Zhang; College of Interner of Things, Nanjing University of Posts and Telecommunications |
|
MMSP-P1.7: LEARNING SEMANTIC-PRESERVING SPACE USING USER PROFILE AND MULTIMODAL MEDIA CONTENT FROM POLITICAL SOCIAL NETWORK |
Wei-Hao Chang; National Tsing Hua University |
Jeng-Lin Li; National Tsing Hua University |
Chi-Chun Lee; National Tsing Hua University |
|
MMSP-P1.8: NOISE-TOLERANT AUDIO-VISUAL ONLINE PERSON VERIFICATION USING AN ATTENTION-BASED NEURAL NETWORK FUSION |
Suwon Shon; Massachusetts Institute of Technology |
Tae-Hyun Oh; Massachusetts Institute of Technology |
James Glass; Massachusetts Institute of Technology |
|
MMSP-P1.9: CROSS-CULTURE MULTIMODAL EMOTION RECOGNITION WITH ADVERSARIAL LEARNING |
Jingjun Liang; Renmin University of China |
Shizhe Chen; Renmin University of China |
Jinming Zhao; Renmin University of China |
Qin Jin; Renmin University of China |
Haibo Liu; Tencent |
Li Lu; Tencent |
|
MMSP-P1.10: A DEEP-NARMA FILTER FOR UNUSUAL BEHAVIOR DETECTION FROM VISUAL, THERMAL AND WIRELESS SIGNALS |
Nikolaos Bakalos; National Technical University of Athens |
Athanasios Voulodimos; University of West Attica |
Anastasios Doulamis; National Technical University of Athens |
Nikolaos Doulamis; National Technical University of Athens |
|
MMSP-P1.11: LEARNING DISENTANGLED REPRESENTATION IN LATENT STOCHASTIC MODELS: A CASE STUDY WITH IMAGE CAPTIONING |
Nidhi Vyas; Carnegie Mellon University |
SaiKrishna Rallabandi; Carnegie Mellon University |
Lalitesh Morishetti; Carnegie Mellon University |
Eduard Hovy; Carnegie Mellon University |
Alan W Black; Carnegie Mellon University |
|