MMSP-P1: Multimodal Signal Processing II |
| Session Type: Poster |
| Time: Friday, May 17, 08:30 - 10:30 |
| Location: Poster Area C, Ground Floor |
| Session Chair: Shmuel Peleg, The Hebrew University of Jerusalem
|
| |
| MMSP-P1.1: SEEING THROUGH SOUNDS: PREDICTING VISUAL SEMANTIC SEGMENTATION RESULTS FROM MULTICHANNEL AUDIO SIGNALS |
| Go Irie; NTT Corporation |
| Mirela Ostrek; The University of Zagreb |
| Haochen Wang; The University of British Columbia |
| Hirokazu Kameoka; NTT Corporation |
| Akisato Kimura; NTT Corporation |
| Takahito Kawanishi; NTT Corporation |
| Kunio Kashino; NTT Corporation |
| |
| MMSP-P1.2: PERFECT MATCH: IMPROVED CROSS-MODAL EMBEDDINGS FOR AUDIO-VISUAL SYNCHRONISATION |
| Soo-Whan Chung; Yonsei University |
| Joon Son Chung; Naver Corp. |
| Hong-Goo Kang; Yonsei University |
| |
| MMSP-P1.3: A NEIGHBOR-AWARE APPROACH FOR IMAGE-TEXT MATCHING |
| Chunxiao Liu; Institute of Information Engineering, Chinese Academy of Sciences |
| Zhendong Mao; Institute of Information Engineering, Chinese Academy of Sciences |
| Wenyu Zang; Institute of Information Engineering, Chinese Academy of Sciences |
| Bin Wang; Institute of Information Engineering, Chinese Academy of Sciences; Xiaomi AI Lab |
| |
| MMSP-P1.4: LEARNING AFFECTIVE CORRESPONDENCE BETWEEN MUSIC AND IMAGE |
| Gaurav Verma; Adobe Research |
| Eeshan Gunesh Dhekane; Mila, Université de Montréal |
| Tanaya Guha; University of Warwick |
| |
| MMSP-P1.5: DYNAMIC TEMPORAL ALIGNMENT OF SPEECH TO LIPS |
| Tavi Halperin; The Hebrew University of Jerusalem |
| Ariel Ephrat; Google, Inc. |
| Shmuel Peleg; The Hebrew University of Jerusalem |
| |
| MMSP-P1.6: GRAYSCALE-THERMAL TRACKING VIA CANONICAL CORRELATION ANALYSIS BASED INVERSE SPARSE REPRESENTATION |
| Wan Ding; College of Communication and Information Engineering, Nanjing University of Posts and Telecommunications |
| Bin Kang; College of Internet of Things, Nanjing University of Posts and Telecommunications |
| Quan Zhou; College of Communication and Information Engineering, Nanjing University of Posts and Telecommunications |
| Min Lin; College of Communication and Information Engineering, Nanjing University of Posts and Telecommunications |
| Suofei Zhang; College of Interner of Things, Nanjing University of Posts and Telecommunications |
| |
| MMSP-P1.7: LEARNING SEMANTIC-PRESERVING SPACE USING USER PROFILE AND MULTIMODAL MEDIA CONTENT FROM POLITICAL SOCIAL NETWORK |
| Wei-Hao Chang; National Tsing Hua University |
| Jeng-Lin Li; National Tsing Hua University |
| Chi-Chun Lee; National Tsing Hua University |
| |
| MMSP-P1.8: NOISE-TOLERANT AUDIO-VISUAL ONLINE PERSON VERIFICATION USING AN ATTENTION-BASED NEURAL NETWORK FUSION |
| Suwon Shon; Massachusetts Institute of Technology |
| Tae-Hyun Oh; Massachusetts Institute of Technology |
| James Glass; Massachusetts Institute of Technology |
| |
| MMSP-P1.9: CROSS-CULTURE MULTIMODAL EMOTION RECOGNITION WITH ADVERSARIAL LEARNING |
| Jingjun Liang; Renmin University of China |
| Shizhe Chen; Renmin University of China |
| Jinming Zhao; Renmin University of China |
| Qin Jin; Renmin University of China |
| Haibo Liu; Tencent |
| Li Lu; Tencent |
| |
| MMSP-P1.10: A DEEP-NARMA FILTER FOR UNUSUAL BEHAVIOR DETECTION FROM VISUAL, THERMAL AND WIRELESS SIGNALS |
| Nikolaos Bakalos; National Technical University of Athens |
| Athanasios Voulodimos; University of West Attica |
| Anastasios Doulamis; National Technical University of Athens |
| Nikolaos Doulamis; National Technical University of Athens |
| |
| MMSP-P1.11: LEARNING DISENTANGLED REPRESENTATION IN LATENT STOCHASTIC MODELS: A CASE STUDY WITH IMAGE CAPTIONING |
| Nidhi Vyas; Carnegie Mellon University |
| SaiKrishna Rallabandi; Carnegie Mellon University |
| Lalitesh Morishetti; Carnegie Mellon University |
| Eduard Hovy; Carnegie Mellon University |
| Alan W Black; Carnegie Mellon University |
| |