MMSP-L1.5

CAPTION UNIFICATION FOR MULTI-VIEW LIFELOGGING IMAGES BASED ON IN-CONTEXT LEARNING WITH HETEROGENEOUS SEMANTIC CONTENTS

Masaya Sato, Keisuke Maeda, Ren Togo, Takahiro Ogawa, Miki Haseyama, Hokkaido University, Japan

Session:
MMSP-L1: Multimodal Processing: Vision + Language 1 Lecture

Track:
Multimedia Signal Processing

Location:
Room 201

Presentation Time:
Tue, 16 Apr, 17:50 - 18:10 (UTC +9)

Session Co-Chairs:
Jin Zeng, Tongji University, Shanghai, China and Fernando Pereira, IST, Portugal
View Manuscript
Presentation
Discussion
Resources
Contacts