TA2.PC.2

Distinctive Image Captioning: Leveraging Ground Truth Captions in CLIP Guided Reinforcement Learning

Antoine Chaffin, IMATAG, IRISA, France; Ewa Kijak, Univ. Rennes, IRISA, Inria, France; Vincent Claveau, CNRS, IRISA, France

Session:
TA2.PC: Image & Video Interpretation and Understanding - VI Poster

Track:
Image and Video Analysis, Synthesis, and Retrieval

Location:
Poster Area C

Presentation Time:
Tue, 29 Oct, 10:30 - 12:00 Gulf Standard Time (UTC +4)

Session Chair:
Salman Khan, Mohamed bin Zayed University of Artificial Intelligence
View Manuscript
Presentation
Discussion
Resources
Session TA2.PC
TA2.PC.1: REFERRING IMAGE SEGMENTATION WITH TWO-STAGE MULTI-MODAL INTERACTION
Zhenhua Wang, Linwei Ye, Wenzhou University, China
TA2.PC.2: Distinctive Image Captioning: Leveraging Ground Truth Captions in CLIP Guided Reinforcement Learning
Antoine Chaffin, IMATAG, IRISA, France; Ewa Kijak, Univ. Rennes, IRISA, Inria, France; Vincent Claveau, CNRS, IRISA, France
TA2.PC.3: STATISTICS-AWARE AUDIO-VISUAL DEEPFAKE DETECTOR
Marcella Astrid, University of Luxembourg, Luxembourg; Enjie Ghorbel, Manouba University, Tunisia; Djamila Aouada, University of Luxembourg, Luxembourg
TA2.PC.4: CROSS-DOMAIN FEW-SHOT IN-CONTEXT LEARNING FOR ENHANCING TRAFFIC SIGN RECOGNITION
Yaozong Gan, Guang Li, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama, Hokkaido University, Japan
TA2.PC.5: Edge-Reserved Knowledge Distillation for Image Matting
Jiasheng Wang, Zhenhua Wang, Jifeng Ning, Northwest A&F University, China
TA2.PC.6: LEARNING A RAIN-INVARIANT NETWORK FOR INSTANCE SEGMENTATION IN THE RAIN
Zhiwen Chen, Wei Wu, Zhengfeng Chen, Xidian University, China
TA2.PC.7: Rethinking Domain Adaptation and Generalization in the Era of CLIP
Ruoyu Feng, Tao Yu, University of Science and Technology of China, China; Xin Jin, Eastern Institute of Technology, Ningbo, China; Xiaoyuan Yu, Lei Xiao, Huawei Cloud, China; Zhibo Chen, University of Science and Technology of China, China
TA2.PC.8: FANET: FEATURE AMPLIFICATION NETWORK FOR SEMANTIC SEGMENTATION IN CLUTTERED BACKGROUND
Muhammad Ali, Mohamed bin Zayed University of Artificial Intelligence, United Arab Emirates; Mamoona Javaid, Institute of Space Technology, Pakistan; Mubashir Noman, Mohamed bin Zayed University of Artificial Intelligence, United Arab Emirates; Mustansar Fiaz, IBM Research, United Arab Emirates; Salman Khan, Mohamed bin Zayed University of Artificial Intelligence, United Arab Emirates
TA2.PC.9: TOWARDS GENERALIZABLE REFERRING IMAGE SEGMENTATION VIA TARGET PROMPT AND VISUAL COHERENCE
Yajie Liu, Beihang University, China; Pu Ge, Hangzhou Innovation Institute, China; Haoxiang Ma, Shichao Fan, Qingjie Liu, Di Huang, Yunhong Wang, Beihang University, China
TA2.PC.10: EXPLORING THE POTENTIAL OF RECURRENCE QUANTIFICATION ANALYSIS FOR VIDEO ANALYSIS AND MOTION DETECTION
Theodora Kyprianidi, Effrosyni Doutsi, George Tzagkarakis, Panagiotis Tsakalides, Foundation for Research and Technology - Hellas, Greece
Contacts