MO2.PB.8
Multi-scale Spatial-Frequency Features Representation and Learnable Cross Modal Feature Fusion in Deepfake Detection
Yuzhi Lu, Wenyi Wang, Xiaowen Chen, Fengyu Wang, Shuai Wang, Jianwen Chen, uestc, China
Session:
MO2.PB: Image and Video Analysis, Synthesis, and Retrieval 5 Poster
Track:
[IV-ANA] Image and video analysis, synthesis, and retrieval
Location:
Poster Area B
Presentation Time:
Mon, 15 Sep, 10:00 - 11:30 Anchorage Time (UTC -8)
Session Chair:
Maggie Zhu, Purdue University
Presentation
Discussion
Resources
No resources available.
Session MO2.PB
MO2.PB.1: ORIENTED OBJECT DETECTION BASED ON COMPOSITE TRIGONOMETRIC FUNCTION CODER
Jing Hu, Jiawei Liang, Minghua Zhao, Shuangli Du, Peng Li, Shaanxi Key Laboratory for Network Computing and Security Technology, School of Computer Science and Engineering, Xi’an University of Technology, China
MO2.PB.2: SEMANTIC CONTEXT RE-MINING FOR MULTIMODAL GUIDED HUMAN-OBJECT INTERACTION DETECTION
Jihao Dong, Hua Yang, Shanghai Jiao Tong University, China
MO2.PB.3: ROBUST CHARACTER STROKE SEGMENTATION FOR DIVERSE FONTS VIA CONTOUR MATCHING AND CHAIN PROPAGATION
Hao Xia, Shenzhen University, China; Xueting Liu, Chengze Li, Saint Francis University, China; Zhenkun Wen, Huisi Wu, Shenzhen University, China
MO2.PB.4: TASK-SPECIFIC SPATIOTEMPORAL CONTEXT-AWARE DECOUPLING FOR OCCLUDED VIDEO OBJECT DETECTION
Kaihong Li, Fei Chen, College of Computer and Data Science, Fuzhou University, Fuzhou, China, China; Xunxun Zeng, College of Mathematics and Statistics, Fuzhou University, Fuzhou, China, China; Wanling Liu, College of Computer and Data Science, Fuzhou University, Fuzhou, China, China; Huayi Chen, Faculty of Science, University of Manitoba, Winnipeg, Canada, China
MO2.PB.5: CMP: COMPOSABLE META PROMPT FOR SAM-BASED CROSS-DOMAIN FEW-SHOT SEGMENTATIO
Shuai Chen, Fanman Meng, Chunjin Yang, Haoran Wei, Chenhao Wu, Qingbo Wu, Hongliang Li, University of Electronic Science and Technology of China, China
MO2.PB.6: GRID-LOGAT: GRID BASED LOCAL AND GLOBAL AREA TRANSCRIPTION FOR VIDEO QUESTION ANSWERING
Md Intisar Chowdhury, Kittinun Aukkapinyo, Hiroshi Fujimura, Joo Ann Woo, Wasu Wasusatein, Fadoua Ghourabi, AWL, Inc., Japan
MO2.PB.7: GEE-UOD: AN UNDERWATER OBJECT DETECTION NETWORK BASED ON GLOBAL AND EDGE INFORMATION ENHANCEMENT
Weirui Na, Yong Wang, Chendong Xu, Zijun Huang, Qisong Wu, Southeast University, China
MO2.PB.8: Multi-scale Spatial-Frequency Features Representation and Learnable Cross Modal Feature Fusion in Deepfake Detection
Yuzhi Lu, Wenyi Wang, Xiaowen Chen, Fengyu Wang, Shuai Wang, Jianwen Chen, uestc, China
MO2.PB.9: RE-PURPOSING SEGMENT ANYTHING FOR SKELETON ACTION LOCALIZATION
Christopher Bunn, Wanqing Li, Jack Yang, University of Wollongong, Australia
MO2.PB.10: VISUAL PROMPT AIDED SINGLE SHOT OBJECT PART SEGMENTATION
Anant Mohan, International Institute of Information Technology Bangalore, India; Yiyong Tan, Gridraster, India; Sarthak Harne, Viswanath Gopalakrishnan, International Institute of Information Technology Bangalore, India; Bhaskar Banerjee, Rishi Ranjan, Pradeep Rangdhol, Gridraster, India
MO2.PB.11: DETECTION OF SCREEN USAGE DURING EATING EVENTS AMONG PRESCHOOL-AGED CHILDREN
Tonmoy Ghosh, Md Billal Hossain, Steven Holiday, Matthew Cribbet, Susan White, University of Alabama, United States; Yu Gan, Stevens Institute of Technology, United States; Edward Sazonov, University of Alabama, United States
MO2.PB.12: CONFIDENCE-AWARE AGGLOMERATION CLASSIFICATION AND SEGMENTATION OF 2D MICROSCOPIC FOOD CRYSTAL IMAGES
Xiaoyu Ji, Ali Shakouri, Fengqing Zhu, Purdue University, United States
MO2.PB.13: DEEP OBJECT RECOGNITION-BASED ANALYSIS OF DIVERSE CULINARY LANDSCAPES
Aibota Sanatbyek, Aknur Karabay, Huseyin Atakan Varol, Mei-Yen Chan, Nazarbayev University, Kazakhstan
Contacts