SLP-P28.2
WaveSP-Net: Learnable Wavelet-Domain Sparse Prompt Tuning for Speech Deepfake Detection
Xi Xuan, University of Eastern Finland, School of Computing, Finland; Xuechen Liu, National Institute of Informatics, Japan; Wenxin Zhang, University of Chinese Academy of Sciences/University of Toronto, Canada; Yi-Cheng Lin, National Taiwan University, Taiwan; Xiaojian Lin, Tsinghua University, China; Tomi Kinnunen, University of Eastern Finland, Finland
Session:
SLP-P28: Modeling & Architectures for Audio Deepfake and Spoofing Detection Poster
Track:
Speech and Language Processing [SL]
Location:
Poster Area 30
Presentation Time:
Wed, 6 May, 16:30 - 18:30
Session Chair:
Chanwoo Kim, Professor of Artificial Intelligence, Korea University
Presentation
Discussion
Resources
No resources available.
Session SLP-P28
SLP-P28.1: DISCRETE-CONTINUOUS FUSION WITH ADAPTIVE HIERARCHICAL FEATURES FOR AUDIO DEEPFAKE DETECTION
Jianqiao Cui, Bingyao Yu, Tsinghua University, China; Shun Qin, Yangtze Delta Region Institute Of Tsinghua University, Zhejiang, China
SLP-P28.2: WaveSP-Net: Learnable Wavelet-Domain Sparse Prompt Tuning for Speech Deepfake Detection
Xi Xuan, University of Eastern Finland, School of Computing, Finland; Xuechen Liu, National Institute of Informatics, Japan; Wenxin Zhang, University of Chinese Academy of Sciences/University of Toronto, Canada; Yi-Cheng Lin, National Taiwan University, Taiwan; Xiaojian Lin, Tsinghua University, China; Tomi Kinnunen, University of Eastern Finland, Finland
SLP-P28.3: TOWARDS DATA DRIFT MONITORING FOR SPEECH DEEPFAKE DETECTION IN THE CONTEXT OF MLOPS
Xin Wang, Wanying Ge, Junichi Yamagishi, National Institute of Informatics, Japan
SLP-P28.4: KAN WE MAKE MODELS SIMPLER FOR AUDIO DEEPFAKE DETECTION WITH KOLMOGOROV–ARNOLD NETWORKS?
Hoan My Tran, Univ Rennes/IRISA/CNRS, France; Aghilas Sini, Le Mans University/LIUM, France; David Guennec, Arnaud Delhay, Univ Rennes/IRISA/CNRS, France; Damien Lolive, Pierre-François Marteau, Univ Bretagne Sud/IRISA/CNRS, France
SLP-P28.5: ROBUST DEEPFAKE AUDIO DETECTION VIA MULTI-LEVEL INTERMEDIATE FEATURE FUSION
Jinpeng Zhao, Jian Zhao, Yufei Zhou, Peijia Zheng, Yusong Du, Sun Yat-sen University, China
SLP-P28.6: CompSpoof: A Dataset and Joint Learning Framework for Component-Level Audio Anti-spoofing Countermeasures
Xueping Zhang, Duke Kunshan University, China; Yechen Wang, Linxi Li, Liwei Jin, OfSpectrum, Inc., United States of America; Ming Li, Duke Kunshan University, China
SLP-P28.7: A PARAMETER-EFFICIENT MULTI-SCALE CONVOLUTIONAL ADAPTER FOR SYNTHETIC SPEECH DETECTION
Yassine El Kheir, DFKI, Germany; Fabian Ritter-Guttierez, NTU, Singapore; Arnab Das, Tim Polzehl, Sebastian Möller, DFKI, Germany
SLP-P28.8: TRI-ATTENTION FUSION: JOINT TEMPORAL-SPECTRAL AND BIDIRECTIONAL MODELING FOR SPEECH SPOOFING DETECTION
Minjiao Yang, Kangfeng Zheng, Jujie Wang, Xiaoyu Zhang, School of Cyberspace Security, Beijing University of Posts and Telecommunications, China; Yaru Zhao, University of International Relations, China
SLP-P28.9: FINE-GRAINED FRAME MODELING IN MULTI-HEAD SELF-ATTENTION FOR SPEECH DEEPFAKE DETECTION
Tuan Dat Phuong, Hanoi University of Science and Technology, Viet Nam; Duc Tuan Truong, Nanyang Technological University, Singapore; Long Vu Hoang, Thi Thu Trang Nguyen, Hanoi University of Science and Technology, Viet Nam
SLP-P28.10: XLSR-MAMBA: A DUAL-COLUMN BIDIRECTIONAL STATE SPACE MODEL FOR SPOOFING ATTACK DETECTION
Yang Xiao, The University of Melbourne, Australia; Rohan Kumar Das, Fortemedia, Singapore
Contacts