SLP-L20.1
HM-CONFORMER: A CONFORMER-BASED AUDIO DEEPFAKE DETECTION SYSTEM WITH HIERARCHICAL POOLING AND MULTI-LEVEL CLASSIFICATION TOKEN AGGREGATION METHODS
Hyun-seo Shin, Jungwoo Heo, Ju-ho Kim, Chan-yeong Lim, Wonbin Kim, Ha-jin Yu, University of Seoul, Korea, Republic of
Session:
SLP-L20: Anti-spoofing I Lecture
Track:
Speech and Language Processing
Location:
Room E2
Presentation Time:
Thu, 18 Apr, 08:20 - 08:40 (UTC +9)
Session Co-Chairs:
Kong Aik Lee, The Hong Hong Polytechnic University and Massimiliano Todisco, EURECOM Graduate School and Research Center
Session SLP-L20
SLP-L20.1: HM-CONFORMER: A CONFORMER-BASED AUDIO DEEPFAKE DETECTION SYSTEM WITH HIERARCHICAL POOLING AND MULTI-LEVEL CLASSIFICATION TOKEN AGGREGATION METHODS
Hyun-seo Shin, Jungwoo Heo, Ju-ho Kim, Chan-yeong Lim, Wonbin Kim, Ha-jin Yu, University of Seoul, Korea, Republic of
SLP-L20.2: CAN LARGE-SCALE VOCODED SPOOFED DATA IMPROVE SPEECH SPOOFING COUNTERMEASURE WITH A SELF-SUPERVISED FRONT END?
Xin Wang, Junichi Yamagishi, National Institute of Informatics, Japan
SLP-L20.3: AUDIO DEEPFAKE DETECTION WITH SELF-SUPERVISED WAVLM AND MULTI-FUSION ATTENTIVE CLASSIFIER
Yinlin Guo, Haofan Huang, Xi Chen, He Zhao, Yuehai Wang, Zhejiang University, China
SLP-L20.4: Improving Short Utterance Anti-Spoofing with AASIST2
Yuxiang Zhang, Jingze Lu, Zengqiang Shang, Wenchao Wang, Pengyuan Zhang, Institute of Acoustics, Chinese Academy of Sciences, China
SLP-L20.5: GMM-RESNET2: ENSEMBLE OF GROUP RESNET NETWORKS FOR SYNTHETIC SPEECH DETECTION
Zhenchun Lei, Hui Yan, Changhong Liu, Yong Zhou, Minglei Ma, Jiangxi Normal University, China
SLP-L20.6: ROBUST SPOOF SPEECH DETECTION BASED ON MULTI-SCALE FEATURE AGGREGATION AND DYNAMIC CONVOLUTION
Haochen Wu, Jie Zhang, University of Science and Technology of China, China; Zhentao Zhang, Wenting Zhao, China Merchants Bank, China; Bin Gu, Wu Guo, University of Science and Technology of China, China
Contacts