ASMSP-P11.6
MIMII-Gen: Generative Modeling Approach for Simulated Evaluation of Anomalous Sound Detection System
Harsh Purohit, Tomoya Nishida, Kota Dohi, Takashi Endo, Yohei Kawaguchi, Hitachi, Japan
Session:
ASMSP-P11: Audio Deepfake and Anomaly Detection Poster
Track:
ASMSP - Acoustic, Speech and Music Signal Processing
Location:
Poster Area C
Presentation Time:
Thu, 11 Sep, 15:30 - 17:10 Italy Time (UTC +2)
Session Chair:
Tomoya Nishida, Hitachi, Ltd.
Presentation
Discussion
Resources
No resources available.
Session ASMSP-P11
ASMSP-P11.1: TIMBRE-BASED ANOMALY EXPLANATION WITHOUT ANOMALOUS TRAINING DATA
Tomoya Nishida, Harsh Purohit, Kota Dohi, Takashi Endo, Yohei Kawaguchi, Hitachi, Ltd., Japan
ASMSP-P11.2: PITCH-SHIFT ROBUSTNESS IN VOICE-BASED FRAUD DETECTION
David Looney, Nikolay Gaubitch, Pindrop, United Kingdom
ASMSP-P11.3: DIN-CTS: Low-Complexity Depthwise-Inception Neural Network with Contrastive Training Strategy for Deepfake Speech Detection
Lam Pham, Austrian Institute of Technology, Austria; Dat Tran, FPT University, Viet Nam; Phat Lam, HCM University of Technology, Viet Nam; Florian Skopik, Alexander Schindler, Silvia Poletti, David Fischinger, Martin Boyer, Austrian Institute of Technology, Austria
ASMSP-P11.4: KAN You Hear the Truth? Audio Deepfake Detection with Kolmogorov–Arnold Networks
Hoan My Tran, Univ Rennes/IRISA/CNRS, France; Damien Lolive, Univ of South Brittany/IRISA/CNRS, France; David Guennec, Univ Rennes/IRISA/CNRS, France; Aghilas Sini, Le Mans University/LIUM, France; Arnaud Delhay, Univ Rennes/IRISA/CNRS, France; Pierre-François Marteau, Univ of South Brittany/IRISA/CNRS, France
ASMSP-P11.5: BENCHMARKING AUDIO DEEPFAKE DETECTION ROBUSTNESS IN REAL-WORLD COMMUNICATION SCENARIOS
Haohan Shi, Xiyu Shi, Safak Dogan, Loughborough University London, United Kingdom; Saif Alzubi, Tianjin Huang, Yunxiao Zhang, University of Exeter, United Kingdom
ASMSP-P11.6: MIMII-Gen: Generative Modeling Approach for Simulated Evaluation of Anomalous Sound Detection System
Harsh Purohit, Tomoya Nishida, Kota Dohi, Takashi Endo, Yohei Kawaguchi, Hitachi, Japan
ASMSP-P11.7: MFCC VS. LFCC FOR AUDIO DEEPFAKE DETECTION: THE ROLE OF DELTA FEATURES AND INPUT LENGTH
Karla Schäfer, Martin Steinebach, Fraunhofer SIT, ATHENE, Germany
ASMSP-P11.8: Unsupervised Anomalous Sound Detection Focused on Timbral-Related Features
Ryoya Ogura, Masashi Unoki, Japan Advanced Institute of Science and Technology, Japan
ASMSP-P11.9: AUDIO DEEPFAKE DETECTION UNDER POST-PROCESSING ATTACK
Karla Schäfer, Jeong-Eun Choi, Martin Steinebach, Fraunhofer SIT, ATHENE, Germany