EUSIPCO 2025 || Palermo, Italy || 8 - 12 September 2025

ASMSP-P11.6

MIMII-Gen: Generative Modeling Approach for Simulated Evaluation of Anomalous Sound Detection System

Harsh Purohit, Tomoya Nishida, Kota Dohi, Takashi Endo, Yohei Kawaguchi, Hitachi, Japan

Session:

ASMSP-P11: Audio Deepfake and Anomaly Detection Poster

Location:

Poster Area C

Presentation Time:

Thu, 11 Sep, 15:30 - 17:10 Italy Time (UTC +2)

Session Chair:

Tomoya Nishida, Hitachi, Ltd.

Session ASMSP-P11

ASMSP-P11.1: TIMBRE-BASED ANOMALY EXPLANATION WITHOUT ANOMALOUS TRAINING DATA

Tomoya Nishida, Harsh Purohit, Kota Dohi, Takashi Endo, Yohei Kawaguchi, Hitachi, Ltd., Japan

ASMSP-P11.2: PITCH-SHIFT ROBUSTNESS IN VOICE-BASED FRAUD DETECTION

David Looney, Nikolay Gaubitch, Pindrop, United Kingdom

ASMSP-P11.3: DIN-CTS: Low-Complexity Depthwise-Inception Neural Network with Contrastive Training Strategy for Deepfake Speech Detection

Lam Pham, Austrian Institute of Technology, Austria; Dat Tran, FPT University, Viet Nam; Phat Lam, HCM University of Technology, Viet Nam; Florian Skopik, Alexander Schindler, Silvia Poletti, David Fischinger, Martin Boyer, Austrian Institute of Technology, Austria

ASMSP-P11.4: KAN You Hear the Truth? Audio Deepfake Detection with Kolmogorov–Arnold Networks

Hoan My Tran, Univ Rennes/IRISA/CNRS, France; Damien Lolive, Univ of South Brittany/IRISA/CNRS, France; David Guennec, Univ Rennes/IRISA/CNRS, France; Aghilas Sini, Le Mans University/LIUM, France; Arnaud Delhay, Univ Rennes/IRISA/CNRS, France; Pierre-François Marteau, Univ of South Brittany/IRISA/CNRS, France

ASMSP-P11.5: BENCHMARKING AUDIO DEEPFAKE DETECTION ROBUSTNESS IN REAL-WORLD COMMUNICATION SCENARIOS

Haohan Shi, Xiyu Shi, Safak Dogan, Loughborough University London, United Kingdom; Saif Alzubi, Tianjin Huang, Yunxiao Zhang, University of Exeter, United Kingdom

ASMSP-P11.6: MIMII-Gen: Generative Modeling Approach for Simulated Evaluation of Anomalous Sound Detection System

Harsh Purohit, Tomoya Nishida, Kota Dohi, Takashi Endo, Yohei Kawaguchi, Hitachi, Japan

ASMSP-P11.7: MFCC VS. LFCC FOR AUDIO DEEPFAKE DETECTION: THE ROLE OF DELTA FEATURES AND INPUT LENGTH

Karla Schäfer, Martin Steinebach, Fraunhofer SIT, ATHENE, Germany

ASMSP-P11.8: Unsupervised Anomalous Sound Detection Focused on Timbral-Related Features

Ryoya Ogura, Masashi Unoki, Japan Advanced Institute of Science and Technology, Japan

ASMSP-P11.9: AUDIO DEEPFAKE DETECTION UNDER POST-PROCESSING ATTACK

Karla Schäfer, Jeong-Eun Choi, Martin Steinebach, Fraunhofer SIT, ATHENE, Germany