AASP-P27.2
TIMBRE-AWARE AUDIO DIFFERENCE CAPTIONING FOR ANOMALOUS MACHINE SOUNDS WITHOUT PAIRED TRAINING DATA VIA SYNTHETIC PERTURBATIONS
TOMOYA NISHIDA, Harsh Purohit, Kota Dohi, Takashi Endo, Yohei Kawaguchi, Hitachi, Ltd., Japan
Session:
AASP-P27: Anomalous Sound Detection and Machine Sound Analysis Poster
Track:
Audio and Acoustic Signal Processing [AA]
Location:
Poster Area 25
Presentation Time:
Fri, 8 May, 09:00 - 11:00
Presentation
Discussion
Resources
No resources available.
Session AASP-P27
AASP-P27.1: REFGEN: REFERENCE-GUIDED SYNTHETIC DATA GENERATION FOR ANOMALOUS SOUND DETECTION
Wenrui Liang, Tsinghua University, China; Yihong Qiu, North China Electric Power University, China; Anbai Jiang, Tsinghua University, China; Bing Han, Shanghai Jiao Tong University, China; Tianyu Liu, Tsinghua University, China; Xinhu Zheng, Shanghai Jiao Tong University, China; Pingyi Fan, Tsinghua University, China; Cheng Lu, North China Electric Power University, China; Jia Liu, Wei-Qiang Zhang, Tsinghua University, China
AASP-P27.2: TIMBRE-AWARE AUDIO DIFFERENCE CAPTIONING FOR ANOMALOUS MACHINE SOUNDS WITHOUT PAIRED TRAINING DATA VIA SYNTHETIC PERTURBATIONS
TOMOYA NISHIDA, Harsh Purohit, Kota Dohi, Takashi Endo, Yohei Kawaguchi, Hitachi, Ltd., Japan
AASP-P27.3: From Human Speech to Ocean Signals: Transferring Speech Large Models for Underwater Acoustic Target Recognition
Mengcheng Huang, Xue Zhou, Chen Xu, Dapeng Man, Harbin Engineering University, China
AASP-P27.4: Improving Anomalous Sound Detection with Attribute-aware Representation from Domain-adaptive Pre-training
Xin Fang, iFLYTEK Research, China; Guirui Zhong, Qing Wang, University of Science and Technology of China, China; Fan Chu, National Intelligent Voice Innovation Center, China; Lei Wang, iFLYTEK Research, China; Mengui Qian, National Intelligent Voice Innovation Center, China; Mingqi Cai, iFLYTEK Research, China; Jiangzhao Wu, National Intelligent Voice Innovation Center, China; Jianqing Gao, iFLYTEK Research, China; Jun Du, University of Science and Technology of China, China
AASP-P27.5: INFLUENCE-AWARE CURATION AND ACTIVE SELECTION FOR INDUSTRIAL AND SURVEILLANCE SOUND EVENTS
Myeonghoon Ryu, Deeply Inc., Korea, Republic of; Seongkyu Mun, Korea Univ, Korea, Republic of; Daewoong Kim, Han Park, Suji Lee, Deeply Inc., Korea, Republic of
AASP-P27.6: TLDIFFGAN: A LATENT DIFFUSION-GAN FRAMEWORK WITH TEMPORAL INFORMATION FUSION FOR ANOMALOUS SOUND DETECTION
Chengyuan Ma, Tsinghua university, China; Peng Jia, Hongyue Guo, Dalian Maritime University, China; Wenming Yang, Tsinghua university, China
AASP-P27.7: RASD-SR: A ROBUST ANOMALOUS SOUND DETECTION FRAMEWORK WITH SCORE RECALIBRATION
Ting Wu, Lu Han, Institute of Acoustics, Chinese Academy of Sciences, China; Zhaoli Yan, Beijing University of Chemical Technology, China; Xiaobin Cheng, Jun Yang, Institute of Acoustics, Chinese Academy of Sciences, China
AASP-P27.8: A LLM-Driven Acoustic Semantic Enriched Framework For Underwater Acoustic Target Recognition
Jingkai Cao, Donghua University, China; Shicheng Ding, Tabor Academy, Massachusetts, USA, China; Shuai Yu, Dalian University of Technology, China; Wei Li, Fudan University, China
AASP-P27.9: ADAPTIVE TASK-INCREMENTAL LEARNING FOR UNDERWATER ACOUSTIC RECOGNITION BASED ON MIXTURE-OF-EXPERTS ADAPTER
Yang Zhang, Changjian Wang, Weiguo Chen, Yuan Yuan, Yingzhi Chen, College of Computer Science and Technology, National University of Defense Technology, China
AASP-P27.10: Phase-Space Signal Processing of Acoustic Data for Advanced Manufacturing in-situ Monitoring
Pouria Meshki Zadeh, Shams Torabnia, Nathan Fonseca, Keng Hsu, Ehsan Dehghan Niri, Arizona State University, United States of America
Contacts