AASP-P14: Acoustic Event Detection and Speech Enhancement |
Session Type: Poster |
Time: Friday, May 17, 08:30 - 10:30 |
Location: Poster Area D, Ground Floor |
Session Chair: Sven Nordholm, Curtin University
|
|
AASP-P14.1: SCENE-DEPENDENT ANOMALOUS ACOUSTIC-EVENT DETECTION BASED ON CONDITIONAL WAVENET AND I-VECTOR |
Tatsuya Komatsu; NEC Corporation |
Tomoki Hayashi; Nagoya University |
Reishi Kondo; NEC Corporation |
Tomoki Toda; Nagoya University |
Kazuya Takeda; Nagoya University |
|
AASP-P14.2: TEACHER-STUDENT TRAINING FOR ACOUSTIC EVENT DETECTION USING AUDIOSET |
Ruibo Shi; Emotech Labs |
Raymond W. M. Ng; Emotech Labs |
Pawel Swietojanski; The University of New South Wales |
|
AASP-P14.3: ACTIVE LEARNING FOR EFFICIENT AUDIO ANNOTATION AND CLASSIFICATION WITH A LARGE AMOUNT OF UNLABELED DATA |
Yu Wang; New York University |
Ana Elisa Mendez Mendez; New York University |
Mark Cartwright; New York University |
Juan Pablo Bello; New York University |
|
AASP-P14.4: POLYPHONIC SOUND EVENT DETECTION USING CONVOLUTIONAL BIDIRECTIONAL LSTM AND SYNTHETIC DATA-BASED TRANSFER LEARNING |
Seokwon Jung; Humelo Inc. / Korea Advanced Institute of Science and Technology (KAIST) |
Jungbae Park; Humelo Inc. / Korea Advanced Institute of Science and Technology (KAIST) |
Sangwan Lee; Korea Advanced Institute of Science and Technology / Humelo Inc. |
|
AASP-P14.5: A MULTI-SPIKE APPROACH FOR ROBUST SOUND RECOGNITION |
Qiang Yu; Tianjin University |
Yanli Yao; Tianjin University |
Longbiao Wang; Tianjin University |
Huajin Tang; Sichuan University |
Jianwu Dang; Tianjin University |
|
AASP-P14.6: CROSS EVALUATION OF SPEECH ENHANCEMENT METHODS UNDER DIFFERENT NOISE CONDITIONS |
Lara Nahma; Curtin University |
Pei Chee Yong; Nuheara |
Hai Huyen Dam; Curtin University |
Sven Nordholm; Curtin University |
|
AASP-P14.7: DIFFERENTIABLE CONSISTENCY CONSTRAINTS FOR IMPROVED DEEP SPEECH ENHANCEMENT |
Scott Wisdom; Google, Inc. |
John R. Hershey; Google, Inc. |
Kevin Wilson; Google, Inc. |
Jeremy Thorpe; Google, Inc. |
Michael Chinen; Google, Inc. |
Brian Patton; Google, Inc. |
Rif Saurous; Google, Inc. |
|
AASP-P14.8: A DEEP GENERATIVE MODEL OF SPEECH COMPLEX SPECTROGRAMS |
Aditya Arie Nugraha; RIKEN Center for Advanced Intelligence Project |
Kouhei Sekiguchi; RIKEN Center for Advanced Intelligence Project |
Kazuyoshi Yoshii; RIKEN Center for Advanced Intelligence Project |
|
AASP-P14.9: DNN TRAINING BASED ON CLASSIC GAIN FUNCTION FOR SINGLE-CHANNEL SPEECH ENHANCEMENT AND RECOGNITION |
Yanhui Tu; University of Science and Technology of China |
Jun Du; University of Science and Technology of China |
Chin-Hui Lee; Georgia Institute of Technology |
|
AASP-P14.10: SNIPER: FEW-SHOT LEARNING FOR ANOMALY DETECTION TO MINIMIZE FALSE-NEGATIVE RATE WITH ENSURED TRUE-POSITIVE RATE |
Yuma Koizumi; NTT Corporation |
Shin Murata; NTT Corporation |
Noboru Harada; NTT Corporation |
Shoichiro Saito; NTT Corporation |
Hisashi Uematsu; NTT Corporation |
|