AUD-P12: Audio, Speech and Music Analysis |
Session Type: Poster |
Time: Friday, 8 May, 15:15 - 17:15 |
Location: On-Demand |
Virtual Session: View on Virtual Platform |
Session Chair: Justin Salomon, Adobe Research |
AUD-P12.1: SNORER DIARISATION BASED ON DEEP NEURAL NETWORK EMBEDDINGS |
Hector E. Romero; University of Sheffield |
Ning Ma; University of Sheffield |
Guy J. Brown; University of Sheffield |
AUD-P12.2: PLAYING TECHNIQUE RECOGNITION BY JOINT TIME–FREQUENCY SCATTERING |
Changhong Wang; Queen Mary University of London |
Vincent Lostanlen; New York University |
Emmanouil Benetos; Queen Mary University of London |
Elaine Chew; IRCAM |
AUD-P12.3: PRIVACY AWARE ACOUSTIC SCENE SYNTHESIS USING DEEP SPECTRAL FEATURE INVERSION |
Félix Gontier; LS2N |
Mathieu Lagrange; LS2N |
Catherine Lavandier; Etis, Université de Cergy-Pontoise |
Jean-François Petiot; LS2N |
AUD-P12.4: ROBUSTNESS ASSESSMENT OF AUTOMATIC REINKE’S EDEMA DIAGNOSIS SYSTEMS |
Mario Madruga; Universidad de Extremadura |
Yolanda Campos-Roca; Universidad de Extremadura |
Carlos J. Pérez; Universidad de Extremadura |
AUD-P12.5: WHOSECOUGH: IN-THE-WILD COUGHER VERIFICATION USING MULTITASK LEARNING |
Matt Whitehill; University of Washington |
Jake Garrison; Google, Inc. |
Shwetak Patel; University of Washington |
AUD-P12.6: CHIRPING UP THE RIGHT TREE: INCORPORATING BIOLOGICAL TAXONOMIES INTO DEEP BIOACOUSTIC CLASSIFIERS |
Jason Cramer; New York University |
Vincent Lostanlen; Cornell Lab of Ornithology |
Andrew Farnsworth; Cornell University |
Justin Salamon; Adobe |
Juan Pablo Bello; New York University |
AUD-P12.7: BEAMFORMING DESIGN FOR HIGH-RESOLUTION LOW-INTENSITY FOCUSED ULTRASOUND NEUROMODULATION |
Boqiang Fan; Rice University |
Wayne Goodman; Baylor College of Medicine |
Raymond Cho; Baylor College of Medicine |
Sameer Sheth; Baylor College of Medicine |
Richard Bouchard; University of Texas MD Anderson Cancer Center |
Behnaam Aazhang; Rice University |
AUD-P12.8: AN ATTENTION ENHANCED MULTI-TASK MODEL FOR OBJECTIVE SPEECH ASSESSMENT IN REAL-WORLD ENVIRONMENTS |
Xuan Dong; Indiana University |
Donald S. Williamson; Indiana University |
AUD-P12.9: HUMBUG ZOONIVERSE: A CROWD-SOURCED ACOUSTIC MOSQUITO DATASET |
Ivan Kiskin; University of Oxford |
Adam Cobb; University of Oxford |
Lawrence Wang; University of Oxford |
Stephen Roberts; University of Oxford |
AUD-P12.10: SUBJECTIVE QUALITY ESTIMATION USING PESQ FOR HANDS-FREE TERMINALS |
Sachiko Kurihara; NTT Corporation |
Masahiro Fukui; NTT Corporation |
Suehiro Shimauchi; Kanazawa Institute of Technology |
Noboru Harada; NTT Corporation |
AUD-P12.11: VOICE ACTIVITY DETECTION FOR TRANSIENT NOISY ENVIRONMENT BASED ON DIFFUSION NETS |
Amir Ivry; Technion - Israel Institute of Technology |
Baruch Berdugo; Technion - Israel Institute of Technology |
Israel Cohen; Technion - Israel Institute of Technology |
AUD-P12.12: GRIFFIN–LIM LIKE PHASE RECOVERY VIA ALTERNATING DIRECTION METHOD OF MULTIPLIERS |
Yoshiki Masuyama; Waseda University |
Kohei Yatabe; Waseda University |
Yasuhiro Oikawa; Waseda University |