AUD-P12: Audio, Speech and Music Analysis |
| Session Type: Poster |
| Time: Friday, 8 May, 15:15 - 17:15 |
| Location: On-Demand |
| Virtual Session: View on Virtual Platform |
| Session Chair: Justin Salomon, Adobe Research |
| AUD-P12.1: SNORER DIARISATION BASED ON DEEP NEURAL NETWORK EMBEDDINGS |
| Hector E. Romero; University of Sheffield |
| Ning Ma; University of Sheffield |
| Guy J. Brown; University of Sheffield |
| AUD-P12.2: PLAYING TECHNIQUE RECOGNITION BY JOINT TIME–FREQUENCY SCATTERING |
| Changhong Wang; Queen Mary University of London |
| Vincent Lostanlen; New York University |
| Emmanouil Benetos; Queen Mary University of London |
| Elaine Chew; IRCAM |
| AUD-P12.3: PRIVACY AWARE ACOUSTIC SCENE SYNTHESIS USING DEEP SPECTRAL FEATURE INVERSION |
| Félix Gontier; LS2N |
| Mathieu Lagrange; LS2N |
| Catherine Lavandier; Etis, Université de Cergy-Pontoise |
| Jean-François Petiot; LS2N |
| AUD-P12.4: ROBUSTNESS ASSESSMENT OF AUTOMATIC REINKE’S EDEMA DIAGNOSIS SYSTEMS |
| Mario Madruga; Universidad de Extremadura |
| Yolanda Campos-Roca; Universidad de Extremadura |
| Carlos J. Pérez; Universidad de Extremadura |
| AUD-P12.5: WHOSECOUGH: IN-THE-WILD COUGHER VERIFICATION USING MULTITASK LEARNING |
| Matt Whitehill; University of Washington |
| Jake Garrison; Google, Inc. |
| Shwetak Patel; University of Washington |
| AUD-P12.6: CHIRPING UP THE RIGHT TREE: INCORPORATING BIOLOGICAL TAXONOMIES INTO DEEP BIOACOUSTIC CLASSIFIERS |
| Jason Cramer; New York University |
| Vincent Lostanlen; Cornell Lab of Ornithology |
| Andrew Farnsworth; Cornell University |
| Justin Salamon; Adobe |
| Juan Pablo Bello; New York University |
| AUD-P12.7: BEAMFORMING DESIGN FOR HIGH-RESOLUTION LOW-INTENSITY FOCUSED ULTRASOUND NEUROMODULATION |
| Boqiang Fan; Rice University |
| Wayne Goodman; Baylor College of Medicine |
| Raymond Cho; Baylor College of Medicine |
| Sameer Sheth; Baylor College of Medicine |
| Richard Bouchard; University of Texas MD Anderson Cancer Center |
| Behnaam Aazhang; Rice University |
| AUD-P12.8: AN ATTENTION ENHANCED MULTI-TASK MODEL FOR OBJECTIVE SPEECH ASSESSMENT IN REAL-WORLD ENVIRONMENTS |
| Xuan Dong; Indiana University |
| Donald S. Williamson; Indiana University |
| AUD-P12.9: HUMBUG ZOONIVERSE: A CROWD-SOURCED ACOUSTIC MOSQUITO DATASET |
| Ivan Kiskin; University of Oxford |
| Adam Cobb; University of Oxford |
| Lawrence Wang; University of Oxford |
| Stephen Roberts; University of Oxford |
| AUD-P12.10: SUBJECTIVE QUALITY ESTIMATION USING PESQ FOR HANDS-FREE TERMINALS |
| Sachiko Kurihara; NTT Corporation |
| Masahiro Fukui; NTT Corporation |
| Suehiro Shimauchi; Kanazawa Institute of Technology |
| Noboru Harada; NTT Corporation |
| AUD-P12.11: VOICE ACTIVITY DETECTION FOR TRANSIENT NOISY ENVIRONMENT BASED ON DIFFUSION NETS |
| Amir Ivry; Technion - Israel Institute of Technology |
| Baruch Berdugo; Technion - Israel Institute of Technology |
| Israel Cohen; Technion - Israel Institute of Technology |
| AUD-P12.12: GRIFFIN–LIM LIKE PHASE RECOVERY VIA ALTERNATING DIRECTION METHOD OF MULTIPLIERS |
| Yoshiki Masuyama; Waseda University |
| Kohei Yatabe; Waseda University |
| Yasuhiro Oikawa; Waseda University |