AASP-P5.7
SPATIALCODEC: NEURAL SPATIAL SPEECH CODING
Zhongweiyang Xu, University of Illinois Urbana-Champaign, United States of America; Yong Xu, Vinay Kothapally, Tencent AI Lab, United States of America; Heming Wang, The Ohio State University, United States of America; Muqiao Yang, Carnegie Mellon University, United States of America; Dong Yu, Tencent AI Lab, United States of America
Session:
AASP-P5: Localization, DOA estimation, Spatial audio recording and reproduction Poster
Track:
Audio and Acoustic Signal Processing
Location:
Poster Zone 5A
Poster Board PZ-5A.7
Poster Board PZ-5A.7
Presentation Time:
Wed, 17 Apr, 08:20 - 10:20 (UTC +9)
Session Chair:
Jesper Rindom Jensen, Aalborg University
Session AASP-P5
AASP-P5.1: COMPARISON OF FREQUENCY-FUSION MECHANISMS FOR BINAURAL DIRECTION-OF-ARRIVAL ESTIMATION FOR MULTIPLE SPEAKERS
Daniel Fejgin, University of Oldenburg, Germany; Elior Hadad, Sharon Gannot, Bar-Ilan University, Israel; Zbynek Koldovsky, Technical University of Liberec, Czechia; Simon Doclo, University of Oldenburg, Germany
AASP-P5.2: A STEERED RESPONSE POWER APPROACH WITH BILINEAR PREDICTION-BASED TRADE-OFF PREWHITENING FOR SPEAKER LOCALIZATION
Zhiheng Wang, Hongsen He, Southwest University of Science and Technology, China; Jingdong Chen, Northwestern Polytechnical University, China; Jacob Benesty, University of Quebec, Canada; Yi Yu, Southwest University of Science and Technology, China
AASP-P5.3: Robust DOA estimation from deep acoustic imaging
Adrian Roman, University of Southern California, United States of America; Iran Roman, Juan Bello, New York University, United States of America
AASP-P5.4: STOFNET: SUPER-RESOLUTION TIME OF FLIGHT NETWORK
Christopher Hahne, Michel Hayoz, Raphael Sznitman, University of Bern, Switzerland
AASP-P5.5: SOUND FIELD INTERPOLATION FOR ROTATION-INVARIANT MULTICHANNEL ARRAY SIGNAL PROCESSING
Yukoh Wakabayashi, Toyohashi University of Technology, Japan; Kouei Yamaoka, Nobutaka Ono, Tokyo Metropolitan University, Japan
AASP-P5.6: SIMULTANEOUS INTERIOR AND EXTERIOR SOUND FIELD SYNTHESIS USING CYLINDRICAL AND SPHERICAL LOUDSPEAKER ARRAYS
Yo SASAKI, Yasushige Nakayama, NHK, Japan
AASP-P5.7: SPATIALCODEC: NEURAL SPATIAL SPEECH CODING
Zhongweiyang Xu, University of Illinois Urbana-Champaign, United States of America; Yong Xu, Vinay Kothapally, Tencent AI Lab, United States of America; Heming Wang, The Ohio State University, United States of America; Muqiao Yang, Carnegie Mellon University, United States of America; Dong Yu, Tencent AI Lab, United States of America
AASP-P5.8: ROTOR NOISE-AWARE NOISE COVARIANCE MATRIX ESTIMATION FOR UNMANNED AERIAL VEHICLE AUDITION
Benjamin Yen, Tokyo Institute of Technology, Japan; Yameizhen Li, Yusuke Hioka, University of Auckland, New Zealand
AASP-P5.9: SPARSE SOUND FIELD REPRESENTATION USING COMPLEX ORTHOGONAL MATCHING PURSUIT
Shaoheng Xu, The Australian National University, Australia; Jihui (Aimee) Zhang, University of Southampton, United Kingdom of Great Britain and Northern Ireland; Thushara Abhayapala, Amy Bastine, Wei-Ting Lai, Prasanga Samarasinghe, The Australian National University, Australia
AASP-P5.10: Broadband Personal Sound Zone Control in the Presence of Nonlinearities
Sankha Subhra Bhattacharjee, Srikanth Burra, Jesper Rindom Jensen, Aalborg University, Denmark; Liming Shi, Chongqing University of Posts and Telecommunications, China; Guoli Ping, Jingkai Weng, Huawei Technologies Co., Ltd, China; Mads Græsbøll Christensen, Aalborg University, Denmark
AASP-P5.11: 3D PERCEPTUAL SOUNDFIELD RECONSTRUCTION VIA VIRTUAL MICROPHONE SYNTHESIS
Ege Erdem, Middle East Technical University, Türkiye; Zoran Cvetkovic, King's College London, United Kingdom of Great Britain and Northern Ireland; Huseyin Hacihabiboglu, Middle East Technical University, Türkiye
AASP-P5.12: QUANTIFYING THE EFFECT OF SIMULATOR-BASED DATA AUGMENTATION FOR SPEECH RECOGNITION ON AUGMENTED REALITY GLASSES
Riku Arakawa, Carnegie Mellon University, United States of America; Mathieu Parvaix, Chiong Lai, Hakan Erdogan, Alex Olwal, Google Research, United States of America
Contacts