WE1.PA3.4

Text-Queried Target Sound Event Localization

Jinzheng Zhao, University of Surrey, United Kingdom; Xinyuan Qian, University of Science and Technology Beijing, China; Yong Xu, Tencent AI Lab, United States; Haohe Liu, University of Surrey, United Kingdom; Yin Cao, Xi’an Jiaotong Liverpool University, China; Davide Berghi, Wenwu Wang, University of Surrey, United Kingdom

Session:
WE1.PA3: Microphone Array Processing and Spatial Audio Poster

Track:
ASMSP - Acoustic, Speech and Music Signal Processing

Location:
Poster Area 3

Presentation Time:
Wed, 28 Aug, 10:30 - 12:30 France Time (UTC +1)

Session Co-Chairs:
Alexander Bertrand, KU Leuven and Karim ABED-MERAIM, Université d'Orléans
Presentation
Discussion
Resources
No resources available.
Session WE1.PA3
WE1.PA3.1: CONSTANT DIRECTIVITY LOUDSPEAKER BEAMFORMING
Yuancheng Luo, Amazon Inc., United States
WE1.PA3.2: MICROPHONE PAIR SELECTION FOR SOUND SOURCE LOCALIZATION IN MASSIVE ARRAYS OF SPATIALLY DISTRIBUTED MICROPHONES
Bilgesu Çakmak, Thomas Dietzen, Katholieke Universiteit Leuven (KU Leuven), Belgium; Randall Ali, University of Surrey, United Kingdom; Patrick Naylor, Imperial College London, United Kingdom; Toon van Waterschoot, Katholieke Universiteit Leuven (KU Leuven), Belgium
WE1.PA3.3: Unsupervised Training of Neural Network-based Virtual Microphone Estimator
Jiachen Wang, Tomoki Toda, Nagoya University, Japan
WE1.PA3.4: Text-Queried Target Sound Event Localization
Jinzheng Zhao, University of Surrey, United Kingdom; Xinyuan Qian, University of Science and Technology Beijing, China; Yong Xu, Tencent AI Lab, United States; Haohe Liu, University of Surrey, United Kingdom; Yin Cao, Xi’an Jiaotong Liverpool University, China; Davide Berghi, Wenwu Wang, University of Surrey, United Kingdom
WE1.PA3.5: SOUND-INTENSITY-BASED DIRECTION OF ARRIVAL ESTIMATION USING CENTRO-SYMMETRIC SENSOR ARRAYS
Tommaso Gambini, Davide Albertini, Alberto Bernardini, Politecnico Di Milano, Italy
WE1.PA3.6: A PROCESS FOR CALIBRATING HRTFS BASED ON DIFFERENTIABLE IMPLICIT REPRESENTATIONS AND DOMAIN ADVERSARIAL LEARNING
Thiago Lobato, Roland Sottek, HEAD acoustics GmbH, Germany
WE1.PA3.7: Room Impulse Response Estimation using Optimal Transport: Simulation-Informed Inference
David Sundström, Lund University, Sweden; Anton Björkman, Aalto University, Finland; Andreas Jakobsson, Lund University, Sweden; Filip Elvander, Aalto University, Finland
WE1.PA3.8: SPEECH DENOISING IN MULTI-NOISE SOURCE ENVIRONMENTS USING MULTIPLE MICROPHONE DEVICES VIA RELATIVE TRANSFER MATRIX
Manish Kumar, Lachlan Birnie, Thushara Abhayapala, Sandra Arcos Holzinger, Amy Bastine, Daniel Grixti-Cheng, Prasanga Samarasinghe, The Australian National University, Australia
WE1.PA3.9: SOUND EVENT DETECTION AND LOCALIZATION WITH DISTANCE ESTIMATION
Daniel Krause, Archontis Politis, Annamaria Mesaros, Tampere University, Finland
WE1.PA3.10: ROBUST SIGNAL AND NOISE COVARIANCE MATRIX ESTIMATION USING RIEMANNIAN OPTIMIZATION
Jesper Brunnström, Marc Moonen, KU Leuven, Belgium; Filip Elvander, Aalto University, Finland