TH2.SC4.1

A VIDEO VISION TRANSFORMER FOR SOUND SOURCE LOCALIZATION

Haruto Yokota, Tokyo Institute of Technology, Japan; Mert Bozkurtlar, Tokyo Institute of Technology/Istanbul Technical University, Japan; Benjamin Yen, Tokyo Institute of Technology, Japan; Katsutoshi Itoyama, Tokyo Institute of Technology/Honda Research Institute Japan Co., Ltd., Japan; Kenji Nishida, Kazuhiro Nakadai, Tokyo Institute of Technology, Japan

Session:
TH2.SC4: Microphone Array Processing Lecture

Track:
ASMSP - Acoustic, Speech and Music Signal Processing

Location:
Saint Clair 4

Presentation Time:
Thu, 29 Aug, 14:00 - 14:20 France Time (UTC +1)

Session Co-Chairs:
Simon Doclo, Oldenburg University and Stefan Goetze, University of Sheffield
Presentation
Discussion
Resources
No resources available.