GC-L7.2

SIR-PROGRESSIVE AUDIO-VISUAL TF-GRIDNET WITH ASR-AWARE SELECTOR FOR TARGET SPEAKER EXTRACTION IN MISP 2023 CHALLENGE

Zhongshu Hou, Tianchi Sun, Nanjing University, China; Yuxiang Hu, Changbao Zhu, Horizon Robotics, China; Kai Chen, Jing Lu, Nanjing University, China

Session:
GC-L7: Multimodal Information Based Speech Processing (MISP) 2023 Challenge Lecture

Track:
Grand Challenges

Location:
Room E8

Presentation Time:
Fri, 19 Apr, 13:30 - 13:50 (UTC +9)

Session Co-Chairs:
Shinji Watanabe, Carnegie Mellon University and Jun Du, University of Science and Technology of China
Presentation
Discussion
Resources
No resources available.
Contacts