SLP-P15: Distant Speech Recognition |
Session Type: Poster |
Time: Thursday, May 16, 15:30 - 17:30 |
Location: Poster Area A, Ground Floor |
Session Chair: Tomohiro Nakatani, NTT Corporation |
SLP-P15.1: SPATIAL AND CHANNEL ATTENTION BASED CONVOLUTIONAL NEURAL NETWORKS FOR MODELING NOISY SPEECH |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Sirui Xu; The Ohio State University |
Eric Fosler-Lussier; The Ohio State University |
SLP-P15.2: ACOUSTIC MODELING FOR DISTANT MULTI-TALKER SPEECH RECOGNITION WITH SINGLE- AND MULTI-CHANNEL BRANCHES |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Naoyuki Kanda; Hitachi Ltd. |
Yusuke Fujita; Hitachi Ltd. |
Shota Horiguchi; Hitachi Ltd. |
Rintaro Ikeshita; Hitachi Ltd. |
Kenji Nagamatsu; Hitachi Ltd. |
Shinji Watanabe; Johns Hopkins University |
SLP-P15.3: MULTI-GEOMETRY SPATIAL ACOUSTIC MODELING FOR DISTANT SPEECH RECOGNITION |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Kenichi Kumatani; Amazon |
Wu Minhua; Amazon |
Shiva Sundaram; Amazon |
Nikko Ström; Amazon |
Björn Hoffmeister; Amazon |
SLP-P15.4: FREQUENCY DOMAIN MULTI-CHANNEL ACOUSTIC MODELING FOR DISTANT SPEECH RECOGNITION |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Wu Minhua; Amazon |
Kenichi Kumatani; Amazon |
Shiva Sundaram; Amazon |
Nikko Ström; Amazon |
Björn Hoffmeister; Amazon |
SLP-P15.5: ON REDUCING THE EFFECT OF SPEAKER OVERLAP FOR CHIME-5 |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Catalin Zorila; Toshiba Cambridge Research Laboratory |
Rama Doddipatla; Toshiba Cambridge Research Laboratory |
SLP-P15.6: A TWO-STAGE SINGLE-CHANNEL SPEAKER-DEPENDENT SPEECH SEPARATION APPROACH FOR CHIME-5 CHALLENGE |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Lei Sun; University of Science and Technology of China |
Jun Du; University of Science and Technology of China |
Tian Gao; University of Science and Technology of China |
Yi Fang; iFlytek Company |
Feng Ma; iFlytek Company |
Jia Pan; iFlytek Company |
Chin-Hui Lee; Georgia Institute of Technology |
SLP-P15.7: JOINT OPTIMIZATION OF NEURAL NETWORK-BASED WPE DEREVERBERATION AND ACOUSTIC MODEL FOR ROBUST ONLINE ASR |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Jahn Heymann; Paderborn University |
Lukas Drude; Paderborn University |
Reinhold Häb-Umbach; Paderborn University |
Keisuke Kinoshita; NTT Communication Science Laboratories |
Tomohiro Nakatani; NTT Communication Science Laboratories |
SLP-P15.8: INVESTIGATION INTO JOINT OPTIMIZATION OF SINGLE CHANNEL SPEECH ENHANCEMENT AND ACOUSTIC MODELING FOR ROBUST ASR |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Tobias Menne; RWTH Aachen University |
Ralf Schlüter; RWTH Aachen University |
Hermann Ney; RWTH Aachen University |
SLP-P15.9: ACOUSTIC MODELING FOR OVERLAPPING SPEECH RECOGNITION: JHU CHIME-5 CHALLENGE SYSTEM |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Vimal Manohar; Johns Hopkins University |
Szu-Jui Chen; Johns Hopkins University |
Zhiqi Wang; Johns Hopkins University |
Yusuke Fujita; Hitachi Ltd. |
Shinji Watanabe; Johns Hopkins University |
Sanjeev Khudanpur; Johns Hopkins University |
SLP-P15.10: LESSONS FROM BUILDING ACOUSTIC MODELS WITH A MILLION HOURS OF SPEECH |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Sree Hari Krishnan Parthasarathi; Amazon |
Nikko Ström; Amazon |