SLP-P15: Distant Speech Recognition |
| Session Type: Poster |
| Time: Thursday, May 16, 15:30 - 17:30 |
| Location: Poster Area A, Ground Floor |
| Session Chair: Tomohiro Nakatani, NTT Corporation |
| SLP-P15.1: SPATIAL AND CHANNEL ATTENTION BASED CONVOLUTIONAL NEURAL NETWORKS FOR MODELING NOISY SPEECH |
| Manuscript Link: Click here to view manuscript on IEEE Xplore |
| Sirui Xu; The Ohio State University |
| Eric Fosler-Lussier; The Ohio State University |
| SLP-P15.2: ACOUSTIC MODELING FOR DISTANT MULTI-TALKER SPEECH RECOGNITION WITH SINGLE- AND MULTI-CHANNEL BRANCHES |
| Manuscript Link: Click here to view manuscript on IEEE Xplore |
| Naoyuki Kanda; Hitachi Ltd. |
| Yusuke Fujita; Hitachi Ltd. |
| Shota Horiguchi; Hitachi Ltd. |
| Rintaro Ikeshita; Hitachi Ltd. |
| Kenji Nagamatsu; Hitachi Ltd. |
| Shinji Watanabe; Johns Hopkins University |
| SLP-P15.3: MULTI-GEOMETRY SPATIAL ACOUSTIC MODELING FOR DISTANT SPEECH RECOGNITION |
| Manuscript Link: Click here to view manuscript on IEEE Xplore |
| Kenichi Kumatani; Amazon |
| Wu Minhua; Amazon |
| Shiva Sundaram; Amazon |
| Nikko Ström; Amazon |
| Björn Hoffmeister; Amazon |
| SLP-P15.4: FREQUENCY DOMAIN MULTI-CHANNEL ACOUSTIC MODELING FOR DISTANT SPEECH RECOGNITION |
| Manuscript Link: Click here to view manuscript on IEEE Xplore |
| Wu Minhua; Amazon |
| Kenichi Kumatani; Amazon |
| Shiva Sundaram; Amazon |
| Nikko Ström; Amazon |
| Björn Hoffmeister; Amazon |
| SLP-P15.5: ON REDUCING THE EFFECT OF SPEAKER OVERLAP FOR CHIME-5 |
| Manuscript Link: Click here to view manuscript on IEEE Xplore |
| Catalin Zorila; Toshiba Cambridge Research Laboratory |
| Rama Doddipatla; Toshiba Cambridge Research Laboratory |
| SLP-P15.6: A TWO-STAGE SINGLE-CHANNEL SPEAKER-DEPENDENT SPEECH SEPARATION APPROACH FOR CHIME-5 CHALLENGE |
| Manuscript Link: Click here to view manuscript on IEEE Xplore |
| Lei Sun; University of Science and Technology of China |
| Jun Du; University of Science and Technology of China |
| Tian Gao; University of Science and Technology of China |
| Yi Fang; iFlytek Company |
| Feng Ma; iFlytek Company |
| Jia Pan; iFlytek Company |
| Chin-Hui Lee; Georgia Institute of Technology |
| SLP-P15.7: JOINT OPTIMIZATION OF NEURAL NETWORK-BASED WPE DEREVERBERATION AND ACOUSTIC MODEL FOR ROBUST ONLINE ASR |
| Manuscript Link: Click here to view manuscript on IEEE Xplore |
| Jahn Heymann; Paderborn University |
| Lukas Drude; Paderborn University |
| Reinhold Häb-Umbach; Paderborn University |
| Keisuke Kinoshita; NTT Communication Science Laboratories |
| Tomohiro Nakatani; NTT Communication Science Laboratories |
| SLP-P15.8: INVESTIGATION INTO JOINT OPTIMIZATION OF SINGLE CHANNEL SPEECH ENHANCEMENT AND ACOUSTIC MODELING FOR ROBUST ASR |
| Manuscript Link: Click here to view manuscript on IEEE Xplore |
| Tobias Menne; RWTH Aachen University |
| Ralf Schlüter; RWTH Aachen University |
| Hermann Ney; RWTH Aachen University |
| SLP-P15.9: ACOUSTIC MODELING FOR OVERLAPPING SPEECH RECOGNITION: JHU CHIME-5 CHALLENGE SYSTEM |
| Manuscript Link: Click here to view manuscript on IEEE Xplore |
| Vimal Manohar; Johns Hopkins University |
| Szu-Jui Chen; Johns Hopkins University |
| Zhiqi Wang; Johns Hopkins University |
| Yusuke Fujita; Hitachi Ltd. |
| Shinji Watanabe; Johns Hopkins University |
| Sanjeev Khudanpur; Johns Hopkins University |
| SLP-P15.10: LESSONS FROM BUILDING ACOUSTIC MODELS WITH A MILLION HOURS OF SPEECH |
| Manuscript Link: Click here to view manuscript on IEEE Xplore |
| Sree Hari Krishnan Parthasarathi; Amazon |
| Nikko Ström; Amazon |