AUD-P11: Signal Enhancement and Restoration II |
| Session Type: Poster |
| Time: Friday, 8 May, 11:45 - 13:45 |
| Location: On-Demand |
| Virtual Session: View on Virtual Platform |
| Session Chair: Jesper Rindom Jensen, Aalborg University |
| AUD-P11.1: CONSISTENCY-AWARE MULTI-CHANNEL SPEECH ENHANCEMENT USING DEEP NEURAL NETWORKS |
| Yoshiki Masuyama; Waseda University |
| Masahito Togami; LINE Corporation |
| Tatsuya Komatsu; LINE Corporation |
| AUD-P11.2: PHASE RECONSTRUCTION BASED ON RECURRENT PHASE UNWRAPPING WITH DEEP NEURAL NETWORKS |
| Yoshiki Masuyama; Waseda University |
| Kohei Yatabe; Waseda University |
| Yuma Koizumi; NTT Corporation |
| Yasuhiro Oikawa; Waseda University |
| Noboru Harada; NTT Corporation |
| AUD-P11.3: PERFORMANCE STUDY OF A CONVOLUTIONAL TIME-DOMAIN AUDIO SEPARATION NETWORK FOR REAL-TIME SPEECH DENOISING |
| Samuel Sonning; Google |
| Christian Schüldt; Google |
| Hakan Erdogan; Google |
| Scott Wisdom; Google |
| AUD-P11.4: CHANNEL-ATTENTION DENSE U-NET FOR MULTICHANNEL SPEECH ENHANCEMENT |
| Bahareh Tolooshams; Harvard University |
| Ritwik Giri; Amazon Web Services |
| Andrew Song; Massachusetts Institute of Technology |
| Umut Isik; Amazon Web Services |
| Arvindh Krishnaswamy; Amazon Web Services |
| AUD-P11.5: A COMPOSITE DNN ARCHITECTURE FOR SPEECH ENHANCEMENT |
| Yochai Yemini; Bar-Ilan University |
| Shlomo E. Chazan; Bar-Ilan University |
| Jacob Goldberger; Bar-Ilan University |
| Sharon Gannot; Bar-Ilan University |
| AUD-P11.6: GEOMETRICALLY CONSTRAINED INDEPENDENT VECTOR ANALYSIS FOR DIRECTIONAL SPEECH ENHANCEMENT |
| Li Li; University of Tsukuba |
| Kazuhito Koishida; Microsoft Corporation |
| AUD-P11.7: REAL-TIME SPEECH ENHANCEMENT USING EQUILIBRIATED RNN |
| Daiki Takeuchi; Waseda University |
| Kohei Yatabe; Waseda University |
| Yuma Koizumi; NTT Corporation |
| Yasuhiro Oikawa; Waseda University |
| Noboru Harada; NTT Corporation |
| AUD-P11.8: SUBSPACE-BASED SPEECH CORRELATION VECTOR ESTIMATION FOR SINGLE-MICROPHONE MULTI-FRAME MVDR FILTERING |
| Dörte Fischer; University of Oldenburg |
| Simon Doclo; University of Oldenburg |
| AUD-P11.9: SPEECH ENHANCEMENT USING A TWO-STAGE NETWORK FOR AN EFFICIENT BOOSTING STRATEGY |
| Juntae Kim; KaKao |
| AUD-P11.10: TIME-FREQUENCY LOSS FOR CNN BASED SPEECH SUPER-RESOLUTION |
| Heming Wang; Ohio State University |
| Deliang Wang; Ohio State University |
| AUD-P11.11: TIME-DOMAIN NEURAL NETWORK APPROACH FOR SPEECH BANDWIDTH EXTENSION |
| Xiang Hao; Northwestern Polytechnical University |
| Chenglin Xu; Nanyang Technological University |
| Nana Hou; Nanyang Technological University |
| Lei Xie; Northwestern Polytechnical University |
| Eng Siong Chng; Nanyang Technological University |
| Haizhou Li; National University of Singapore |
| AUD-P11.12: WEIGHTED SPEECH DISTORTION LOSSES FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT |
| Yangyang Xia; Carnegie Mellon University |
| Sebastian Braun; Microsoft Research |
| Chandan Reddy; Microsoft Corporation |
| Harishchandra Dubey; Microsoft Corporation |
| Ross Cutler; Microsoft Corporation |
| Ivan Tashev; Microsoft Research |