AUD-P11: Signal Enhancement and Restoration II |
Session Type: Poster |
Time: Friday, 8 May, 11:45 - 13:45 |
Location: On-Demand |
Virtual Session: View on Virtual Platform |
Session Chair: Jesper Rindom Jensen, Aalborg University |
AUD-P11.1: CONSISTENCY-AWARE MULTI-CHANNEL SPEECH ENHANCEMENT USING DEEP NEURAL NETWORKS |
Yoshiki Masuyama; Waseda University |
Masahito Togami; LINE Corporation |
Tatsuya Komatsu; LINE Corporation |
AUD-P11.2: PHASE RECONSTRUCTION BASED ON RECURRENT PHASE UNWRAPPING WITH DEEP NEURAL NETWORKS |
Yoshiki Masuyama; Waseda University |
Kohei Yatabe; Waseda University |
Yuma Koizumi; NTT Corporation |
Yasuhiro Oikawa; Waseda University |
Noboru Harada; NTT Corporation |
AUD-P11.3: PERFORMANCE STUDY OF A CONVOLUTIONAL TIME-DOMAIN AUDIO SEPARATION NETWORK FOR REAL-TIME SPEECH DENOISING |
Samuel Sonning; Google |
Christian Schüldt; Google |
Hakan Erdogan; Google |
Scott Wisdom; Google |
AUD-P11.4: CHANNEL-ATTENTION DENSE U-NET FOR MULTICHANNEL SPEECH ENHANCEMENT |
Bahareh Tolooshams; Harvard University |
Ritwik Giri; Amazon Web Services |
Andrew Song; Massachusetts Institute of Technology |
Umut Isik; Amazon Web Services |
Arvindh Krishnaswamy; Amazon Web Services |
AUD-P11.5: A COMPOSITE DNN ARCHITECTURE FOR SPEECH ENHANCEMENT |
Yochai Yemini; Bar-Ilan University |
Shlomo E. Chazan; Bar-Ilan University |
Jacob Goldberger; Bar-Ilan University |
Sharon Gannot; Bar-Ilan University |
AUD-P11.6: GEOMETRICALLY CONSTRAINED INDEPENDENT VECTOR ANALYSIS FOR DIRECTIONAL SPEECH ENHANCEMENT |
Li Li; University of Tsukuba |
Kazuhito Koishida; Microsoft Corporation |
AUD-P11.7: REAL-TIME SPEECH ENHANCEMENT USING EQUILIBRIATED RNN |
Daiki Takeuchi; Waseda University |
Kohei Yatabe; Waseda University |
Yuma Koizumi; NTT Corporation |
Yasuhiro Oikawa; Waseda University |
Noboru Harada; NTT Corporation |
AUD-P11.8: SUBSPACE-BASED SPEECH CORRELATION VECTOR ESTIMATION FOR SINGLE-MICROPHONE MULTI-FRAME MVDR FILTERING |
Dörte Fischer; University of Oldenburg |
Simon Doclo; University of Oldenburg |
AUD-P11.9: SPEECH ENHANCEMENT USING A TWO-STAGE NETWORK FOR AN EFFICIENT BOOSTING STRATEGY |
Juntae Kim; KaKao |
AUD-P11.10: TIME-FREQUENCY LOSS FOR CNN BASED SPEECH SUPER-RESOLUTION |
Heming Wang; Ohio State University |
Deliang Wang; Ohio State University |
AUD-P11.11: TIME-DOMAIN NEURAL NETWORK APPROACH FOR SPEECH BANDWIDTH EXTENSION |
Xiang Hao; Northwestern Polytechnical University |
Chenglin Xu; Nanyang Technological University |
Nana Hou; Nanyang Technological University |
Lei Xie; Northwestern Polytechnical University |
Eng Siong Chng; Nanyang Technological University |
Haizhou Li; National University of Singapore |
AUD-P11.12: WEIGHTED SPEECH DISTORTION LOSSES FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT |
Yangyang Xia; Carnegie Mellon University |
Sebastian Braun; Microsoft Research |
Chandan Reddy; Microsoft Corporation |
Harishchandra Dubey; Microsoft Corporation |
Ross Cutler; Microsoft Corporation |
Ivan Tashev; Microsoft Research |