AUD-L4: Audio and Speech Source Separation |
| Session Type: Lecture |
| Time: Wednesday, 6 May, 11:30 - 13:30 |
| Location: On-Demand |
| Virtual Session: View on Virtual Platform |
| Session Chair: Scott Wisdom, Google |
| AUD-L4.1: LEARNING TO SEPARATE SOUNDS FROM WEAKLY LABELED SCENES |
| Fatemeh Pishdadian; Northwestern University |
| Gordon Wichern; Mitsubishi Electric Research Laboratories (MERL) |
| Jonathan Le Roux; Mitsubishi Electric Research Laboratories (MERL) |
| AUD-L4.2: IMPROVING UNIVERSAL SOUND SEPARATION USING SOUND CLASSIFICATION |
| Efthymios Tzinis; University of Illinois at Urbana-Champaign |
| Scott Wisdom; Google |
| John R. Hershey; Google |
| Aren Jansen; Google |
| Daniel P. W. Ellis; Google Research |
| AUD-L4.3: SOURCE SEPARATION WITH WEAKLY LABELLED DATA: AN APPROACH TO COMPUTATIONAL AUDITORY SCENE ANALYSIS |
| Qiuqiang Kong; ByteDance |
| Yuxuan Wang; ByteDance |
| Xuchen Song; ByteDance |
| Yin Cao; University of Surrey |
| Wenwu Wang; University of Surrey |
| Mark D. Plumbley; University of Surrey |
| AUD-L4.4: BOOSTED LOCALITY SENSITIVE HASHING: DISCRIMINATIVE BINARY CODES FOR SOURCE SEPARATION |
| Sunwoo Kim; Indiana University Bloomington |
| Haici Yang; Indiana University Bloomington |
| Minje Kim; Indiana University Bloomington |
| AUD-L4.5: A FREQUENCY-DOMAIN BSS METHOD BASED ON L1 NORM, UNITARY CONSTRAINT, AND CAYLEY TRANSFORM |
| Satoru Emura; NTT |
| Hiroshi Sawada; NTT |
| Shoko Araki; NTT |
| Noboru Harada; NTT |
| AUD-L4.6: END-TO-END NON-NEGATIVE AUTOENCODERS FOR SOUND SOURCE SEPARATION |
| Shrikant Venkataramani; University of Illinois at Urbana-Champaign |
| Efthymios Tzinis; University of Illinois at Urbana-Champaign |
| Paris Smaragdis; University of Illinois at Urbana-Champaign, Adobe Research |