SPE-P10: Speaker Diarization and Characterization |
| Session Type: Poster |
| Time: Thursday, 7 May, 09:00 - 11:00 |
| Location: On-Demand |
| Virtual Session: View on Virtual Platform |
| Session Chairs: Paola Garcia Perera, Johns Hopkins University and Takafumi Koshinaka, NEC Corporation |
| SPE-P10.1: TOWARD BETTER SPEAKER EMBEDDINGS: AUTOMATED COLLECTION OF SPEECH SAMPLES FROM UNKNOWN DISTINCT SPEAKERS |
| Minh Pham; Worcester Polytechnic Institute |
| Zeqian Li; Worcester Polytechnic Institute |
| Jacob Whitehill; Worcester Polytechnic Institute |
| SPE-P10.2: CHANNEL ADVERSARIAL TRAINING FOR SPEAKER VERIFICATION AND DIARIZATION |
| Chau Luu; University of Edinburgh |
| Peter Bell; University of Edinburgh |
| Steve Renals; University of Edinburgh |
| SPE-P10.3: PROGRESSIVE MULTI-TARGET NETWORK BASED SPEECH ENHANCEMENT WITH SNR-PRESELECTION FOR ROBUST SPEAKER DIARIZATION |
| Lei Sun; University of Science and Technology of China |
| Jun Du; University of Science and Technology of China |
| Xueyang Zhang; IFLYTEK Research |
| Tian Gao; IFLYTEK Research |
| Xin Fang; IFLYTEK Research |
| Chin-Hui Lee; Georgia Institute of Technology |
| SPE-P10.4: IMPROVED LARGE-MARGIN SOFTMAX LOSS FOR SPEAKER DIARISATION |
| Yassir Fathullah; University of Cambridge |
| Chao Zhang; University of Cambridge |
| Philip Woodland; University of Cambridge |
| SPE-P10.5: SPEAKER DIARIZATION WITH SESSION-LEVEL SPEAKER EMBEDDING REFINEMENT USING GRAPH NEURAL NETWORKS |
| Jixuan Wang; University of Toronto |
| Xiong Xiao; Microsoft |
| Jian Wu; Microsoft |
| Ranjani Ramamurthy; Microsoft |
| Frank Rudzicz; University of Toronto |
| Michael Brudno; University of Toronto |
| SPE-P10.6: OVERLAP-AWARE DIARIZATION: RESEGMENTATION USING NEURAL END-TO-END OVERLAPPED SPEECH DETECTION |
| Latané Bullock; Rice University |
| Hervé Bredin; LIMSI, CNRS, Univ. Paris-Sud, Universite Paris-Saclay |
| Leibny Paola Garcia Perera; Johns Hopkins University |
| SPE-P10.7: ON THE IMPORTANCE OF VOCAL TRACT CONSTRICTION FOR SPEAKER CHARACTERIZATION: THE WHISPERED SPEECH STUDY |
| Rohan Kumar Das; National University of Singapore |
| Haizhou Li; National University of Singapore |
| SPE-P10.8: PYANNOTE.AUDIO: NEURAL BUILDING BLOCKS FOR SPEAKER DIARIZATION |
| Hervé Bredin; LIMSI, CNRS, Université Paris-Saclay |
| Ruiqing Yin; LIMSI, CNRS, Université Paris-Saclay |
| Juan Manuel Coria; LIMSI, CNRS, Univ. Paris-Sud, Université Paris-Saclay |
| Gregory Gelly; LIMSI, CNRS |
| Pavel Korshunov; Idiap Research Institute |
| Marvin Lavechin; Ecole Normale Supérieure/INRIA |
| Diego Fustes; Toptal LLC |
| Hadrien Titeux; Université PSL |
| Wassim Bouaziz; Ecole Normale Supérieure/INRIA |
| Marie-Philippe Gill; Ecole de Technologie Supérieure, Université du Québec |
| SPE-P10.9: SPEAKER EMBEDDINGS INCORPORATING ACOUSTIC CONDITIONS FOR DIARIZATION |
| Yosuke Higuchi; Waseda University |
| Masayuki Suzuki; IBM |
| Gakuto Kurata; IBM |
| SPE-P10.10: SUPERVISED ONLINE DIARIZATION WITH SAMPLE MEAN LOSS FOR MULTI-DOMAIN DATA |
| Enrico Fini; PerVoice |
| Alessio Brutti; Fondazione Bruno Kessler |
| SPE-P10.11: INVESTIGATION OF SPECAUGMENT FOR DEEP SPEAKER EMBEDDING LEARNING |
| Shuai Wang; Shanghai Jiao Tong University |
| Johan Rohdin; Brno University of Technology |
| Oldřich Plchot; Brno University of Technology |
| Lukáš Burget; Brno University of Technology |
| Kai Yu; Shanghai Jiao Tong University |
| Jan Cernocky; Brno University of Technology |