Technical Program

Click on the icon to view the manuscript on IEEE XPlore in the IEEE ICASSP 2020 Open Preview.

SPE-P10: Speaker Diarization and Characterization

Session Type: Poster
Time: Thursday, 7 May, 09:00 - 11:00
Location: On-Demand
Virtual Session: View on Virtual Platform
Session Chairs: Paola Garcia Perera, Johns Hopkins University and Takafumi Koshinaka, NEC Corporation
 
 SPE-P10.1: TOWARD BETTER SPEAKER EMBEDDINGS: AUTOMATED COLLECTION OF SPEECH SAMPLES FROM UNKNOWN DISTINCT SPEAKERS
         Minh Pham; Worcester Polytechnic Institute
         Zeqian Li; Worcester Polytechnic Institute
         Jacob Whitehill; Worcester Polytechnic Institute
 
 SPE-P10.2: CHANNEL ADVERSARIAL TRAINING FOR SPEAKER VERIFICATION AND DIARIZATION
         Chau Luu; University of Edinburgh
         Peter Bell; University of Edinburgh
         Steve Renals; University of Edinburgh
 
 SPE-P10.3: PROGRESSIVE MULTI-TARGET NETWORK BASED SPEECH ENHANCEMENT WITH SNR-PRESELECTION FOR ROBUST SPEAKER DIARIZATION
         Lei Sun; University of Science and Technology of China
         Jun Du; University of Science and Technology of China
         Xueyang Zhang; IFLYTEK Research
         Tian Gao; IFLYTEK Research
         Xin Fang; IFLYTEK Research
         Chin-Hui Lee; Georgia Institute of Technology
 
 SPE-P10.4: IMPROVED LARGE-MARGIN SOFTMAX LOSS FOR SPEAKER DIARISATION
         Yassir Fathullah; University of Cambridge
         Chao Zhang; University of Cambridge
         Philip Woodland; University of Cambridge
 
 SPE-P10.5: SPEAKER DIARIZATION WITH SESSION-LEVEL SPEAKER EMBEDDING REFINEMENT USING GRAPH NEURAL NETWORKS
         Jixuan Wang; University of Toronto
         Xiong Xiao; Microsoft
         Jian Wu; Microsoft
         Ranjani Ramamurthy; Microsoft
         Frank Rudzicz; University of Toronto
         Michael Brudno; University of Toronto
 
 SPE-P10.6: OVERLAP-AWARE DIARIZATION: RESEGMENTATION USING NEURAL END-TO-END OVERLAPPED SPEECH DETECTION
         Latané Bullock; Rice University
         Hervé Bredin; LIMSI, CNRS, Univ. Paris-Sud, Universite Paris-Saclay
         Leibny Paola Garcia Perera; Johns Hopkins University
 
 SPE-P10.7: ON THE IMPORTANCE OF VOCAL TRACT CONSTRICTION FOR SPEAKER CHARACTERIZATION: THE WHISPERED SPEECH STUDY
         Rohan Kumar Das; National University of Singapore
         Haizhou Li; National University of Singapore
 
 SPE-P10.8: PYANNOTE.AUDIO: NEURAL BUILDING BLOCKS FOR SPEAKER DIARIZATION
         Hervé Bredin; LIMSI, CNRS, Université Paris-Saclay
         Ruiqing Yin; LIMSI, CNRS, Université Paris-Saclay
         Juan Manuel Coria; LIMSI, CNRS, Univ. Paris-Sud, Université Paris-Saclay
         Gregory Gelly; LIMSI, CNRS
         Pavel Korshunov; Idiap Research Institute
         Marvin Lavechin; Ecole Normale Supérieure/INRIA
         Diego Fustes; Toptal LLC
         Hadrien Titeux; Université PSL
         Wassim Bouaziz; Ecole Normale Supérieure/INRIA
         Marie-Philippe Gill; Ecole de Technologie Supérieure, Université du Québec
 
 SPE-P10.9: SPEAKER EMBEDDINGS INCORPORATING ACOUSTIC CONDITIONS FOR DIARIZATION
         Yosuke Higuchi; Waseda University
         Masayuki Suzuki; IBM
         Gakuto Kurata; IBM
 
 SPE-P10.10: SUPERVISED ONLINE DIARIZATION WITH SAMPLE MEAN LOSS FOR MULTI-DOMAIN DATA
         Enrico Fini; PerVoice
         Alessio Brutti; Fondazione Bruno Kessler
 
 SPE-P10.11: INVESTIGATION OF SPECAUGMENT FOR DEEP SPEAKER EMBEDDING LEARNING
         Shuai Wang; Shanghai Jiao Tong University
         Johan Rohdin; Brno University of Technology
         Oldřich Plchot; Brno University of Technology
         Lukáš Burget; Brno University of Technology
         Kai Yu; Shanghai Jiao Tong University
         Jan Cernocky; Brno University of Technology