SPE-P4: Speech Analysis and Coding |
Session Type: Poster |
Time: Tuesday, 5 May, 16:30 - 18:30 |
Location: On-Demand |
Virtual Session: View on Virtual Platform |
Session Chairs: Anil Kumar Vuppala, International Institute of Information Technology, Hyderabad and Juan Rafael Orozco-Arroyave, Universidad de Antioquia, Medellín-Colombia
|
|
SPE-P4.1: GCI DETECTION FROM RAW SPEECH USING A FULLY-CONVOLUTIONAL NETWORK |
Luc Ardaillon; IRCAM |
Axel Roebel; IRCAM |
|
SPE-P4.2: FRAME-BASED OVERLAPPING SPEECH DETECTION USING CONVOLUTIONAL NEURAL NETWORKS |
Midia Yousefi; University of Texas at Dallas |
John H.L. Hansen; University of Texas at Dallas |
|
SPE-P4.3: LEARNING DOMAIN INVARIANT REPRESENTATIONS FOR CHILD-ADULT CLASSIFICATION FROM SPEECH |
Rimita Lahiri; University of Southern California |
Manoj Kumar; University of Southern California |
Somer Bishop; University of California, San Francisco |
Shrikanth Narayanan; University of Southern California |
|
SPE-P4.4: SYLNET: AN ADAPTABLE END-TO-END SYLLABLE COUNT ESTIMATOR FOR SPEECH |
Shreyas Seshadri; Aalto University |
Okko Räsänen; Tampere University of Technology |
|
SPE-P4.5: SINGLE FREQUENCY FILTER BANK BASED LONG-TERM AVERAGE SPECTRA FOR HYPERNASALITY DETECTION AND ASSESSMENT IN CLEFT LIP AND PALATE SPEECH |
Hashim Javid Mohammad; International Institute of Information Technology, Hyderabad |
Krishna Gurugubelli; International Institute of Information Technology, Hyderabad |
Anil Kumar Vuppala; International Institute of Information Technology, Hyderabad |
|
SPE-P4.6: AUTOREGRESSIVE PARAMETER ESTIMATION WITH DNN-BASED PRE-PROCESSING |
Zihao Cui; Beijing University of Technology |
Changchun Bao; Beijing University of Technology |
Jesper Kjær Nielsen; Aalborg University |
Mads Græsbøll Christensen; Aalborg University |
|
SPE-P4.7: ENHANCEMENT OF CODED SPEECH USING A MASK-BASED POST-FILTER |
Srikanth Korse; Fraunhofer Institute for Integrated Circuits IIS |
Kishan Gupta; AudioLabs-IIS |
Guillaume Fuchs; AudioLabs-IIS |
|
SPE-P4.8: ROBUST LOW RATE SPEECH CODING BASED ON CLONED NETWORKS AND WAVENET |
Felicia Lim; Google |
W. Bastiaan Kleijn; Victoria University of Wellington |
Michael Chinen; Google |
Jan Skoglund; Google |
|
SPE-P4.9: MIXTURE FACTORIZED AUTO-ENCODER FOR UNSUPERVISED HIERARCHICAL DEEP FACTORIZATION OF SPEECH SIGNAL |
Zhiyuan Peng; Chinese University of Hong Kong |
Siyuan Feng; Chinese University of Hong Kong |
Tan Lee; Chinese University of Hong Kong |
|
SPE-P4.10: A NOVEL APPROACH FOR INTELLIGIBILITY ASSESSMENT IN DYSARTHRIC SUBJECTS |
Ayush Tripathi; Tata Consultancy Services |
Swapnil Bhosale; Tata Consultancy Services |
Sunil Kumar Kopparapu; Tata Consultancy Services |
|
SPE-P4.11: VOICE BASED CLASSIFICATION OF PATIENTS WITH AMYOTROPHIC LATERAL SCLEROSIS, PARKINSON'S DISEASE AND HEALTHY CONTROLS WITH CNN-LSTM USING TRANSFER LEARNING |
Jhansi Mallela; Indian Institute of Science |
Aravind Illa; Indian Institute of Science |
Suhas B N; Indian Institute of Science |
Sathvik Udupa; Indian Institute of Science |
Yamini Belur; National Institute of Mental Health and Neuro Sciences |
Nalini Atchayaram; National Institute of Mental Health and Neuro Sciences |
Ravi Yadav; National Institute of Mental Health and Neuro Sciences |
Pradeep Reddy; National Institute of Mental Health and Neuro Sciences |
Dipanjan Gope; Indian Institute of Science |
Prasanta Kumar Ghosh; Indian Institute of Science |
|
SPE-P4.12: ANALYSIS OF ACOUSTIC FEATURES FOR SPEECH SOUND BASED CLASSIFICATION OF ASTHMATIC AND HEALTHY SUBJECTS |
Shivani Yadav; Indian Institute of Science |
Merugu Keerthana; Rajiv Gandhi University of Knowledge Technologies, Kadapa |
Dipanjan Gope; Indian Institute of Science |
Uma Maheswari Krishnaswamy; St. Johns National Academy of Health Sciences |
Prasanta Kumar Ghosh; Indian Institute of Science |
|