Thu AM1.P.3
ON FREQUENCY-WISE NORMALIZATIONS FOR BETTER RECORDING DEVICE GENERALIZATION IN AUDIO SPECTROGRAM TRANSFORMERS
Paul Primus, Gerhard Widmer, Johannes Kepler University, Austria
Session:
Thu AM1.P: Audio Classification and Recognition Poster
Track:
ASMSP - Acoustic, Speech and Music Signal Processing
Location:
Fennia Foyer
Presentation Time:
Thu, 7 Sep, 10:30 - 12:30 Finland Time (UTC +3)
Session Chair:
Nilesh Madhu, Ghent University - imec
Presentation
Discussion
Resources
No resources available.
Session Thu AM1.P
Thu AM1.P.1: IMPROVING SPEECH EMOTION RECOGNITION WITH DATA EXPRESSION AWARE MULTI-TASK LEARNING
Pooja Kumawat, Aurobinda Routray, Indian Institute of Technology Kharagpur, India
Thu AM1.P.3: ON FREQUENCY-WISE NORMALIZATIONS FOR BETTER RECORDING DEVICE GENERALIZATION IN AUDIO SPECTROGRAM TRANSFORMERS
Paul Primus, Gerhard Widmer, Johannes Kepler University, Austria
Thu AM1.P.4: TOPIC IDENTIFICATION FOR SPONTANEOUS SPEECH: ENRICHING AUDIO FEATURES WITH EMBEDDED LINGUISTIC INFORMATION
Dejan Porjazovski, Tamas Grosz, Mikko Kurimo, Aalto University, Finland
Thu AM1.P.5: PROBING STATISTICAL REPRESENTATIONS FOR END-TO-END ASR
Anna Ollerenshaw, Md Asif Jalal, Thomas Hain, University of Sheffield, United Kingdom
Thu AM1.P.6: PrimaDNN':A Characteristics-aware DNN Customization for Singing Technique Detection
Yuya Yamamoto, University of Tsukuba, Japan; Juhan Nam, KAIST, Japan; Hiroko Terasawa, University of Tsukuba, Japan
Thu AM1.P.7: USING DEEP NEURAL NETWORKS FOR DETECTING DEPRESSION FROM SPEECH
Mirela Gheorghe, Serban Mihalache, Dragos Burileanu, University “Politehnica” of Bucharest, Romania
Thu AM1.P.8: STUDY OF SPEECH EMOTION RECOGNITION USING BLSTM WITH ATTENTION
Dalia Sherman, Gershon Hazan, Sharon Gannot, Bar-Ilan University, Israel
Thu AM1.P.9: AUDIO-BASED SEQUENTIAL MUSIC RECOMMENDATION
Rodrigo Borges, Marcelo Queiroz, Universidade de São Paulo, Brazil