ASMSP-P4.2
Baseline Systems and Evaluation Metrics for Spatial Semantic Segmentation of Sound Scenes
Binh Thien Nguyen, Masahiro Yasuda, Daiki Takeuchi, Daisuke Niizumi, Yasunori Ohishi, Noboru Harada, NTT Corporation, Japan
Session:
ASMSP-P4: Audio Quality Assessment and Classification Poster
Track:
ASMSP - Acoustic, Speech and Music Signal Processing
Location:
Poster Area E
Presentation Time:
Wed, 10 Sep, 09:00 - 10:40 Italy Time (UTC +2)
Session Co-Chairs:
Dimitra Emmanouilidou, Microsoft Research and Tetsuji Ogawa, Waseda University
Presentation
Discussion
Resources
No resources available.
Session ASMSP-P4
ASMSP-P4.1: Necessity of Voice Sample Selection in Qualification Tests for Crowdsourced Subjective Audio Quality Evaluation
Takuma Yabe, Moe Yaegashi, Teppei Nakano, Tetsuji Ogawa, Waseda University, Japan
ASMSP-P4.2: Baseline Systems and Evaluation Metrics for Spatial Semantic Segmentation of Sound Scenes
Binh Thien Nguyen, Masahiro Yasuda, Daiki Takeuchi, Daisuke Niizumi, Yasunori Ohishi, Noboru Harada, NTT Corporation, Japan
ASMSP-P4.3: PRUNING STATE SPACE MODELS WITH MODEL ORDER REDUCTION FOR EFFICIENT RAW AUDIO CLASSIFICATION
Matthias Bittner, Daniel Schnöll, Dominik Dallinger, Matthias Wess, Axel Jantsch, TU Wien, Austria
ASMSP-P4.4: LOCAL EQUIVARIANCE ERROR-BASED METRICS FOR EVALUATING SAMPLING-FREQUENCY-INDEPENDENT PROPERTY OF NEURAL NETWORK
Kanami Imamura, The University of Tokyo, Japan; Tomohiko Nakamura, National Institute of Advanced Industrial Science and Technology (AIST), Japan; Norihiro Takamune, The University of Tokyo, Japan; Kohei Yatabe, Tokyo University of Agriculture and Technology, Japan; Hiroshi Saruwatari, The University of Tokyo, Japan
ASMSP-P4.5: TRUSTWORTHY MAJORITY VOTING FOR LABELING AND ANALYZING MULTI-ANNOTATOR TEXT SENTIMENT DATASETS
Fotis Avgoustidis, Paraskevi Bassia, Ioannis Pitas, Department of Informatics, Aristotle University of Thessaloniki, Greece
ASMSP-P4.6: Quality Over Quantity? LLM-Based Curation for a Data-Efficient Audio–Video Foundation Model
Ali Vosoughi, University of Rochester, United States; Dimitra Emmanouilidou, Hannes Gamper, Microsoft Research, United States