List of Accepted Papers
Following is the list of accepted IWAENC 2024 papers, sorted by paper title. You can use the search feature of your web browser to find your paper number. Notifications to all authors have also been sent by email. If you have not received your notification of the results by email, please contact us at iwaenc2024@cmsworkshops.com.
Paper Number | Paper Title |
---|---|
1063 | A CASCADED SEMI-BLIND SOURCE SEPARATION METHOD FOR JOINT ACOUSTIC ECHO CANCELLATION, INTERFERENCE SUPPRESSION, AND NOISE REDUCTION |
1075 | A CROSS-DOMAIN APPROACH TO TEMPORAL ENVELOPE SHAPING IN PARAMETRIC STEREO CODING USING DEEP LEARNING |
1077 | A DATA-REUSE SEMI-BLIND SOURCE SEPARATION APPROACH FOR NONLINEAR ACOUSTIC ECHO CANCELLATION |
1053 | A HYBRID APPROACH FOR LOW-COMPLEXITY JOINT ACOUSTIC ECHO AND NOISE REDUCTION |
1087 | A MULTI-NOISE MULTI-CHANNEL ANC SYSTEM USING RELATIVE TRANSFER MATRIX-BASED APPROACH |
1135 | A Multi-Room Transition Dataset for Blind Estimation of Energy Decay |
1159 | A PHYSICS-INFORMED NEURAL NETWORK-BASED APPROACH FOR THE SPATIAL UPSAMPLING OF SPHERICAL MICROPHONE ARRAYS |
1141 | A THIRD-ORDER TENSOR DECOMPOSITION BASED LINEAR-IN-THE-PARAMETERS NONLINEAR ADAPTIVE FILTER |
1081 | A tunable binaural audio telepresence system capable of balancing immersive and enhanced modes |
1057 | A Unified Approach to Speaker Separation and Target Speaker Extraction Using Encoder-Decoder Based Attractors |
1037 | Accurate delayed source model for multi-frame Full-rank Spatial Covariance Analysis |
1103 | ACTIVE ROAD NOISE CONTROL BASED ON DATA-DRIVEN PREDICTIONS OF PASSENGER EAR NOISE SIGNAL |
1013 | AN EFFECTIVE MVDR POST-PROCESSING METHOD FOR LOW-LATENCY CONVOLUTIVE BLIND SOURCE SEPARATION |
1041 | ANALYSIS OF EARBUD-MOUNTED BONE-CONDUCTION MICROPHONES |
1038 | Bayesian sound field estimation using uncertain data |
1019 | Binaural Direction-of-Arrival estimation incorporating head movement information |
1132 | Blind Acoustic Parameter Estimation Through Task-Agnostic Embeddings Using Latent Approximations |
1071 | BUDDY: SINGLE-CHANNEL BLIND UNSUPERVISED DEREVERBERATION WITH DIFFUSION MODELS |
1054 | COMPARATIVE ANALYSIS OF DISCRIMINATIVE DEEP LEARNING-BASED NOISE REDUCTION METHODS IN LOW SNR SCENARIOS |
1115 | Complexity Reduction for Classification of Musical Instruments Using Element Selection |
1072 | CONCATENET: DIALOGUE SEPARATION USING LOCAL AND GLOBAL FEATURE CONCATENATION |
1034 | CONVOLUTIONAL NEURAL NETWORK-BASED PREDICTION OF A FRENCH MODIFIED RHYME TEST RECORDED WITH A BODY-CONDUCTION MICROPHONE |
1088 | DERIVATIVE FEATURES OF SHORT-TIME HOLOMORPHIC FOURIER TRANSFORM |
1120 | DIMINISHING DOMAIN MISMATCH FOR DNN-BASED ACOUSTIC DISTANCE ESTIMATION VIA STOCHASTIC ROOM REVERBERATION MODELS |
1028 | DIRECTION OF ARRIVAL ESTIMATION ON A SPHERE |
1060 | DIRECTIVITY ANALYSIS OF A VIBRATING SPHERICAL CAP ON A RIGID SPHERE |
1009 | DSP-INFORMED BANDWIDTH EXTENSION USING LOCALLY-CONDITIONED EXCITATION AND LINEAR TIME-VARYING FILTER SUBNETWORKS |
1018 | DYNAMIC AUDIO-VISUAL SPEECH ENHANCEMENT USING RECURRENT VARIATIONAL AUTOENCODERS |
1047 | EFFICIENT AREA-BASED AND SPEAKER-AGNOSTIC SOURCE SEPARATION |
1064 | EFFICIENT, CLUSTER-INFORMED, DEEP SPEECH SEPARATION WITH CROSS-CLUSTER INFORMATION IN AD-HOC WIRELESS ACOUSTIC SENSOR NETWORKS |
1030 | ESTIMATION OF OUTPUT SI-SDR OF SPEECH SIGNALS SEPARATED FROM NOISY INPUT BY CONV-TASNET |
1055 | E-URES: EFFICIENT USER-CENTRIC RESIDUAL-ECHO SUPPRESSION FRAMEWORK WITH A DATA-DRIVEN APPROACH TO REDUCING COMPUTATIONAL COSTS |
1112 | EVALUATING SPEECH ENHANCEMENT SYSTEMS THROUGH LISTENING EFFORT |
1099 | EVALUATION OF DATA-DRIVEN ROOM GEOMETRY INFERENCE METHODS USING A SMART SPEAKER PROTOTYPE |
1048 | EVALUATION OF OBJECTIVE QUALITY MODELS ON NEURAL AUDIO CODECS |
1022 | FEASIBILITY OF IMAGLS-BSM - ILD INFORMED BINAURAL SIGNAL MATCHING WITH ARBITRARY MICROPHONE ARRAYS |
1083 | Gaussian Flow Bridges for Audio Domain Transfer with Unpaired Data |
1131 | GREEDY DESIGN OF CIRCULAR CONCENTRIC ARRAYS FOR BROADBAND MVDR |
1067 | HARMONICS TO THE RESCUE: WHY VOICED SPEECH IS NOT A WSS PROCESS |
1061 | HIGH-FIDELITY DIFFUSION-BASED AUDIO CODEC |
1073 | Informed FastICA: Semi-Blind Minimum Variance Distortionless Beamformer |
1105 | INTERAURAL TIME DIFFERENCE LOSS FOR BINAURAL TARGET SOUND EXTRACTION |
1070 | INVESTIGATION ON SYSTEM BANDWIDTH FOR DNN-BASED BINAURAL SOUND LOCALISATION FOR HEARING AIDS |
1145 | ITERATIVE AND COMPLEX ORTHOGONAL MATCHING PURSUIT FOR BROADBAND SPARSE SOUND FIELD RECONSTRUCTION |
1076 | Joint Audio Source Localization and Separation With Distributed Microphone Arrays Based on Spatially-Regularized Multichannel NMF |
1049 | JOINT OPTIMIZATION OF MICROPHONE ARRAY GEOMETRY AND REGION-OF-INTEREST BEAMFORMING WITH SPARSE CIRCULAR SECTOR ARRAYS |
1140 | LASER: LANGUAGE-QUERIED SPEECH ENHANCER |
1136 | Latency-agnostic speech enhancement for wireless acoustic sensor networks using polynomial eigenvalue decomposition |
1116 | Learning-based Multi-Channel Speech Presence Probability Estimation Using a Low-Parameter Model and Integration With MVDR Beamforming for Multi-Channel Speech Enhancement |
1138 | LONG-TERM CONVERSATION ANALYSIS: PRIVACY-UTILITY TRADE-OFF UNDER NOISE AND REVERBERATION |
1113 | LOW COMPLEXITY SIGNAL ADAPTIVE SOUND ZONE CONTROL USING SUBSPACE TRACKING |
1137 | LOW LATENCY TWO STAGE BEAMFORMING WITH DISTRIBUTED MICROPHONE ARRAYS USING A PLANE WAVE DECOMPOSITION |
1016 | LOW-LATENCY SINGLE-MICROPHONE SPEAKER SEPARATION WITH TEMPORAL CONVOLUTIONAL NETWORKS USING SPEAKER REPRESENTATIONS |
1044 | LOW-ORDER CONTROLLERS FOR ACTIVE NOISE CANCELLATION BASED ON HANKEL MATRIX RANK MINIMIZATION |
1151 | Magnitude Least-Squares based Ambisonics Estimation of Head-Worn Device Microphone Measurements for Binaural Reproduction |
1133 | MAGNITUDE OR PHASE? A TWO-STAGE ALGORITHM FOR SINGLE-MICROPHONE SPEECH DEREVERBERATION |
1035 | MATRIX STUDY OF FEATURE COMPRESSION TYPES AND INSTRUMENTAL SPEECH QUALITY METRICS IN ULTRA-LIGHT DNN-BASED SPECTRAL SPEECH ENHANCEMENT |
1024 | Maximum Likelihood Estimation of the Direction of Sound In A Reverberant Noisy Environment |
1152 | META-LEARNING FOR VARIABLE ARRAY CONFIGURATIONS IN END-TO-END FEW-SHOT MULTICHANNEL SPEECH ENHANCEMENT |
1130 | MONAURAL SPEECH ENHANCEMENT ON DRONE VIA ADAPTER BASED TRANSFER LEARNING |
1085 | Multi-label audio classification with a noisy zero-shot teacher |
1134 | MULTI-LABEL ZERO-SHOT AUDIO CLASSIFICATION WITH TEMPORAL ATTENTION |
1017 | MULTI-SPEAKER DOA TRACKING ALGORITHM UTILIZING PROBABILITY HYPOTHESIS DENSITY FILTER AND WEIGHTED HISTOGRAM OF SRP-PHAT |
1108 | MULTI-STREAM DIFFUSION MODEL FOR PROBABILISTIC INTEGRATION OF MODEL-BASED AND DATA-DRIVEN SPEECH ENHANCEMENT |
1122 | NEAR-END LISTENING ENHANCEMENT USING A NOISE-ROBUST LINEAR TIME-INVARIANT FILTER |
1097 | NEURAL DIRECTIONAL FILTERING: FAR-FIELD DIRECTIVITY CONTROL WITH A SMALL MICROPHONE ARRAY |
1045 | Non-Causal to Causal SSL-Supported Transfer Learning: Towards a High-Performance Low-Latency Speech Vocoder |
1160 | ON LIMITATIONS AND IMPROVEMENT OF DIFFERENTIAL BEAMFORMING VIA QUADRATIC EIGENVALUE OPTIMIZATION |
1124 | ON THE IMPACT OF FREQUENCY RESOLUTION ON FEMALE AND MALE SPEECH IN DNN-BASED NOISE REDUCTION SYSTEMS |
1126 | ONE-SHOT DISTRIBUTED NODE-SPECIFIC SIGNAL ESTIMATION WITH NON-OVERLAPPING LATENT SUBSPACES IN ACOUSTIC SENSOR NETWORKS |
1021 | ONLINE SYSTEM IDENTIFICATION ON LEARNED ACOUSTIC MANIFOLDS USING AN EXTENDED KALMAN FILTER |
1118 | PAD-VC: A PROSODY-AWARE DECODER FOR ANY-TO-FEW VOICE CONVERSION |
1162 | PLUG-AND-PLAY AUDIO RESTORATION WITH DIFFUSION DENOISER |
1148 | PREDICTING SUBJECTIVE SATISFACTION WITH SPEECH PREDICTION-BASED ANC USING PERCEPTUALLY RELEVANT METRICS CORRELATED WITH SOUND ATTRIBUTES |
1026 | REAL-TIME JOINT NOISE SUPPRESSION AND BANDWIDTH EXTENSION OF NOISY REVERBERANT WIDEBAND SPEECH |
1153 | REFERENCE MICROPHONE SELECTION FOR THE WEIGHTED PREDICTION ERROR ALGORITHM USING THE NORMALIZED L-P NORM |
1036 | RGI-NET: 3D ROOM GEOMETRY INFERENCE FROM ROOM IMPULSE RESPONSES WITH HIDDEN FIRST-ORDER REFLECTIONS |
1107 | Robustness of Speech Separation Models for Similar-pitch Speakers |
1117 | ROOM IMPULSE RESPONSE PROTOTYPING USING RECEIVER DISTANCE ESTIMATIONS FOR HIGH QUALITY ROOM EQUALISATION ALGORITHMS |
1121 | SAMPLE RATE OFFSET COMPENSATED ACOUSTIC ECHO CANCELLATION FOR MULTI-DEVICE SCENARIOS |
1129 | SIMULATING SOUND FIELDS IN ROOMS WITH ARBITRARY GEOMETRIES USING THE DIFFRACTION-ENHANCED IMAGE SOURCE METHOD |
1156 | SOUND FIELD ESTIMATION IN REGION INCLUDING SCATTERING OBJECTS BASED ON KERNEL INTERPOLATION: EVALUATION FOR VARIOUS SCATTERERS |
1127 | Sound Field Estimation Using Deep Kernel Learning Regularized by the Wave Equation |
1050 | SOUND FIELD SYNTHESIS WITH ACOUSTIC WAVES |
1100 | Source Localization by Multidimensional Steered Response Power Mapping with Sparse Bayesian Learning |
1109 | SOURCE SIGNAL CAPTURE IN ACOUSTIC SENSOR NETWORKS BASED ON ROBUST BEAMFORMING AND SOURCE-RELATED CLUSTER ESTIMATION |
1143 | SPHERICAL MAPPING OF SHORT-TIME SPECTRAL COMPONENTS |
1015 | Split-Attention Mechanisms with Graph Convolutional Network for Multi-Channel Speech Separation |
1123 | SUPPRESSING NOISE DISPARITY IN TRAINING DATA FOR AUTOMATIC PATHOLOGICAL SPEECH DETECTION |
1082 | TF-LOCOFORMER: TRANSFORMER WITH LOCAL MODELING BY CONVOLUTION FOR SPEECH SEPARATION AND ENHANCEMENT |
1098 | The acoustic velocity vectors of the outgoing sound field |
1161 | THIRD-ORDER TENSOR DECOMPOSITION BASED MULTICHANNEL LINEAR PREDICTION FOR ROBUST DEREVERBERATION |
1029 | TINY NEURAL-NETWORK CONTROL OF FREQUENCY-DOMAIN ADAPTIVE FILTERING FOR LINEAR SYSTEM IDENTIFICATION IN ACOUSTIC ECHO CANCELLATION |
1069 | UNCERTAINTY-BASED REMIXING FOR UNSUPERVISED DOMAIN ADAPTATION IN DEEP SPEECH ENHANCEMENT |
1106 | UTILIZING HEAD ROTATION DATA IN DNN-BASED MULTI-CHANNEL SPEECH ENHANCEMENT FOR HEARING AIDS |
1032 | WEAKLY DOA GUIDED SPEAKER SEPARATION WITH RANDOM LOOK DIRECTIONS AND ITERATIVELY REFINED TARGET AND INTERFERENCE PRIORS |
1092 | XANE Background Acoustic Embeddings: Ablation and Clustering Analysis |