Technical Program
Paper Search
Program Booklet
Sunday, April 14
Sun 08:30 - 12:00
Tutorial
T-1: Advances in Objective Speech Intelligibility and Quality Assessment: From Psychoacoustics to Machine Learning
Room 103
Tutorial
T-2: Understanding Deep Representation Learning via Neural Collapse
Room E5
Tutorial
T-3: Quantum Tensor Networks in Machine Learning and Signal Processing
Room E6
Tutorial
T-5: Variational Inference, (Not So) Approximate Bayesian Techniques, and Applications
Room 300
Tutorial
T-6: Safe and Trustworthy Large Language Models
Room 201
Workshop
WS-6: Timely and Private Machine Learning over Networks
Room 209A
Sun 08:30 - 17:30
Industry
Entrepreneurship Forum
Room 101
PROGRess Workshop
Room 102
Workshop
WS-3: Self-supervision in Audio, Speech and Beyond (SASB)
Room 104
Workshop
WS-8: Revolutionizing Interaction: Embodied Intelligence and the New Era of Human-Robot Collaboration
Room 206
Workshop
WS-10: 1st Workshop on Integration of Sensing, Communication, and Computation (ISCC)
Room 105
Sun 13:00 - 17:30
Workshop
WS-11: Signal Processing and Machine Learning Advances in Automotive Radars
Room 205
Sun 14:00 - 17:30
Tutorial
T-7: Deep generative model for inference
Room 103
Tutorial
T-8: Learning with Multiple Objectives Beyond Bilevel Optimization - New Foundations and Speech Applications
Room E5
Tutorial
T-9: Quantum Error Correcting Codes and Circuits: Algorithm-Architecture Codesign
Room E6
Tutorial
T-10: Building White-Box Deep Neural Networks
Room 300
Tutorial
T-11: Reinforcement Learning and Bandits for Speech and Language Processing
Room 201
Workshop
WS-1: Deep Neural Network Model Compression
Room 209A
Workshop
WS-9: SPID-CPS: Signal Processing for resilient Intrusion Detection in Cyber-Physical Systems
Room 209B
Sun 18:00 - 19:00
Industry
Entrepreneurship Forum After-Party
Room 101 Lobby
Monday, April 15
Mon 07:00 - 09:00
Micro Mentoring Experience Program (MiME)
Room E5-E6
Mon 08:30 - 12:00
Industry
Spotlight Talks
Room 101
Workshop
WS-12: Workshop on Radio Maps and Their Applications (RMA)
Room 209A
Mon 08:30 - 12:30
Tutorial
T-12: Tropical Geometry for Machine Learning and Optimization
Room E7
Tutorial
T-13: Hearables: Real World Applications of Interpretable AI for eHealth
Room 300
Tutorial
T-14: Fundamentals of Transformers: A Signal-processing View
Room 103
Tutorial
T-15: Sparsity in Large Language Models: The New Odyssey
Room MR1
Tutorial
T-16: Localization-of-Things in Beyond 5G Networks: New Opportunities in Signal Processing
Room MR2
Tutorial
T-17: Parameter-Efficient and Prompt Learning for Speech and Language Foundation Models
Room 201
Workshop
WS-14: Fearless Steps APOLLO: A Naturalistic Team based Speech Communications Community Resource (FS-APOLLO)
Room 209B
Mon 08:30 - 17:30
Workshop
WS-4: XAI-SA: ICASSP 2024 Workshop on Explainable AI for Speech and Audio
Room 105
Workshop
WS-7: Second Workshop on Signal Processing for Autonomous Systems (SPAS)
Room 104
Workshop
WS-13: Super-resolution integrated communications, localization, vision and radio mapping (SUPER-CLAM)
Room 206
Workshop
WS-15: Hands-free Speech Communication and Microphone Arrays (HSCMA 2024): Efficient and Personalized Speech Processing through Data Science
Room 205
Mon 14:00 - 17:30
Industry
Industry Colloquiums
Room 101
Tutorial
T-18: Adaptive and Flexible Model-Based AI for Wireless Systems
Room 300
Tutorial
T-20: Zeroth-Order Machine Learning: Fundamental Principles and Emerging Applications in Foundation Models
Room 201
Tutorial
T-21: Foundational Problems in Neural Speech Recognition
Room 103
Tutorial
T-22: Positioning, Navigation, and Timing Using LEO Satellite Signals: Models, Methods, and Challenges
Room MR1
Tutorial
T-23: Generative AI Models for Signal and Data Processing: Theory, Methods, and Applications
Room E7
Workshop
WS-2: Trustworthy Speech Processing (TSP)
Room 209A
Workshop
WS-5: Workshop on Computational Imaging Using Synthetic Apertures
Room 209B
Mon 15:00 - 18:00
Signal Processing Cup Competition
Room E5-E6
Mon 18:00 - 20:30
Welcome Reception
Hall D1 + 3F Lobby
Tuesday, April 16
Tue 08:20 - 18:00
Industry Exhibitions
Hall D2
Tue 08:40 - 10:20
Opening Ceremony
Auditorium
Tue 10:40 - 11:40
Plenary
PLEN-1: Seong-Whan Lee "Brain-To-Speech: Neural Speech Synthesis from Brain Signals"
Auditorium
Tue 11:40 - 14:00
PROGRESS: The impact of NRF/NSF early career faculty programs
Room E5-E6
Tue 13:10 - 15:10
DEMO-1A: Show and Tell Demos I-A
Hall D2: Podium Pitch Room A
DEMO-1B: Show and Tell Demos I-B
Hall D2: Podium Pitch Room B
IVMSP-L1: Vision and language
Room 103
AASP-L1: Acoustic Signal Processing
Room 101
MLSP-L1: Deep Learning Techniques I
Room 102
SLP-L1: Speech enhancement and separation - Diffusion and other probabilistic models
Room 104
ASPS-L1: ASPS Algorithm and Architecture Design and Synthesis Lecture Session
Room 105
SPCOM-L1: Distributed and Federated Learning
Room E1
MLSP-L2: Transfer Learning I
Room E3
SLP-L2: Voice Conversion I
Room 201
MLSP-L3: Graph Neural Networks I
Room E2
SLP-L3: Language resources, metrics and systems
Room E4
IFS-L1: Watermarking and Data Hiding
Room 205A
SPTM-L1: Signal and Information Processing over Graphs
Room 205B
Special Session
SS-L1: Model based machine learning for wireless communications and sensing
Room 209A
SAM-L1: Integrated Sensing and Communications
Room E8
GC-L1: Auditory EEG decoding challenge 2024
Room 209B
AASP-P1: Audio events detection and classification; Music Information Retrieval 1
Poster Zone 1A
SLP-P1: Language understanding and computational semantics III - NLP Tasks
Poster Zone 1B
BISP-P1: Physiological and wearable signal processing
Poster Zone 1C
AASP-P2: Speech enhancement 1; Music information retrieval 2
Poster Zone 2A
BISP-P2: Multimodal medical image fusion and analysis
Poster Zone 2B
SPTM-P1: Sparse/Low-Dimensional Signal Processing
Poster Zone 2C
MLSP-P1: Robust and Sustainable Machine Learning
Poster Zone 3A
MLSP-P2: Machine Learning for Image and Video Processing II
Poster Zone 3B
MLSP-P3: Deep Learning Generalization
Poster Zone 3C
SPCOM-P1: Distributed Processing and Federated Learning
Poster Zone 4A
BISP-P3: Biological image analysis
Poster Zone 4B
MLSP-P4: Learning from Multimodal Data II
Poster Zone 4C
IFS-P1: Biometrics
Poster Zone 5A
SPTM-P2: Detection and classification
Poster Zone 5B
MMSP-P1: Multimedia Coding
Poster Zone 5C
IFS-P2: Anonymisation, Data Privacy and Hiding
Poster Zone 6A
IVMSP-P1: Quality assessment and anomaly detection I
Poster Zone 6B
SPTM-P3: Signal Filtering, Reconstruction, Restoration and Enhancement
Poster Zone 6C
WS-10: 1st Workshop on Integration of Sensing, Communication, and Computation (ISCC)
Workshop Poster
WS-15a: Hands-free Speech Communication and Microphone Arrays (HSCMA 2024): Efficient and Personalized Speech Processing through Data Science I
Workshop Poster
WS-15b: Hands-free Speech Communication and Microphone Arrays (HSCMA 2024): Efficient and Personalized Speech Processing through Data Science II
Workshop Poster
WS-15c: Hands-free Speech Communication and Microphone Arrays (HSCMA 2024): Efficient and Personalized Speech Processing through Data Science III
Workshop Poster
WS-3a: Self-supervision in Audio, Speech and Beyond I
Workshop Poster
WS-3b: Self-supervision in Audio, Speech and Beyond II
Workshop Poster
WS-3c: Self-supervision in Audio, Speech and Beyond III
Workshop Poster
WS-7: Second Workshop on Signal Processing for Autonomous Systems (SPAS)
Workshop Poster
WS-8: Revolutionizing Interaction: Embodied Intelligence and the New Era of Human-Robot Collaboration
Workshop Poster
Tue 14:00 - 15:00
AI-ML Panel Series
Room E7
Tue 15:10 - 16:10
Industry
Plenary
IPLEN-1: Songyee Yoon "Exploring Game & AI With the Application of AI in MMORPGs"
Auditorium
Tue 16:10 - 18:00
Industry
Industry Workshop by MathWorks - Building Data-Centric AI and Signal Processing Applications with MATLAB & Simulink for Industrial Innovation
Auditorum
Tue 16:30 - 18:30
SLP-L4: Speech Emotion Recognition and Analysis I
Room 103
MLSP-L4: Deep Generative Models I
Room 101
SLP-L5: Context and LLM speech recognition
Room 102
SLP-L6: Language understanding and computational semantics I - NLP Tasks
Room 104
AASP-L2: Music Information Retrieval
Room 105
ASPS-L2: ASPS Systems Lecture Session
Room E1
MLSP-L5: Machine Learning for Image and Video Processing I
Room E3
MMSP-L1: Multimodal Processing: Vision + Language 1
Room 201
AASP-L3: Environmental Sound Synthesis and Generation
Room E2
IVMSP-L2: Biomedical and biological image processing
Room E4
IVMSP-L3: Quality assessment and anomaly detection
Room 205A
Special Session
SS-L2: Exploiting Diversities in Advanced Array Systems: New Applications and Trends
Room 205B
SAM-L2: DoA Estimation
Room 209A
SPTM-L2: Tracking
Room E8
GC-L2: The 2nd e-Prevention challenge: Psychotic and Non-Psychotic Relapse Detection using Wearable-Based Digital Phenotyping
Room 209B
SPCOM-P2: Machine Learning for Communications
Poster Zone 1A
IVMSP-P2: Image and video processing for watermarking and security
Poster Zone 1B
SLP-P2: Self-supervised learning for speech processing II
Poster Zone 1C
MLSP-P5: Deep Learning Techniques II
Poster Zone 2A
IVMSP-P3: Deep Learning for Image and Video Processing I
Poster Zone 2B
IVMSP-P4: Image, video, and 3D content generation I
Poster Zone 2C
AASP-P3: Classification of acoustic scenes and events
Poster Zone 3A
MLSP-P6: Reinforcement Learning II
Poster Zone 3B
MLSP-P7: Subspace and Manifold Learning
Poster Zone 3C
AASP-P4: Active noise control and echo cancellation; Source separation 1
Poster Zone 4A
SPTM-P4: Machine learning, detection and classification
Poster Zone 4B
MLSP-P8: Machine Learning for Audio, Speech and Music Processing
Poster Zone 4C
MMSP-P2: Multimedia Generation and Synthesis
Poster Zone 5A
BISP-P4: Medical image detection and segmentation-1
Poster Zone 5B
IVMSP-P5: Vision and language I
Poster Zone 5C
IFS-P3: Multimedia Forensics and Cybersecurity
Poster Zone 6A
SPTM-P5: Estimation Theory and Methods
Poster Zone 6B
BISP-P5: Emerging methods for biomedical image and signal processing
Poster Zone 6C
Wednesday, April 17
Wed 08:20 - 10:20
SLP-L7: Text to Speech Generation - O1
Room 103
AASP-L4: Audio Classification, Detection and Localization
Room 101
MLSP-L6: Self-Supervised and Semi-Supervised Learning I
Room 102
SLP-L8: Multichannel/Multimodal Speech Recognition
Room 104
SLP-L9: Speaker verification I
Room 105
MMSP-L2: Audio-Visual Speech Processing
Room E1
SLP-L10: Speaker Diarization I
Room E3
MLSP-L7: Adversarial Machine Learning I
Room 201
SLP-L11: Machine learning methods for language
Room E2
SPED-L1: SPED: Signal Processing Education
Room E4
MMSP-L3: Multimedia Quality of Experience
Room 205A
Special Session
SS-L3: Generative Semantic Communication: How Generative Models Enhance Semantic Communications
Room 205B
Special Session
SS-L4: Quantum Machine Learning Algorithms and Applications on NISQ Devices
Room 209A
BISP-L1: Domain-enriched learning for medical image processing
Room E8
GC-L3: In-Car Multi-Channel Automatic Speech Recognition Challenge (ICMC-ASR)
Room 209B
SLP-P3: Speech enhancement and separation I
Poster Zone 1A
IVMSP-P6: Image denoising
Poster Zone 1B
ASPS-P1: ASPS Systems Poster Session
Poster Zone 1C
SLP-P4: ASR - New algorithms and approaches
Poster Zone 2A
SLP-P5: Voice Conversion II
Poster Zone 2B
IVMSP-P7: Image, video, and 3D content generation II
Poster Zone 2C
MLSP-P9: Deep Learning Techniques III
Poster Zone 3A
MLSP-P10: Distributed and Federated Learning II
Poster Zone 3B
MLSP-P11: Data Mining and Big Data II
Poster Zone 3C
SLP-P6: Language understanding and computational semantics IV - Machine Learning
Poster Zone 4A
MLSP-P12: Explainable and Interpretable Machine Learning II
Poster Zone 4B
BISP-P6: Neuroimaging and brain/human-computer interfaces-1
Poster Zone 4C
AASP-P5: Localization, DOA estimation, Spatial audio recording and reproduction
Poster Zone 5A
MMSP-P3: Perception and Processing for Autonomous Systems and Applications
Poster Zone 5B
CI-P1: Computational Imaging III
Poster Zone 5C
AASP-P6: Audio and speech quality and intelligibility measures; Music analysis 1
Poster Zone 6A
IVMSP-P8: Quality assessment and anomaly detection II
Poster Zone 6B
BISP-P7: Medical image formation, reconstruction and restoration
Poster Zone 6C
Wed 08:20 - 18:00
Industry Exhibitions
Hall D2
Wed 10:00 - 12:00
Short Course
SC-2a: Practical Guide to Computational Imaging: From Basics to Brilliance (Part 1)
Room 300-A
Short Course
SC-3a: RF Sensing for Wireless AI Perception: Theories, Algorithms, and Applications (Part 1)
Room 300-B
Short Course
SC-4a: Multi-Agent Optimization and Learning (Part 1)
Room E7
Wed 10:40 - 11:40
Plenary
PLEN-2: Bhaskar D. Rao "Classical versus Modern Signal Processing Algorithms: A Contrast Study"
Auditorium
Wed 11:40 - 13:40
Women in Signal Processing Luncheon
Room E5-E6
Wed 13:10 - 15:10
SLP-L12: Speech Emotion Recognition and Analysis II
Room 103
AASP-L5: Audio and Speech Source Separation
Room 101
SLP-L13: Text-based customization for speech-to-text
Room 102
MLSP-L8: Deep Learning Models I
Room 104
SPCOM-L2: Next-Gen Communication Systems
Room 105
IVMSP-L4: Image restoration
Room E1
MLSP-L9: Robustness and Trustworthy Machine Learning I
Room E3
SPTM-L3: Signal Processing over Networks
Room 201
MLSP-L10: Learning from Multimodal Data I
Room E2
IVMSP-L5: 3D understanding
Room E4
SLP-L14: Self-supervised learning for speech processing I
Room 205A
BISP-L2: Medical image detection and segmentation I
Room 205B
Special Session
SS-L5: Robust Reconstruction Methods in Computational Imaging
Room 209A
SAM-L3: Compressed sensing and machine learning for multi-sensor systems
Room E8
GC-L4: LIMMITS'24: Multi-speaker, Multi-lingual Indic TTS with voice cloning
Room 209B
SLP-P7: Natural Language Processing for speech-to-text
Poster Zone 1A
SLP-P8: Resource Constrained Acoustic and Langugage Modeling I
Poster Zone 1B
SLP-P9: Language resources, metrics and systems
Poster Zone 1C
AASP-P7: Dereverberation and RIR estimation 1; Speech enhancement and restoration
Poster Zone 2A
IVMSP-P9: Image/video super-resolution
Poster Zone 2B
ASPS-P2: ASPS IOT Poster Session
Poster Zone 2C
MLSP-P13: Transfer Learning II
Poster Zone 3A
MLSP-P14: Machine Learning for Image and Video Processing III
Poster Zone 3B
MLSP-P15: Matrix Factorization and Source Separation
Poster Zone 3C
AASP-P8: Beamforming for audio and speech; Music signal analysis, processing and synthesis
Poster Zone 4A
SLP-P10: Summarization, retrieval and language learning
Poster Zone 4B
MLSP-P16: Sequential Learning and Sequential Decision Methods
Poster Zone 4C
SPCOM-P3: MIMO and Massive MIMO Communication Systems
Poster Zone 5A
MMSP-P4: Multimodal Emotion/Sentiment Analysis
Poster Zone 5B
IVMSP-P10: Vision and language II
Poster Zone 5C
SLP-P11: Text to Speech Generation - P1
Poster Zone 6A
IVMSP-P11: Human understanding I
Poster Zone 6B
SPTM-P6: Signal and Information Processing over Graphs
Poster Zone 6C
Wed 14:00 - 15:30
Short Course
SC-2b: Practical Guide to Computational Imaging: From Basics to Brilliance (Part 2)
Room 300-A
Short Course
SC-3b: RF Sensing for Wireless AI Perception: Theories, Algorithms, and Applications (Part 2)
Room 300-B
Short Course
SC-4b: Multi-Agent Optimization and Learning (Part 2)
Room E7
Wed 15:10 - 16:10
Industry
Innovation Forum
Auditorium
Wed 16:10 - 18:00
Industry
Industry Workshop by Meta - Meta industry workshop – open problems in Speech, Audio, Video and Signal Processing
Auditorum
Wed 16:30 - 18:30
Signal Processing on Climate Change Interactive Session
Room 2B+2C
IVMSP-L6: Image and video synthesis
Room 103
SPCOM-L3: MIMO and High-frequency Communications
Room 101
IVMSP-L7: Image and video super-resolution
Room 102
MLSP-L11: Deep Generative Models II
Room 104
AASP-L6: Spatial Audio Recording and Reproduction
Room 105
AASP-L7: Audio Signal Restoration and Speech Enhancement
Room E1
SLP-L15: Discourse and dialog I
Room E3
SPTM-L4: Bayesian Signal Processing
Room 201
MLSP-L12: Pattern Recognition and Classification I
Room E2
IVMSP-L8: Human understanding
Room E4
SLP-L16: Key Word Spotting
Room 205A
Special Session
SS-L6: Graphical Inference and Modeling in Dynamical Systems
Room 205B
BISP-L3: Physiological and wearable signal processing I
Room 209A
SLP-L17: Speech analysis - Pitch, Spectrum and Voice disorders
Room E8
GC-L5: Grand Challenge on Hyperspectral Skin Vision
Room 209B
SLP-P12: Robust speech recognition and adaptation I
Poster Zone 1A
SLP-P13: Speech analysis and language disorder analysis
Poster Zone 1B
IVMSP-P12: Aspects in image/video processing and analysis
Poster Zone 1C
MLSP-P17: Deep Learning Models III
Poster Zone 2A
IVMSP-P13: Deep Learning for Image and Video Processing II
Poster Zone 2B
SAM-P1: DoA Estimation and Source Localization I
Poster Zone 2C
SLP-P14: Multimodal processing of language
Poster Zone 3A
MLSP-P18: Graph Neural Networks III
Poster Zone 3B
ASPS-P3: ASPS Neuromorphic, Quantum, and Software Poster Session
Poster Zone 3C
AASP-P9: Source separation 2; Music analysis 2
Poster Zone 4A
MLSP-P19: Machine Learning for Time Series Analysis II
Poster Zone 4B
MLSP-P20: Adversarial Machine Learning II
Poster Zone 4C
SLP-P15: Speech Emotion Recognition and Analysis III
Poster Zone 5A
MMSP-P5: Multimedia Search and Retrieval
Poster Zone 5B
CI-P2: Computational Imaging IV
Poster Zone 5C
AASP-P10: Anomaly detection; Sound event detection and localization
Poster Zone 6A
BISP-P8: Medical image detection and segmentation-2
Poster Zone 6B
SAM-P2: Acoustic array and signal processing
Poster Zone 6C
Wed 17:30 - 18:30
MiME Catch-up
Room E5-E6
Wed 19:30 - 22:00
Conference Banquet
Hall D1
Thursday, April 18
Thu 08:00 - 10:00
Author Ethics and IEEE Author Tools
Room MR2-AB
Thu 08:20 - 10:20
SLP-L18: Text to Speech Generation -O2
Room 103
AASP-L8: Music Signal Analysis and Processing
Room 101
SLP-L19: Language understanding and computational semantics II - Language Models
Room 102
MLSP-L13: Self-Supervised and Semi-Supervised Learning II
Room 104
IVMSP-L9: Deep learning theory
Room 105
SPTM-L5: Estimation theory and methods
Room E1
MLSP-L14: Distributed and Federated Learning I
Room E3
CI-L1: Computational Imaging I
Room 201
SLP-L20: Anti-spoofing I
Room E2
Special Session
SS-L7: Advancements in Integrated Sensing and Communication for Next-Generation Wireless Networks
Room E4
MMSP-L4: Pose, Gesture, and Action in Multimedia
Room 205A
SPTM-L6: Sampling Theory, Compressed and Non-uniform Sampling
Room 205B
SAM-L4: MIMO and Massive MIMO systems
Room 209A
BISP-L4: Multimodal and emerging medical signal analysis
Room E8
GC-L6: The RF Signal Separation Challenge
Room 209B
SLP-P16: Robust speech recognition and adaptation II
Poster Zone 1A
SPCOM-P4: Signal Processing for Communications
Poster Zone 1B
IVMSP-P14: Machine learning for image and video processing I
Poster Zone 1C
AASP-P11: Audio and speech modeling, coding and transmission; Spatial audio recording and reproduction
Poster Zone 2A
SLP-P17: Voice Conversion: Singing, accent and emotion
Poster Zone 2B
SLP-P18: machine learning methods for language
Poster Zone 2C
MLSP-P21: Deep Learning Models IV
Poster Zone 3A
SLP-P19: Speaker Diarization II
Poster Zone 3B
MLSP-P22: Other Machine Learning Applications I
Poster Zone 3C
SLP-P20: Speaker Recognition and Anonymization
Poster Zone 4A
MLSP-P23: Feature Extraction Selection and Learning
Poster Zone 4B
ASPS-P4: ASPS Emerging Topics Poster Session
Poster Zone 4C
AASP-P12: Music information retrieval 3; Quality and intelligibility measures
Poster Zone 5A
MLSP-P24: Learning Theory and Performance Bound
Poster Zone 5B
MMSP-P6: Human-Centric Multimedia
Poster Zone 5C
SLP-P21: Multilingual speech recognition and identification
Poster Zone 6A
IVMSP-P15: Image recognition and detection I
Poster Zone 6B
SPTM-P7: Signal Processing over Graphs and Networks
Poster Zone 6C
Thu 08:20 - 18:00
Industry Exhibitions
Hall D2
Thu 10:00 - 12:00
Short Course
SC-2c: Practical Guide to Computational Imaging: From Basics to Brilliance (Part 3)
Room 300-A
Short Course
SC-3c: RF Sensing for Wireless AI Perception: Theories, Algorithms, and Applications (Part 3)
Room 300-B
Short Course
SC-4c: Multi-Agent Optimization and Learning (Part 3)
Room E7
Thu 10:40 - 11:40
Plenary
PLEN-3: Daniel D. Lee "Geometry and Latent Signal Representations in Machine Learning"
Auditorium
Thu 11:40 - 13:40
Student Job Fair & Luncheon
Room E5-E6
Thu 13:10 - 15:10
DEMO-2A: Show and Tell Demos II-A
Hall D2: Podium Pitch Room A
DEMO-2B: Show and Tell Demos II-B
Hall D2: Podium Pitch Room B
SLP-L21: End-to-end modeling for automatic speech recognition
Room 103
SLP-L22: Segmentation, tagging, and parsing of language
Room 101
IVMSP-L10: Detection
Room 102
MLSP-L15: Deep Learning Models II
Room 104
MLSP-L16: Robustness and Trustworthy Machine Learning II
Room 105
AASP-L9: Audio-Language Processing and Audio Captioning
Room E1
IVMSP-L11: Action recognition
Room E3
IFS-L2: Biometrics-1
Room 201
MLSP-L17: Explainable and Interpretable Machine Learning I
Room E2
IVMSP-L12: Image, video and other applications
Room E4
Special Session
SS-L8: Signal and Graph Processing for Autonomous Agents - 1
Room 205A
Special Session
SS-L9: Next-Generation Wi-Fi Sensing: Part I
Room 205B
BISP-L5: Medical image detection and segmentation II
Room 209A
Special Session
SS-L10: Signal processing theory for covert communication and cybersecurity
Room E8
GC-L11: The ICASSP 2024 Audio Deep Packet Loss Concealment Grand Challenge
Room 209B
SPCOM-P5: Next-Gen Communications and PHY Security
Poster Zone 1A
IFS-P4: Network and System Security
Poster Zone 1B
AASP-P13: Target source extraction; Active noise control, echo reduction and feedback reduction
Poster Zone 1C
SLP-P22: Text to Speech Generation - P2
Poster Zone 2A
SLP-P23: Machine translation for spoken and written language II
Poster Zone 2B
AASP-P14: Sound events detection, description and generation
Poster Zone 2C
MLSP-P25: Deep Generative Models III
Poster Zone 3A
MLSP-P26: Learning from Multimodal Data III
Poster Zone 3B
MLSP-P27: Transfer Learning III
Poster Zone 3C
IFS-P5: Applied Cryptography
Poster Zone 4A
SLP-P24: Speaker verification II
Poster Zone 4B
BISP-P9: Neuroimaging and brain/human-computer interfaces-2
Poster Zone 4C
SLP-P25: Speech Emotion Recognition and Analysis IV
Poster Zone 5A
SLP-P26: Language understanding and computational semantics V - NLP Tasks
Poster Zone 5B
CI-P3: Computational Imaging V
Poster Zone 5C
MMSP-P7: Machine/Deep Learning Methodologies for Multimedia
Poster Zone 6A
IVMSP-P16: Human understanding II
Poster Zone 6B
SAM-P3: DoA Estimation and Source Localization II
Poster Zone 6C
Thu 14:00 - 15:30
Short Course
SC-2d: Practical Guide to Computational Imaging: From Basics to Brilliance (Part 4)
Room 300-A
Short Course
SC-3d: RF Sensing for Wireless AI Perception: Theories, Algorithms, and Applications (Part 4)
Room 300-B
Short Course
SC-4d: Multi-Agent Optimization and Learning (Part 4)
Room E7
Thu 15:10 - 16:10
Industry
Plenary
IPLEN-2: Johan Schalkwyk "Multi Modal Large Language models as the path towards Language Inclusivity"
Auditorium
Thu 16:30 - 18:30
Special Session
SS-L11: In-Context Learning Methods for Speech and Spoken Language Processing
Room 103
MMSP-L5: Multimodal Processing: Vision + Language 2
Room 101
MLSP-L18: Graph Neural Networks II
Room 102
SLP-L23: Speech separation and extraction
Room 104
SPCOM-L4: Signal Processing and Machine Learning for Communications
Room 105
AASP-L10: Audio Coding
Room E1
SLP-L24: Resource Constrained Acoustic and Langugage Modeling
Room E3
AASP-L11: Active noise control and echo cancellation
Room 201
MLSP-L19: Bayesian Machine Learning
Room E2
BISP-L6: Neuroimaging and brain/human-computer interfaces
Room E4
Special Session
SS-L12: Signal and Graph Processing for Autonomous Agents - 2
Room 205A
Special Session
SS-L13: Next-Generation Wi-Fi Sensing: Part II
Room 205B
BISP-L7: Physiological and wearable signal processing II
Room 209A
Special Session
SS-L14: Topological Signal Processing over Higher-order Networks
Room E8
GC-L8: Advancing the frontiers of deep learning for low-dose 3D cone-beam CT reconstruction
Room 209B
AASP-P15: Bioacoustics and medical acoustics; Audio security
Poster Zone 1A
SLP-P27: Acoustic modeling for automatic speech recognition
Poster Zone 1B
SLP-P28: Discourse and dialog II
Poster Zone 1C
SLP-P29: Multimodal processing of speech
Poster Zone 2A
IFS-P6: IFS General
Poster Zone 2B
IVMSP-P17: 3D image and video processing and analysis I
Poster Zone 2C
MLSP-P28: Deep Learning Training Methods II
Poster Zone 3A
MLSP-P29: Pattern Recognition and Classification II
Poster Zone 3B
SLP-P30: Key Word Spotting and Acoustic Event Detection
Poster Zone 3C
SPCOM-P6: Coding, Information Theory, and Applications of Signal Processing for Communications
Poster Zone 4A
SLP-P31: Speech analysis
Poster Zone 4B
AASP-P16: Music separation; Audio for multimedia and audio processing systems
Poster Zone 4C
IFS-P7: Adversarial Machine Learning
Poster Zone 5A
MLSP-P30: Machine Learning for Time Series Analysis III
Poster Zone 5B
MLSP-P31: Machine Learning for Communications and Wireless Networks
Poster Zone 5C
SLP-P32: Speech enhancement and separation II
Poster Zone 6A
IVMSP-P18: Image recognition and detection II
Poster Zone 6B
IVMSP-P19: Image and video coding/compression I
Poster Zone 6C
Thu 18:30 - 20:30
Young Professionals Networking Event
Room E5-E6
Friday, April 19
Fri 08:20 - 10:20
Special Session
SS-L15: Deepfakes and AI-Generated Content (AIGC) Detection and Forensics: Recent Advances
Room 103
BISP-L8: Bioinformatics and biomedical signal processing
Room 101
SLP-L25: Audio-visual speech/intent recognition
Room 102
IVMSP-L13: Image and video coding/compression
Room 104
MLSP-L20: Reinforcement Learning I
Room 105
MMSP-L6: Multimodal Clustering, Segmentation, and Summarization
Room E1
MLSP-L21: Learning Theory and Methods
Room E3
CI-L2: Computational Imaging II
Room 201
Special Session
SS-L16: Recent Advances in AI-Powered Visual Computing and Multimodal Signal Processing for Metaverse Era
Room E2
Special Session
SS-L17: Algorithm-Hardware Co-Design of Neuromorphic Solutions for Signal Processing Applications
Room E4
Special Session
SS-L18: Automotive Radar Signal Processing for Autonomous Driving
Room 205A
Special Session
SS-L19: Learning with Incomplete Medical Data
Room 205B
Special Session
SS-L20: Signal Processing and Machine Learning for Collective Intelligence
Room 209A
Special Session
SS-L21: Variational Inference and Approximate Bayesian Techniques
Room E8
GC-L9: ICASSP SP Cadenza Challenge: Music demixing/remixing for hearing aids
Room 209B
SLP-P33: Language understanding and computational semantics VI - Machine Learning
Poster Zone 1A
SLP-P34: Resource Constrained Acoustic and Langugage Modeling II
Poster Zone 1B
SAM-P4: Radar Signal Processing
Poster Zone 1C
MLSP-P32: Deep Learning Training Methods III
Poster Zone 2A
BISP-P10: Biological and medical signal and image processing
Poster Zone 2B
IVMSP-P20: 3D image and video processing and analysis II
Poster Zone 2C
SLP-P35: Text to Speech Generation - P3
Poster Zone 3A
MLSP-P33: Distributed and Federated Learning III
Poster Zone 3B
SLP-P36: machine learning methods for language
Poster Zone 3C
MLSP-P34: Self-Supervised and Semi-Supervised Learning III
Poster Zone 4A
SLP-P37: Speech enhancement and separation III
Poster Zone 4B
SLP-P38: Anti-spoofing and Speaker Embedding
Poster Zone 4C
SLP-P39: Speech Emotion Recognition and Analysis V
Poster Zone 5A
BISP-P11: Medical image detection and segmentation-3
Poster Zone 5B
AASP-P17: Speech enhancement 2; Dereverberation and RIR estimation 2
Poster Zone 5C
BISP-P12: Bioinformatics
Poster Zone 6A
IVMSP-P21: Segmentation
Poster Zone 6B
MLSP-P35: Other Machine Learning Applications II
Poster Zone 6C
Fri 08:20 - 18:00
Industry Exhibitions
Hall D2
Fri 10:00 - 12:00
Short Course
SC-2e: Practical Guide to Computational Imaging: From Basics to Brilliance (Part 5)
Room E5
Short Course
SC-3e: RF Sensing for Wireless AI Perception: Theories, Algorithms, and Applications (Part 5)
Room E6
Short Course
SC-4e: Multi-Agent Optimization and Learning (Part 5)
Room E7
Fri 10:40 - 11:40
Plenary
PLEN-4: Jitendra Malik "Reconstructing and Recognizing Human Actions in Video"
Auditorium
Fri 13:10 - 15:10
MLSP-L22: Deep Learning Training Methods I
Room 103
IVMSP-L14: Segmentation
Room 101
IVMSP-L15: 3D generation
Room 102
Special Session
SS-L22: Efficient Modeling of Long Sequences with Applications to Speech and Audio
Room 104
IFS-L3: Adversarial Machine Learning
Room 105
IFS-L4: Biometrics-2
Room E1
IFS-L5: Multimedia Forensics
Room E3
SLP-L26: Machine translation for spoken and written language I
Room 201
MLSP-L23: Machine Learning for Time Series Analysis I
Room E2
MLSP-L24: Data Mining and Big Data I
Room E4
Special Session
SS-L23: Decentralized learning with resource-constrained communication
Room 205A
Special Session
SS-L24: Localization and Sensing based on Signals from Terrestrial and Non-Terrestrial Networks
Room 205B
Special Session
SS-L25: Signal Processing and Machine Learning for Understanding Brain Dynamics
Room 209A
GC-L7: Multimodal Information Based Speech Processing (MISP) 2023 Challenge
Room E8
GC-L10: ICASSP 2024 SPEECH SIGNAL IMPROVEMENT CHALLENGE
Room 209B
XAI-SA: ICASSP 2024 Workshop on Explainable AI for Speech and Audio
Workshop Lectures
WS-1: Deep Neural Network Model Compression
Workshop Lectures
WS-11: Signal Processing and Machine Learning Advances in Automotive Radars
Workshop Lectures
WS-12: Radio Maps and Their Applications (RMA)
Workshop Lectures
WS-13: Super-resolution integrated communications, localization, vision and radio mapping (SUPER-CLAM)
Workshop Lectures
WS-14: Fearless Steps APOLLO: A Naturalistic Team based Speech Communications Community Resource (FS-APOLLO)
Workshop Lectures
WS-2: Trustworthy Speech Processing (TSP)
Workshop Lectures
WS-5: Computational Imaging Using Synthetic Apertures
Workshop Lectures
WS-6: Timely and Private Machine Learning over Networks
Workshop Lectures
WS-9: SPID-CPS: Signal Processing for resilient Intrusion Detection in Cyber-Physical Systems
Workshop Lectures
IVMSP-P22: Action recognition
Poster Zone 1A
SPTM-P8: Signal processing theory and methods journal papers
Poster Zone 1B
IVMSP-P23: Machine learning for image and video processing II
Poster Zone 1C
SAM-P5: Multi-sensor and multichannel signal processing
Poster Zone 2A
SAM-P6: Array processing and beamforming
Poster Zone 2B
AASP-P18: Sound event classification and generation; Active noise control, echo reduction and feedback reduction
Poster Zone 2C
MLSP-P36: Graph Neural Networks IV
Poster Zone 3A
MLSP-P37: Deep Learning Fairness and Privacy
Poster Zone 3B
MLSP-P38: Pattern Recognition and Classification III
Poster Zone 3C
SPTM-P9: Sparsity and Low-Rank Models
Poster Zone 4C
SPTM-P10: Optimization methods for signal processing
Poster Zone 5A
MMSP-P8: Multimodal Processing
Poster Zone 5B
IVMSP-P24: Image and video coding/compression II
Poster Zone 5C
Fri 14:00 - 15:00
Short Course
SC-2f: Practical Guide to Computational Imaging: From Basics to Brilliance (Part 6)
Room E5
Short Course
SC-3f: RF Sensing for Wireless AI Perception: Theories, Algorithms, and Applications (Part 6)
Room E6
Short Course
SC-4f: Multi-Agent Optimization and Learning (Part 6)
Room E7
Fri 15:10 - 16:10
Industry
Plenary
IPLEN-3: Joohyung Lee
Auditorium
Fri 16:30 - 18:00
Closing Ceremony
Auditorium