AASP-L3: Environmental Sound Synthesis and Generation
Tue, 16 Apr, 16:30 - 18:30 (UTC +9)
Location: Room E2
Session Type: Lecture
Session Co-Chairs: Francois Germain, Mitsubishi Electric Research Laboratories and Prem Seetharaman, Adobe Research
Track: Audio and Acoustic Signal Processing
Click the to view the manuscript on IEEE Xplore Open Preview
Tue, 16 Apr, 16:30 - 16:50 (UTC +9)
AASP-L3.1: Environmental sound synthesis from vocal imitations and sound event labels
Tue, 16 Apr, 17:10 - 17:30 (UTC +9)
AASP-L3.3: SOUNDLOCD: AN EFFICIENT CONDITIONAL DISCRETE CONTRASTIVE LATENT DIFFUSION MODEL FOR TEXT-TO-SOUND GENERATION
Tue, 16 Apr, 17:30 - 17:50 (UTC +9)
AASP-L3.4: MTDIFFUSION: MULTI-TASK DIFFUSION MODEL WITH DUAL-UNET FOR FOLEY SOUND GENERATION
Tue, 16 Apr, 17:50 - 18:10 (UTC +9)
AASP-L3.5: GENERATION OR REPLICATION: AUSCULTATING AUDIO LATENT DIFFUSION MODELS
Tue, 16 Apr, 18:10 - 18:30 (UTC +9)