AASP-L5: Audio Understanding and Generation
Oral
Wed, 6 May, 16:30 - 18:30
Location: Room 127+128
Session Type: Oral
Track: Audio and Acoustic Signal Processing [AA]
Click the to view the manuscript on IEEE Xplore Open Preview
Wed, 6 May, 16:50 - 17:10
AASP-L5.2: LAMB: LLM-BASED AUDIO CAPTIONING WITH MODALITY GAP BRIDGING VIA CAUCHY-SCHWARZ DIVERGENCE
Wed, 6 May, 17:10 - 17:30
AASP-L5.3: PICOAUDIO2: TEMPORAL CONTROLLABLE TEXT-TO-AUDIO GENERATION WITH NATURAL LANGUAGE DESCRIPTION
Wed, 6 May, 18:10 - 18:30