AASP-L3.3

Residual Tokens Enhance Masked Autoencoders for Speech Modeling

Samir Sadok, Stéphane Lathuilière, Xavier Alameda-Pineda, INRIA, France

Session:
AASP-L3: Neural Speech and Audio Coding Oral

Track:
Audio and Acoustic Signal Processing [AA]

Location:
Room 127+128

Presentation Time:
Wed, 6 May, 09:40 - 10:00

Presentation
Discussion
Resources
No resources available.
Contacts