SPE-L9: Multimodal Processing of Language |
Session Type: Lecture |
Time: Wednesday, 6 May, 11:30 - 13:30 |
Location: On-Demand |
Virtual Session: View on Virtual Platform |
Session Chair: Theodora Chaspari, Texas A&M University |
SPE-L9.1: COGANS FOR UNSUPERVISED VISUAL SPEECH ADAPTATION TO NEW SPEAKERS |
Adriana Fernandez-Lopez; Pompeu Fabra University |
Ali Karaali; Trinity College Dublin |
Naomi Harte; Trinity College Dublin |
Federico M. Sukno; Pompeu Fabra University |
SPE-L9.2: VISUALLY GUIDED SELF SUPERVISED LEARNING OF SPEECH REPRESENTATIONS |
Abhinav Shukla; Imperial College London |
Konstantinos Vougioukas; Imperial College London |
Pingchuan Ma; Imperial College London |
Stavros Petridis; Imperial College London |
Maja Pantic; Imperial College London |
SPE-L9.3: LOOKING ENHANCES LISTENING: RECOVERING MISSING SPEECH USING IMAGES |
Tejas Srinivasan; Carnegie Mellon University |
Ramon Sanabria; University of Edinburgh |
Florian Metze; Carnegie Mellon University |
SPE-L9.4: TOWARDS MULTILINGUAL SIGN LANGUAGE RECOGNITION |
Sandrine Tornay; Idiap Research Institute |
Marzieh Razavi; Telepathy Labs GmbH |
Mathew Magimai.-Doss; Idiap Research Institute |
SPE-L9.5: AUTOMATIC IDENTIFICATION OF SPEAKERS FROM HEAD GESTURES IN A NARRATION |
Sanjeev Kadagathur Vadiraj; Indian Institute of Science, Bangalore |
Achuth Rao M V; Indian Institute of Science, Bangalore |
Prasanta Kumar Ghosh; Indian Institute of Science, Bangalore |
SPE-L9.6: LIPREADING USING TEMPORAL CONVOLUTIONAL NETWORKS |
Brais Martinez; Samsung |
Pingchuan Ma; Imperial College London |
Stavros Petridis; Imperial College London |
Maja Pantic; Imperial College London and Samsung |