MMSP-L2: Audio-Visual Speech Processing
      Wed, 17 Apr, 08:20 - 10:20  (UTC +9)
            
          Location: Room E1
            Session Type: Lecture
            Session Co-Chairs: Li Liu, HKUST Guangzhou, China and Prasanta Ghosh, Indian Institute of Science (IISc), Bangalore
                        Track: Multimedia Signal Processing
          
        Click the  to view the manuscript on IEEE Xplore Open Preview
      
    Wed, 17 Apr, 08:20 - 08:40  (UTC +9)
               MMSP-L2.1: THE MULTIMODAL INFORMATION BASED SPEECH PROCESSING (MISP) 2023 CHALLENGE: AUDIO-VISUAL TARGET SPEAKER EXTRACTION
Wed, 17 Apr, 08:40 - 09:00  (UTC +9)
               MMSP-L2.2: HOURGLASS-AVSR: DOWN-UP SAMPLING-BASED COMPUTATIONAL EFFICIENCY MODEL FOR AUDIO-VISUAL SPEECH RECOGNITION
Wed, 17 Apr, 09:00 - 09:20  (UTC +9)
               MMSP-L2.3: TALKNCE: IMPROVING ACTIVE SPEAKER DETECTION WITH TALK-AWARE CONTRASTIVE LEARNING
Wed, 17 Apr, 09:20 - 09:40  (UTC +9)
               MMSP-L2.4: MLCA-AVSR: MULTI-LAYER CROSS ATTENTION FUSION BASED AUDIO-VISUAL SPEECH RECOGNITION
Wed, 17 Apr, 09:40 - 10:00  (UTC +9)
               MMSP-L2.5: AUDIO-VISUAL SPEECH RECOGNITION IN-THE-WILD: MULTI-ANGLE VEHICLE CABIN CORPUS AND ATTENTION-BASED METHOD
Wed, 17 Apr, 10:00 - 10:20  (UTC +9)