MMSP-P3: Multimedia Signal Processing |
| Session Type: Poster |
| Time: Thursday, 7 May, 16:30 - 18:30 |
| Location: On-Demand |
| Virtual Session: View on Virtual Platform |
| Session Chair: Shrikanth Narayanan, University of Southern California |
| MMSP-P3.1: AVA ACTIVE SPEAKER: AN AUDIO-VISUAL DATASET FOR ACTIVE SPEAKER DETECTION |
| Joseph Roth; Google, Inc. |
| Sourish Chaudhuri; Google, Inc. |
| Ondrej Klejch; Google, Inc. |
| Radhika Marvin; Google, Inc. |
| Andrew Gallagher; Google, Inc. |
| Liat Kaver; Google, Inc. |
| Sharadh Ramaswamy; Google, Inc. |
| Arkadiusz Stopczynski; Google, Inc. |
| Cordelia Schmid; Google, Inc. |
| Zhonghua Xi; Google, Inc. |
| Caroline Pantofaru; Google, Inc. |
| MMSP-P3.2: SUPERVISED DEEP HASHING FOR EFFICIENT AUDIO EVENT RETRIEVAL |
| Arindam Jati; University of Southern California |
| Dimitra Emmanouilidou; Microsoft Research |
| MMSP-P3.3: AN LSTM-BASED DYNAMIC CHORD PROGRESSION GENERATION SYSTEM FOR INTERACTIVE MUSIC PERFORMANCE |
| Christos Garoufis; National Technical University of Athens |
| Athanasia Zlatintsi; National Technical University of Athens |
| Petros Maragos; National Technical University of Athens |
| MMSP-P3.5: ENSEMBLE NETWORK FOR RANKING IMAGES BASED ON VISUAL APPEAL |
| Sachin Singh; Indian Institute of Technology Kanpur |
| Victor Sanchez; University of Warwick |
| Tanaya Guha; University of Warwick |
| MMSP-P3.6: TRAPEZOIDAL SEGMENT SEQUENCING: A NOVEL APPROACH FOR FUSION OF HUMAN-PRODUCED CONTINUOUS ANNOTATIONS |
| Brandon Booth; University of Southern California |
| Shrikanth Narayanan; University of Southern California |
| MMSP-P3.7: SEQUENCE-TO-SEQUENCE LABANOTATION GENERATION BASED ON MOTION CAPTURE DATA |
| Min Li; Beijing Jiaotong University |
| Zhenjiang Miao; Beijing Jiaotong University |
| Cong Ma; Beijing Jiaotong University |
| MMSP-P3.8: POSE REFINEMENT: BRIDGING THE GAP BETWEEN UNSUPERVISED LEARNING AND GEOMETRIC METHODS FOR VISUAL ODOMETRY |
| Lanqing Zhang; Peking University Shenzhen Graduate School |
| Ge Li; Peking University Shenzhen Graduate School |
| Thomas H. Li; Peking University |
| MMSP-P3.9: MULTIMODAL ACTIVE SPEAKER DETECTION AND VIRTUAL CINEMATOGRAPHY FOR VIDEO CONFERENCING |
| Ross Cutler; Microsoft |
| Ramin Mehran; Zillow |
| Sam Johnson; Facebook |
| Cha Zhang; Microsoft |
| Adam Kirk; Omnivor |
| Oliver Whyte; Omnivor |
| Adarsh Kowdle; perceptiveIO |
| MMSP-P3.10: A NEW VARIATIONAL METHOD FOR DEEP SUPERVISED SEMANTIC IMAGE HASHING |
| Furen Zhuang; University of Illinois at Urbana-Champaign |
| Pierre Moulin; University of Illinois at Urbana-Champaign |