MMSP-P3: Multimedia Signal Processing |
Session Type: Poster |
Time: Thursday, 7 May, 16:30 - 18:30 |
Location: On-Demand |
Virtual Session: View on Virtual Platform |
Session Chair: Shrikanth Narayanan, University of Southern California
|
|
MMSP-P3.1: AVA ACTIVE SPEAKER: AN AUDIO-VISUAL DATASET FOR ACTIVE SPEAKER DETECTION |
Joseph Roth; Google, Inc. |
Sourish Chaudhuri; Google, Inc. |
Ondrej Klejch; Google, Inc. |
Radhika Marvin; Google, Inc. |
Andrew Gallagher; Google, Inc. |
Liat Kaver; Google, Inc. |
Sharadh Ramaswamy; Google, Inc. |
Arkadiusz Stopczynski; Google, Inc. |
Cordelia Schmid; Google, Inc. |
Zhonghua Xi; Google, Inc. |
Caroline Pantofaru; Google, Inc. |
|
MMSP-P3.2: SUPERVISED DEEP HASHING FOR EFFICIENT AUDIO EVENT RETRIEVAL |
Arindam Jati; University of Southern California |
Dimitra Emmanouilidou; Microsoft Research |
|
MMSP-P3.3: AN LSTM-BASED DYNAMIC CHORD PROGRESSION GENERATION SYSTEM FOR INTERACTIVE MUSIC PERFORMANCE |
Christos Garoufis; National Technical University of Athens |
Athanasia Zlatintsi; National Technical University of Athens |
Petros Maragos; National Technical University of Athens |
|
MMSP-P3.5: ENSEMBLE NETWORK FOR RANKING IMAGES BASED ON VISUAL APPEAL |
Sachin Singh; Indian Institute of Technology Kanpur |
Victor Sanchez; University of Warwick |
Tanaya Guha; University of Warwick |
|
MMSP-P3.6: TRAPEZOIDAL SEGMENT SEQUENCING: A NOVEL APPROACH FOR FUSION OF HUMAN-PRODUCED CONTINUOUS ANNOTATIONS |
Brandon Booth; University of Southern California |
Shrikanth Narayanan; University of Southern California |
|
MMSP-P3.7: SEQUENCE-TO-SEQUENCE LABANOTATION GENERATION BASED ON MOTION CAPTURE DATA |
Min Li; Beijing Jiaotong University |
Zhenjiang Miao; Beijing Jiaotong University |
Cong Ma; Beijing Jiaotong University |
|
MMSP-P3.8: POSE REFINEMENT: BRIDGING THE GAP BETWEEN UNSUPERVISED LEARNING AND GEOMETRIC METHODS FOR VISUAL ODOMETRY |
Lanqing Zhang; Peking University Shenzhen Graduate School |
Ge Li; Peking University Shenzhen Graduate School |
Thomas H. Li; Peking University |
|
MMSP-P3.9: MULTIMODAL ACTIVE SPEAKER DETECTION AND VIRTUAL CINEMATOGRAPHY FOR VIDEO CONFERENCING |
Ross Cutler; Microsoft |
Ramin Mehran; Zillow |
Sam Johnson; Facebook |
Cha Zhang; Microsoft |
Adam Kirk; Omnivor |
Oliver Whyte; Omnivor |
Adarsh Kowdle; perceptiveIO |
|
MMSP-P3.10: A NEW VARIATIONAL METHOD FOR DEEP SUPERVISED SEMANTIC IMAGE HASHING |
Furen Zhuang; University of Illinois at Urbana-Champaign |
Pierre Moulin; University of Illinois at Urbana-Champaign |
|