Hao Shi, Tatsuya Kawahara, Graduate School of Informatics, Kyoto University, Japan; Longbiao Wang, Tianjin University, China; Sheng Li, National Institute of Information and Communications Technology (NICT), Japan; Cunhang Fan, Anhui Province Key Laboratory of Multimodal Cognitive Computation, School of Computer Science and Technology, Anhui University, China; Jianwu Dang, Japan Advanced Institute of Science and Technology, Ishikawa, Japan