MLSP-L20: Reinforcement Learning I
Fri, 19 Apr, 08:20 - 10:20 (UTC +9)
Location: Room 105
Session Type: Lecture
Session Co-Chairs: Ville Hautamäki, University of Eastern Finland and Che Lin, National Taiwan University
Track: Machine Learning for Signal Processing
Click the to view the manuscript on IEEE Xplore Open Preview
Fri, 19 Apr, 08:20 - 08:40 (UTC +9)
 

MLSP-L20.1: MULTI-AGENT EXPLORATION VIA SELF-LEARNING AND SOCIAL LEARNING

Shaokang Dong, Chao Li, Wubing Chen, Hongye Cao, Wenbin Li, Yang Gao, Nanjing University, China
Fri, 19 Apr, 08:40 - 09:00 (UTC +9)
 

MLSP-L20.2: M$^3$ARL: Moment-Embedded Mean-Field Multi-Agent Reinforcement Learning for Continuous Action Space

Huaze Tang, Yuanquan Hu, Fanfan Zhao, Junji Yan, Ting Dong, Wenbo Ding, Tsinghua University, China
Fri, 19 Apr, 09:00 - 09:20 (UTC +9)
 

MLSP-L20.3: Zero-shot Imitation Policy via Search in Demonstration Dataset

Federico Malato, University of Eastern Finland, Finland; Florian Leopold, Andrew Melnik, Bielefeld University, Germany; Ville Hautamäki, University of Eastern Finland, Finland
Fri, 19 Apr, 09:20 - 09:40 (UTC +9)
 

MLSP-L20.4: Adaptive parameter sharing for multi-agent reinforcement learning

Dapeng Li, Institute of Automation, Chinese Academy of Sciences. School of Artificial Intelligence, University of Chinese Academy of Sciences., China; Na Lou, Institute of Automation, Chinese Academy of Sciences, China; Bin Zhang, Zhiwei Xu, Guoliang Fan, Institute of Automation, Chinese Academy of Sciences, China
Fri, 19 Apr, 09:40 - 10:00 (UTC +9)
 

MLSP-L20.5: A META-PRECONDITIONING APPROACH FOR DEEP Q-LEARNING

Spilios Evmorfos, Athina Petropulu, RUTGERS UNIVERSITY, United States of America
Fri, 19 Apr, 10:00 - 10:20 (UTC +9)
 

MLSP-L20.6: MEPE: A Minimalist Ensemble Policy Evaluation Operator for Deep Reinforcement Learning

Qiang He, Xinwen Hou, Institute of Automation, Chinese Academy of Sciences, Germany