Technical Program

Paper Detail

Paper IDC-2-3.4
Paper Title MULTIPLE TARGET PREDICTION FOR DEEP REINFORCEMENT LEARNING
Authors Jen-Tzung Chien, Po-Yen Hung, National Chiao Tung University, Taiwan
Session C-2-3: Machine Learning and Data Analysis 1
TimeWednesday, 09 December, 17:15 - 19:15
Presentation Time:Wednesday, 09 December, 18:00 - 18:15 Check your Time Zone
All times are in New Zealand Time (UTC +13)
Topic Machine Learning and Data Analytics (MLDA):
Abstract This paper presents the multiple target prediction for model-free deep reinforcement learning. Traditionally, the learning algorithm based on deep Q network (DQN) suffers from slow convergence in learning process which usually constrains the system performance. To speed up the learning process, this study incorporates the prediction network and the auxiliary replay memory in DQN which allows us to predict multiple target values over different actions rather than relying on a single target value from an action. The prediction network is trained in a sequential manner by using the samples from replay memory while the auxiliary replay memory is introduced to store the states and the predicted targets which are related to individual possible actions that are taken. With these two additional components functioned in DQN algorithm, the resulting approach can efficiently predict Q values and rewards to train an agent with comprehensive target values. The experiments on different tasks demonstrate the merit of multiple target prediction in reinforcement learning.