MLSP-P3: Reinforcement and Sequential Learning |
Session Type: Poster |
Time: Tuesday, 5 May, 16:30 - 18:30 |
Location: On-Demand |
Virtual Session: View on Virtual Platform |
Session Chair: Jie Ding, University of Minnesota |
MLSP-P3.1: HIERARCHICAL CACHING VIA DEEP REINFORCEMENT LEARNING |
Alireza Sadeghi; University of Minnesota |
Gang Wang; University of Minnesota |
Georgios B. Giannakis; University of Minnesota |
MLSP-P3.2: LEARNING NETWORK REPRESENTATION THROUGH REINFORCEMENT LEARNING |
Siqi Shen; National University of Defense Technology |
Yongquan Fu; National University of Defense Technology |
Adele Lu Jia; China Agricultural University |
Huayou Su; National University of Defense Technology |
Qinglin Wang; National University of Defense Technology |
Chengsong Wang; National University of Defense Technology |
Yong Dou; National University of Defense Technology |
MLSP-P3.3: ATTENTION-BASED CURIOSITY-DRIVEN EXPLORATION IN DEEP REINFORCEMENT LEARNING |
Patrik Reizinger; Budapest University of Technology and Economics |
Márton Szemenyei; Budapest University of Technology and Economics |
MLSP-P3.4: STABILIZING MULTI-AGENT DEEP REINFORCEMENT LEARNING BY IMPLICITLY ESTIMATING OTHER AGENTS’ BEHAVIORS |
Yue Jin; Tsinghua University |
Shuangqing Wei; Louisiana State University |
Jian Yuan; Tsinghua University |
Xudong Zhang; Tsinghua University |
Chao Wang; Tsinghua University |
MLSP-P3.5: QOS-AWARE FLOW CONTROL FOR POWER-EFFICIENT DATA CENTER NETWORKS WITH DEEP REINFORCEMENT LEARNING |
Penghao Sun; National Digital Switching System Engineering & Technological R&D Center |
Zehua Guo; Beijing Institute of Technology |
Sen Liu; Central South University |
Julong Lan; Central South University |
Yuxiang Hu; Central South University |
MLSP-P3.6: IMPROVING THE SCALABILITY OF DEEP REINFORCEMENT LEARNING-BASED ROUTING WITH CONTROL ON PARTIAL NODES |
Penghao Sun; National Digital Switching System Engineering & Technological R&D Center |
Julong Lan; National Digital Switching System Engineering & Technological R&D Center |
Zehua Guo; Beijing Institute of Technology |
Yang Xu; Fudan University |
Yuxiang Hu; National Digital Switching System Engineering & Technological R&D Center |
MLSP-P3.7: GENERALIZED LINEAR BANDITS WITH SAFETY CONSTRAINTS |
Sanae Amani; University of California, Santa Barbara |
Mahnoosh Alizadeh; University of California, Santa Barbara |
Christos Thrampoulidis; University of California, Santa Barbara |
MLSP-P3.8: FROM VIDEO GAME TO REAL ROBOT: THE TRANSFER BETWEEN ACTION SPACES |
Janne Karttunen; Karelics Oy |
Anssi Kanervisto; University of Eastern Finland |
Ville Kyrki; Aalto University |
Ville Hautamäki; University of Eastern Finland |
MLSP-P3.9: CORRELATED MULTI-ARMED BANDITS WITH A LATENT RANDOM SOURCE |
Samarth Gupta; Carnegie Mellon University |
Gauri Joshi; Carnegie Mellon University |
Osman Yagan; Carnegie Mellon University |
MLSP-P3.10: ADAPTIVE SEQUENTIAL INTERPOLATOR USING ACTIVE LEARNING FOR EFFICIENT EMULATION OF COMPLEX SYSTEMS |
Luca Martino; Universidad Rey Juan Carlos |
Daniel Heestermans Svendsen; Universitat de Valencia |
Jorge Vicent; Universitat of Valencia and Magellium Company in Geoinformation and Image Processing |
Gustau Camps-Valls; Universitat de Valencia |
MLSP-P3.11: CONTINUAL LEARNING FOR INFINITE HIERARCHICAL CHANGE-POINT DETECTION |
Pablo Moreno-Muñoz; Universidad Carlos III de Madrid |
David Ramírez; Universidad Carlos III de Madrid |
Antonio Artés-Rodríguez; Universidad Carlos III de Madrid |