Login Paper Search My Schedule Paper Index Help

My ICASSP 2020 Schedule

Note: Your custom schedule will not be saved unless you create a new account or login to an existing account.
  1. Create a login based on your email (takes less than one minute)
  2. Perform 'Paper Search'
  3. Select papers that you desire to save in your personalized schedule
  4. Click on 'My Schedule' to see the current list of selected papers
  5. Click on 'Printable Version' to create a separate window suitable for printing (the header and menu will appear, but will not actually print)
Click on the icon to view the manuscript on IEEE XPlore in the IEEE ICASSP 2020 Open Preview.

Clicking on the Add button next to a paper title will add that paper to your custom schedule.
Clicking on the Remove button next to a paper will remove that paper from your custom schedule.

TU2.I: Reinforcement and Sequential Learning

Session Type: Poster
Time: Tuesday, 5 May, 16:30 - 18:30
Location: On-Demand
Session Chair: Jie Ding, University of Minnesota
 
   TU2.I.1: HIERARCHICAL CACHING VIA DEEP REINFORCEMENT LEARNING
         Alireza Sadeghi; University of Minnesota
         Gang Wang; University of Minnesota
         Georgios B. Giannakis; University of Minnesota
 
   TU2.I.2: LEARNING NETWORK REPRESENTATION THROUGH REINFORCEMENT LEARNING
         Siqi Shen; National University of Defense Technology
         Yongquan Fu; National University of Defense Technology
         Adele Lu Jia; China Agricultural University
         Huayou Su; National University of Defense Technology
         Qinglin Wang; National University of Defense Technology
         Chengsong Wang; National University of Defense Technology
         Yong Dou; National University of Defense Technology
 
   TU2.I.3: ATTENTION-BASED CURIOSITY-DRIVEN EXPLORATION IN DEEP REINFORCEMENT LEARNING
         Patrik Reizinger; Budapest University of Technology and Economics
         Márton Szemenyei; Budapest University of Technology and Economics
 
   TU2.I.4: STABILIZING MULTI-AGENT DEEP REINFORCEMENT LEARNING BY IMPLICITLY ESTIMATING OTHER AGENTS’ BEHAVIORS
         Yue Jin; Tsinghua University
         Shuangqing Wei; Louisiana State University
         Jian Yuan; Tsinghua University
         Xudong Zhang; Tsinghua University
         Chao Wang; Tsinghua University
 
   TU2.I.5: QOS-AWARE FLOW CONTROL FOR POWER-EFFICIENT DATA CENTER NETWORKS WITH DEEP REINFORCEMENT LEARNING
         Penghao Sun; National Digital Switching System Engineering & Technological R&D Center
         Zehua Guo; Beijing Institute of Technology
         Sen Liu; Central South University
         Julong Lan; Central South University
         Yuxiang Hu; Central South University
 
   TU2.I.6: IMPROVING THE SCALABILITY OF DEEP REINFORCEMENT LEARNING-BASED ROUTING WITH CONTROL ON PARTIAL NODES
         Penghao Sun; National Digital Switching System Engineering & Technological R&D Center
         Julong Lan; National Digital Switching System Engineering & Technological R&D Center
         Zehua Guo; Beijing Institute of Technology
         Yang Xu; Fudan University
         Yuxiang Hu; National Digital Switching System Engineering & Technological R&D Center
 
   TU2.I.7: GENERALIZED LINEAR BANDITS WITH SAFETY CONSTRAINTS
         Sanae Amani; University of California, Santa Barbara
         Mahnoosh Alizadeh; University of California, Santa Barbara
         Christos Thrampoulidis; University of California, Santa Barbara
 
   TU2.I.8: FROM VIDEO GAME TO REAL ROBOT: THE TRANSFER BETWEEN ACTION SPACES
         Janne Karttunen; Karelics Oy
         Anssi Kanervisto; University of Eastern Finland
         Ville Kyrki; Aalto University
         Ville Hautamäki; University of Eastern Finland
 
   TU2.I.9: CORRELATED MULTI-ARMED BANDITS WITH A LATENT RANDOM SOURCE
         Samarth Gupta; Carnegie Mellon University
         Gauri Joshi; Carnegie Mellon University
         Osman Yagan; Carnegie Mellon University
 
   TU2.I.10: ADAPTIVE SEQUENTIAL INTERPOLATOR USING ACTIVE LEARNING FOR EFFICIENT EMULATION OF COMPLEX SYSTEMS
         Luca Martino; Universidad Rey Juan Carlos
         Daniel Heestermans Svendsen; Universitat de Valencia
         Jorge Vicent; Universitat of Valencia and Magellium Company in Geoinformation and Image Processing
         Gustau Camps-Valls; Universitat de Valencia
 
   TU2.I.11: CONTINUAL LEARNING FOR INFINITE HIERARCHICAL CHANGE-POINT DETECTION
         Pablo Moreno-Muñoz; Universidad Carlos III de Madrid
         David Ramírez; Universidad Carlos III de Madrid
         Antonio Artés-Rodríguez; Universidad Carlos III de Madrid