MLSP-P2: Reinforcement and Transfer Learning |
| Session Type: Poster |
| Time: Tuesday, May 14, 13:30 - 15:30 |
| Location: Poster Area H, East Bar, First Floor |
| Session Chair: Namrata Vaswani, Iowa State University |
| MLSP-P2.1: REINFORCEMENT LEARNING WITH SAFE EXPLORATION FOR NETWORK SECURITY |
| Manuscript Link: Click here to view manuscript on IEEE Xplore |
| Canhuang Dai; Xiamen University |
| Liang Xiao; Xiamen University |
| Xiaoyue Wan; Xiamen University |
| Ye Chen; Xiamen University |
| MLSP-P2.2: TEAM POLICY LEARNING FOR MULTI-AGENT REINFORCEMENT LEARNING |
| Manuscript Link: Click here to view manuscript on IEEE Xplore |
| Lucas Cassano; University of California, Los Angeles |
| Sulaiman Alghunaim; University of California, Los Angeles |
| Ali H. Sayed; École Polytechnique Fédérale de Lausanne |
| MLSP-P2.3: DEEP REINFORCEMENT LEARNING FOR FINANCIAL TRADING USING PRICE TRAILING |
| Manuscript Link: Click here to view manuscript on IEEE Xplore |
| Konstantinos Saitas Zarkias; Aristotle University of Thessaloniki |
| Nikolaos Passalis; Aristotle University of Thessaloniki |
| Avraam Tsantekidis; Aristotle University of Thessaloniki |
| Anastasios Tefas; Aristotle University of Thessaloniki |
| MLSP-P2.4: JOINT ON-LINE LEARNING OF A ZERO-SHOT SPOKEN SEMANTIC PARSER AND A REINFORCEMENT LEARNING DIALOGUE MANAGER |
| Manuscript Link: Click here to view manuscript on IEEE Xplore |
| Matthieu Riou; University of Avignon |
| Bassam Jabaian; University of Avignon |
| Stéphane Huet; University of Avignon |
| Fabrice Lefèvre; University of Avignon |
| MLSP-P2.5: GRAPH SIGNAL SAMPLING VIA REINFORCEMENT LEARNING |
| Manuscript Link: Click here to view manuscript on IEEE Xplore |
| Oleksii Abramenko; Aalto University |
| Alexander Jung; Aalto University |
| MLSP-P2.6: BHATTACHARYYA DISTANCE-BASED TRANSFER LEARNING FOR A HYBRID EEG-FTCD BRAIN-COMPUTER INTERFACE |
| Manuscript Link: Click here to view manuscript on IEEE Xplore |
| Elise Dagois; University of Pittsburgh |
| Aya Khalaf; University of Pittsburgh |
| Ervin Sejdic; University of Pittsburgh |
| Murat Akçakaya; University of Pittsburgh |
| MLSP-P2.7: A SUBJECT-TO-SUBJECT TRANSFER LEARNING FRAMEWORK BASED ON JENSEN-SHANNON DIVERGENCE FOR IMPROVING BRAIN-COMPUTER INTERFACE |
| Manuscript Link: Click here to view manuscript on IEEE Xplore |
| Joshua Giles; University of Sheffield |
| Kai Keng Ang; Institute for Infocomme Research |
| Lyudmila Mihaylova; University of Sheffield |
| Mahnaz Arvaneh; University of Sheffield |
| MLSP-P2.8: CONTENT PLACEMENT LEARNING FOR SUCCESS PROBABILITY MAXIMIZATION IN WIRELESS EDGE CACHING NETWORKS |
| Manuscript Link: Click here to view manuscript on IEEE Xplore |
| Navneet Garg; University of Edinburgh |
| Mathini Sellathurai; Heriot-Watt University |
| Tharmalingam Ratnarajah; The University of Edinburgh |
| MLSP-P2.9: ACTIVE LEARNING WITH LABEL PROPORTIONS |
| Manuscript Link: Click here to view manuscript on IEEE Xplore |
| Rafael Poyiadzis; University of Bristol |
| Raul Santos-Rodriguez; University of Bristol |
| Niall Twomey; University of Bristol |
| MLSP-P2.10: COVER: A CLUSTER-BASED VARIANCE REDUCED METHOD FOR ONLINE LEARNING |
| Manuscript Link: Click here to view manuscript on IEEE Xplore |
| Kun Yuan; University of California, Los Angeles |
| Bicheng Ying; University of California, Los Angeles |
| Ali H. Sayed; École Polytechnique Fédérale de Lausanne |
| MLSP-P2.11: FLEXIBLE NON-NEGATIVE MATRIX FACTORIZATION WITH ADAPTIVELY LEARNED GRAPH REGULARIZATION |
| Manuscript Link: Click here to view manuscript on IEEE Xplore |
| Yong Peng; Hangzhou Dianzi University |
| Yanfang Long; Hangzhou Dianzi University |
| Feiwei Qin; Hangzhou Dianzi University |
| Wanzeng Kong; Hangzhou Dianzi University |
| Feiping Nie; Northwestern Polytechnical University |
| Andrzej Cichocki; Skolkovo Institute of Science and Technology |