MLSP-P2: Reinforcement and Transfer Learning |
Session Type: Poster |
Time: Tuesday, May 14, 13:30 - 15:30 |
Location: Poster Area H, East Bar, First Floor |
Session Chair: Namrata Vaswani, Iowa State University |
MLSP-P2.1: REINFORCEMENT LEARNING WITH SAFE EXPLORATION FOR NETWORK SECURITY |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Canhuang Dai; Xiamen University |
Liang Xiao; Xiamen University |
Xiaoyue Wan; Xiamen University |
Ye Chen; Xiamen University |
MLSP-P2.2: TEAM POLICY LEARNING FOR MULTI-AGENT REINFORCEMENT LEARNING |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Lucas Cassano; University of California, Los Angeles |
Sulaiman Alghunaim; University of California, Los Angeles |
Ali H. Sayed; École Polytechnique Fédérale de Lausanne |
MLSP-P2.3: DEEP REINFORCEMENT LEARNING FOR FINANCIAL TRADING USING PRICE TRAILING |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Konstantinos Saitas Zarkias; Aristotle University of Thessaloniki |
Nikolaos Passalis; Aristotle University of Thessaloniki |
Avraam Tsantekidis; Aristotle University of Thessaloniki |
Anastasios Tefas; Aristotle University of Thessaloniki |
MLSP-P2.4: JOINT ON-LINE LEARNING OF A ZERO-SHOT SPOKEN SEMANTIC PARSER AND A REINFORCEMENT LEARNING DIALOGUE MANAGER |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Matthieu Riou; University of Avignon |
Bassam Jabaian; University of Avignon |
Stéphane Huet; University of Avignon |
Fabrice Lefèvre; University of Avignon |
MLSP-P2.5: GRAPH SIGNAL SAMPLING VIA REINFORCEMENT LEARNING |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Oleksii Abramenko; Aalto University |
Alexander Jung; Aalto University |
MLSP-P2.6: BHATTACHARYYA DISTANCE-BASED TRANSFER LEARNING FOR A HYBRID EEG-FTCD BRAIN-COMPUTER INTERFACE |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Elise Dagois; University of Pittsburgh |
Aya Khalaf; University of Pittsburgh |
Ervin Sejdic; University of Pittsburgh |
Murat Akçakaya; University of Pittsburgh |
MLSP-P2.7: A SUBJECT-TO-SUBJECT TRANSFER LEARNING FRAMEWORK BASED ON JENSEN-SHANNON DIVERGENCE FOR IMPROVING BRAIN-COMPUTER INTERFACE |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Joshua Giles; University of Sheffield |
Kai Keng Ang; Institute for Infocomme Research |
Lyudmila Mihaylova; University of Sheffield |
Mahnaz Arvaneh; University of Sheffield |
MLSP-P2.8: CONTENT PLACEMENT LEARNING FOR SUCCESS PROBABILITY MAXIMIZATION IN WIRELESS EDGE CACHING NETWORKS |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Navneet Garg; University of Edinburgh |
Mathini Sellathurai; Heriot-Watt University |
Tharmalingam Ratnarajah; The University of Edinburgh |
MLSP-P2.9: ACTIVE LEARNING WITH LABEL PROPORTIONS |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Rafael Poyiadzis; University of Bristol |
Raul Santos-Rodriguez; University of Bristol |
Niall Twomey; University of Bristol |
MLSP-P2.10: COVER: A CLUSTER-BASED VARIANCE REDUCED METHOD FOR ONLINE LEARNING |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Kun Yuan; University of California, Los Angeles |
Bicheng Ying; University of California, Los Angeles |
Ali H. Sayed; École Polytechnique Fédérale de Lausanne |
MLSP-P2.11: FLEXIBLE NON-NEGATIVE MATRIX FACTORIZATION WITH ADAPTIVELY LEARNED GRAPH REGULARIZATION |
Manuscript Link: Click here to view manuscript on IEEE Xplore |
Yong Peng; Hangzhou Dianzi University |
Yanfang Long; Hangzhou Dianzi University |
Feiwei Qin; Hangzhou Dianzi University |
Wanzeng Kong; Hangzhou Dianzi University |
Feiping Nie; Northwestern Polytechnical University |
Andrzej Cichocki; Skolkovo Institute of Science and Technology |