MLSP-P6.4

CAUSAL-BOOTSTRAPPED MULTI-AGENT REINFORCEMENT LEARNING FOR MITIGATING THE COLD-START PROBLEM

Ruobing Zhang, Yiping Song, National University of Defense Technology, China

Session:
MLSP-P6: Reinforcement Learning Algorithms and Applications I Poster

Track:
Machine Learning for Signal Processing [ML]

Location:
Poster Area 9

Presentation Time:
Tue, 5 May, 14:00 - 16:00

Presentation
Discussion
Resources
No resources available.
Session MLSP-P6
MLSP-P6.1: Cognitive Attention and Dual Residual Networks for Offline Regularized Multi-Agent Reinforcement Learning
Hongzhe Liu, Quan Liu, Yukang Cao, Soochow University, China
MLSP-P6.2: RAME: ROLE-AWARE MULTI-VIEW EMBEDDING FOR TRANSFERABLE MULTI-AGENT REINFORCEMENT LEARNING
Yang Zhou, Wenyu Chen, University of Electronic Science and Technology of China, China; Siying Wang, Southwest Minzu University, China; Ruoning Zhang, University of Electronic Science and Technology of China, China; Zhitong Zhao, Chengdu University of Technology, China; Jiawei Cui, Chao Zhai, University of Electronic Science and Technology of China, China
MLSP-P6.3: PROSE: Probabilistic Reinforcement Learning Optimized by Success Estimation for Stage-Aware Cotton Irrigation Scheduling
Zhongyan Yi, Zhihua Fang, Ruifeng Xu, Sai Yuan, Xinjiang University, China; Liang He, Tsinghua University, China
MLSP-P6.4: CAUSAL-BOOTSTRAPPED MULTI-AGENT REINFORCEMENT LEARNING FOR MITIGATING THE COLD-START PROBLEM
Ruobing Zhang, Yiping Song, National University of Defense Technology, China
MLSP-P6.5: NO VERIFIABLE REWARD FOR PROSODY: TOWARD PREFERENCE-GUIDED PROSODY LEARNING IN TTS
Seungyoun Shin, Channel Corporation, Korea, Republic of; Dongha Ahn, Kernelspace Co. Ltd., Korea, Republic of; Jiwoo Kim, Sungkyunkwan University, Korea, Republic of; Sungwook Jeon, NAVER Cloud, Korea, Republic of
MLSP-P6.6: HACG: Contribution-Based Dynamic Grouping with Hierarchical Graph Attention for Multi-Agent Cooperation
Tingting Wei, Zhangling Wang, Chushu Yi, Shaofei Chen, Lina Lu, Xueqiang Gu, National University of Defense Technology, China
MLSP-P6.7: DYNAMIC SEQUENCING AND GNN-BASED POSTED-PRICE DESIGN FOR COMBINATORIAL AUCTIONS
Yue Dong, Guozheng Rao, Tianjin University, China; Shuyuan You, Griffith University, Australia
MLSP-P6.8: EFFICIENT OFFLINE REINFORCEMENT LEARNING WITH PROGRESSIVE HEURISTIC BLENDING IN COMPLEX ENVIRONMENTS
Yi Yang, Mingfeng Lv, Hanlei Li, Xiamen University, China; Ruize Yan, Zhengsen Ruan, Zhejiang Gongshang University, China; Lvqing Yang, Xiamen University, China
MLSP-P6.9: KGER: Knowledge Graph Error Detection and Refinement with Reinforcement Learning
Aishan Maoliniyazi, Renmin University of China, China; Chaohong Ma, HEBEI Normal University, China; Xiaofeng Meng, Renmin University of China, China
Contacts