no code implementations • 12 Mar 2024 • Yunpeng Qing, Shunyu Liu, Jingyuan Cong, KaiXuan Chen, Yihe Zhou, Mingli Song
Offline Reinforcement Learning (RL) endeavors to leverage offline datasets to craft effective agent policy without online interaction, which imposes proper conservative constraints with the support of behavior policies to tackle the Out-Of-Distribution (OOD) problem.
no code implementations • 5 Jan 2024 • KaiXuan Chen, Wei Luo, Shunyu Liu, Yaoquan Wei, Yihe Zhou, Yunpeng Qing, Quan Zhang, Jie Song, Mingli Song
In this paper, we present a novel transformer architecture tailored for learning robust power system state representations, which strives to optimize power dispatch for the power flow adjustment across different transmission sections.
1 code implementation • 14 Jun 2023 • Shunyu Liu, Yunpeng Qing, Shuqi Xu, Hongyan Wu, Jiangtao Zhang, Jingyuan Cong, Tianhao Chen, YunFu Liu, Mingli Song
Inverse Reinforcement Learning (IRL) aims to reconstruct the reward function from expert demonstrations to facilitate policy learning, and has demonstrated its remarkable success in imitation learning.
1 code implementation • 27 May 2023 • Yihe Zhou, Shunyu Liu, Yunpeng Qing, KaiXuan Chen, Tongya Zheng, Yanhao Huang, Jie Song, Mingli Song
Despite the encouraging results achieved, CTDE makes an independence assumption on agent policies, which limits agents to adopt global cooperative information from each other during centralized training.
Multi-agent Reinforcement Learning reinforcement-learning +2
1 code implementation • 12 Nov 2022 • Yunpeng Qing, Shunyu Liu, Jie Song, Huiqiong Wang, Mingli Song
In this survey, we provide a comprehensive review of existing works on eXplainable RL (XRL) and introduce a new taxonomy where prior works are clearly categorized into model-explaining, reward-explaining, state-explaining, and task-explaining methods.