Search Results for author: Yunpeng Qing

Found 5 papers, 3 papers with code

Advantage-Aware Policy Optimization for Offline Reinforcement Learning

no code implementations • 12 Mar 2024 • Yunpeng Qing, Shunyu Liu, Jingyuan Cong, KaiXuan Chen, Yihe Zhou, Mingli Song

Offline Reinforcement Learning (RL) endeavors to leverage offline datasets to craft effective agent policy without online interaction, which imposes proper conservative constraints with the support of behavior policies to tackle the Out-Of-Distribution (OOD) problem.

D4RL reinforcement-learning +1

Paper
Add Code

Powerformer: A Section-adaptive Transformer for Power Flow Adjustment

no code implementations • 5 Jan 2024 • KaiXuan Chen, Wei Luo, Shunyu Liu, Yaoquan Wei, Yihe Zhou, Yunpeng Qing, Quan Zhang, Jie Song, Mingli Song

In this paper, we present a novel transformer architecture tailored for learning robust power system state representations, which strives to optimize power dispatch for the power flow adjustment across different transmission sections.

Paper
Add Code

Curricular Subgoals for Inverse Reinforcement Learning

1 code implementation • 14 Jun 2023 • Shunyu Liu, Yunpeng Qing, Shuqi Xu, Hongyan Wu, Jiangtao Zhang, Jingyuan Cong, Tianhao Chen, YunFu Liu, Mingli Song

Inverse Reinforcement Learning (IRL) aims to reconstruct the reward function from expert demonstrations to facilitate policy learning, and has demonstrated its remarkable success in imitation learning.

Autonomous Driving D4RL +2

Paper
Code

Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?

1 code implementation • 27 May 2023 • Yihe Zhou, Shunyu Liu, Yunpeng Qing, KaiXuan Chen, Tongya Zheng, Yanhao Huang, Jie Song, Mingli Song

Despite the encouraging results achieved, CTDE makes an independence assumption on agent policies, which limits agents to adopt global cooperative information from each other during centralized training.

Multi-agent Reinforcement Learning reinforcement-learning +2

Paper
Code

A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges

1 code implementation • 12 Nov 2022 • Yunpeng Qing, Shunyu Liu, Jie Song, Huiqiong Wang, Mingli Song

In this survey, we provide a comprehensive review of existing works on eXplainable RL (XRL) and introduce a new taxonomy where prior works are clearly categorized into model-explaining, reward-explaining, state-explaining, and task-explaining methods.

reinforcement-learning Reinforcement Learning (RL)

174

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.