Search Results for author: Yunpeng Qing

Found 5 papers, 3 papers with code

Advantage-Aware Policy Optimization for Offline Reinforcement Learning

no code implementations12 Mar 2024 Yunpeng Qing, Shunyu Liu, Jingyuan Cong, KaiXuan Chen, Yihe Zhou, Mingli Song

Offline Reinforcement Learning (RL) endeavors to leverage offline datasets to craft effective agent policy without online interaction, which imposes proper conservative constraints with the support of behavior policies to tackle the Out-Of-Distribution (OOD) problem.

D4RL reinforcement-learning +1

Powerformer: A Section-adaptive Transformer for Power Flow Adjustment

no code implementations5 Jan 2024 KaiXuan Chen, Wei Luo, Shunyu Liu, Yaoquan Wei, Yihe Zhou, Yunpeng Qing, Quan Zhang, Jie Song, Mingli Song

In this paper, we present a novel transformer architecture tailored for learning robust power system state representations, which strives to optimize power dispatch for the power flow adjustment across different transmission sections.

Curricular Subgoals for Inverse Reinforcement Learning

1 code implementation14 Jun 2023 Shunyu Liu, Yunpeng Qing, Shuqi Xu, Hongyan Wu, Jiangtao Zhang, Jingyuan Cong, Tianhao Chen, YunFu Liu, Mingli Song

Inverse Reinforcement Learning (IRL) aims to reconstruct the reward function from expert demonstrations to facilitate policy learning, and has demonstrated its remarkable success in imitation learning.

Autonomous Driving D4RL +2

Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?

1 code implementation27 May 2023 Yihe Zhou, Shunyu Liu, Yunpeng Qing, KaiXuan Chen, Tongya Zheng, Yanhao Huang, Jie Song, Mingli Song

Despite the encouraging results achieved, CTDE makes an independence assumption on agent policies, which limits agents to adopt global cooperative information from each other during centralized training.

Multi-agent Reinforcement Learning reinforcement-learning +2

A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges

1 code implementation12 Nov 2022 Yunpeng Qing, Shunyu Liu, Jie Song, Huiqiong Wang, Mingli Song

In this survey, we provide a comprehensive review of existing works on eXplainable RL (XRL) and introduce a new taxonomy where prior works are clearly categorized into model-explaining, reward-explaining, state-explaining, and task-explaining methods.

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.