Search Results for author: Yilang Guo

Found 1 papers, 1 papers with code

Behavior Proximal Policy Optimization

2 code implementations • 22 Feb 2023 • Zifeng Zhuang, Kun Lei, Jinxin Liu, Donglin Wang, Yilang Guo

Offline reinforcement learning (RL) is a challenging setting where existing off-policy actor-critic methods perform poorly due to the overestimation of out-of-distribution state-action pairs.

D4RL Offline RL +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.