Offline RL

226 papers with code • 2 benchmarks • 6 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find Offline RL models and implementations
14 papers
38
7 papers
395
4 papers
2,574
See all 10 libraries.

MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman Operator

xiaoyinliu0714/micro 7 Dec 2023

This method trades off performance and robustness via introducing the robust Bellman operator into the algorithm.

0
07 Dec 2023

SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation

hakuhodo-technologies/scope-rl 30 Nov 2023

This paper introduces SCOPE-RL, a comprehensive open-source Python software designed for offline reinforcement learning (offline RL), off-policy evaluation (OPE), and selection (OPS).

97
30 Nov 2023

Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees

yifeizhou02/hnpg 14 Nov 2023

In this work, we propose a new hybrid RL algorithm that combines an on-policy actor-critic method with offline data.

1
14 Nov 2023

Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning

srzer/LaMo-2023 31 Oct 2023

Offline reinforcement learning (RL) aims to find a near-optimal policy using pre-collected datasets.

28
31 Oct 2023

Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning

zhaoyizhou1123/mbrcsl 30 Oct 2023

Off-policy dynamic programming (DP) techniques such as $Q$-learning have proven to be important in sequential decision-making problems.

8
30 Oct 2023

Robust Offline Reinforcement learning with Heavy-Tailed Rewards

mamba413/room 28 Oct 2023

This paper endeavors to augment the robustness of offline reinforcement learning (RL) in scenarios laden with heavy-tailed rewards, a prevalent circumstance in real-world applications.

4
28 Oct 2023

Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data Coverage

zaiyan-x/drqi 27 Oct 2023

The goal of an offline reinforcement learning (RL) algorithm is to learn optimal polices using historical (offline) data, without access to the environment for online exploration.

1
27 Oct 2023

CROP: Conservative Reward for Model-based Offline Policy Optimization

g0k0ururi/crop 26 Oct 2023

Offline reinforcement learning (RL) aims to optimize policy using collected data without online interactions.

5
26 Oct 2023

Corruption-Robust Offline Reinforcement Learning with General Function Approximation

yangrui2015/uwmsg NeurIPS 2023

Notably, under the assumption of single policy coverage and the knowledge of $\zeta$, our proposed algorithm achieves a suboptimality bound that is worsened by an additive factor of $\mathcal{O}(\zeta (C(\widehat{\mathcal{F}},\mu)n)^{-1})$ due to the corruption.

2
23 Oct 2023

Towards Robust Offline Reinforcement Learning under Diverse Data Corruption

zzmtsvv/ORL 19 Oct 2023

Offline reinforcement learning (RL) presents a promising approach for learning reinforced policies from offline datasets without the need for costly or unsafe interactions with the environment.

38
19 Oct 2023