Offline RL

225 papers with code • 2 benchmarks • 6 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find Offline RL models and implementations
14 papers
35
7 papers
385
4 papers
2,515
See all 10 libraries.

DiffClone: Enhanced Behaviour Cloning in Robotics with Diffusion-Driven Policy Learning

sirabas369/diffclone 17 Jan 2024

The Train-Offline-Test-Online (TOTO) Benchmark provides a well-curated open-source dataset for offline training comprised mostly of expert data and also benchmark scores of the common offline-RL and behaviour cloning agents.

3
17 Jan 2024

Learning from Sparse Offline Datasets via Conservative Density Estimation

czp16/cde-offline-rl 16 Jan 2024

Offline reinforcement learning (RL) offers a promising direction for learning policies from pre-collected datasets without requiring further interactions with the environment.

0
16 Jan 2024

SPQR: Controlling Q-ensemble Independence with Spiked Random Model for Reinforcement Learning

dohyeoklee/SPQR NeurIPS 2023

Alleviating overestimation bias is a critical challenge for deep reinforcement learning to achieve successful performance on more complex tasks or offline datasets containing out-of-distribution data.

2
06 Jan 2024

Policy-regularized Offline Multi-objective Reinforcement Learning

qianlin04/prmorl 4 Jan 2024

In this paper, we aim to utilize only offline trajectory data to train a policy for multi-objective RL.

3
04 Jan 2024

Online Symbolic Music Alignment with Offline Reinforcement Learning

sildater/parangonar 31 Dec 2023

First, in its capacity to identify correct score positions for sampled test contexts; second, as the core technique of a complete algorithm for symbolic online note-wise alignment; and finally, as a real-time symbolic score follower.

16
31 Dec 2023

PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning

maohangyu/TIT_open_source 26 Dec 2023

Designing better deep networks and better reinforcement learning (RL) algorithms are both important for deep RL.

46
26 Dec 2023

A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning

opendilab/DI-engine 12 Dec 2023

In this paper, from a novel perspective, we systematically study the challenges that remain in O2O RL and identify that the reason behind the slow improvement of the performance and the instability of online finetuning lies in the inaccurate Q-value estimation inherited from offline pretraining.

2,515
12 Dec 2023

Traffic Signal Control Using Lightweight Transformers: An Offline-to-Online RL Approach

xingshuaihuang/dtlight 12 Dec 2023

In this work, we propose DTLight, a simple yet powerful lightweight Decision Transformer-based TSC method that can learn policy from easily accessible offline datasets.

9
12 Dec 2023

The Generalization Gap in Offline Reinforcement Learning

facebookresearch/gen_dgrl 10 Dec 2023

Our experiments reveal that existing offline learning algorithms struggle to match the performance of online RL on both train and test environments.

20
10 Dec 2023

MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman Operator

xiaoyinliu0714/micro 7 Dec 2023

This method trades off performance and robustness via introducing the robust Bellman operator into the algorithm.

0
07 Dec 2023