Offline RL

227 papers with code • 2 benchmarks • 6 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find Offline RL models and implementations
14 papers
38
7 papers
395
4 papers
2,574
See all 10 libraries.

Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model

zhengyinan-air/fisor 19 Jan 2024

Interestingly, we discover that via reachability analysis of safe-control theory, the hard safety constraint can be equivalently translated to identifying the largest feasible region given the offline dataset.

44
19 Jan 2024

DiffClone: Enhanced Behaviour Cloning in Robotics with Diffusion-Driven Policy Learning

sirabas369/diffclone 17 Jan 2024

The Train-Offline-Test-Online (TOTO) Benchmark provides a well-curated open-source dataset for offline training comprised mostly of expert data and also benchmark scores of the common offline-RL and behaviour cloning agents.

3
17 Jan 2024

Learning from Sparse Offline Datasets via Conservative Density Estimation

czp16/cde-offline-rl 16 Jan 2024

Offline reinforcement learning (RL) offers a promising direction for learning policies from pre-collected datasets without requiring further interactions with the environment.

0
16 Jan 2024

SPQR: Controlling Q-ensemble Independence with Spiked Random Model for Reinforcement Learning

dohyeoklee/SPQR NeurIPS 2023

Alleviating overestimation bias is a critical challenge for deep reinforcement learning to achieve successful performance on more complex tasks or offline datasets containing out-of-distribution data.

2
06 Jan 2024

Policy-regularized Offline Multi-objective Reinforcement Learning

qianlin04/prmorl 4 Jan 2024

In this paper, we aim to utilize only offline trajectory data to train a policy for multi-objective RL.

3
04 Jan 2024

Online Symbolic Music Alignment with Offline Reinforcement Learning

sildater/parangonar 31 Dec 2023

First, in its capacity to identify correct score positions for sampled test contexts; second, as the core technique of a complete algorithm for symbolic online note-wise alignment; and finally, as a real-time symbolic score follower.

16
31 Dec 2023

PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning

maohangyu/TIT_open_source 26 Dec 2023

Designing better deep networks and better reinforcement learning (RL) algorithms are both important for deep RL.

48
26 Dec 2023

A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning

opendilab/DI-engine 12 Dec 2023

In this paper, from a novel perspective, we systematically study the challenges that remain in O2O RL and identify that the reason behind the slow improvement of the performance and the instability of online finetuning lies in the inaccurate Q-value estimation inherited from offline pretraining.

2,574
12 Dec 2023

Traffic Signal Control Using Lightweight Transformers: An Offline-to-Online RL Approach

xingshuaihuang/dtlight 12 Dec 2023

In this work, we propose DTLight, a simple yet powerful lightweight Decision Transformer-based TSC method that can learn policy from easily accessible offline datasets.

10
12 Dec 2023

The Generalization Gap in Offline Reinforcement Learning

facebookresearch/gen_dgrl 10 Dec 2023

Our experiments reveal that existing offline learning algorithms struggle to match the performance of online RL on both train and test environments.

22
10 Dec 2023