Offline RL

227 papers with code • 2 benchmarks • 6 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Offline RL

Trend	Dataset	Best Model	Paper	Code	Compare
	D4RL	KFC			See all
	Walker2d	ParPI			See all

Libraries

Use these libraries to find Offline RL models and implementations

zzmtsvv/rl_task

14 papers

yihaosun1124/OfflineRL-Kit

8 papers

236

corl-team/CORL

7 papers

400

takuseno/d3rlpy

5 papers

1,216

See all 10 libraries.

Datasets

Subtasks

DQN Replay Dataset

Latest papers

Most implemented Social Latest No code

Building Persona Consistent Dialogue Agents with Offline Reinforcement Learning

ryanshea10/personachat_offline_rl • • 16 Oct 2023

Our automatic and human evaluations show that our framework improves both the persona consistency and dialogue quality of a state-of-the-art social chatbot.

16 Oct 2023

Paper
Code

Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias

MaxSobolMark/OOO • • 12 Oct 2023

Can we leverage offline RL to recover better policies from online interaction?

12 Oct 2023

Paper
Code

DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning

felix-thu/DiffCPS • • 9 Oct 2023

Constrained policy search (CPS) is a fundamental problem in offline reinforcement learning, which is generally solved by advantage weighted regression (AWR).

09 Oct 2023

Paper
Code

Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL

zzmtsvv/ORL • • NeurIPS 2023

We first identify a fundamental pattern, self-excitation, as the primary cause of Q-value estimation divergence in offline RL.

06 Oct 2023

Paper
Code

Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets

Improbable-AI/dw-offline-rl • • NeurIPS 2023

We argue this is due to an assumption made by current offline RL algorithms of staying close to the trajectories in the dataset.

06 Oct 2023

Paper
Code

Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning

quantumiracle/consistency_model_for_reinforcement_learning • • 29 Sep 2023

We propose to apply the consistency model as an efficient yet expressive policy representation, namely consistency policy, with an actor-critic style algorithm for three typical RL settings: offline, offline-to-online and online.

29 Sep 2023

Paper
Code

Zero-Shot Reinforcement Learning from Low Quality Data

enjeeneer/conservative-world-models • • 26 Sep 2023

Zero-shot reinforcement learning (RL) promises to provide agents that can perform any task in an environment after an offline, reward-free pre-training phase.

26 Sep 2023

Paper
Code

Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning

thu-rllab/CFCQL • • NeurIPS 2023

Offline multi-agent reinforcement learning is challenging due to the coupling effect of both distribution shift issue common in offline setting and the high dimension issue common in multi-agent setting, making the action out-of-distribution (OOD) and value overestimation phenomenon excessively severe.

22 Sep 2023

Paper
Code

VAPOR: Legged Robot Navigation in Outdoor Vegetation Using Offline Reinforcement Learning

kasunweerkoon/VAPOR • • 14 Sep 2023

We present VAPOR, a novel method for autonomous legged robot navigation in unstructured, densely vegetated outdoor environments using offline Reinforcement Learning (RL).

14 Sep 2023

Paper
Code

Reasoning with Latent Diffusion in Offline Reinforcement Learning

ldcq/ldcq • • 12 Sep 2023

However, a key challenge in offline RL lies in effectively stitching portions of suboptimal trajectories from the static dataset while avoiding extrapolation errors arising due to a lack of support in the dataset.

12 Sep 2023

Paper
Code

Offline RL

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result