Offline RL

234 papers with code • 2 benchmarks • 7 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Offline RL

Trend	Dataset	Best Model	Paper	Code	Compare
	D4RL	KFC			See all
	Walker2d	ParPI			See all

Libraries

Use these libraries to find Offline RL models and implementations

zzmtsvv/rl_task

14 papers

yihaosun1124/OfflineRL-Kit

8 papers

241

corl-team/CORL

7 papers

405

takuseno/d3rlpy

5 papers

1,231

See all 10 libraries.

Datasets

Subtasks

DQN Replay Dataset

Latest papers

Most implemented Social Latest No code

DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning

felix-thu/DiffCPS • • 9 Oct 2023

Constrained policy search (CPS) is a fundamental problem in offline reinforcement learning, which is generally solved by advantage weighted regression (AWR).

09 Oct 2023

Paper
Code

Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL

zzmtsvv/ORL • • NeurIPS 2023

We first identify a fundamental pattern, self-excitation, as the primary cause of Q-value estimation divergence in offline RL.

06 Oct 2023

Paper
Code

Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets

Improbable-AI/dw-offline-rl • • NeurIPS 2023

We argue this is due to an assumption made by current offline RL algorithms of staying close to the trajectories in the dataset.

06 Oct 2023

Paper
Code

Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning

quantumiracle/consistency_model_for_reinforcement_learning • • 29 Sep 2023

We propose to apply the consistency model as an efficient yet expressive policy representation, namely consistency policy, with an actor-critic style algorithm for three typical RL settings: offline, offline-to-online and online.

29 Sep 2023

Paper
Code

Zero-Shot Reinforcement Learning from Low Quality Data

enjeeneer/conservative-world-models • • 26 Sep 2023

Zero-shot reinforcement learning (RL) promises to provide agents that can perform any task in an environment after an offline, reward-free pre-training phase.

26 Sep 2023

Paper
Code

Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning

thu-rllab/CFCQL • • NeurIPS 2023

Offline multi-agent reinforcement learning is challenging due to the coupling effect of both distribution shift issue common in offline setting and the high dimension issue common in multi-agent setting, making the action out-of-distribution (OOD) and value overestimation phenomenon excessively severe.

22 Sep 2023

Paper
Code

VAPOR: Legged Robot Navigation in Outdoor Vegetation Using Offline Reinforcement Learning

kasunweerkoon/VAPOR • • 14 Sep 2023

We present VAPOR, a novel method for autonomous legged robot navigation in unstructured, densely vegetated outdoor environments using offline Reinforcement Learning (RL).

14 Sep 2023

Paper
Code

Reasoning with Latent Diffusion in Offline Reinforcement Learning

ldcq/ldcq • • 12 Sep 2023

However, a key challenge in offline RL lies in effectively stitching portions of suboptimal trajectories from the static dataset while avoiding extrapolation errors arising due to a lack of support in the dataset.

12 Sep 2023

Paper
Code

ORL-AUDITOR: Dataset Auditing in Offline Deep Reinforcement Learning

link-zju/orl-auditor • • 6 Sep 2023

In safety-critical domains such as autonomous vehicles, offline deep reinforcement learning (offline DRL) is frequently used to train models on pre-collected datasets, as opposed to training these models by interacting with the real-world environment as the online DRL.

06 Sep 2023

Paper
Code

Model-based Offline Policy Optimization with Adversarial Network

junming-yang/moan • • 5 Sep 2023

Model-based offline reinforcement learning (RL), which builds a supervised transition model with logging dataset to avoid costly interactions with the online environment, has been a promising approach for offline policy optimization.

05 Sep 2023

Paper
Code

Offline RL

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result