Offline RL

225 papers with code • 2 benchmarks • 6 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Offline RL

Trend	Dataset	Best Model	Paper	Code	Compare
	D4RL	KFC			See all
	Walker2d	ParPI			See all

Libraries

Use these libraries to find Offline RL models and implementations

zzmtsvv/rl_task

14 papers

yihaosun1124/OfflineRL-Kit

8 papers

227

corl-team/CORL

7 papers

387

opendilab/DI-engine

4 papers

2,523

See all 10 libraries.

Datasets

Subtasks

DQN Replay Dataset

Latest papers

Most implemented Social Latest No code

Compositional Conservatism: A Transductive Approach in Offline Reinforcement Learning

faceonlive/ai-research • 6 Apr 2024

Our COCOA seeks both in-distribution anchors and differences by utilizing the learned reverse dynamics model, encouraging conservatism in the compositional input space for the policy or value function.

140

06 Apr 2024

Paper
Code

Unsupervised Zero-Shot Reinforcement Learning via Functional Reward Encodings

kvfrans/fre • • 27 Feb 2024

Can we pre-train a generalist agent from a large amount of unlabeled offline trajectories such that it can be immediately adapted to any new downstream tasks in a zero-shot manner?

27 Feb 2024

Paper
Code

Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions

lucascjysdl/dgms-for-offline-policy-learning • 21 Feb 2024

This work offers a hands-on reference for the research progress in deep generative models for offline policy learning, and aims to inspire improved DGM-based offline RL or IL algorithms.

21 Feb 2024

Paper
Code

MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces

zheng0428/more_ • • 20 Feb 2024

Drawing upon the intuition that aligning different modalities to the same semantic embedding space would allow models to understand states and actions more easily, we propose a new perspective to the offline reinforcement learning (RL) challenge.

20 Feb 2024

Paper
Code

Stitching Sub-Trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL

rlatjddbs/ssd • • 11 Feb 2024

In this paper, we propose SSD (Sub-trajectory Stitching with Diffusion), a model-based offline GCRL method that leverages the conditional diffusion model to address these limitations.

11 Feb 2024

Paper
Code

SEABO: A Simple Search-Based Method for Offline Imitation Learning

dmksjfl/seabo • • 6 Feb 2024

Offline reinforcement learning (RL) has attracted much attention due to its ability in learning from static offline datasets and eliminating the need of interacting with the environment.

06 Feb 2024

Paper
Code

Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning

ruoqizzz/entropy-regularized-diffusion-policy-with-qensemble • • 6 Feb 2024

We show that such an SDE has a solution that we can use to calculate the log probability of the policy, yielding an entropy regularizer that improves the exploration of offline datasets.

06 Feb 2024

Paper
Code

ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update

maoliyuan/odice-pytorch • • 1 Feb 2024

To resolve this issue, we propose a simple yet effective modification that projects the backward gradient onto the normal plane of the forward gradient, resulting in an orthogonal-gradient update, a new learning rule for DICE-based methods.

01 Feb 2024

Paper
Code

Differentiable Tree Search in Latent State Space

openai/procgen • 22 Jan 2024

In this work, we introduce Differentiable Tree Search (DTS), a novel neural network architecture that significantly strengthens the inductive bias by embedding the algorithmic structure of a best-first online search algorithm.

972

22 Jan 2024

Paper
Code

Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model

zhengyinan-air/fisor • • 19 Jan 2024

Interestingly, we discover that via reachability analysis of safe-control theory, the hard safety constraint can be equivalently translated to identifying the largest feasible region given the offline dataset.

19 Jan 2024

Paper
Code

Offline RL

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result