Offline RL

234 papers with code • 2 benchmarks • 7 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find Offline RL models and implementations
14 papers
38
7 papers
405
5 papers
1,231
See all 10 libraries.

DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning

felix-thu/DiffCPS 9 Oct 2023

Constrained policy search (CPS) is a fundamental problem in offline reinforcement learning, which is generally solved by advantage weighted regression (AWR).

6
09 Oct 2023

Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL

zzmtsvv/ORL NeurIPS 2023

We first identify a fundamental pattern, self-excitation, as the primary cause of Q-value estimation divergence in offline RL.

38
06 Oct 2023

Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets

Improbable-AI/dw-offline-rl NeurIPS 2023

We argue this is due to an assumption made by current offline RL algorithms of staying close to the trajectories in the dataset.

16
06 Oct 2023

Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning

quantumiracle/consistency_model_for_reinforcement_learning 29 Sep 2023

We propose to apply the consistency model as an efficient yet expressive policy representation, namely consistency policy, with an actor-critic style algorithm for three typical RL settings: offline, offline-to-online and online.

19
29 Sep 2023

Zero-Shot Reinforcement Learning from Low Quality Data

enjeeneer/conservative-world-models 26 Sep 2023

Zero-shot reinforcement learning (RL) promises to provide agents that can perform any task in an environment after an offline, reward-free pre-training phase.

5
26 Sep 2023

Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning

thu-rllab/CFCQL NeurIPS 2023

Offline multi-agent reinforcement learning is challenging due to the coupling effect of both distribution shift issue common in offline setting and the high dimension issue common in multi-agent setting, making the action out-of-distribution (OOD) and value overestimation phenomenon excessively severe.

17
22 Sep 2023

VAPOR: Legged Robot Navigation in Outdoor Vegetation Using Offline Reinforcement Learning

kasunweerkoon/VAPOR 14 Sep 2023

We present VAPOR, a novel method for autonomous legged robot navigation in unstructured, densely vegetated outdoor environments using offline Reinforcement Learning (RL).

2
14 Sep 2023

Reasoning with Latent Diffusion in Offline Reinforcement Learning

ldcq/ldcq 12 Sep 2023

However, a key challenge in offline RL lies in effectively stitching portions of suboptimal trajectories from the static dataset while avoiding extrapolation errors arising due to a lack of support in the dataset.

22
12 Sep 2023

ORL-AUDITOR: Dataset Auditing in Offline Deep Reinforcement Learning

link-zju/orl-auditor 6 Sep 2023

In safety-critical domains such as autonomous vehicles, offline deep reinforcement learning (offline DRL) is frequently used to train models on pre-collected datasets, as opposed to training these models by interacting with the real-world environment as the online DRL.

5
06 Sep 2023

Model-based Offline Policy Optimization with Adversarial Network

junming-yang/moan 5 Sep 2023

Model-based offline reinforcement learning (RL), which builds a supervised transition model with logging dataset to avoid costly interactions with the online environment, has been a promising approach for offline policy optimization.

3
05 Sep 2023