Offline RL

227 papers with code • 2 benchmarks • 6 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find Offline RL models and implementations
14 papers
38
7 papers
400
5 papers
1,216
See all 10 libraries.

Building Persona Consistent Dialogue Agents with Offline Reinforcement Learning

ryanshea10/personachat_offline_rl 16 Oct 2023

Our automatic and human evaluations show that our framework improves both the persona consistency and dialogue quality of a state-of-the-art social chatbot.

3
16 Oct 2023

Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias

MaxSobolMark/OOO 12 Oct 2023

Can we leverage offline RL to recover better policies from online interaction?

15
12 Oct 2023

DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning

felix-thu/DiffCPS 9 Oct 2023

Constrained policy search (CPS) is a fundamental problem in offline reinforcement learning, which is generally solved by advantage weighted regression (AWR).

6
09 Oct 2023

Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL

zzmtsvv/ORL NeurIPS 2023

We first identify a fundamental pattern, self-excitation, as the primary cause of Q-value estimation divergence in offline RL.

38
06 Oct 2023

Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets

Improbable-AI/dw-offline-rl NeurIPS 2023

We argue this is due to an assumption made by current offline RL algorithms of staying close to the trajectories in the dataset.

14
06 Oct 2023

Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning

quantumiracle/consistency_model_for_reinforcement_learning 29 Sep 2023

We propose to apply the consistency model as an efficient yet expressive policy representation, namely consistency policy, with an actor-critic style algorithm for three typical RL settings: offline, offline-to-online and online.

19
29 Sep 2023

Zero-Shot Reinforcement Learning from Low Quality Data

enjeeneer/conservative-world-models 26 Sep 2023

Zero-shot reinforcement learning (RL) promises to provide agents that can perform any task in an environment after an offline, reward-free pre-training phase.

5
26 Sep 2023

Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning

thu-rllab/CFCQL NeurIPS 2023

Offline multi-agent reinforcement learning is challenging due to the coupling effect of both distribution shift issue common in offline setting and the high dimension issue common in multi-agent setting, making the action out-of-distribution (OOD) and value overestimation phenomenon excessively severe.

14
22 Sep 2023

VAPOR: Legged Robot Navigation in Outdoor Vegetation Using Offline Reinforcement Learning

kasunweerkoon/VAPOR 14 Sep 2023

We present VAPOR, a novel method for autonomous legged robot navigation in unstructured, densely vegetated outdoor environments using offline Reinforcement Learning (RL).

2
14 Sep 2023

Reasoning with Latent Diffusion in Offline Reinforcement Learning

ldcq/ldcq 12 Sep 2023

However, a key challenge in offline RL lies in effectively stitching portions of suboptimal trajectories from the static dataset while avoiding extrapolation errors arising due to a lack of support in the dataset.

21
12 Sep 2023