Offline RL

227 papers with code • 2 benchmarks • 6 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Offline RL

Trend	Dataset	Best Model	Paper	Code	Compare
	D4RL	KFC			See all
	Walker2d	ParPI			See all

Libraries

Use these libraries to find Offline RL models and implementations

zzmtsvv/rl_task

14 papers

yihaosun1124/OfflineRL-Kit

8 papers

232

corl-team/CORL

7 papers

395

opendilab/DI-engine

4 papers

2,582

See all 10 libraries.

Datasets

Subtasks

DQN Replay Dataset

Latest papers with no code

Most implemented Social Latest No code

The Value of Reward Lookahead in Reinforcement Learning

no code yet • 18 Mar 2024

In particular, we measure the ratio between the value of standard RL agents and that of agents with partial future-reward lookahead.

Paper
Add Code

Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement Learning

no code yet • 14 Mar 2024

Distributionally robust offline reinforcement learning (RL), which seeks robust policy training against environment perturbation by modeling dynamics uncertainty, calls for function approximations when facing large state-action spaces.

Paper
Add Code

Towards Optimizing Human-Centric Objectives in AI-Assisted Decision-Making With Offline Reinforcement Learning

no code yet • 9 Mar 2024

Across two experiments (N=316 and N=964), our results demonstrated that people interacting with policies optimized for accuracy achieve significantly better accuracy -- and even human-AI complementarity -- compared to those interacting with any other type of AI support.

Paper
Add Code

Why Online Reinforcement Learning is Causal

no code yet • 7 Mar 2024

Our main argument is that in online learning, conditional probabilities are causal, and therefore offline RL is the setting where causal learning has the most potential to make a difference.

Paper
Add Code

Offline Fictitious Self-Play for Competitive Games

no code yet • 29 Feb 2024

Firstly, unaware of the game structure, it is impossible to interact with the opponents and conduct a major learning paradigm, self-play, for competitive games.

Paper
Add Code

Trajectory-wise Iterative Reinforcement Learning Framework for Auto-bidding

no code yet • 23 Feb 2024

The trained policy can subsequently be deployed for further data collection, resulting in an iterative training framework, which we refer to as iterative offline RL.

Paper
Add Code

Align Your Intents: Offline Imitation Learning via Optimal Transport

no code yet • 20 Feb 2024

We report that AILOT outperforms state-of-the art offline imitation learning algorithms on D4RL benchmarks and improves the performance of other offline RL algorithms in the sparse-reward tasks.

Paper
Add Code

Offline Multi-task Transfer RL with Representational Penalization

no code yet • 19 Feb 2024

We study the problem of representation transfer in offline Reinforcement Learning (RL), where a learner has access to episodic data from a number of source tasks collected a priori, and aims to learn a shared representation to be used in finding a good policy for a target task.

Paper
Add Code

Goal-Conditioned Offline Reinforcement Learning via Metric Learning

no code yet • 16 Feb 2024

Experimentally, we show how our method consistently outperforms other offline RL baselines in learning from sub-optimal offline datasets.

Paper
Add Code

Reward Poisoning Attack Against Offline Reinforcement Learning

no code yet • 15 Feb 2024

To the best of our knowledge, we propose the first black-box reward poisoning attack in the general offline RL setting.

Paper
Add Code

Offline RL

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result