Unsupervised Reinforcement Learning

24 papers with code • 8 benchmarks • 2 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Unsupervised Reinforcement Learning

Dataset	Best Model	Compare
URLB (pixels, 10^5 frames)	ProtoRL	See all
URLB (pixels, 2*10^6 frames)	Mastering URLB	See all
URLB (states, 10^5 frames)	Disagreement	See all
URLB (states, 5*10^5 frames)	RND	See all
URLB (states, 10^6 frames)	RND	See all
URLB (states, 2*10^6 frames)	RND	See all
URLB (pixels, 5*10^5 frames)	Disagreement	See all
URLB (pixels, 10^6 frames)	Disagreement	See all

Libraries

Use these libraries to find Unsupervised Reinforcement Learning models and implementations

rle-foundation/rlexplore

2 papers

318

Datasets

Most implemented papers

Most implemented Social Latest No code

Exploration by Random Network Distillation

openai/random-network-distillation • • ICLR 2019

In particular we establish state of the art performance on Montezuma's Revenge, a game famously difficult for deep reinforcement learning methods.

Paper
Code

Curiosity-driven Exploration by Self-supervised Prediction

pathak22/noreward-rl • • ICML 2017

In many real-world scenarios, rewards extrinsic to the agent are extremely sparse, or absent altogether.

Paper
Code

Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning

facebookresearch/drqv2 • • ICLR 2022

We present DrQ-v2, a model-free reinforcement learning (RL) algorithm for visual continuous control.

Paper
Code

Diversity is All You Need: Learning Skills without a Reward Function

navneet-nmk/Hierarchical-Meta-Reinforcement-Learning • • ICLR 2019

On a variety of simulated robotic tasks, we show that this simple objective results in the unsupervised emergence of diverse skills, such as walking and jumping.

Paper
Code

Self-Supervised Exploration via Disagreement

pathak22/exploration-by-disagreement • • 10 Jun 2019

In this paper, we propose a formulation for exploration inspired by the work in active learning literature.

Paper
Code

Emergent Real-World Robotic Skills via Unsupervised Off-Policy Reinforcement Learning

google-research/dads • • 27 Apr 2020

Can we instead develop efficient reinforcement learning methods that acquire diverse skills without any reward function, and then repurpose these skills for downstream tasks?

Paper
Code

Unsupervised Reinforcement Learning in Multiple Environments

muttimirco/alphamepol • • 16 Dec 2021

Along this line, we address the problem of unsupervised reinforcement learning in a class of multiple environments, in which the policy is pre-trained with interactions from the whole class, and then fine-tuned for several tasks in any environment of the class.

Paper
Code

Variational Intrinsic Control

jbinas/gym-mnist • • 22 Nov 2016

In this paper we introduce a new unsupervised reinforcement learning method for discovering the set of intrinsic options available to an agent.

Paper
Code

Efficient Exploration via State Marginal Matching

RLAgent/state-marginal-matching • • 12 Jun 2019

The SMM objective can be viewed as a two-player, zero-sum game between a state density model and a parametric policy, an idea that we use to build an algorithm for optimizing the SMM objective.

Paper
Code

SMiRL: Surprise Minimizing Reinforcement Learning in Unstable Environments

chenziku/train-procgen • • ICLR 2021

Every living organism struggles against disruptive environmental forces to carve out and maintain an orderly niche.

Paper
Code

Unsupervised Reinforcement Learning

Benchmarks Add a Result

Libraries

Datasets

Most implemented papers

Content

Benchmarks

Add a Result