Montezuma's Revenge

28 papers with code • 1 benchmarks • 1 datasets

Montezuma's Revenge is an ATARI 2600 Benchmark game that is known to be difficult to perform on for reinforcement learning algorithms. Solutions typically employ algorithms that incentivise environment exploration in different ways.

For the state-of-the art tables, please consult the parent Atari Games task.

( Image credit: Q-map )

Benchmarks

Add a Result

These leaderboards are used to track progress in Montezuma's Revenge

Trend	Dataset	Best Model	Paper	Code	Compare
	Atari 2600 Montezuma's Revenge	Rainbow (tuned)			See all

Datasets

Arcade Learning Environment

Latest papers

Most implemented Social Latest No code

A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs

facebookresearch/minihack • • 5 Jun 2023

This results in an algorithm which sets a new state of the art across 16 tasks from the MiniHack suite used in prior work, and also performs robustly on Habitat and Montezuma's Revenge.

451

05 Jun 2023

Paper
Code

Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning

samlobel/cfn • • 5 Jun 2023

We propose a new method for count-based exploration in high-dimensional state spaces.

05 Jun 2023

Paper
Code

Redeeming Intrinsic Rewards via Constrained Optimization

improbable-ai/eipo • • 14 Nov 2022

However, on easy exploration tasks, the agent gets distracted by intrinsic rewards and performs unnecessary exploration even when sufficient task (also called extrinsic) reward is available.

14 Nov 2022

Paper
Code

Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient

yudasong/hyq • • 13 Oct 2022

We consider a hybrid reinforcement learning setting (Hybrid RL), in which an agent has access to an offline dataset and the ability to collect experience via real-world online interaction.

13 Oct 2022

Paper
Code

Cell-Free Latent Go-Explore

qgallouedec/lge • • 31 Aug 2022

In this paper, we introduce Latent Go-Explore (LGE), a simple and general approach based on the Go-Explore paradigm for exploration in reinforcement learning (RL).

31 Aug 2022

Paper
Code

Open-Ended Reinforcement Learning with Neural Reward Functions

amujika/open-ended-reinforcement-learning-with-neural-reward-functions • • 16 Feb 2022

Inspired by the great success of unsupervised learning in Computer Vision and Natural Language Processing, the Reinforcement Learning community has recently started to focus more on unsupervised discovery of skills.

16 Feb 2022

Paper
Code

NovelD: A Simple yet Effective Exploration Criterion

tianjunz/NovelD • • NeurIPS 2021

We analyze NovelD thoroughly in MiniGrid and found that empirically it helps the agent explore the environment more uniformly with a focus on exploring beyond the boundary.

01 Dec 2021

Paper
Code

Reinforcement Learning with Latent Flow

WendyShang/flare • • NeurIPS 2021

Temporal information is essential to learning effective policies with Reinforcement Learning (RL).

06 Jan 2021

Paper
Code

Learning Abstract Models for Strategic Exploration and Fast Reward Transfer

google-research/google-research • • 12 Jul 2020

Model-based reinforcement learning (RL) is appealing because (i) it enables planning and thus more strategic exploration, and (ii) by decoupling dynamics from rewards, it enables fast transfer to new reward functions.

32,808

12 Jul 2020

Paper
Code

First return, then explore

uber-research/go-explore • • 27 Apr 2020

The promise of reinforcement learning is to solve complex sequential decision problems autonomously by specifying a high-level reward function only.

547

27 Apr 2020

Paper
Code

Montezuma's Revenge

Benchmarks Add a Result

Datasets

Latest papers

Content

Benchmarks

Add a Result