Montezuma's Revenge

28 papers with code • 1 benchmarks • 1 datasets

Montezuma's Revenge is an ATARI 2600 Benchmark game that is known to be difficult to perform on for reinforcement learning algorithms. Solutions typically employ algorithms that incentivise environment exploration in different ways.

For the state-of-the art tables, please consult the parent Atari Games task.

( Image credit: Q-map )

Benchmarks

Add a Result

These leaderboards are used to track progress in Montezuma's Revenge

Trend	Dataset	Best Model	Paper	Code	Compare
	Atari 2600 Montezuma's Revenge	Rainbow (tuned)			See all

Datasets

Arcade Learning Environment

Latest papers with no code

Most implemented Social Latest No code

Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem

no code yet • 5 Feb 2024

Fine-tuning is a widespread technique that allows practitioners to transfer pre-trained capabilities, as recently showcased by the successful applications of foundation models.

Paper
Add Code

Int-HRL: Towards Intention-based Hierarchical Reinforcement Learning

no code yet • 20 Jun 2023

We show that intentions of human players, i. e. the precursor of goal-oriented decisions, can be robustly predicted from eye gaze even for the long-horizon sparse rewards task of Montezuma's Revenge - one of the most challenging RL tasks in the Atari2600 game suite.

Paper
Add Code

Sample Efficient Deep Reinforcement Learning via Local Planning

no code yet • 29 Jan 2023

One useful property of simulators is that it is typically easy to reset the environment to a previously observed state.

Paper
Add Code

Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments

no code yet • 18 Nov 2022

In this work, we study a natural solution derived from structural causal models of the world: Our key idea is to learn representations of the future that capture precisely the unpredictable aspects of each outcome -- which we use as additional input for predictions, such that intrinsic rewards only reflect the predictable aspects of world dynamics.

Paper
Add Code

Paused Agent Replay Refresh

no code yet • 26 Sep 2022

Paused Agent Replay Refresh (PARR) is a drop-in replacement for target networks that supports more complex learning algorithms without this need for approximation.

Paper
Add Code

GAN-based Intrinsic Exploration For Sample Efficient Reinforcement Learning

no code yet • 28 Jun 2022

In this study, we address the problem of efficient exploration in reinforcement learning.

Paper
Add Code

Parametrically Retargetable Decision-Makers Tend To Seek Power

no code yet • 27 Jun 2022

We show that a range of qualitatively dissimilar decision-making procedures incentivize agents to seek power.

Paper
Add Code

Understanding and Preventing Capacity Loss in Reinforcement Learning

no code yet • ICLR 2022

The reinforcement learning (RL) problem is rife with sources of non-stationarity, making it a notoriously difficult problem domain for the application of neural networks.

Paper
Add Code

Generative Adversarial Exploration for Reinforcement Learning

no code yet • 27 Jan 2022

Exploration is crucial for training the optimal reinforcement learning (RL) policy, where the key is to discriminate whether a state visiting is novel.

Paper
Add Code

Exploration by Random Network Distillation

no code yet • ICLR Track Blog 2022

In particular we establish state of the art performance on Montezuma's Revenge, a game famously difficult for deep reinforcement learning methods.

Paper
Add Code

Montezuma's Revenge

Benchmarks Add a Result

Datasets

Latest papers with no code

Content

Benchmarks

Add a Result