Montezuma's Revenge

28 papers with code • 1 benchmarks • 1 datasets

Montezuma's Revenge is an ATARI 2600 Benchmark game that is known to be difficult to perform on for reinforcement learning algorithms. Solutions typically employ algorithms that incentivise environment exploration in different ways.

For the state-of-the art tables, please consult the parent Atari Games task.

( Image credit: Q-map )

Benchmarks

Add a Result

These leaderboards are used to track progress in Montezuma's Revenge

Trend	Dataset	Best Model	Paper	Code	Compare
	Atari 2600 Montezuma's Revenge	Rainbow (tuned)			See all

Datasets

Arcade Learning Environment

Latest papers

Most implemented Social Latest No code

Exploring Unknown States with Action Balance

NeteaseFuxiRL/action-balance-exploration • • 10 Mar 2020

In this paper, we focus on improving the effectiveness of finding unknown states and propose action balance exploration, which balances the frequency of selecting each action at a given state and can be treated as an extension of upper confidence bound (UCB) to deep reinforcement learning.

10 Mar 2020

Paper
Code

Uncertainty-sensitive Learning and Planning with Ensembles

learningandplanningICLR/learningandplanning • • 19 Dec 2019

The former manifests itself through the use of value function, while the latter is powered by a tree search planner.

19 Dec 2019

Paper
Code

DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning

grockious/deepsynth • • 22 Nov 2019

This paper proposes DeepSynth, a method for effective training of deep Reinforcement Learning (RL) agents when the reward is sparse and non-Markovian, but at the same time progress towards the reward requires achieving an unknown sequence of high-level objectives.

22 Nov 2019

Paper
Code

Uncertainty - sensitive learning and planning with ensembles

learningandplanningICLR/learningandplanning • • 25 Sep 2019

Notably, our method performs well in environments with sparse rewards where standard $TD(1)$ backups fail.

25 Sep 2019

Paper
Code

Combining Experience Replay with Exploration by Random Network Distillation

Francesco-Sovrano/Combining--experience-replay--with--exploration-by-random-network-distillation- • • 18 May 2019

Our work is a simple extension of the paper "Exploration by Random Network Distillation".

18 May 2019

Paper
Code

Using Natural Language for Reward Shaping in Reinforcement Learning

prasoongoyal/rl-learn • • 5 Mar 2019

A common approach to reduce interaction time with the environment is to use reward shaping, which involves carefully designing reward functions that provide the agent intermediate rewards for progress towards the goal.

05 Mar 2019

Paper
Code