TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Montezuma's Revenge	Atari 2600 Montezuma's Revenge	Rainbow	Average Return (NoOp)	384	# 3
Atari Games	Atari 2600 Ms. Pacman	Rainbow	Score	2,570.2	# 47
Atari Games	Atari 2600 Space Invaders	Rainbow	Score	12,629.0	# 54
Atari Games	Atari-57	Rainbow DQN	Human World Record Breakthrough	4	# 8
Atari Games	Atari-57	Rainbow DQN	Mean Human Normalized Score	873.97%	# 9
Atari Games	atari game	Rainbow	Human World Record Breakthrough	4	# 9
Atari Games	Atari games	Rainbow DQN	Mean Human Normalized Score	873.97%	# 10

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/rainbow-combining-improvements-in-deep/montezuma-s-revenge-on-atari-2600-montezuma-s)](https://paperswithcode.com/sota/montezuma-s-revenge-on-atari-2600-montezuma-s?p=rainbow-combining-improvements-in-deep)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/rainbow-combining-improvements-in-deep/atari-games-on-atari-57)](https://paperswithcode.com/sota/atari-games-on-atari-57?p=rainbow-combining-improvements-in-deep)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/rainbow-combining-improvements-in-deep/atari-games-on-atari-game)](https://paperswithcode.com/sota/atari-games-on-atari-game?p=rainbow-combining-improvements-in-deep)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/rainbow-combining-improvements-in-deep/atari-games-on-atari-games)](https://paperswithcode.com/sota/atari-games-on-atari-games?p=rainbow-combining-improvements-in-deep)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/rainbow-combining-improvements-in-deep/atari-games-on-atari-2600-ms-pacman)](https://paperswithcode.com/sota/atari-games-on-atari-2600-ms-pacman?p=rainbow-combining-improvements-in-deep)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/rainbow-combining-improvements-in-deep/atari-games-on-atari-2600-space-invaders)](https://paperswithcode.com/sota/atari-games-on-atari-2600-space-invaders?p=rainbow-combining-improvements-in-deep)`

Rainbow: Combining Improvements in Deep Reinforcement Learning

6 Oct 2017 · Matteo Hessel, Joseph Modayil, Hado van Hasselt, Tom Schaul, Georg Ostrovski, Will Dabney, Dan Horgan, Bilal Piot, Mohammad Azar, David Silver ·

The deep reinforcement learning community has made several independent improvements to the DQN algorithm. However, it is unclear which of these extensions are complementary and can be fruitfully combined. This paper examines six extensions to the DQN algorithm and empirically studies their combination. Our experiments show that the combination provides state-of-the-art performance on the Atari 2600 benchmark, both in terms of data efficiency and final performance. We also provide results from a detailed ablation study that shows the contribution of each component to overall performance.

PDF Abstract

Code

Add Remove Mark official

thu-ml/tianshou

7,370

facebookresearch/ReAgent

3,521

facebookresearch/Horizon

3,521

opendilab/DI-engine

2,500

Curt-Park/rainbow-is-all-you-need

↳ Quickstart in

Colab

1,751

See all 32 implementations

Tasks

Add Remove

Atari Games

Montezuma's Revenge

reinforcement-learning

Reinforcement Learning (RL)

Datasets

Arcade Learning Environment

DQN Replay Dataset

Results from the Paper

Edit

Ranked #9 on Atari Games on atari game

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Montezuma's Revenge	Atari 2600 Montezuma's Revenge	Rainbow	Average Return (NoOp)	384	# 3	Compare
Atari Games	Atari 2600 Ms. Pacman	Rainbow	Score	2,570.2	# 47	Compare
Atari Games	Atari 2600 Space Invaders	Rainbow	Score	12,629.0	# 54	Compare
Atari Games	Atari-57	Rainbow DQN	Human World Record Breakthrough	4	# 8	Compare
Atari Games	Atari-57	Rainbow DQN	Mean Human Normalized Score	873.97%	# 9	Compare
Atari Games	atari game	Rainbow	Human World Record Breakthrough	4	# 9	Compare
Atari Games	Atari games	Rainbow DQN	Mean Human Normalized Score	873.97%	# 10	Compare

Methods

Add Remove

Adam • Convolution • Dense Connections • Double Q-learning • DQN • Dueling Network • Noisy Linear Layer • N-step Returns • Prioritized Experience Replay • Q-Learning • Rainbow DQN

Edit Social Preview

Rainbow: Combining Improvements in Deep Reinforcement Learning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove