TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Atari Games	Atari 2600 Alien	FQF	Score	16754.6	# 9
Atari Games	Atari 2600 Amidar	FQF	Score	3165.3	# 6
Atari Games	Atari 2600 Asterix	FQF	Score	578388.5	# 7
Atari Games	Atari 2600 Asteroids	FQF	Score	4553.0	# 15
Atari Games	Atari 2600 Battle Zone	FQF	Score	87928.6	# 8
Atari Games	Atari 2600 Berzerk	FQF	Score	12422.2	# 8
Atari Games	Atari 2600 Bowling	FQF	Score	102.3	# 11
Atari Games	Atari 2600 Breakout	FQF	Score	854.2	# 6
Atari Games	Atari 2600 Chopper Command	FQF	Score	876460.0	# 7
Atari Games	Atari 2600 Crazy Climber	FQF	Score	223470.6	# 7
Atari Games	Atari 2600 Fishing Derby	FQF	Score	52.7	# 12
Atari Games	Atari 2600 Frostbite	Fearlessmrx	Score	214060	# 5
Atari Games	Atari 2600 Gravitar	FQF	Score	1406.0	# 20
Atari Games	Atari 2600 HERO	FQF	Score	30926.2	# 14
Atari Games	Atari 2600 Ice Hockey	FQF	Score	17.3	# 11
Atari Games	Atari 2600 James Bond	FQF	Score	87291.7	# 5
Atari Games	Atari 2600 Kung-Fu Master	FQF	Score	111138.5	# 8
Atari Games	Atari 2600 Ms. Pacman	FQF	Score	7631.9	# 11
Atari Games	Atari 2600 Phoenix	FQF	Score	174077.5	# 11
Atari Games	Atari 2600 River Raid	FQF	Score	23560.7	# 10
Atari Games	Atari 2600 Robotank	FQF	Score	75.7	# 9
Atari Games	Atari 2600 Skiing	FQF	Score	-9085.3	# 3
Atari Games	Atari 2600 Space Invaders	FQF	Score	46498.3	# 8
Atari Games	Atari 2600 Star Gunner	FQF	Score	131981.2	# 12
Atari Games	Atari 2600 Wizard of Wor	FQF	Score	44782.6	# 9

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-skiing)](https://paperswithcode.com/sota/atari-games-on-atari-2600-skiing?p=fully-parameterized-quantile-function-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-frostbite)](https://paperswithcode.com/sota/atari-games-on-atari-2600-frostbite?p=fully-parameterized-quantile-function-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-james-bond)](https://paperswithcode.com/sota/atari-games-on-atari-2600-james-bond?p=fully-parameterized-quantile-function-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-amidar)](https://paperswithcode.com/sota/atari-games-on-atari-2600-amidar?p=fully-parameterized-quantile-function-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-breakout)](https://paperswithcode.com/sota/atari-games-on-atari-2600-breakout?p=fully-parameterized-quantile-function-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-asterix)](https://paperswithcode.com/sota/atari-games-on-atari-2600-asterix?p=fully-parameterized-quantile-function-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-chopper-command)](https://paperswithcode.com/sota/atari-games-on-atari-2600-chopper-command?p=fully-parameterized-quantile-function-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-crazy-climber)](https://paperswithcode.com/sota/atari-games-on-atari-2600-crazy-climber?p=fully-parameterized-quantile-function-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-battle-zone)](https://paperswithcode.com/sota/atari-games-on-atari-2600-battle-zone?p=fully-parameterized-quantile-function-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-berzerk)](https://paperswithcode.com/sota/atari-games-on-atari-2600-berzerk?p=fully-parameterized-quantile-function-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-kung-fu-master)](https://paperswithcode.com/sota/atari-games-on-atari-2600-kung-fu-master?p=fully-parameterized-quantile-function-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-space-invaders)](https://paperswithcode.com/sota/atari-games-on-atari-2600-space-invaders?p=fully-parameterized-quantile-function-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-alien)](https://paperswithcode.com/sota/atari-games-on-atari-2600-alien?p=fully-parameterized-quantile-function-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-robotank)](https://paperswithcode.com/sota/atari-games-on-atari-2600-robotank?p=fully-parameterized-quantile-function-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-wizard-of-wor)](https://paperswithcode.com/sota/atari-games-on-atari-2600-wizard-of-wor?p=fully-parameterized-quantile-function-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-river-raid)](https://paperswithcode.com/sota/atari-games-on-atari-2600-river-raid?p=fully-parameterized-quantile-function-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-bowling)](https://paperswithcode.com/sota/atari-games-on-atari-2600-bowling?p=fully-parameterized-quantile-function-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-ice-hockey)](https://paperswithcode.com/sota/atari-games-on-atari-2600-ice-hockey?p=fully-parameterized-quantile-function-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-ms-pacman)](https://paperswithcode.com/sota/atari-games-on-atari-2600-ms-pacman?p=fully-parameterized-quantile-function-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-phoenix)](https://paperswithcode.com/sota/atari-games-on-atari-2600-phoenix?p=fully-parameterized-quantile-function-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-fishing-derby)](https://paperswithcode.com/sota/atari-games-on-atari-2600-fishing-derby?p=fully-parameterized-quantile-function-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-star-gunner)](https://paperswithcode.com/sota/atari-games-on-atari-2600-star-gunner?p=fully-parameterized-quantile-function-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-hero)](https://paperswithcode.com/sota/atari-games-on-atari-2600-hero?p=fully-parameterized-quantile-function-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-asteroids)](https://paperswithcode.com/sota/atari-games-on-atari-2600-asteroids?p=fully-parameterized-quantile-function-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fully-parameterized-quantile-function-for/atari-games-on-atari-2600-gravitar)](https://paperswithcode.com/sota/atari-games-on-atari-2600-gravitar?p=fully-parameterized-quantile-function-for)`

Fully Parameterized Quantile Function for Distributional Reinforcement Learning

NeurIPS 2019 · Derek Yang, Li Zhao, Zichuan Lin, Tao Qin, Jiang Bian, Tie-Yan Liu ·

Distributional Reinforcement Learning (RL) differs from traditional RL in that, rather than the expectation of total returns, it estimates distributions and has achieved state-of-the-art performance on Atari Games. The key challenge in practical distributional RL algorithms lies in how to parameterize estimated distributions so as to better approximate the true continuous distribution. Existing distributional RL algorithms parameterize either the probability side or the return value side of the distribution function, leaving the other side uniformly fixed as in C51, QR-DQN or randomly sampled as in IQN. In this paper, we propose fully parameterized quantile function that parameterizes both the quantile fraction axis (i.e., the x-axis) and the value axis (i.e., y-axis) for distributional RL. Our algorithm contains a fraction proposal network that generates a discrete set of quantile fractions and a quantile value network that gives corresponding quantile values. The two networks are jointly trained to find the best approximation of the true distribution. Experiments on 55 Atari Games show that our algorithm significantly outperforms existing distributional RL algorithms and creates a new record for the Atari Learning Environment for non-distributed agents.

PDF Abstract NeurIPS 2019 PDF NeurIPS 2019 Abstract

Code

Add Remove Mark official

opendilab/DI-engine

↳ Quickstart in

Colab

2,585

ku2482/fqf-iqn-qrdqn.pytorch

148

ku2482/rljax

microsoft/FQF

BY571/FQF-and-Extensions

See all 6 implementations

Tasks

Add Remove

Atari Games

Distributional Reinforcement Learning

reinforcement-learning

Reinforcement Learning (RL)

Datasets

Arcade Learning Environment

Results from the Paper

Edit

Ranked #3 on Atari Games on Atari 2600 Skiing (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Atari Games	Atari 2600 Alien	FQF	Score	16754.6	# 9	Compare
Atari Games	Atari 2600 Amidar	FQF	Score	3165.3	# 6	Compare
Atari Games	Atari 2600 Asterix	FQF	Score	578388.5	# 7	Compare
Atari Games	Atari 2600 Asteroids	FQF	Score	4553.0	# 15	Compare
Atari Games	Atari 2600 Battle Zone	FQF	Score	87928.6	# 8	Compare
Atari Games	Atari 2600 Berzerk	FQF	Score	12422.2	# 8	Compare
Atari Games	Atari 2600 Bowling	FQF	Score	102.3	# 11	Compare
Atari Games	Atari 2600 Breakout	FQF	Score	854.2	# 6	Compare
Atari Games	Atari 2600 Chopper Command	FQF	Score	876460.0	# 7	Compare
Atari Games	Atari 2600 Crazy Climber	FQF	Score	223470.6	# 7	Compare
Atari Games	Atari 2600 Fishing Derby	FQF	Score	52.7	# 12	Compare
Atari Games	Atari 2600 Frostbite	Fearlessmrx	Score	214060	# 5	Compare
Atari Games	Atari 2600 Gravitar	FQF	Score	1406.0	# 20	Compare
Atari Games	Atari 2600 HERO	FQF	Score	30926.2	# 14	Compare
Atari Games	Atari 2600 Ice Hockey	FQF	Score	17.3	# 11	Compare
Atari Games	Atari 2600 James Bond	FQF	Score	87291.7	# 5	Compare
Atari Games	Atari 2600 Kung-Fu Master	FQF	Score	111138.5	# 8	Compare
Atari Games	Atari 2600 Ms. Pacman	FQF	Score	7631.9	# 11	Compare
Atari Games	Atari 2600 Phoenix	FQF	Score	174077.5	# 11	Compare
Atari Games	Atari 2600 River Raid	FQF	Score	23560.7	# 10	Compare
Atari Games	Atari 2600 Robotank	FQF	Score	75.7	# 9	Compare
Atari Games	Atari 2600 Skiing	FQF	Score	-9085.3	# 3	Compare
Atari Games	Atari 2600 Space Invaders	FQF	Score	46498.3	# 8	Compare
Atari Games	Atari 2600 Star Gunner	FQF	Score	131981.2	# 12	Compare
Atari Games	Atari 2600 Wizard of Wor	FQF	Score	44782.6	# 9	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Fully Parameterized Quantile Function for Distributional Reinforcement Learning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove