TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Atari Games 100k	Atari 100k	HarmonyDream	Mean Human-Normalized Score	1.365	# 4
Atari Games 100k	Atari 100k	HarmonyDream	Medium Human-Normalized Score	0.671	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/harmony-world-models-boosting-sample/atari-games-100k-on-atari-100k)](https://paperswithcode.com/sota/atari-games-100k-on-atari-100k?p=harmony-world-models-boosting-sample)`

HarmonyDream: Task Harmonization Inside World Models

30 Sep 2023 · Haoyu Ma, Jialong Wu, Ningya Feng, Chenjun Xiao, Dong Li, Jianye Hao, Jianmin Wang, Mingsheng Long ·

Model-based reinforcement learning (MBRL) holds the promise of sample-efficient learning by utilizing a world model, which models how the environment works and typically encompasses components for two tasks: observation modeling and reward modeling. In this paper, through a dedicated empirical investigation, we gain a deeper understanding of the role each task plays in world models and uncover the overlooked potential of sample-efficient MBRL by mitigating the domination of either observation or reward modeling. Our key insight is that while prevalent approaches of explicit MBRL attempt to restore abundant details of the environment via observation models, it is difficult due to the environment's complexity and limited model capacity. On the other hand, reward models, while dominating implicit MBRL and adept at learning compact task-centric dynamics, are inadequate for sample-efficient learning without richer learning signals. Motivated by these insights and discoveries, we propose a simple yet effective approach, HarmonyDream, which automatically adjusts loss coefficients to maintain task harmonization, i.e. a dynamic equilibrium between the two tasks in world model learning. Our experiments show that the base MBRL method equipped with HarmonyDream gains 10%-69% absolute performance boosts on visual robotic tasks and sets a new state-of-the-art result on the Atari 100K benchmark.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Atari Games 100k

Model-based Reinforcement Learning

reinforcement-learning

Datasets

DeepMind Control Suite

RLBench Atari 100k

Results from the Paper

Edit

Ranked #4 on Atari Games 100k on Atari 100k

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Atari Games 100k	Atari 100k	HarmonyDream	Mean Human-Normalized Score	1.365	# 4		Compare
Atari Games 100k	Atari 100k	HarmonyDream	Medium Human-Normalized Score	0.671	# 4		Compare

Methods

Add Remove

BASE

Edit Social Preview

HarmonyDream: Task Harmonization Inside World Models

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove