Model-based reinforcement learning (RL) enjoys several benefits, such as data-efficiency and planning, by learning a model of the environment's dynamics.

Paper
Code

The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning

chandar-lab/LoCA • • NeurIPS 2020

For example, the common single-task sample-efficiency metric conflates improvements due to model-based learning with various other aspects, such as representation learning, making it difficult to assess true progress on model-based RL.

Paper
Code

Understanding and Mitigating the Limitations of Prioritized Experience Replay

eleurent/highway-env • 19 Jul 2020

Prioritized Experience Replay (ER) has been empirically shown to improve sample efficiency across many domains and attracted great attention; however, there is little theoretical understanding of why such prioritized sampling helps and its limitations.

Paper
Code

Integrated Decision and Control: Towards Interpretable and Computationally Efficient Driving Intelligence

idthanm/env_build • • 18 Mar 2021

In this paper, we present an interpretable and computationally efficient framework called integrated decision and control (IDC) for automated vehicles, which decomposes the driving task into static path planning and dynamic optimal tracking that are structured hierarchically.

Paper
Code

Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow

siemens/industrialbenchmark • • 26 Mar 2021

This paves the way for new research directions, e. g. investigating uncertainty-aware environment models that are not necessarily neural-network-based, or developing algorithms to solve industrially-motivated benchmarks that share characteristics with real-world problems.

Paper
Code

Online and Offline Reinforcement Learning by Planning with a Learned Model

DHDev0/Muzero-unplugged • • NeurIPS 2021

Combining Reanalyse with the MuZero algorithm, we introduce MuZero Unplugged, a single unified algorithm for any data budget, including offline RL.

Paper
Code

Gradient Information Matters in Policy Optimization by Back-propagating through Model

CCreal/ddppo • • ICLR 2022

Then we proposed a two-model-based learning method to control the prediction error and the gradient error.

Paper
Code

Planning in Stochastic Environments with a Learned Model

opendilab/LightZero • • ICLR 2022

However, previous instantiations of this approach were limited to the use of deterministic models.

Paper
Code

Planning with Diffusion for Flexible Behavior Synthesis

jannerm/diffuser • • 20 May 2022

Model-based reinforcement learning methods often use learning only for the purpose of estimating an approximate dynamics model, offloading the rest of the decision-making work to classical trajectory optimizers.

Paper
Code

The Effectiveness of World Models for Continual Reinforcement Learning

skezle/continual-dreamer • • 29 Nov 2022

World models power some of the most efficient reinforcement learning algorithms.

Paper
Code

Model-based Reinforcement Learning

Benchmarks Add a Result

Libraries

Datasets

Most implemented papers

Content

Benchmarks

Add a Result