Distributional Reinforcement Learning

31 papers with code • 0 benchmarks • 0 datasets

Value distribution is the distribution of the random return received by a reinforcement learning agent. it been used for a specific purpose such as implementing risk-aware behaviour.

We have random return Z whose expectation is the value Q. This random return is also described by a recursive equation, but one of a distributional nature

Benchmarks

Add a Result

These leaderboards are used to track progress in Distributional Reinforcement Learning

No evaluation results yet. Help compare methods by submitting evaluation metrics.

Latest papers

Most implemented Social Latest No code

A Distributional Analogue to the Successor Representation

jessefarebro/distributional-sr • • 13 Feb 2024

This paper contributes a new approach for distributional reinforcement learning which elucidates a clean separation of transition structure and reward in the learning process.

13 Feb 2024

Paper
Code

A Robust Quantile Huber Loss With Interpretable Parameter Adjustment In Distributional Reinforcement Learning

pmalekzadeh/A-robust-quantile-huber-loss • • 4 Jan 2024

Distributional Reinforcement Learning (RL) estimates return distribution mainly by learning quantile values via minimizing the quantile Huber loss function, entailing a threshold parameter often selected heuristically or via hyperparameter search, which may not generalize well and can be suboptimal.

04 Jan 2024

Paper
Code

Distributional Bellman Operators over Mean Embeddings

google-deepmind/sketch_dqn • • 9 Dec 2023

We propose a novel algorithmic framework for distributional reinforcement learning, based on learning finite-dimensional mean embeddings of return distributions.

09 Dec 2023

Paper
Code

Estimation and Inference in Distributional Reinforcement Learning

zhangliangyu32/estimationandinferencedistributionalrl • • 29 Sep 2023

This implies the distributional policy evaluation problem can be solved with sample efficiency.

29 Sep 2023

Paper
Code

Variance Control for Distributional Reinforcement Learning

kuangqi927/qem • • 30 Jul 2023

Although distributional reinforcement learning (DRL) has been widely examined in the past few years, very few studies investigate the validity of the obtained Q-function estimator in the distributional setting.

30 Jul 2023

Paper
Code

Distributional Model Equivalence for Risk-Sensitive Reinforcement Learning

tylerkastner/distribution-equivalence • • NeurIPS 2023

We consider the problem of learning models for risk-sensitive reinforcement learning.

04 Jul 2023

Paper
Code

Distributional constrained reinforcement learning for supply chain optimization

jaimesabalimperial/jaisalab • • 3 Feb 2023

We introduce Distributional Constrained Policy Optimization (DCPO), a novel approach for reliable constraint satisfaction in RL.

03 Feb 2023

Paper
Code

Trust Region-Based Safe Distributional Reinforcement Learning for Multiple Constraints

rllab-snu/safe-distributional-actor-critic • • NeurIPS 2023

In safety-critical robotic tasks, potential failures must be reduced, and multiple constraints must be met, such as avoiding collisions, limiting energy consumption, and maintaining balance.

26 Jan 2023

Paper
Code

Risk-Sensitive Policy with Distributional Reinforcement Learning

thibauttheate/risk-sensitive-policy-with-distributional-reinforcement-learning • • 30 Dec 2022

Classical reinforcement learning (RL) techniques are generally concerned with the design of decision-making policies driven by the maximisation of the expected outcome.

30 Dec 2022

Paper
Code

Intelligent Resource Allocation in Joint Radar-Communication With Graph Neural Networks

joleeson/Directional-JRC • • IEEE Transactions on Vehicular Technology 2022

In this paper, we propose a framework for intelligent vehicles to conduct JRC, with minimal prior knowledge of the system model and a tunable performance balance, in an environment where surrounding vehicles execute radar detection periodically, which is typical in contemporary protocols.

17 Oct 2022

Paper
Code

Distributional Reinforcement Learning

Benchmarks Add a Result

Latest papers

Content

Benchmarks

Add a Result