Search Results for author: Adrien Bolland

Found 8 papers, 3 papers with code

Behind the Myth of Exploration in Policy Gradients

no code implementations31 Jan 2024 Adrien Bolland, Gaspard Lambrechts, Damien Ernst

To compute near-optimal policies, it is essential in practice to include exploration terms in the learning objective.

Informed POMDP: Leveraging Additional Information in Model-Based RL

1 code implementation20 Jun 2023 Gaspard Lambrechts, Adrien Bolland, Damien Ernst

We then show that this informed objective consists of learning an environment model from which we can sample latent trajectories.

Policy Gradient Algorithms Implicitly Optimize by Continuation

no code implementations11 May 2023 Adrien Bolland, Gilles Louppe, Damien Ernst

First, we formulate direct policy optimization in the optimization by continuation framework.

Recurrent networks, hidden states and beliefs in partially observable environments

no code implementations6 Aug 2022 Gaspard Lambrechts, Adrien Bolland, Damien Ernst

In summary, this work shows that in its hidden states, a recurrent neural network approximating the Q-function of a partially observable environment reproduces a sufficient statistic from the history that is correlated to the relevant part of the belief for taking optimal actions.

Distributional Reinforcement Learning with Unconstrained Monotonic Neural Networks

1 code implementation6 Jun 2021 Thibaut Théate, Antoine Wehenkel, Adrien Bolland, Gilles Louppe, Damien Ernst

The results highlight the main strengths and weaknesses associated with each probability metric together with an important limitation of the Wasserstein distance.

Distributional Reinforcement Learning reinforcement-learning +2

Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent

1 code implementation2 Jun 2020 Adrien Bolland, Ioannis Boukas, Mathias Berger, Damien Ernst

We assess the performance of our algorithm in three environments concerned with the design and control of a mass-spring-damper system, a small-scale off-grid power system and a drone, respectively.

Policy Gradient Methods reinforcement-learning +1

A Deep Reinforcement Learning Framework for Continuous Intraday Market Bidding

no code implementations13 Apr 2020 Ioannis Boukas, Damien Ernst, Thibaut Théate, Adrien Bolland, Alexandre Huynen, Martin Buchwald, Christelle Wynants, Bertrand Cornélusse

In this paper, we propose a novel modelling framework for the strategic participation of energy storage in the European continuous intraday market where exchanges occur through a centralized order book.

Decision Making reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.