Search Results for author: Gaspard Lambrechts

Found 4 papers, 1 papers with code

Behind the Myth of Exploration in Policy Gradients

no code implementations31 Jan 2024 Adrien Bolland, Gaspard Lambrechts, Damien Ernst

To compute near-optimal policies, it is essential in practice to include exploration terms in the learning objective.

Informed POMDP: Leveraging Additional Information in Model-Based RL

1 code implementation20 Jun 2023 Gaspard Lambrechts, Adrien Bolland, Damien Ernst

We then show that this informed objective consists of learning an environment model from which we can sample latent trajectories.

Recurrent networks, hidden states and beliefs in partially observable environments

no code implementations6 Aug 2022 Gaspard Lambrechts, Adrien Bolland, Damien Ernst

In summary, this work shows that in its hidden states, a recurrent neural network approximating the Q-function of a partially observable environment reproduces a sufficient statistic from the history that is correlated to the relevant part of the belief for taking optimal actions.

Warming up recurrent neural networks to maximise reachable multistability greatly improves learning

no code implementations2 Jun 2021 Gaspard Lambrechts, Florent De Geeter, Nicolas Vecoven, Damien Ernst, Guillaume Drion

This insight leads to the design of a novel way to initialise any recurrent cell connectivity through a procedure called "warmup" to improve its capability to learn arbitrarily long time dependencies.

Time Series Analysis

Cannot find the paper you are looking for? You can Submit a new open access paper.