Search Results for author: Vincent Francois-Lavet

Found 9 papers, 4 papers with code

Improving generalization in reinforcement learning through forked agents

no code implementations13 Dec 2022 Olivier Moulin, Vincent Francois-Lavet, Mark Hoogendoorn

An eco-system of agents each having their own policy with some, but limited, generalizability has proven to be a reliable approach to increase generalization across procedurally generated environments.

reinforcement-learning Reinforcement Learning (RL) +1

A Machine With Human-Like Memory Systems

1 code implementation4 Apr 2022 Taewoon Kim, Michael Cochez, Vincent Francois-Lavet, Mark Neerincx, Piek Vossen

Inspired by the cognitive science theory, we explicitly model an agent with both semantic and episodic memory systems, and show that it is better than having just one of the two memory systems.

OpenAI Gym

Component Transfer Learning for Deep RL Based on Abstract Representations

1 code implementation22 Nov 2021 Geoffrey van Driessel, Vincent Francois-Lavet

We learn a low-dimensional encoding of the environment, meant to capture summarizing abstractions, from which the internal dynamics and value functions are learned.

Transfer Learning

Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey

no code implementations28 Sep 2021 Amjad Yousef Majid, Serge Saaybi, Tomas van Rietbergen, Vincent Francois-Lavet, R Venkatesha Prasad, Chris Verhoeven

Deep Reinforcement Learning (DRL) and Evolution Strategies (ESs) have surpassed human-level control in many sequential decision-making problems, yet many open challenges still exist.

Decision Making reinforcement-learning +1

RandomNet: Towards Fully Automatic Neural Architecture Design for Multimodal Learning

no code implementations2 Mar 2020 Stefano Alletto, Shenyang Huang, Vincent Francois-Lavet, Yohei Nakata, Guillaume Rabusseau

Almost all neural architecture search methods are evaluated in terms of performance (i. e. test accuracy) of the model structures that it finds.

Neural Architecture Search

On overfitting and asymptotic bias in batch reinforcement learning with partial observability

no code implementations22 Sep 2017 Vincent Francois-Lavet, Guillaume Rabusseau, Joelle Pineau, Damien Ernst, Raphael Fonteneau

This paper provides an analysis of the tradeoff between asymptotic bias (suboptimality with unlimited data) and overfitting (additional suboptimality due to limited data) in the context of reinforcement learning with partial observability.

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.