Search Results for author: Thomas Pierrot

Found 19 papers, 8 papers with code

Multi-Objective Quality-Diversity for Crystal Structure Prediction

1 code implementation • 25 Mar 2024 • Hannah Janmohamed, Marta Wolinska, Shikha Surana, Thomas Pierrot, Aron Walsh, Antoine Cully

This approach overlooks other potentially interesting materials that lie in neighbouring local minima and have different material properties such as conductivity or resistance to deformation.

Paper
Code

QDax: A Library for Quality-Diversity and Population-based Algorithms with Hardware Acceleration

1 code implementation • 7 Aug 2023 • Felix Chalumeau, Bryan Lim, Raphael Boige, Maxime Allard, Luca Grillotti, Manon Flageat, Valentin Macé, Arthur Flajolet, Thomas Pierrot, Antoine Cully

QDax is an open-source library with a streamlined and modular API for Quality-Diversity (QD) optimization algorithms in Jax.

Continuous Control reinforcement-learning +1

243

Paper
Code

PASTA: Pretrained Action-State Transformer Agents

no code implementations • 20 Jul 2023 • Raphael Boige, Yannis Flet-Berliac, Arthur Flajolet, Guillaume Richard, Thomas Pierrot

Self-supervised learning has brought about a revolutionary paradigm shift in various computing domains, including NLP, vision, and biology.

Language Modelling Masked Language Modeling +3

Paper
Add Code

Gradient-Informed Quality Diversity for the Illumination of Discrete Spaces

no code implementations • 8 Jun 2023 • Raphael Boige, Guillaume Richard, Jérémie Dona, Thomas Pierrot, Antoine Cully

While early QD algorithms view the objective and descriptor functions as black-box functions, novel tools have been introduced to use gradient information to accelerate the search and improve overall performance of those algorithms over continuous input spaces.

Drug Discovery Image Generation +1

Paper
Add Code

The Quality-Diversity Transformer: Generating Behavior-Conditioned Trajectories with Decision Transformers

no code implementations • 27 Mar 2023 • Valentin Macé, Raphaël Boige, Felix Chalumeau, Thomas Pierrot, Guillaume Richard, Nicolas Perrin-Gilbert

In the context of neuroevolution, Quality-Diversity algorithms have proven effective in generating repertoires of diverse and efficient policies by relying on the definition of a behavior space.

Paper
Add Code

Evolving Populations of Diverse RL Agents with MAP-Elites

1 code implementation • 9 Mar 2023 • Thomas Pierrot, Arthur Flajolet

Quality Diversity (QD) has emerged as a powerful alternative optimization paradigm that aims at generating large and diverse collections of solutions, notably with its flagship algorithm MAP-ELITES (ME) which evolves solutions through mutations and crossovers.

Reinforcement Learning (RL)

243

Paper
Code

Improving the Data Efficiency of Multi-Objective Quality-Diversity through Gradient Assistance and Crowding Exploration

1 code implementation • 24 Feb 2023 • Hannah Janmohamed, Thomas Pierrot, Antoine Cully

We show that MOME-PGX is between 4. 3 and 42 times more data-efficient than MOME and doubles the performance of MOME, NSGA-II and SPEA2 in challenging environments.

Evolutionary Algorithms

Paper
Code

Assessing Quality-Diversity Neuro-Evolution Algorithms Performance in Hard Exploration Problems

no code implementations • 24 Nov 2022 • Felix Chalumeau, Thomas Pierrot, Valentin Macé, Arthur Flajolet, Karim Beguir, Antoine Cully, Nicolas Perrin-Gilbert

Exploration is at the heart of several domains trying to solve control problems such as Reinforcement Learning and QD methods are promising candidates to overcome the challenges associated.

Evolutionary Algorithms

Paper
Add Code

Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery

1 code implementation • 6 Oct 2022 • Felix Chalumeau, Raphael Boige, Bryan Lim, Valentin Macé, Maxime Allard, Arthur Flajolet, Antoine Cully, Thomas Pierrot

Recent work has shown that training a mixture of policies, as opposed to a single one, that are driven to explore different regions of the state-action space can address this shortcoming by generating a diverse set of behaviors, referred to as skills, that can be collectively used to great effect in adaptation tasks or for hierarchical planning.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Fast Population-Based Reinforcement Learning on a Single Machine

no code implementations • 17 Jun 2022 • Arthur Flajolet, Claire Bizon Monroc, Karim Beguir, Thomas Pierrot

Training populations of agents has demonstrated great promise in Reinforcement Learning for stabilizing training, improving exploration and asymptotic performance, and generating a diverse set of solutions.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Multi-Objective Quality Diversity Optimization

1 code implementation • 7 Feb 2022 • Thomas Pierrot, Guillaume Richard, Karim Beguir, Antoine Cully

In this work, we consider the problem of Quality-Diversity (QD) optimization with multiple objectives.

Paper
Code

Factored Action Spaces in Deep Reinforcement Learning

no code implementations • 1 Jan 2021 • Thomas Pierrot, Valentin Macé, Jean-Baptiste Sevestre, Louis Monier, Alexandre Laterre, Nicolas Perrin, Karim Beguir, Olivier Sigaud

Very large action spaces constitute a critical challenge for deep Reinforcement Learning (RL) algorithms.

Continuous Control Dota 2 +3

Paper
Add Code

Sample efficient Quality Diversity for neural continuous control

no code implementations • 1 Jan 2021 • Thomas Pierrot, Valentin Macé, Geoffrey Cideron, Nicolas Perrin, Karim Beguir, Olivier Sigaud

The QD part contributes structural biases by decoupling the search for diversity from the search for high return, resulting in efficient management of the exploration-exploitation trade-off.

Continuous Control Management +1

Paper
Add Code

Designing a Prospective COVID-19 Therapeutic with Reinforcement Learning

no code implementations • 3 Dec 2020 • Marcin J. Skwark, Nicolás López Carranza, Thomas Pierrot, Joe Phillips, Slim Said, Alexandre Laterre, Amine Kerkeni, Uğur Şahin, Karim Beguir

This suggests that combining leading protein design methods with modern deep reinforcement learning is a viable path for discovering a Covid-19 cure and may accelerate design of peptide-based therapeutics for other diseases.

Protein Design reinforcement-learning +1

Paper
Add Code

Offline Reinforcement Learning Hands-On

no code implementations • 29 Nov 2020 • Louis Monier, Jakub Kmec, Alexandre Laterre, Thomas Pierrot, Valentin Courgeau, Olivier Sigaud, Karim Beguir

Offline Reinforcement Learning (RL) aims to turn large datasets into powerful decision-making engines without any online interactions with the environment.

Behavioural cloning Decision Making +3

Paper
Add Code

Learning Compositional Neural Programs for Continuous Control

no code implementations • 27 Jul 2020 • Thomas Pierrot, Nicolas Perrin, Feryal Behbahani, Alexandre Laterre, Olivier Sigaud, Karim Beguir, Nando de Freitas

Third, the self-models are harnessed to learn recursive compositional programs with multiple levels of abstraction.

Continuous Control

Paper
Add Code

Diversity Policy Gradient for Sample Efficient Quality-Diversity Optimization

1 code implementation • NeurIPS 2021 • Thomas Pierrot, Valentin Macé, Félix Chalumeau, Arthur Flajolet, Geoffrey Cideron, Karim Beguir, Antoine Cully, Olivier Sigaud, Nicolas Perrin-Gilbert

This paper proposes a novel algorithm, QDPG, which combines the strength of Policy Gradient algorithms and Quality Diversity approaches to produce a collection of diverse and high-performing neural policies in continuous control environments.

Continuous Control Evolutionary Algorithms

Paper
Code

Learning Compositional Neural Programs with Recursive Tree Search and Planning

1 code implementation • NeurIPS 2019 • Thomas Pierrot, Guillaume Ligner, Scott Reed, Olivier Sigaud, Nicolas Perrin, Alexandre Laterre, David Kas, Karim Beguir, Nando de Freitas

AlphaZero contributes powerful neural network guided search algorithms, which we augment with recursion.

Paper
Code

First-order and second-order variants of the gradient descent in a unified framework

no code implementations • 18 Oct 2018 • Thomas Pierrot, Nicolas Perrin, Olivier Sigaud

In this paper, we provide an overview of first-order and second-order variants of the gradient descent method that are commonly used in machine learning.

BIG-bench Machine Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.