Search Results for author: Julien Pérolat

Found 9 papers, 2 papers with code

Learning in Mean Field Games: A Survey

no code implementations • 25 May 2022 • Mathieu Laurière, Sarah Perrin, Julien Pérolat, Sertan Girgin, Paul Muller, Romuald Élie, Matthieu Geist, Olivier Pietquin

Non-cooperative and cooperative games with a very large number of players have many applications but remain generally intractable when the number of players increases.

Reinforcement Learning (RL)

Paper
Add Code

Scalable Deep Reinforcement Learning Algorithms for Mean Field Games

no code implementations • 22 Mar 2022 • Mathieu Laurière, Sarah Perrin, Sertan Girgin, Paul Muller, Ayush Jain, Theophile Cabannes, Georgios Piliouras, Julien Pérolat, Romuald Élie, Olivier Pietquin, Matthieu Geist

One limiting factor to further scale up using RL is that existing algorithms to solve MFGs require the mixing of approximated quantities such as strategies or $q$-values.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Generalization in Mean Field Games by Learning Master Policies

no code implementations • 20 Sep 2021 • Sarah Perrin, Mathieu Laurière, Julien Pérolat, Romuald Élie, Matthieu Geist, Olivier Pietquin

Mean Field Games (MFGs) can potentially scale multi-agent systems to extremely large populations of agents.

Paper
Add Code

Concave Utility Reinforcement Learning: the Mean-Field Game Viewpoint

no code implementations • 7 Jun 2021 • Matthieu Geist, Julien Pérolat, Mathieu Laurière, Romuald Elie, Sarah Perrin, Olivier Bachem, Rémi Munos, Olivier Pietquin

Mean-field Games (MFGs) are a continuous approximation of many-agent RL.

Imitation Learning reinforcement-learning +1

Paper
Add Code

Mean Field Games Flock! The Reinforcement Learning Way

no code implementations • 17 May 2021 • Sarah Perrin, Mathieu Laurière, Julien Pérolat, Matthieu Geist, Romuald Élie, Olivier Pietquin

We present a method enabling a large number of agents to learn how to flock, which is a natural behavior observed in large populations of animals.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Learning to Play No-Press Diplomacy with Best Response Policy Iteration

2 code implementations • NeurIPS 2020 • Thomas Anthony, Tom Eccles, Andrea Tacchetti, János Kramár, Ian Gemp, Thomas C. Hudson, Nicolas Porcel, Marc Lanctot, Julien Pérolat, Richard Everett, Roman Werpachowski, Satinder Singh, Thore Graepel, Yoram Bachrach

It also features a large combinatorial action space and simultaneous moves, which are challenging for RL algorithms.

Reinforcement Learning (RL) Starcraft

Paper
Code

OpenSpiel: A Framework for Reinforcement Learning in Games

15 code implementations • 26 Aug 2019 • Marc Lanctot, Edward Lockhart, Jean-Baptiste Lespiau, Vinicius Zambaldi, Satyaki Upadhyay, Julien Pérolat, Sriram Srinivasan, Finbarr Timbers, Karl Tuyls, Shayegan Omidshafiei, Daniel Hennes, Dustin Morrill, Paul Muller, Timo Ewalds, Ryan Faulkner, János Kramár, Bart De Vylder, Brennan Saeta, James Bradbury, David Ding, Sebastian Borgeaud, Matthew Lai, Julian Schrittwieser, Thomas Anthony, Edward Hughes, Ivo Danihelka, Jonah Ryan-Davis

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

General Reinforcement Learning reinforcement-learning +1

3,998

Paper
Code

On the Convergence of Model Free Learning in Mean Field Games

no code implementations • 4 Jul 2019 • Romuald Elie, Julien Pérolat, Mathieu Laurière, Matthieu Geist, Olivier Pietquin

In order to design scalable algorithms for systems with a large population of interacting agents (e. g. swarms), this paper focuses on Mean Field MAS, where the number of agents is asymptotically infinite.

Paper
Add Code

Computing Approximate Equilibria in Sequential Adversarial Games by Exploitability Descent

no code implementations • 13 Mar 2019 • Edward Lockhart, Marc Lanctot, Julien Pérolat, Jean-Baptiste Lespiau, Dustin Morrill, Finbarr Timbers, Karl Tuyls

In this paper, we present exploitability descent, a new algorithm to compute approximate equilibria in two-player zero-sum extensive-form games with imperfect information, by direct policy optimization against worst-case opponents.

counterfactual

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.