Search Results for author: Bruno Lacerda

Found 13 papers, 7 papers with code

Monte Carlo Tree Search with Boltzmann Exploration

2 code implementations • NeurIPS 2023 • Michael Painter, Mohamed Baioumy, Nick Hawes, Bruno Lacerda

Monte-Carlo Tree Search (MCTS) methods, such as Upper Confidence Bound applied to Trees (UCT), are instrumental to automated planning techniques.

Game of Go

186

Paper
Code

JaxMARL: Multi-Agent RL Environments in JAX

2 code implementations • 16 Nov 2023 • Alexander Rutherford, Benjamin Ellis, Matteo Gallici, Jonathan Cook, Andrei Lupu, Gardar Ingvarsson, Timon Willi, Akbir Khan, Christian Schroeder de Witt, Alexandra Souly, Saptarashmi Bandyopadhyay, Mikayel Samvelyan, Minqi Jiang, Robert Tjarko Lange, Shimon Whiteson, Bruno Lacerda, Nick Hawes, Tim Rocktaschel, Chris Lu, Jakob Nicolaus Foerster

This not only enables GPU acceleration, but also provides a more flexible MARL environment, unlocking the potential for self-play, meta-learning, and other future applications in MARL.

Meta-Learning Multi-agent Reinforcement Learning +3

328

Paper
Code

A Framework for Learning from Demonstration with Minimal Human Effort

1 code implementation • 15 Jun 2023 • Marc Rigter, Bruno Lacerda, Nick Hawes

In this setting we address reinforcement learning, and learning from demonstration, where there is a cost associated with human time.

reinforcement-learning

Paper
Code

Formal Modelling for Multi-Robot Systems Under Uncertainty

no code implementations • 26 May 2023 • Charlie Street, Masoumeh Mansouri, Bruno Lacerda

Purpose of Review: To effectively synthesise and analyse multi-robot behaviour, we require formal task-level models which accurately capture multi-robot execution.

Paper
Add Code

One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based Offline Reinforcement Learning

1 code implementation • NeurIPS 2023 • Marc Rigter, Bruno Lacerda, Nick Hawes

Our model-based approach is risk-averse to both epistemic and aleatoric uncertainty.

Decision Making Offline RL +1

Paper
Code

RAMBO-RL: Robust Adversarial Model-Based Offline Reinforcement Learning

2 code implementations • 26 Apr 2022 • Marc Rigter, Bruno Lacerda, Nick Hawes

Model-based algorithms, which learn a model of the environment from the dataset and perform conservative policy optimisation within that model, have emerged as a promising approach to this problem.

Offline RL reinforcement-learning +1

231

Paper
Code

Planning for Risk-Aversion and Expected Value in MDPs

1 code implementation • 25 Oct 2021 • Marc Rigter, Paul Duckworth, Bruno Lacerda, Nick Hawes

This motivates us to propose a lexicographic approach which minimises the expected cost subject to the constraint that the CVaR of the total cost is optimal.

Paper
Code

On Solving a Stochastic Shortest-Path Markov Decision Process as Probabilistic Inference

no code implementations • 13 Sep 2021 • Mohamed Baioumy, Bruno Lacerda, Paul Duckworth, Nick Hawes

Previous work on planning as active inference addresses finite horizon problems and solutions valid for online planning.

valid

Paper
Add Code

Risk-Averse Bayes-Adaptive Reinforcement Learning

no code implementations • NeurIPS 2021 • Marc Rigter, Bruno Lacerda, Nick Hawes

In this work, we address risk-averse Bayes-adaptive reinforcement learning.

Bayesian Optimisation reinforcement-learning +1

Paper
Add Code

Minimax Regret Optimisation for Robust Planning in Uncertain Markov Decision Processes

no code implementations • 8 Dec 2020 • Marc Rigter, Bruno Lacerda, Nick Hawes

We propose a dynamic programming algorithm that utilises the regret Bellman equation, and show that it optimises minimax regret exactly for UMDPs with independent uncertainties.

Paper
Add Code

Active Inference for Integrated State-Estimation, Control, and Learning

1 code implementation • 12 May 2020 • Mohamed Baioumy, Paul Duckworth, Bruno Lacerda, Nick Hawes

This work presents an approach for control, state-estimation and learning model (hyper)parameters for robotic manipulators.

Robotics

Paper
Code

Convex Hull Monte-Carlo Tree Search

no code implementations • 9 Mar 2020 • Michael Painter, Bruno Lacerda, Nick Hawes

This work investigates Monte-Carlo planning for agents in stochastic environments, with multiple objectives.

Multi-Armed Bandits

Paper
Add Code

Simultaneous Task Allocation and Planning Under Uncertainty

no code implementations • 7 Mar 2018 • Fatma Faruq, Bruno Lacerda, Nick Hawes, David Parker

We propose novel techniques for task allocation and planning in multi-robot systems operating in uncertain environments.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.