Search Results for author: Haitham Bou Ammar

Found 21 papers, 7 papers with code

Framework and Benchmarks for Combinatorial and Mixed-variable Bayesian Optimization

1 code implementation • NeurIPS 2023 • Kamil Dreczkowski, Antoine Grosnit, Haitham Bou Ammar

This paper introduces a modular framework for Mixed-variable and Combinatorial Bayesian Optimization (MCBO) to address the lack of systematic benchmarking and standardized evaluation in the field.

Bayesian Optimization Benchmarking

2,952

Paper
Code

End-to-End Meta-Bayesian Optimisation with Transformer Neural Processes

2 code implementations • NeurIPS 2023 • Alexandre Maraval, Matthieu Zimmer, Antoine Grosnit, Haitham Bou Ammar

We enable this end-to-end framework with reinforcement learning (RL) to tackle the lack of labelled acquisition data.

Bayesian Optimisation Inductive Bias +2

2,953

Paper
Code

Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions

no code implementations • 16 May 2023 • Desong Du, Shaohang Han, Naiming Qi, Haitham Bou Ammar, Jun Wang, Wei Pan

Reinforcement learning (RL) exhibits impressive performance when managing complicated control tasks for robots.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Structured Q-learning For Antibody Design

no code implementations • 10 Sep 2022 • Alexander I. Cowen-Rivers, Philip John Gorinski, Aivar Sootla, Asif Khan, Liu Furui, Jun Wang, Jan Peters, Haitham Bou Ammar

Optimizing combinatorial structures is core to many real-world problems, such as those encountered in life sciences.

Combinatorial Optimization Molecular Docking +1

Paper
Add Code

Effects of Safety State Augmentation on Safe Exploration

1 code implementation • 6 Jun 2022 • Aivar Sootla, Alexander I. Cowen-Rivers, Jun Wang, Haitham Bou Ammar

We further show that Simmer can stabilize training and improve the performance of safe RL with average constraints.

Reinforcement Learning (RL) Safe Exploration +1

2,952

Paper
Code

Sample-Efficient Optimisation with Probabilistic Transformer Surrogates

no code implementations • 27 May 2022 • Alexandre Maraval, Matthieu Zimmer, Antoine Grosnit, Rasul Tutunov, Jun Wang, Haitham Bou Ammar

First, we notice that these models are trained on uniformly distributed inputs, which impairs predictive accuracy on non-uniform data - a setting arising from any typical BO loop due to exploration-exploitation trade-offs.

Bayesian Optimisation Gaussian Processes

Paper
Add Code

BOiLS: Bayesian Optimisation for Logic Synthesis

no code implementations • 11 Nov 2021 • Antoine Grosnit, Cedric Malherbe, Rasul Tutunov, Xingchen Wan, Jun Wang, Haitham Bou Ammar

Optimising the quality-of-results (QoR) of circuits during logic synthesis is a formidable challenge necessitating the exploration of exponentially sized search spaces.

Bayesian Optimisation Navigate

Paper
Add Code

Viscos Flows: Variational Schur Conditional Sampling With Normalizing Flows

no code implementations • 6 Jul 2021 • Vincent Moens, Aivar Sootla, Haitham Bou Ammar, Jun Wang

We present a method for conditional sampling for pre-trained normalizing flows when only part of an observation is available.

Paper
Add Code

Online Double Oracle

1 code implementation • 13 Mar 2021 • Le Cong Dinh, Yaodong Yang, Stephen Mcaleer, Zheng Tian, Nicolas Perez Nieves, Oliver Slumbers, David Henry Mguni, Haitham Bou Ammar, Jun Wang

Solving strategic games with huge action space is a critical yet under-explored topic in economics, operations research and artificial intelligence.

Paper
Code

Diverse Auto-Curriculum is Critical for Successful Real-World Multiagent Learning Systems

no code implementations • 15 Feb 2021 • Yaodong Yang, Jun Luo, Ying Wen, Oliver Slumbers, Daniel Graves, Haitham Bou Ammar, Jun Wang, Matthew E. Taylor

Multiagent reinforcement learning (MARL) has achieved a remarkable amount of success in solving various types of video games.

Autonomous Driving

Paper
Add Code

HEBO Pushing The Limits of Sample-Efficient Hyperparameter Optimisation

3 code implementations • 7 Dec 2020 • Alexander I. Cowen-Rivers, Wenlong Lyu, Rasul Tutunov, Zhi Wang, Antoine Grosnit, Ryan Rhys Griffiths, Alexandre Max Maraval, Hao Jianye, Jun Wang, Jan Peters, Haitham Bou Ammar

Our results on the Bayesmark benchmark indicate that heteroscedasticity and non-stationarity pose significant challenges for black-box optimisers.

Ranked #1 on Hyperparameter Optimization on Bayesmark

Bayesian Optimisation BIG-bench Machine Learning +1

2,952

Paper
Code

SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving

3 code implementations • 19 Oct 2020 • Ming Zhou, Jun Luo, Julian Villella, Yaodong Yang, David Rusu, Jiayu Miao, Weinan Zhang, Montgomery Alban, Iman Fadakar, Zheng Chen, Aurora Chongxi Huang, Ying Wen, Kimia Hassanzadeh, Daniel Graves, Dong Chen, Zhengbang Zhu, Nhat Nguyen, Mohamed Elsayed, Kun Shao, Sanjeevan Ahilan, Baokuan Zhang, Jiannan Wu, Zhengang Fu, Kasra Rezaee, Peyman Yadmellat, Mohsen Rohani, Nicolas Perez Nieves, Yihan Ni, Seyedershad Banijamali, Alexander Cowen Rivers, Zheng Tian, Daniel Palenicek, Haitham Bou Ammar, Hongbo Zhang, Wulong Liu, Jianye Hao, Jun Wang

We open-source the SMARTS platform and the associated benchmark tasks and evaluation metrics to encourage and empower research on multi-agent learning for autonomous driving.

Autonomous Driving Multi-agent Reinforcement Learning +2

880

Paper
Code

Multi-View Reinforcement Learning

1 code implementation • NeurIPS 2019 • Minne Li, Lisheng Wu, Haitham Bou Ammar, Jun Wang

This paper is concerned with multi-view reinforcement learning (MVRL), which allows for decision making when agents share common dynamics but adhere to different observation models.

Decision Making reinforcement-learning +1

Paper
Code

Derivative-Free & Order-Robust Optimisation

no code implementations • 9 Oct 2019 • Victor Gabillon, Rasul Tutunov, Michal Valko, Haitham Bou Ammar

In this paper, we formalise order-robust optimisation as an instance of online learning minimising simple regret, and propose Vroom, a zero'th order optimisation algorithm capable of achieving vanishing regret in non-stationary environments, while recovering favorable rates under stochastic reward-generating processes.

Paper
Add Code

$α^α$-Rank: Practically Scaling $α$-Rank through Stochastic Optimisation

no code implementations • 25 Sep 2019 • Yaodong Yang, Rasul Tutunov, Phu Sakulwongtana, Haitham Bou Ammar

Furthermore, we also show successful results on large joint strategy profiles with a maximum size in the order of $\mathcal{O}(2^{25})$ ($\approx 33$ million joint strategies) -- a setting not evaluable using $\alpha$-Rank with reasonable computational budget.

Stochastic Optimization

Paper
Add Code

Wasserstein Robust Reinforcement Learning

no code implementations • 30 Jul 2019 • Mohammed Amin Abdullah, Hang Ren, Haitham Bou Ammar, Vladimir Milenkovic, Rui Luo, Mingtian Zhang, Jun Wang

Reinforcement learning algorithms, though successful, tend to over-fit to training environments hampering their application to the real-world.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Distributed Multitask Reinforcement Learning with Quadratic Convergence

no code implementations • NeurIPS 2018 • Rasul Tutunov, Dongho Kim, Haitham Bou Ammar

Multitask reinforcement learning (MTRL) suffers from scalability issues when the number of tasks or trajectories grows large.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Learning to Communicate Implicitly By Actions

no code implementations • 10 Oct 2018 • Zheng Tian, Shihao Zou, Ian Davies, Tim Warr, Lisheng Wu, Haitham Bou Ammar, Jun Wang

The auxiliary reward for communication is integrated into the learning of the policy module.

Paper
Add Code

Estimating 3D Trajectories from 2D Projections via Disjunctive Factored Four-Way Conditional Restricted Boltzmann Machines

no code implementations • 20 Apr 2016 • Decebal Constantin Mocanu, Haitham Bou Ammar, Luis Puig, Eric Eaton, Antonio Liotta

Estimation, recognition, and near-future prediction of 3D trajectories based on their two dimensional projections available from one camera source is an exceptionally difficult problem due to uncertainty in the trajectories and environment, high dimensionality of the specific trajectory states, lack of enough labeled data and so on.

Future prediction Time Series +1

Paper
Add Code

Theoretically-Grounded Policy Advice from Multiple Teachers in Reinforcement Learning Settings with Applications to Negative Transfer

no code implementations • 13 Apr 2016 • Yusen Zhan, Haitham Bou Ammar, Matthew E. Taylor

This paper formally defines a setting where multiple teacher agents can provide advice to a student and introduces an algorithm to leverage both autonomous exploration and teacher's advice.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Safe Policy Search for Lifelong Reinforcement Learning with Sublinear Regret

no code implementations • 21 May 2015 • Haitham Bou Ammar, Rasul Tutunov, Eric Eaton

Lifelong reinforcement learning provides a promising framework for developing versatile agents that can accumulate knowledge over a lifetime of experience and rapidly learn new tasks by building upon prior knowledge.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.