no code implementations • 24 Apr 2023 • Haneya Naeem Qureshi, Usama Masood, Marvin Manalastas, Syed Muhammad Asad Zaidi, Hasan Farooq, Julien Forgeat, Maxime Bouton, Shruti Bothe, Per Karlsson, Ali Rizwan, Ali Imran
The extensive survey of training data scarcity addressing techniques combined with proposed framework to select a suitable technique for given type of data, can assist researchers and network operators in choosing appropriate methods to overcome the data scarcity challenge in leveraging AI to radio access network automation.
no code implementations • 20 Jan 2023 • Maxime Bouton, Jaeseong Jeong, Jose Outes, Adriano Mendo, Alexandros Nikou
Future generations of mobile networks are expected to contain more and more antennas with growing complexity and more parameters.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 16 Nov 2022 • Viktor Eriksson Möllerstedt, Alessio Russo, Maxime Bouton
Non-differentiable controllers and rule-based policies are widely used for controlling real systems such as telecommunication networks and robots.
no code implementations • 27 Dec 2021 • Yifei Jin, Filippo Vannella, Maxime Bouton, Jaeseong Jeong, Ezeddin Al Hakim
GAQ relies on a graph attention mechanism to select relevant neighbors information, improve the agent state representation, and update the tilt control policy based on a history of observations using a Deep Q-Network (DQN).
no code implementations • 30 Sep 2021 • Maxime Bouton, Hasan Farooq, Julien Forgeat, Shruti Bothe, Meral Shirazipour, Per Karlsson
In this work, we demonstrate how to use coordination graphs and reinforcement learning in a complex application involving hundreds of cooperating agents.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 25 May 2020 • Maxime Bouton, Alireza Nakhaei, David Isele, Kikuo Fujimura, Mykel J. Kochenderfer
This approach exposes the agent to a broad variety of behaviors during training, which promotes learning policies that are robust to model discrepancies.
1 code implementation • 11 Jan 2020 • Maxime Bouton, Jana Tumova, Mykel J. Kochenderfer
Autonomous systems are often required to operate in partially observable environments.
1 code implementation • 26 Jun 2019 • Maxime Bouton, Alireza Nakhaei, Kikuo Fujimura, Mykel J. Kochenderfer
In this work, we present a reinforcement learning approach to learn how to interact with drivers with different cooperation levels.
2 code implementations • 25 Apr 2019 • Maxime Bouton, Alireza Nakhaei, Kikuo Fujimura, Mykel J. Kochenderfer
Navigating urban environments represents a complex task for automated vehicles.
1 code implementation • 25 Apr 2019 • Markus Schratter, Maxime Bouton, Mykel J. Kochenderfer, Daniel Watzenig
We show that combining the two approaches provides a robust autonomous braking system that reduces unnecessary braking caused by using the AEB system on its own.
no code implementations • 15 Apr 2019 • Maxime Bouton, Jesper Karlsson, Alireza Nakhaei, Kikuo Fujimura, Mykel J. Kochenderfer, Jana Tumova
We propose a generic approach to enforce probabilistic guarantees on an RL agent.
1 code implementation • 6 Feb 2018 • Maxime Bouton, Kyle Julian, Alireza Nakhaei, Kikuo Fujimura, Mykel J. Kochenderfer
In contexts where an agent interacts with multiple entities, utility decomposition can be used to separate the global objective into local tasks considering each individual entity independently.
no code implementations • 14 Apr 2017 • Maxime Bouton, Akansel Cosgun, Mykel J. Kochenderfer
Urban intersections represent a complex environment for autonomous vehicles with many sources of uncertainty.
Robotics