1 code implementation • Autonomous Agents and Multi Agent Systems (AAMAS) 2023 • Sumedh Pendurkar, Chris Chow, Luo Jie, Guni Sharon
We address a mechanism design problem where the goal of the designer is to maximize the entropy of a player’s mixed strategy at a Nash equilibrium.
1 code implementation • 20 Oct 2022 • Vaibhav Bajaj, Guni Sharon, Peter Stone
Applying reinforcement learning (RL) to sparse reward domains is notoriously challenging due to insufficient guiding signals.
1 code implementation • 20 Sep 2022 • Sheelabhadra Dey, Sumedh Pendurkar, Guni Sharon, Josiah P. Hanna
The learning process in JIRL assumes the availability of a baseline policy and is designed with two objectives in mind \textbf{(a)} leveraging the baseline's online demonstrations to minimize the regret w. r. t the baseline policy during training, and \textbf{(b)} eventually surpassing the baseline performance.
1 code implementation • 7 Sep 2022 • Sumedh Pendurkar, Taoan Huang, Sven Koenig, Guni Sharon
Our first experimental results for three representative NP-hard minimum-cost path problems suggest that using neural networks to approximate completely informed heuristic functions with high precision might result in network sizes that scale exponentially in the instance sizes.
no code implementations • 16 Apr 2022 • Aaron Parks-Young, Guni Sharon
This document provides technical details regarding the Hybrid-AIM simulator that was used in Sharon and Stone (2017) and Parks-Young and Sharon (2022).
1 code implementation • 23 Dec 2019 • James Ault, Josiah P. Hanna, Guni Sharon
Given such a safety-critical domain, the affiliated actuation policy is required to be interpretable in a way that can be understood and regulated by a human.
no code implementations • 27 Sep 2017 • Guni Sharon, Michael Albert, Tarun Rambha, Stephen Boyles, Peter Stone
This paper focuses on two commonly used path assignment policies for agents traversing a congested network: self-interested routing, and system-optimum routing.
no code implementations • 17 Feb 2017 • Hang Ma, Sven Koenig, Nora Ayanian, Liron Cohen, Wolfgang Hoenig, T. K. Satish Kumar, Tansel Uras, Hong Xu, Craig Tovey, Guni Sharon
Multi-agent path finding (MAPF) is well-studied in artificial intelligence, robotics, theoretical computer science and operations research.