1 code implementation • 7 Feb 2024 • Jannis Weil, Zhenghua Bao, Osama Abboud, Tobias Meuser
The size of the observed neighborhood limits the generalizability to different graphs and affects the reactivity of agents, the quality of the selected actions, and the communication overhead.
no code implementations • 24 Nov 2023 • Jannis Weil, Gizem Ekinci, Heinz Koeppl, Tobias Meuser
In addition to this message size selection, agents learn to encode and decode messages to improve their jointly trained policies.
1 code implementation • 22 May 2023 • Jannis Weil, Johannes Czech, Tobias Meuser, Kristian Kersting
In combination with Reinforcement Learning, Monte-Carlo Tree Search has shown to outperform human grandmasters in games such as Chess, Shogi and Go with little to no prior domain knowledge.