1 code implementation • 21 Apr 2023 • Shanchuan Wan, Yujin Tang, Yingtao Tian, Tomoyuki Kaneko
Exploration is a fundamental aspect of reinforcement learning (RL), and its effectiveness is a deciding factor in the performance of RL algorithms, especially when facing sparse extrinsic rewards.
1 code implementation • 6 Oct 2020 • Yuji Kanagawa, Tomoyuki Kaneko
We consider the problem of autonomously learning reusable temporally extended actions, or options, in reinforcement learning.
no code implementations • 17 Aug 2020 • Quentin Gendre, Tomoyuki Kaneko
Catan is a strategic board game having interesting properties, including multi-player, imperfect information, stochastic, complex state space structure (hexagonal board where each vertex, edge and face has its own features, cards for each player, etc), and a large action space (including negotiation).
2 code implementations • 17 Apr 2019 • Yuji Kanagawa, Tomoyuki Kaneko
Following these studies, we propose the use of roguelikes as a benchmark for evaluating the generalization ability of RL agents.