no code implementations • ICML 2018 • Eugenio Bargiacchi, Timothy Verstraeten, Diederik Roijers, Ann Nowé, Hado Hasselt
Learning to coordinate between multiple agents is an important problem in many reinforcement learning problems.
Multi-Armed Bandits Q-Learning