1 code implementation • 8 Dec 2020 • Griffin Adams, Sarguna Janani Padmanabhan, Shivang Shekhar
We address two major challenges of implicit coordination in multi-agent deep reinforcement learning: non-stationarity and exponential growth of state-action space, by combining Deep-Q Networks for policy learning with Nash equilibrium for action selection.