no code implementations • 23 May 2019 • Or Raveh, Ron Meir
We develop model free PAC performance guarantees for multiple concurrent MDPs, extending recent works where a single learner interacts with multiple non-interacting agents in a noise free environment.
Multi-agent Reinforcement Learning reinforcement-learning +1