no code implementations • 25 Nov 2021 • Xiaoxiao Zhao, Jinlong Lei, Li Li, Jie Chen
This paper studies a distributed policy gradient in collaborative multi-agent reinforcement learning (MARL), where agents over a communication network aim to find the optimal policy to maximize the average of all agents' local returns.
Multi-agent Reinforcement Learning reinforcement-learning +2
no code implementations • ECCV 2020 • Zhen Zhao, Miaojing Shi, Xiaoxiao Zhao, Li Li
To learn a reliable people counter from crowd images, head center annotations are normally required.