no code implementations • 16 Aug 2019 • Jiancheng Long, Hongming Zhang, Tianyang Yu, Bo Xu
In this method, iterative update can greatly alleviate the nonstationarity of the environment, unified representation can speed up the interaction with environment and avoid the linear growth of memory usage.
Multi-agent Reinforcement Learning reinforcement-learning +1