no code implementations • 5 Mar 2024 • Liangzhou Wang, Kaiwen Zhu, Fengming Zhu, Xinghu Yao, Shujie Zhang, Deheng Ye, Haobo Fu, Qiang Fu, Wei Yang
The common goal is an achievable state with high value, which is obtained by sampling from the distribution of future states.
1 code implementation • 11 Nov 2019 • Xinghu Yao, Chao Wen, Yuhui Wang, Xiaoyang Tan
Learning a stable and generalizable centralized value function (CVF) is a crucial but challenging task in multi-agent reinforcement learning (MARL), as it has to deal with the issue that the joint action space increases exponentially with the number of agents in such scenarios.