no code implementations • ICLR 2019 • David Earl Hostallero, Daewoo Kim, Kyunghwan Son, Yung Yi
Under these semi-cooperative scenarios, popular methods of centralized training with decentralized execution for inducing cooperation and removing the non-stationarity problem do not work well due to lack of a common shared reward as well as inscalability in centralized training.
Multi-agent Reinforcement Learning Reinforcement Learning (RL)
no code implementations • 6 May 2022 • Maximilian Igl, Daewoo Kim, Alex Kuefler, Paul Mougin, Punit Shah, Kyriacos Shiarlis, Dragomir Anguelov, Mark Palatucci, Brandyn White, Shimon Whiteson
The beam search refines these policies on the fly by pruning branches that are unfavourably evaluated by a discriminator.
3 code implementations • 14 May 2019 • Kyunghwan Son, Daewoo Kim, Wan Ju Kang, David Earl Hostallero, Yung Yi
We explore value-based solutions for multi-agent reinforcement learning (MARL) tasks in the centralized training with decentralized execution (CTDE) regime popularized recently.
Ranked #1 on SMAC+ on Off_Superhard_parallel
Multi-agent Reinforcement Learning reinforcement-learning +2
1 code implementation • ICLR 2019 • Daewoo Kim, Sangwoo Moon, David Hostallero, Wan Ju Kang, Taeyoung Lee, Kyunghwan Son, Yung Yi
Many real-world reinforcement learning tasks require multiple agents to make sequential decisions under the agents' interaction, where well-coordinated actions among the agents are crucial to achieve the target goal better at these tasks.
Multi-agent Reinforcement Learning reinforcement-learning +2