no code implementations • ICLR 2019 • David Earl Hostallero, Daewoo Kim, Kyunghwan Son, Yung Yi
Under these semi-cooperative scenarios, popular methods of centralized training with decentralized execution for inducing cooperation and removing the non-stationarity problem do not work well due to lack of a common shared reward as well as inscalability in centralized training.
Multi-agent Reinforcement Learning Reinforcement Learning (RL)
no code implementations • 29 Oct 2022 • Roben Delos Reyes, Kyunghwan Son, Jinhwan Jung, Wan Ju Kang, Yung Yi
First, we develop a two-headed curiosity module that is trained to predict the corresponding agent's next observation in the first head and the next joint observation in the second head.
no code implementations • 29 Sep 2021 • Kyunghwan Son, Junsu Kim, Yung Yi, Jinwoo Shin
Although these two sources are both important factors for learning robust policies of agents, prior works do not separate them or deal with only a single risk source, which could lead to suboptimal equilibria.
Ranked #1 on SMAC+ on Off_Near_parallel
Multi-agent Reinforcement Learning reinforcement-learning +3
no code implementations • 22 Jun 2020 • Kyunghwan Son, Sung-Soo Ahn, Roben Delos Reyes, Jinwoo Shin, Yung Yi
QTRAN is a multi-agent reinforcement learning (MARL) algorithm capable of learning the largest class of joint-action value functions up to date.
no code implementations • 19 Feb 2020 • Hai H. Tran, Sumyeong Ahn, Taeyoung Lee, Yung Yi
In this paper, we propose an idea of empowering the discriminativeness: Adding a new, artificial class and training the model on the data together with the GAN-generated samples of the new class.
no code implementations • 30 Jan 2020 • Se-eun Yoon, HyungSeok Song, Kijung Shin, Yung Yi
Hypergraphs provide a natural way of representing group relations, whose complexity motivates an extensive array of prior work to adopt some form of abstraction and simplification of higher-order interactions.
no code implementations • 9 Sep 2019 • Hyungseok Song, Hyeryung Jang, Hai H. Tran, Se-eun Yoon, Kyunghwan Son, Donggyu Yun, Hyoju Chung, Yung Yi
IS-MDP decomposes a joint action of selecting K items simultaneously into K iterative selections resulting in the decrease of actions at the expense of an exponential increase of states.
3 code implementations • 14 May 2019 • Kyunghwan Son, Daewoo Kim, Wan Ju Kang, David Earl Hostallero, Yung Yi
We explore value-based solutions for multi-agent reinforcement learning (MARL) tasks in the centralized training with decentralized execution (CTDE) regime popularized recently.
Ranked #1 on SMAC+ on Off_Superhard_parallel
Multi-agent Reinforcement Learning reinforcement-learning +2
1 code implementation • ICLR 2019 • Daewoo Kim, Sangwoo Moon, David Hostallero, Wan Ju Kang, Taeyoung Lee, Kyunghwan Son, Yung Yi
Many real-world reinforcement learning tasks require multiple agents to make sequential decisions under the agents' interaction, where well-coordinated actions among the agents are crucial to achieve the target goal better at these tasks.
Multi-agent Reinforcement Learning reinforcement-learning +2
no code implementations • 29 Apr 2018 • Hyeryung Jang, HyungSeok Song, Yung Yi
In this paper, we consider the problem of recovering a graph that represents the statistical data dependency among nodes for a set of data samples generated by nodes, which provides the basic structure to perform an inference task, such as MAP (maximum a posteriori).
no code implementations • 28 Feb 2017 • Jungseul Ok, Sewoong Oh, Yunhun Jang, Jinwoo Shin, Yung Yi
Crowdsourcing platforms emerged as popular venues for purchasing human intelligence at low cost for large volume of tasks.
no code implementations • 26 May 2016 • Hyeryung Jang, Hyungwon Choi, Yung Yi, Jinwoo Shin
This paper studies the problem of parameter learning in probabilistic graphical models having latent variables, where the standard approach is the expectation maximization algorithm alternating expectation (E) and maximization (M) steps.
no code implementations • 11 Feb 2016 • Jungseul Ok, Sewoong Oh, Jinwoo Shin, Yung Yi
Crowdsourcing systems are popular for solving large-scale labelling tasks with low-paid workers.