Search Results for author: Yung Yi

Found 13 papers, 2 papers with code

Inducing Cooperation via Learning to reshape rewards in semi-cooperative multi-agent reinforcement learning

no code implementations ICLR 2019 David Earl Hostallero, Daewoo Kim, Kyunghwan Son, Yung Yi

Under these semi-cooperative scenarios, popular methods of centralized training with decentralized execution for inducing cooperation and removing the non-stationarity problem do not work well due to lack of a common shared reward as well as inscalability in centralized training.

Multi-agent Reinforcement Learning Reinforcement Learning (RL)

Curiosity-Driven Multi-Agent Exploration with Mixed Objectives

no code implementations29 Oct 2022 Roben Delos Reyes, Kyunghwan Son, Jinhwan Jung, Wan Ju Kang, Yung Yi

First, we develop a two-headed curiosity module that is trained to predict the corresponding agent's next observation in the first head and the next joint observation in the second head.

Disentangling Sources of Risk for Distributional Multi-Agent Reinforcement Learning

no code implementations29 Sep 2021 Kyunghwan Son, Junsu Kim, Yung Yi, Jinwoo Shin

Although these two sources are both important factors for learning robust policies of agents, prior works do not separate them or deal with only a single risk source, which could lead to suboptimal equilibria.

Multi-agent Reinforcement Learning reinforcement-learning +3

QTRAN++: Improved Value Transformation for Cooperative Multi-Agent Reinforcement Learning

no code implementations22 Jun 2020 Kyunghwan Son, Sung-Soo Ahn, Roben Delos Reyes, Jinwoo Shin, Yung Yi

QTRAN is a multi-agent reinforcement learning (MARL) algorithm capable of learning the largest class of joint-action value functions up to date.

reinforcement-learning Reinforcement Learning (RL) +2

Enlarging Discriminative Power by Adding an Extra Class in Unsupervised Domain Adaptation

no code implementations19 Feb 2020 Hai H. Tran, Sumyeong Ahn, Taeyoung Lee, Yung Yi

In this paper, we propose an idea of empowering the discriminativeness: Adding a new, artificial class and training the model on the data together with the GAN-generated samples of the new class.

Unsupervised Domain Adaptation

How Much and When Do We Need Higher-order Information in Hypergraphs? A Case Study on Hyperedge Prediction

no code implementations30 Jan 2020 Se-eun Yoon, HyungSeok Song, Kijung Shin, Yung Yi

Hypergraphs provide a natural way of representing group relations, whose complexity motivates an extensive array of prior work to adopt some form of abstraction and simplification of higher-order interactions.

Hyperedge Prediction

Solving Continual Combinatorial Selection via Deep Reinforcement Learning

no code implementations9 Sep 2019 Hyungseok Song, Hyeryung Jang, Hai H. Tran, Se-eun Yoon, Kyunghwan Son, Donggyu Yun, Hyoju Chung, Yung Yi

IS-MDP decomposes a joint action of selecting K items simultaneously into K iterative selections resulting in the decrease of actions at the expense of an exponential increase of states.

reinforcement-learning Reinforcement Learning (RL)

QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning

3 code implementations14 May 2019 Kyunghwan Son, Daewoo Kim, Wan Ju Kang, David Earl Hostallero, Yung Yi

We explore value-based solutions for multi-agent reinforcement learning (MARL) tasks in the centralized training with decentralized execution (CTDE) regime popularized recently.

Multi-agent Reinforcement Learning reinforcement-learning +2

Learning to Schedule Communication in Multi-agent Reinforcement Learning

1 code implementation ICLR 2019 Daewoo Kim, Sangwoo Moon, David Hostallero, Wan Ju Kang, Taeyoung Lee, Kyunghwan Son, Yung Yi

Many real-world reinforcement learning tasks require multiple agents to make sequential decisions under the agents' interaction, where well-coordinated actions among the agents are crucial to achieve the target goal better at these tasks.

Multi-agent Reinforcement Learning reinforcement-learning +2

Learning Data Dependency with Communication Cost

no code implementations29 Apr 2018 Hyeryung Jang, HyungSeok Song, Yung Yi

In this paper, we consider the problem of recovering a graph that represents the statistical data dependency among nodes for a set of data samples generated by nodes, which provides the basic structure to perform an inference task, such as MAP (maximum a posteriori).

Iterative Bayesian Learning for Crowdsourced Regression

no code implementations28 Feb 2017 Jungseul Ok, Sewoong Oh, Yunhun Jang, Jinwoo Shin, Yung Yi

Crowdsourcing platforms emerged as popular venues for purchasing human intelligence at low cost for large volume of tasks.

regression

Adiabatic Persistent Contrastive Divergence Learning

no code implementations26 May 2016 Hyeryung Jang, Hyungwon Choi, Yung Yi, Jinwoo Shin

This paper studies the problem of parameter learning in probabilistic graphical models having latent variables, where the standard approach is the expectation maximization algorithm alternating expectation (E) and maximization (M) steps.

Cannot find the paper you are looking for? You can Submit a new open access paper.