no code implementations • NeurIPS 2023 • Sungho Choi, Seungyul Han, Woojun Kim, Jongseong Chae, Whiyoung Jung, Youngchul Sung
In this paper, we consider domain-adaptive imitation learning with visual observation, where an agent in a target domain learns to perform a task by observing expert demonstrations in a source domain.
no code implementations • 1 Mar 2023 • Woojun Kim, Whiyoung Jung, Myungsik Cho, Youngchul Sung
In this paper, we propose a new mutual information framework for multi-agent reinforcement learning to enable multiple agents to learn coordinated behaviors by regularizing the accumulated return with the simultaneous mutual information between multi-agent actions.
Multi-agent Reinforcement Learning reinforcement-learning +1
1 code implementation • 28 Nov 2022 • Whiyoung Jung, Myungsik Cho, Jongeui Park, Youngchul Sung
This paper proposes a framework, named Quantile Constrained RL (QCRL), to constrain the quantile of the distribution of the cumulative sum cost that is a necessary and sufficient condition to satisfy the outage constraint.
1 code implementation • 20 Jun 2022 • Jeewon Jeon, Woojun Kim, Whiyoung Jung, Youngchul Sung
In this paper, we consider cooperative multi-agent reinforcement learning (MARL) with sparse reward.
1 code implementation • 19 Jun 2022 • Jongseong Chae, Seungyul Han, Whiyoung Jung, Myungsik Cho, Sungho Choi, Youngchul Sung
In this paper, we propose a robust imitation learning (IL) framework that improves the robustness of IL when environment dynamics are perturbed.
no code implementations • 1 Jan 2021 • Giseung Park, Whiyoung Jung, Sungho Choi, Youngchul Sung
In this paper, we consider intrinsic reward generation for sparse-reward reinforcement learning based on model prediction errors.
no code implementations • 4 Jun 2020 • Woojun Kim, Whiyoung Jung, Myungsik Cho, Youngchul Sung
In this paper, we propose a maximum mutual information (MMI) framework for multi-agent reinforcement learning (MARL) to enable multiple agents to learn coordinated behaviors by regularizing the accumulated return with the mutual information between actions.
Multiagent Systems
1 code implementation • ICLR 2020 • Whiyoung Jung, Giseung Park, Youngchul Sung
In the proposed scheme, multiple identical learners with their own value-functions and policies share a common experience replay buffer, and search a good policy in collaboration with the guidance of the best policy information.
no code implementations • 25 Sep 2019 • Giseung Park, Whiyoung Jung, Sungho Choi, Youngchul Sung
In this paper, a new intrinsic reward generation method for sparse-reward reinforcement learning is proposed based on an ensemble of dynamics models.
no code implementations • 27 Sep 2018 • Whiyoung Jung, Giseung Park, Youngchul Sung
In this paper, a new interactive parallel learning scheme is proposed to enhance the performance of off-policy continuous-action reinforcement learning.