no code implementations • 19 Feb 2024 • Han-Dong Lim, HyeAnn Lee, Donghwan Lee
Reinforcement learning has witnessed significant advancements, particularly with the emergence of model-based approaches.
no code implementations • 1 Oct 2023 • Han-Dong Lim, Donghwan Lee
The goal of this paper is to investigate distributed temporal difference (TD) learning for a networked multi-agent Markov decision process.
no code implementations • 31 Jul 2023 • Donghwan Lee, Han-Dong Lim, Do Wan Kim
The main goal of this paper is to investigate continuous-time distributed dynamic programming (DP) algorithms for networked multi-agent Markov decision problems (MAMDPs).
no code implementations • 16 Jun 2023 • Han-Dong Lim, Donghwan Lee
Temporal-difference (TD) learning is widely regarded as one of the most popular algorithms in reinforcement learning (RL).
no code implementations • 20 Feb 2023 • Han-Dong Lim, Donghwan Lee
Off-policy learning ability is an important feature of reinforcement learning (RL) for practical applications.
no code implementations • 25 Jul 2022 • Han-Dong Lim, Donghwan Lee
Q-learning has long been one of the most popular reinforcement learning algorithms, and theoretical analysis of Q-learning has been an active research topic for decades.
no code implementations • 11 Feb 2022 • Han-Dong Lim, Do Wan Kim, Donghwan Lee
This paper develops a new Q-learning algorithm that converges when linear function approximation is used.
no code implementations • 9 Sep 2021 • Donghwan Lee, Han-Dong Lim, Jihoon Park, Okyong Choi
Sutton, Szepesv\'{a}ri and Maei introduced the first gradient temporal-difference (GTD) learning algorithms compatible with both linear function approximation and off-policy training.