Search Results for author: Kuekyeng Kim

Found 2 papers, 0 papers with code

Variational Reward Estimator Bottleneck: Learning Robust Reward Estimator for Multi-Domain Task-Oriented Dialog

no code implementations31 May 2020 Jeiyoon Park, Chanhee Lee, Kuekyeng Kim, Heuiseok Lim

Despite its notable success in adversarial learning approaches to multi-domain task-oriented dialog system, training the dialog policy via adversarial inverse reinforcement learning often fails to balance the performance of the policy generator and reward estimator.

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.