no code implementations • ICML 2020 • Somdeb Majumdar, Shauharda Khadka, Santiago Miret, Stephen Mcaleer, Kagan Tumer
Training policies solely on the team-based reward is often difficult due to its sparsity.
no code implementations • 18 Jun 2019 • Shauharda Khadka, Somdeb Majumdar, Santiago Miret, Stephen Mcaleer, Kagan Tumer
Training policies solely on the team-based reward is often difficult due to its sparsity.
1 code implementation • 2 May 2019 • Shauharda Khadka, Somdeb Majumdar, Tarek Nassar, Zach Dwiel, Evren Tumer, Santiago Miret, Yinyin Liu, Kagan Tumer
Deep reinforcement learning algorithms have been successfully applied to a range of challenging control tasks.
6 code implementations • NeurIPS 2018 • Shauharda Khadka, Kagan Tumer
However, these methods typically suffer from three core difficulties: temporal credit assignment with sparse rewards, lack of effective exploration, and brittle convergence properties that are extremely sensitive to hyperparameters.