no code implementations • 23 Feb 2023 • Takuma Yoneda, Luzhe Sun, and Ge Yang, Bradly Stadie, Matthew Walter
Traditional approaches to shared autonomy rely on knowledge of the environment dynamics, a discrete space of user goals that is known a priori, or knowledge of the user's policy -- assumptions that are unrealistic in many domains.
no code implementations • 15 Dec 2021 • Takuma Yoneda, Ge Yang, Matthew R. Walter, Bradly Stadie
A robot's deployment environment often involves perceptual changes that differ from what it has experienced during training.
2 code implementations • ICML 2020 • Silviu Pitis, Harris Chan, Stephen Zhao, Bradly Stadie, Jimmy Ba
What goals should a multi-goal reinforcement learning agent pursue during training in long-horizon tasks?
no code implementations • ICLR 2020 • Matthew Shunshi Zhang, Bradly Stadie
Yet, these same techniques often falter when applied to the problem of recovering sparse recurrent networks.
no code implementations • NeurIPS 2018 • Bradly Stadie, Ge Yang, Rein Houthooft, Peter Chen, Yan Duan, Yuhuai Wu, Pieter Abbeel, Ilya Sutskever
Results are presented on a new environment we call `Krazy World': a difficult high-dimensional gridworld which is designed to highlight the importance of correctly differentiating through sampling distributions in meta-reinforcement learning.