no code implementations • 2 Feb 2021 • Tom Everitt, Ryan Carey, Eric Langlois, Pedro A Ortega, Shane Legg
We propose a new graphical criterion for value of control, establishing its soundness and completeness.
no code implementations • 20 Jan 2020 • Ryan Carey, Eric Langlois, Tom Everitt, Shane Legg
Which variables does an agent have an incentive to control with its decision, and which variables does it have an incentive to respond to?
2 code implementations • 3 Jul 2019 • Tingwu Wang, Xuchan Bao, Ignasi Clavera, Jerrick Hoang, Yeming Wen, Eric Langlois, Shunshi Zhang, Guodong Zhang, Pieter Abbeel, Jimmy Ba
Model-based reinforcement learning (MBRL) is widely seen as having the potential to be significantly more sample efficient than model-free RL.