1 code implementation • ICLR 2019 • Rohin Shah, Dmitrii Krasheninnikov, Jordan Alexander, Pieter Abbeel, Anca Dragan
We find that information from the initial state can be used to infer both side effects that should be avoided as well as preferences for how the environment should be organized.