no code implementations • 29 Dec 2021 • Chris Beeler, Xinkai Li, Colin Bellinger, Mark Crowley, Maia Fraser, Isaac Tamblyn
Using a novel toy nautical navigation environment, we show that dynamic programming can be used when only incomplete information about a partially observed Markov decision process (POMDP) is known.
no code implementations • NeurIPS 2021 • Haoye Lu, Yongyi Mao, Maia Fraser
In particular, we describe a convex optimization problem that arises in a family of estimation tasks commonly appearing in the design of deep learning models.
no code implementations • NeurIPS 2016 • Maia Fraser
We use this to give conditions on the marginal and conditional models which imply an advantage for multi-step learning approaches.