no code implementations • 9 May 2017 • Steven Stenberg Hansen
We present a new deep meta reinforcement learner, which we call Deep Episodic Value Iteration (DEVI).
Meta Reinforcement Learning Model-based Reinforcement Learning +3
no code implementations • 14 Jan 2017 • Steven Stenberg Hansen
This means that vanishing gradients aren't a problem, as all of the necessary gradient paths are short.
no code implementations • 14 Jan 2017 • Steven Stenberg Hansen
The position of this article is that only by aligning our agents' abilities and environments with those of humans do we stand a chance at developing general artificial intelligence (GAI).