no code implementations • 29 Dec 2018 • Yiming Shen, Kehan Yang, Yufeng Yuan, Simon Cheng Liu
In this paper, we propose a novel meta-learning method in a reinforcement learning setting, based on evolution strategies (ES), exploration in parameter space and deterministic policy gradients.