no code implementations • 29 Sep 2021 • John Kenton Moore, Junier Oliva
This work develops a novel black box optimization technique for learning robust policies for stochastic environments.
OpenAI Gym reinforcement-learning +1