2 code implementations • 10 Sep 2018 • Michał Garmulewicz, Henryk Michalewski, Piotr Miłoś
We propose an expert-augmented actor-critic algorithm, which we evaluate on two environments with sparse rewards: Montezumas Revenge and a demanding maze from the ViZDoom suite.