1 code implementation • 3 Jul 2022 • Edoardo Cetin, Philip J. Ball, Steve Roberts, Oya Celiktutan
Off-policy reinforcement learning (RL) from pixel observations is notoriously unstable.
Data Augmentation reinforcement-learning +1