no code implementations • 21 Apr 2020 • Somjit Nath, Richa Verma, Abhik Ray, Harshad Khadilkar
We propose a generic reward shaping approach for improving the rate of convergence in reinforcement learning (RL), called Self Improvement Based REwards, or SIBRE.