no code implementations • 11 Jan 2022 • Jesus Bujalance Martin, Fabien Moutarde
Our method is based on a reward bonus given to demonstrations and successful episodes (via relabeling), encouraging expert imitation and self-imitation.
no code implementations • 27 Oct 2021 • Jesus Bujalance Martin, Raphael Chekroun, Fabien Moutarde
We also present a new method for sparse-reward tasks, based on a reward bonus given to demonstrations and successful episodes.
no code implementations • 20 May 2021 • Maxence Mahe, Pierre Belamri, Jesus Bujalance Martin
The pipeline is divided into two parts: the first one is to capture the relevant information from the RGB video with a Computer Vision algorithm.