no code implementations • 1 Nov 2023 • Alain Andres, Daochen Zha, Javier Del Ser
Exploration poses a fundamental challenge in Reinforcement Learning (RL) with sparse rewards, limiting an agent's ability to learn optimal decision-making due to a lack of informative feedback signals.
no code implementations • 18 Apr 2023 • Alain Andres, Lukas Schäfer, Esther Villar-Rodriguez, Stefano V. Albrecht, Javier Del Ser
Motivated by the recent success of Offline RL and Imitation Learning (IL), we conduct a study to investigate whether agents can leverage offline data in the form of trajectories to improve the sample-efficiency in procedurally generated environments.
1 code implementation • 30 Nov 2022 • Alain Andres, Esther Villar-Rodriguez, Javier Del Ser
Unfortunately, in a broad range of problems the design of a good reward function is not trivial, so in such cases sparse reward signals are instead adopted.
1 code implementation • 23 May 2022 • Alain Andres, Esther Villar-Rodriguez, Javier Del Ser
In the last few years, the research activity around reinforcement learning tasks formulated over environments with sparse rewards has been especially notable.
1 code implementation • 24 Feb 2022 • Alain Andres, Esther Villar-Rodriguez, Javier Del Ser
In this work we combine ideas from intrinsic motivation and transfer learning.