3 code implementations • 6 Feb 2023 • Thomas Carta, Clément Romac, Thomas Wolf, Sylvain Lamprier, Olivier Sigaud, Pierre-Yves Oudeyer
Using an interactive textual environment designed to study higher-level forms of functional grounding, and a set of spatial and navigation tasks, we study several scientific questions: 1) Can LLMs boost sample efficiency for online learning of various RL tasks?
1 code implementation • 20 Jun 2022 • Thomas Carta, Pierre-Yves Oudeyer, Olivier Sigaud, Sylvain Lamprier
Reinforcement learning (RL) in long horizon and sparse reward tasks is notoriously difficult and requires a lot of training steps.
no code implementations • 26 Oct 2020 • Thomas Carta, Subhajit Chaudhury, Kartik Talamadupula, Michiaki Tatsubori
The goal is to force an RL agent to use both text and visual features to predict natural language action commands for solving the final task of cooking a meal.