Search Results for author: Juan Rocamonde

Found 2 papers, 2 papers with code

Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning

1 code implementation • 19 Oct 2023 • Juan Rocamonde, Victoriano Montesinos, Elvis Nava, Ethan Perez, David Lindner

We find that VLM-RMs are remarkably robust as long as the VLM is large enough.

Prompt Engineering reinforcement-learning +2

19

Paper
Code

imitation: Clean Imitation Learning Implementations

2 code implementations • 22 Nov 2022 • Adam Gleave, Mohammad Taufeeque, Juan Rocamonde, Erik Jenner, Steven H. Wang, Sam Toyer, Maximilian Ernestus, Nora Belrose, Scott Emmons, Stuart Russell

imitation provides open-source implementations of imitation and reward learning algorithms in PyTorch.

Imitation Learning reinforcement-learning +1

1,136

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.