1 code implementation • 19 Oct 2023 • Juan Rocamonde, Victoriano Montesinos, Elvis Nava, Ethan Perez, David Lindner
We find that VLM-RMs are remarkably robust as long as the VLM is large enough.
2 code implementations • 22 Nov 2022 • Adam Gleave, Mohammad Taufeeque, Juan Rocamonde, Erik Jenner, Steven H. Wang, Sam Toyer, Maximilian Ernestus, Nora Belrose, Scott Emmons, Stuart Russell
imitation provides open-source implementations of imitation and reward learning algorithms in PyTorch.