Search Results for author: Edward Beeching

Found 7 papers, 5 papers with code

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

1 code implementation15 Feb 2024 Quentin Gallouédec, Edward Beeching, Clément Romac, Emmanuel Dellandréa

The search for a general model that can operate seamlessly across multiple domains remains a key goal in machine learning research.

Decision Making Reinforcement Learning (RL)

Zephyr: Direct Distillation of LM Alignment

1 code implementation25 Oct 2023 Lewis Tunstall, Edward Beeching, Nathan Lambert, Nazneen Rajani, Kashif Rasul, Younes Belkada, Shengyi Huang, Leandro von Werra, Clémentine Fourrier, Nathan Habib, Nathan Sarrazin, Omar Sanseviero, Alexander M. Rush, Thomas Wolf

Starting from a dataset of outputs ranked by a teacher model, we apply distilled direct preference optimization (dDPO) to learn a chat model with significantly improved intent alignment.

2D Cyclist Detection Language Modelling

Graph augmented Deep Reinforcement Learning in the GameRLand3D environment

no code implementations22 Dec 2021 Edward Beeching, Maxim Peter, Philippe Marcotte, Jilles Debangoye, Olivier Simonin, Joshua Romoff, Christian Wolf

We address planning and navigation in challenging 3D video games featuring maps with disconnected regions reachable by agents using special actions.

reinforcement-learning Reinforcement Learning (RL)

Godot Reinforcement Learning Agents

1 code implementation7 Dec 2021 Edward Beeching, Jilles Debangoye, Olivier Simonin, Christian Wolf

We present Godot Reinforcement Learning (RL) Agents, an open-source interface for developing environments and agents in the Godot Game Engine.

reinforcement-learning Reinforcement Learning (RL)

Learning to plan with uncertain topological maps

1 code implementation ECCV 2020 Edward Beeching, Jilles Dibangoye, Olivier Simonin, Christian Wolf

We train an agent to navigate in 3D environments using a hierarchical strategy including a high-level graph based planner and a local policy.

Inductive Bias Navigate

EgoMap: Projective mapping and structured egocentric memory for Deep RL

no code implementations24 Jan 2020 Edward Beeching, Christian Wolf, Jilles Dibangoye, Olivier Simonin

The EgoMap architecture incorporates several inductive biases including a differentiable inverse projection of CNN feature vectors onto a top-down spatially structured map.

Memorization reinforcement-learning +1

Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a Supercomputer

1 code implementation3 Apr 2019 Edward Beeching, Christian Wolf, Jilles Dibangoye, Olivier Simonin

In this paper we argue that research on training agents capable of complex reasoning can be simplified by decoupling from the requirement of high fidelity photographic observations.

Reinforcement Learning (RL) Scene Understanding

Cannot find the paper you are looking for? You can Submit a new open access paper.