Search Results for author: Edward Beeching

Found 7 papers, 5 papers with code

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

1 code implementation • 15 Feb 2024 • Quentin Gallouédec, Edward Beeching, Clément Romac, Emmanuel Dellandréa

The search for a general model that can operate seamlessly across multiple domains remains a key goal in machine learning research.

Decision Making Reinforcement Learning (RL)

103

Paper
Code

Zephyr: Direct Distillation of LM Alignment

1 code implementation • 25 Oct 2023 • Lewis Tunstall, Edward Beeching, Nathan Lambert, Nazneen Rajani, Kashif Rasul, Younes Belkada, Shengyi Huang, Leandro von Werra, Clémentine Fourrier, Nathan Habib, Nathan Sarrazin, Omar Sanseviero, Alexander M. Rush, Thomas Wolf

Starting from a dataset of outputs ranked by a teacher model, we apply distilled direct preference optimization (dDPO) to learn a chat model with significantly improved intent alignment.

2D Cyclist Detection Language Modelling

3,846

Paper
Code

Graph augmented Deep Reinforcement Learning in the GameRLand3D environment

no code implementations • 22 Dec 2021 • Edward Beeching, Maxim Peter, Philippe Marcotte, Jilles Debangoye, Olivier Simonin, Joshua Romoff, Christian Wolf

We address planning and navigation in challenging 3D video games featuring maps with disconnected regions reachable by agents using special actions.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Godot Reinforcement Learning Agents

1 code implementation • 7 Dec 2021 • Edward Beeching, Jilles Debangoye, Olivier Simonin, Christian Wolf

We present Godot Reinforcement Learning (RL) Agents, an open-source interface for developing environments and agents in the Godot Game Engine.

reinforcement-learning Reinforcement Learning (RL)

754

Paper
Code

Learning to plan with uncertain topological maps

1 code implementation • ECCV 2020 • Edward Beeching, Jilles Dibangoye, Olivier Simonin, Christian Wolf

We train an agent to navigate in 3D environments using a hierarchical strategy including a high-level graph based planner and a local policy.

Inductive Bias Navigate

Paper
Code

EgoMap: Projective mapping and structured egocentric memory for Deep RL

no code implementations • 24 Jan 2020 • Edward Beeching, Christian Wolf, Jilles Dibangoye, Olivier Simonin

The EgoMap architecture incorporates several inductive biases including a differentiable inverse projection of CNN feature vectors onto a top-down spatially structured map.

Memorization reinforcement-learning +1

Paper
Add Code

Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a Supercomputer

1 code implementation • 3 Apr 2019 • Edward Beeching, Christian Wolf, Jilles Dibangoye, Olivier Simonin

In this paper we argue that research on training agents capable of complex reasoning can be simplified by decoupling from the requirement of high fidelity photographic observations.

Reinforcement Learning (RL) Scene Understanding

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.